arXiv preprint arXiv:2401.06730 , year=

Zhou, K · 2024 · arXiv 2401.06730

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

representative citing papers

The Score Granularity Gap in Black-Box LLM Classification: A Comparative Study of Confidence Constructions

cs.CL · 2026-06-20 · unverdicted · novelty 6.0

Comparative evaluation of seven confidence constructions across 25 LLM-dataset pairs reveals that verbalized scores provide good ranking but coarse granularity for thresholding, while multi-query aggregation helps weak models but can harm strong ones.

Synthetic Sources?: Auditing Generative Search Engine Citations for Evidence of AI-Generated Sources

cs.IR · 2026-05-22 · unverdicted · novelty 6.0

Audit of ChatGPT, Copilot, Gemini and Perplexity finds ~16% of cited sources are AI-generated across 712 queries on politics, health and environment.

Beyond Semantic Relevance: Counterfactual Risk Minimization for Robust Retrieval-Augmented Generation

cs.CL · 2026-05-02 · unverdicted · novelty 6.0

CoRM-RAG uses a cognitive perturbation protocol to simulate biases and trains an Evidence Critic to retrieve documents that support correct decisions even under adversarial query changes.

A Roadmap to Pluralistic Alignment

cs.AI · 2024-02-07 · unverdicted · novelty 6.0

The paper formalizes three types of pluralistic AI models and three benchmark classes, arguing that current alignment techniques may reduce rather than increase distributional pluralism.

Calibrating Model-Based Evaluation Metrics for Summarization

cs.CL · 2026-04-19 · unverdicted · novelty 5.0

A reference-free proxy scoring framework combined with GIRB calibration produces better-aligned evaluation metrics for summarization and outperforms baselines across seven datasets.

Measuring and mitigating overreliance to build human-compatible AI

cs.CY · 2025-09-08 · conditional · novelty 5.0

The paper consolidates risks of overreliance on LLMs, identifies gaps in current measurement approaches, and proposes mitigation strategies to keep AI as a human-compatible thought partner.

citing papers explorer

Showing 1 of 1 citing paper after filters.

A Roadmap to Pluralistic Alignment cs.AI · 2024-02-07 · unverdicted · none · ref 291
The paper formalizes three types of pluralistic AI models and three benchmark classes, arguing that current alignment techniques may reduce rather than increase distributional pluralism.

arXiv preprint arXiv:2401.06730 , year=

fields

years

verdicts

representative citing papers

citing papers explorer