Air: A systematic analysis of annotations, instructions, and response pairs in preference dataset

Bingxiang He, Ning Ding, Cheng Qian, Jia Deng, Ganqu Cui, Lifan Yuan, Haiwen Hong, Huan-ang Gao, Longtao Huang, Huimin Chen, et al · 2025 · arXiv 2504.03612

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

Yuvion LLM: An Adversarially-Aware Large Language Model for Content And AI Safety

cs.CL · 2026-06-26 · unverdicted · novelty 5.0

Yuvion LLM applies adversarially aware training and introduces the YLRE benchmark set, claiming superior safety robustness over larger models on multiple tasks.

A Survey of Reinforcement Learning for Large Reasoning Models

cs.CL · 2025-09-10 · accept · novelty 3.0

A survey compiling RL methods, challenges, data resources, and applications for enhancing reasoning in large language models and large reasoning models since DeepSeek-R1.

citing papers explorer

Showing 1 of 1 citing paper after filters.

A Survey of Reinforcement Learning for Large Reasoning Models cs.CL · 2025-09-10 · accept · none · ref 185
A survey compiling RL methods, challenges, data resources, and applications for enhancing reasoning in large language models and large reasoning models since DeepSeek-R1.

Air: A systematic analysis of annotations, instructions, and response pairs in preference dataset

fields

years

verdicts

representative citing papers

citing papers explorer