D Critic-Embed Training Details Critic-Embed is initialized from Stella-400M em- bedding model (Zhang et al., 2025a) 7 and fine- tuned with InfoNCE (temperature τ= 0.02 )

test 14,267 Table 6: Evaluation set sizes for the QA datasets used in our experiments · arXiv 4300.6668

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Critic-R: Improving Agentic Search using Instruction-tuned Retrievers with Natural Language Introspective Feedback

cs.IR · 2026-05-30 · unverdicted · novelty 5.0

Critic-R uses a critic model for natural-language introspective feedback to refine queries at inference time and optimize retrievers from successful/failed trajectories on multi-hop QA tasks.

citing papers explorer

Showing 1 of 1 citing paper.

Critic-R: Improving Agentic Search using Instruction-tuned Retrievers with Natural Language Introspective Feedback cs.IR · 2026-05-30 · unverdicted · none · ref 18
Critic-R uses a critic model for natural-language introspective feedback to refine queries at inference time and optimize retrievers from successful/failed trajectories on multi-hop QA tasks.

D Critic-Embed Training Details Critic-Embed is initialized from Stella-400M em- bedding model (Zhang et al., 2025a) 7 and fine- tuned with InfoNCE (temperature τ= 0.02 )

fields

years

verdicts

representative citing papers

citing papers explorer