CCRS: A zero-shot LLM-as-a-judge framework for comprehensive RAG evaluation.arXiv preprint arXiv:2506.20128, 2025

Aashiq Muhamed · 2025 · arXiv 2506.20128

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

H2HMem: A Multimodal Memory Benchmark for Agents in Human-Human Interactions

cs.CL · 2026-06-08 · unverdicted · novelty 7.0

H2HMem is a multimodal memory benchmark evaluating LLM agents on recall, reasoning, and application in dyadic and multi-party human-human conversations with phenomena such as anaphora and deixis.

citing papers explorer

Showing 1 of 1 citing paper.

H2HMem: A Multimodal Memory Benchmark for Agents in Human-Human Interactions cs.CL · 2026-06-08 · unverdicted · none · ref 63
H2HMem is a multimodal memory benchmark evaluating LLM agents on recall, reasoning, and application in dyadic and multi-party human-human conversations with phenomena such as anaphora and deixis.

CCRS: A zero-shot LLM-as-a-judge framework for comprehensive RAG evaluation.arXiv preprint arXiv:2506.20128, 2025

fields

years

verdicts

representative citing papers

citing papers explorer