GPT-Fathom: Benchmarking large language models to decipher the evolutionary path towards GPT-4 and beyond.arXiv preprint arXiv:2309.16583

Zheng, L · arXiv 2309.16583

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Does RL Expand the Capability Boundary of LLM Agents? A PASS@(k,T) Analysis

cs.LG · 2026-04-16 · unverdicted · novelty 7.0

RL expands the capability boundary of LLM agents on compositional tool-use tasks, shown by non-converging pass curves at large k with increasing T, while SFT regresses it and the effect is absent on simpler tasks.

citing papers explorer

Showing 1 of 1 citing paper.

Does RL Expand the Capability Boundary of LLM Agents? A PASS@(k,T) Analysis cs.LG · 2026-04-16 · unverdicted · none · ref 20
RL expands the capability boundary of LLM agents on compositional tool-use tasks, shown by non-converging pass curves at large k with increasing T, while SFT regresses it and the effect is absent on simpler tasks.

GPT-Fathom: Benchmarking large language models to decipher the evolutionary path towards GPT-4 and beyond.arXiv preprint arXiv:2309.16583

fields

years

verdicts

representative citing papers

citing papers explorer