Gemini 1.5 models achieve near-perfect recall over up to 10M tokens of multimodal context, improve long-document and long-video QA, and match or exceed prior Gemini 1.0 Ultra performance on standard benchmarks.
Nicholas Carlini, Milad Nasr, Christopher A
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CL 2representative citing papers
TEC is a new public dataset of detailed human trial-and-error trajectories and reflections on web tasks, with humans showing substantially higher accuracy than LLMs.
citing papers explorer
-
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Gemini 1.5 models achieve near-perfect recall over up to 10M tokens of multimodal context, improve long-document and long-video QA, and match or exceed prior Gemini 1.0 Ultra performance on standard benchmarks.
-
TEC: A Collection of Human Trial-and-error Trajectories for Problem Solving
TEC is a new public dataset of detailed human trial-and-error trajectories and reflections on web tasks, with humans showing substantially higher accuracy than LLMs.