Gemini 3.1 pro model card

Google DeepMind · 2026

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Omni-DuplexEval: Evaluating Real-time Duplex Omni-modal Interaction

cs.CV · 2026-05-17 · conditional · novelty 7.0

Omni-DuplexEval creates a new benchmark and LLM-as-a-Judge framework for real-time duplex omni-modal interaction, revealing that current models score below 40% overall and struggle especially with proactive responses.

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents

cs.CL · 2026-05-22 · unverdicted · novelty 6.0

OpenSkillEval automatically builds realistic tasks from evolving artifacts to audit skill effectiveness in LLM agents, finding that skill use depends on model and framework and that many popular skills do not outperform base agents.

citing papers explorer

Showing 2 of 2 citing papers.

Omni-DuplexEval: Evaluating Real-time Duplex Omni-modal Interaction cs.CV · 2026-05-17 · conditional · none · ref 2
Omni-DuplexEval creates a new benchmark and LLM-as-a-Judge framework for real-time duplex omni-modal interaction, revealing that current models score below 40% overall and struggle especially with proactive responses.
OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents cs.CL · 2026-05-22 · unverdicted · none · ref 13
OpenSkillEval automatically builds realistic tasks from evolving artifacts to audit skill effectiveness in LLM agents, finding that skill use depends on model and framework and that many popular skills do not outperform base agents.

Gemini 3.1 pro model card

fields

years

verdicts

representative citing papers

citing papers explorer