V-IRL: Grounding Virtual Intelligence in Real Life

Ellis Brown; Jihan Yang; Runyu Ding; Saining Xie; Xiaojuan Qi

arxiv: 2402.03310 · v3 · pith:V24OSFX4new · submitted 2024-02-05 · 💻 cs.AI · cs.CV

V-IRL: Grounding Virtual Intelligence in Real Life

Jihan Yang , Runyu Ding , Ellis Brown , Xiaojuan Qi , Saining Xie This is my paper

classification 💻 cs.AI cs.CV

keywords agentsrealdigitalenvironmenthumansinhabitplatformreal-world

0 comments

read the original abstract

There is a sensory gulf between the Earth that humans inhabit and the digital realms in which modern AI agents are created. To develop AI agents that can sense, think, and act as flexibly as humans in real-world settings, it is imperative to bridge the realism gap between the digital and physical worlds. How can we embody agents in an environment as rich and diverse as the one we inhabit, without the constraints imposed by real hardware and control? Towards this end, we introduce V-IRL: a platform that enables agents to scalably interact with the real world in a virtual yet realistic environment. Our platform serves as a playground for developing agents that can accomplish various practical tasks and as a vast testbed for measuring progress in capabilities spanning perception, decision-making, and interaction with real-world data across the entire globe.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Less Detail, Better Answers: Degradation-Driven Prompting for VQA
cs.CV 2026-04 unverdicted novelty 5.0

Degradation-Driven Prompting improves VQA by intentionally reducing image detail and using masks, lines, and examples to guide models toward essential structures.