BEHAVIOR-1K introduces a benchmark of 1,000 human everyday activities in realistic simulated scenes together with the OMNIGIBSON physics simulator to evaluate embodied AI.
The threedworld transport challenge: A visually guided task- and-motion planning benchmark for physically realistic em- bodied ai.arXiv preprint arXiv:2103.14025, 2021
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
representative citing papers
MEBench is a new benchmark and data-generation pipeline that measures mutual exclusivity bias in VLMs, finding weak bias but some use of spatial context to resolve novel-object ambiguity.
citing papers explorer
-
BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation
BEHAVIOR-1K introduces a benchmark of 1,000 human everyday activities in realistic simulated scenes together with the OMNIGIBSON physics simulator to evaluate embodied AI.
-
MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models
MEBench is a new benchmark and data-generation pipeline that measures mutual exclusivity bias in VLMs, finding weak bias but some use of spatial context to resolve novel-object ambiguity.