GenSim: Generating Robotic Simulation Tasks Via Large Language Models

· 2024 · arXiv 2310.01361

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

read on arXiv browse 8 citing papers

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

KinDER: A Physical Reasoning Benchmark for Robot Learning and Planning

cs.RO · 2026-04-28 · unverdicted · novelty 7.0

KinDER is a new open-source benchmark that demonstrates substantial gaps in current robot learning and planning methods for handling physical constraints.

KITE: Keyframe-Indexed Tokenized Evidence for VLM-Based Robot Failure Analysis

cs.RO · 2026-04-08 · unverdicted · novelty 7.0

KITE is a training-free method that uses keyframe-indexed tokenized evidence including BEV schematics to enhance VLM performance on robot failure detection, identification, localization, explanation, and correction.

Generative Simulation for Policy Learning in Physical Human-Robot Interaction

cs.RO · 2026-04-09 · unverdicted · novelty 6.0

A text-to-simulation pipeline using LLMs and VLMs generates synthetic pHRI data to train vision-based imitation learning policies that achieve over 80% success in zero-shot sim-to-real transfer on real assistive tasks.

RoboPlayground: Democratizing Robotic Evaluation through Structured Physical Domains

cs.RO · 2026-04-06 · unverdicted · novelty 6.0

RoboPlayground reframes robotic manipulation evaluation as a language-driven process over structured physical domains, letting users author varied yet reproducible tasks that reveal policy generalization failures.

IGen: Scalable Data Generation for Robot Learning from Open-World Images

cs.RO · 2025-12-01 · unverdicted · novelty 6.0

IGen generates realistic visuomotor training data including actions and temporally coherent visuals from unstructured open-world images via 3D reconstruction and VLM reasoning.

To Select or not to Select, that is the Question: Distilling Robot Skill Prediction into a Small Ensemble

cs.RO · 2026-05-20 · unverdicted · novelty 4.0

A 133M-parameter ensemble of fine-tuned mpnet and MiniLM encoders achieves 83.5% accuracy on a 200-task synthetic benchmark for robot skill prediction, beating several larger zero-shot LLMs.

LLM-based Realistic Safety-Critical Driving Video Generation

cs.RO · 2025-07-02 · unverdicted · novelty 4.0

Framework uses LLMs for few-shot CARLA scenario code generation focused on collisions, followed by Cosmos-Transfer1 with ControlNet to produce realistic safety-critical driving videos.

Vision-Language-Action in Robotics: A Survey of Datasets, Benchmarks, and Data Engines

cs.RO · 2026-04-24 · unverdicted · novelty 3.0

A survey of VLA robotics research identifies data infrastructure as the primary bottleneck and distills four open challenges in representation alignment, multimodal supervision, reasoning assessment, and scalable data generation.

citing papers explorer

Showing 8 of 8 citing papers.

KinDER: A Physical Reasoning Benchmark for Robot Learning and Planning cs.RO · 2026-04-28 · unverdicted · none · ref 15
KinDER is a new open-source benchmark that demonstrates substantial gaps in current robot learning and planning methods for handling physical constraints.
KITE: Keyframe-Indexed Tokenized Evidence for VLM-Based Robot Failure Analysis cs.RO · 2026-04-08 · unverdicted · none · ref 41
KITE is a training-free method that uses keyframe-indexed tokenized evidence including BEV schematics to enhance VLM performance on robot failure detection, identification, localization, explanation, and correction.
Generative Simulation for Policy Learning in Physical Human-Robot Interaction cs.RO · 2026-04-09 · unverdicted · none · ref 19
A text-to-simulation pipeline using LLMs and VLMs generates synthetic pHRI data to train vision-based imitation learning policies that achieve over 80% success in zero-shot sim-to-real transfer on real assistive tasks.
RoboPlayground: Democratizing Robotic Evaluation through Structured Physical Domains cs.RO · 2026-04-06 · unverdicted · none · ref 32
RoboPlayground reframes robotic manipulation evaluation as a language-driven process over structured physical domains, letting users author varied yet reproducible tasks that reveal policy generalization failures.
IGen: Scalable Data Generation for Robot Learning from Open-World Images cs.RO · 2025-12-01 · unverdicted · none · ref 59
IGen generates realistic visuomotor training data including actions and temporally coherent visuals from unstructured open-world images via 3D reconstruction and VLM reasoning.
To Select or not to Select, that is the Question: Distilling Robot Skill Prediction into a Small Ensemble cs.RO · 2026-05-20 · unverdicted · none · ref 1
A 133M-parameter ensemble of fine-tuned mpnet and MiniLM encoders achieves 83.5% accuracy on a 200-task synthetic benchmark for robot skill prediction, beating several larger zero-shot LLMs.
LLM-based Realistic Safety-Critical Driving Video Generation cs.RO · 2025-07-02 · unverdicted · none · ref 8
Framework uses LLMs for few-shot CARLA scenario code generation focused on collisions, followed by Cosmos-Transfer1 with ControlNet to produce realistic safety-critical driving videos.
Vision-Language-Action in Robotics: A Survey of Datasets, Benchmarks, and Data Engines cs.RO · 2026-04-24 · unverdicted · none · ref 23
A survey of VLA robotics research identifies data infrastructure as the primary bottleneck and distills four open challenges in representation alignment, multimodal supervision, reasoning assessment, and scalable data generation.

GenSim: Generating Robotic Simulation Tasks Via Large Language Models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer