I-failsense: Towards general robotic failure detection with vision-language models

· 2026 · arXiv 2509.16072

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

representative citing papers

IntentVLM: Open-Vocabulary Intention Recognition through Forward-Inverse Modeling with Video-Language Models

cs.HC · 2026-04-27 · unverdicted · novelty 7.0

IntentVLM uses forward-inverse modeling in a two-stage video-language setup to reach up to 80% accuracy on open-vocabulary intention recognition benchmarks, beating baselines by 30% and matching human performance.

Foresight: Failure Detection for Long-Horizon Robotic Manipulation with Action-Conditioned World Model Latents

cs.RO · 2026-06-22 · unverdicted · novelty 6.0

Foresight detects failures in long-horizon robotic manipulation using latents from action-conditioned world models trained only on task-level labels and calibrated via functional conformal prediction.

A Physical Agentic Loop for Language-Guided Grasping with Execution-State Monitoring

cs.RO · 2026-04-08 · unverdicted · novelty 6.0

A physical agentic loop with execution-state monitoring improves robustness of language-guided grasping over open-loop execution by converting noisy telemetry into discrete outcome events that trigger retries or user escalation.

FAR: Failure-Aware Retry for Test-Time Recovery and Continual Policy Improvement

cs.RO · 2026-07-01 · unverdicted · novelty 4.0

FAR combines failure-contrastive preference adaptation with action perturbations for test-time recovery and continual policy improvement, reporting 17.6% and 11.7% success gains over diffusion policies in simulation and real-world manipulation tasks.

Fail-RAG : A Retrieval Augmented Generation Informed Framework for Robot Failure Identification

cs.RO · 2026-06-17 · unverdicted · novelty 4.0

Fail-RAG is a retrieval-augmented generation framework that detects and describes robot failures in warehouse tasks by querying an embedded failure database and applying VLMs, showing 25 percentage point higher accuracy than off-the-shelf VLMs.

citing papers explorer

Showing 5 of 5 citing papers.

IntentVLM: Open-Vocabulary Intention Recognition through Forward-Inverse Modeling with Video-Language Models cs.HC · 2026-04-27 · unverdicted · none · ref 12
IntentVLM uses forward-inverse modeling in a two-stage video-language setup to reach up to 80% accuracy on open-vocabulary intention recognition benchmarks, beating baselines by 30% and matching human performance.
Foresight: Failure Detection for Long-Horizon Robotic Manipulation with Action-Conditioned World Model Latents cs.RO · 2026-06-22 · unverdicted · none · ref 28
Foresight detects failures in long-horizon robotic manipulation using latents from action-conditioned world models trained only on task-level labels and calibrated via functional conformal prediction.
A Physical Agentic Loop for Language-Guided Grasping with Execution-State Monitoring cs.RO · 2026-04-08 · unverdicted · none · ref 15
A physical agentic loop with execution-state monitoring improves robustness of language-guided grasping over open-loop execution by converting noisy telemetry into discrete outcome events that trigger retries or user escalation.
FAR: Failure-Aware Retry for Test-Time Recovery and Continual Policy Improvement cs.RO · 2026-07-01 · unverdicted · none · ref 19
FAR combines failure-contrastive preference adaptation with action perturbations for test-time recovery and continual policy improvement, reporting 17.6% and 11.7% success gains over diffusion policies in simulation and real-world manipulation tasks.
Fail-RAG : A Retrieval Augmented Generation Informed Framework for Robot Failure Identification cs.RO · 2026-06-17 · unverdicted · none · ref 8
Fail-RAG is a retrieval-augmented generation framework that detects and describes robot failures in warehouse tasks by querying an embedded failure database and applying VLMs, showing 25 percentage point higher accuracy than off-the-shelf VLMs.

I-failsense: Towards general robotic failure detection with vision-language models

fields

years

verdicts

representative citing papers

citing papers explorer