ESARBench is the first unified benchmark for MLLM-driven UAV agents that must explore, locate clues, and decide on victim positions in photorealistic simulated SAR environments.
Canonical reference
Title resolution pending
Canonical reference. 100% of citing Pith papers cite this work as background.
citation-role summary
citation-polarity summary
years
2026 6verdicts
UNVERDICTED 6roles
background 5polarities
background 5representative citing papers
Heterogeneous visual agents form shared symbols via decentralized Metropolis-Hastings captioning, where encoder similarity shapes the content and symmetry of the resulting language.
A conditional point-cloud flow matching model maps motor actuation to 3D geometry of tendon-driven continuum robots and outperforms prior self-modeling methods on simulated and real 2- and 3-module hardware.
Robots discover causal tool features through VLM suggestions and physics-based counterfactual perturbations in simulation, then transfer manipulation skills via conditioned keypoint matching.
Boundary element simulations demonstrate that droplet lens shape on elastic sheets depends on thickness and applied tension, with elongation and folds under uniaxial stretch.
A-ROM delivers competitive MedMNIST performance via pretrained ViT metric spaces, a concept dictionary, and kNN without backpropagation or fine-tuning, framed as interpretable few-shot learning under the Platonic Representation Hypothesis.
citing papers explorer
-
ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue
ESARBench is the first unified benchmark for MLLM-driven UAV agents that must explore, locate clues, and decide on victim positions in photorealistic simulated SAR environments.
-
Emergent Communication between Heterogeneous Visual Agents through Decentralized Learning
Heterogeneous visual agents form shared symbols via decentralized Metropolis-Hastings captioning, where encoder similarity shapes the content and symmetry of the resulting language.
-
Continuum Robot Modeling with Action Conditioned Flow Matching
A conditional point-cloud flow matching model maps motor actuation to 3D geometry of tendon-driven continuum robots and outperforms prior self-modeling methods on simulated and real 2- and 3-module hardware.
-
Creative Robot Tool Use by Counterfactual Reasoning
Robots discover causal tool features through VLM suggestions and physics-based counterfactual perturbations in simulation, then transfer manipulation skills via conditioned keypoint matching.
-
Droplets sitting on thin elastic sheets: A study with the boundary element method
Boundary element simulations demonstrate that droplet lens shape on elastic sheets depends on thickness and applied tension, with elongation and folds under uniaxial stretch.
-
Toward Aristotelian Medical Representations: Backpropagation-Free Layer-wise Analysis for Interpretable Generalized Metric Learning on MedMNIST
A-ROM delivers competitive MedMNIST performance via pretrained ViT metric spaces, a concept dictionary, and kNN without backpropagation or fine-tuning, framed as interpretable few-shot learning under the Platonic Representation Hypothesis.