PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes

DOI: 10 · 2018 · DOI 10.15607/rss.2018.xiv

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

open at publisher browse 9 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Pose Estimation for Non-Cooperative Rendezvous Using Neural Networks

cs.CV · 2019-06-24 · unverdicted · novelty 7.0

SPN is a CNN that detects a spacecraft bounding box, classifies then regresses attitude, and optimizes position via Gauss-Newton, achieving degree-level attitude and cm-level position errors on real images after training only on synthetic data.

AI Coaching for Accelerating Human Skill Development with Reinforcement Learning

cs.RO · 2026-06-24 · unverdicted · novelty 6.0

A reinforcement learning framework for AI coaching, modeled as a non-cooperative game with causal skill models, shows improved human learning outcomes in a drone racing user study over baselines.

HORIZON: Recoverability-Governed Curriculum for Physical-Domain Scaling

cs.RO · 2026-06-03 · unverdicted · novelty 6.0

HORIZON is a recoverability-governed checkpointed frontier curriculum for on-policy physical-domain scaling on quadruped locomotion that identifies three regularities: uneven widening, non-monotonic composition, and the necessity of joint on-policy interaction.

GPU-Parallel Multi-Task Reinforcement Learning with Demonstration Guided Policy Optimization

cs.RO · 2026-06-02 · unverdicted · novelty 6.0

Presents MT-Libero, a GPU-parallel multi-task RL benchmark in Isaac Lab, and DGPO, an on-policy method combining importance-weighted PPO with adaptive behavior cloning from demonstrations.

From Demonstrations to Rewards: Test-Time Prompt Optimization for VLM Reward Models

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

Demo2Reward optimizes VLM reward model language instructions at test time from a few demonstrations to reduce false positives and enable policy learning in simulated and real robotic tasks without manual reward design.

Robots That Know What to Ask: Recovering Misaligned Rewards through Targeted Explanations

cs.RO · 2026-05-21 · unverdicted · novelty 6.0

Robots detect underspecified reward features via demonstration variation and query targeted natural language explanations to improve reward recovery from imperfect demos.

Creative Robot Tool Use by Counterfactual Reasoning

cs.RO · 2026-05-06 · unverdicted · novelty 6.0

Robots discover causal tool features through VLM suggestions and physics-based counterfactual perturbations in simulation, then transfer manipulation skills via conditioned keypoint matching.

SynManDex: Synthesizing Human-like Dexterous Grasps from Synthetic Human Pre-Grasps

cs.RO · 2026-06-08 · unverdicted · novelty 5.0

SynManDex generates human-like dexterous grasps for robots from synthetic human pre-grasps via retargeting and force-closure optimization, reporting 86.4% stability, 4.67/5 human-likeness, 80.7% sim success, and 83.3% real-robot success.

Mind Your Steps: A General Learning Framework for Accurate Humanoid Foothold Tracking

cs.RO · 2026-06-06 · unverdicted · novelty 5.0

A lightweight RL framework trains terrain-agnostic 3D foothold-tracking policies for humanoids that transfer directly to real-world use as standalone low-level controllers.

citing papers explorer

Showing 9 of 9 citing papers.

Pose Estimation for Non-Cooperative Rendezvous Using Neural Networks cs.CV · 2019-06-24 · unverdicted · none · ref 23
SPN is a CNN that detects a spacecraft bounding box, classifies then regresses attitude, and optimizes position via Gauss-Newton, achieving degree-level attitude and cm-level position errors on real images after training only on synthetic data.
AI Coaching for Accelerating Human Skill Development with Reinforcement Learning cs.RO · 2026-06-24 · unverdicted · none · ref 6
A reinforcement learning framework for AI coaching, modeled as a non-cooperative game with causal skill models, shows improved human learning outcomes in a drone racing user study over baselines.
HORIZON: Recoverability-Governed Curriculum for Physical-Domain Scaling cs.RO · 2026-06-03 · unverdicted · none · ref 9
HORIZON is a recoverability-governed checkpointed frontier curriculum for on-policy physical-domain scaling on quadruped locomotion that identifies three regularities: uneven widening, non-monotonic composition, and the necessity of joint on-policy interaction.
GPU-Parallel Multi-Task Reinforcement Learning with Demonstration Guided Policy Optimization cs.RO · 2026-06-02 · unverdicted · none · ref 18
Presents MT-Libero, a GPU-parallel multi-task RL benchmark in Isaac Lab, and DGPO, an on-policy method combining importance-weighted PPO with adaptive behavior cloning from demonstrations.
From Demonstrations to Rewards: Test-Time Prompt Optimization for VLM Reward Models cs.LG · 2026-05-22 · unverdicted · none · ref 13
Demo2Reward optimizes VLM reward model language instructions at test time from a few demonstrations to reduce false positives and enable policy learning in simulated and real robotic tasks without manual reward design.
Robots That Know What to Ask: Recovering Misaligned Rewards through Targeted Explanations cs.RO · 2026-05-21 · unverdicted · none · ref 20
Robots detect underspecified reward features via demonstration variation and query targeted natural language explanations to improve reward recovery from imperfect demos.
Creative Robot Tool Use by Counterfactual Reasoning cs.RO · 2026-05-06 · unverdicted · none · ref 11
Robots discover causal tool features through VLM suggestions and physics-based counterfactual perturbations in simulation, then transfer manipulation skills via conditioned keypoint matching.
SynManDex: Synthesizing Human-like Dexterous Grasps from Synthetic Human Pre-Grasps cs.RO · 2026-06-08 · unverdicted · none · ref 52
SynManDex generates human-like dexterous grasps for robots from synthetic human pre-grasps via retargeting and force-closure optimization, reporting 86.4% stability, 4.67/5 human-likeness, 80.7% sim success, and 83.3% real-robot success.
Mind Your Steps: A General Learning Framework for Accurate Humanoid Foothold Tracking cs.RO · 2026-06-06 · unverdicted · none · ref 33
A lightweight RL framework trains terrain-agnostic 3D foothold-tracking policies for humanoids that transfer directly to real-world use as standalone low-level controllers.

PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer