Marcus Rohrbach
Identifiers
No identifiers captured yet.
Papers (51)
- Think Twice, Act Once: Verifier-Guided Action Selection For Embodied Agents cs.AI · 2026 · author #6
- SIEVES: Selective Prediction Generalizes through Visual Evidence Scoring cs.CV · 2026 · author #2
- ReCap: Lightweight Referential Grounding for Coherent Story Visualization cs.CV · 2026 · author #4
- HaloProbe: Bayesian Detection and Mitigation of Object Hallucinations in Vision-Language Models cs.CV · 2026 · author #6
- VeriTaS: The First Dynamic Benchmark for Multimodal Automated Fact-Checking cs.IR · 2026 · author #3
- Variational Visual Question Answering for Uncertainty-Aware Selective Prediction cs.CV · 2025 · author #4
- Towards VQA Models That Can Read cs.CL · 2019 · author #8
- On Tiny Episodic Memories in Continual Learning cs.LG · 2019 · author #2
- Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering cs.LG · 2019 · author #4
- Cycle-Consistency for Robust Visual Question Answering cs.CV · 2019 · author #3
- DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition cs.CV · 2019 · author #5
- Exploring the Challenges towards Lifelong Fact Learning cs.CV · 2018 · author #4
- Grounded Video Description cs.CV · 2018 · author #5
- Adversarial Inference for Multi-Sentence Video Description cs.CV · 2018 · author #2
- Efficient Lifelong Learning with A-GEM cs.LG · 2018 · author #3
- Graph-Based Global Reasoning Networks cs.CV · 2018 · author #2
- Visual Coreference Resolution in Visual Dialog using Neural Module Networks cs.CV · 2018 · author #5
- Pythia v0.1: the Winning Entry to the VQA Challenge 2018 cs.CV · 2018 · author #4
- Selfless Sequential Learning stat.ML · 2018 · author #2
- Multimodal Explanations: Justifying Decisions and Pointing to the Evidence cs.AI · 2018 · author #7
- CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication cs.CV · 2017 · author #4
- Memory Aware Synapses: Learning what (not) to forget cs.CV · 2017 · author #4
- Attentive Explanations: Justifying Decisions and Pointing to the Evidence (Extended Abstract) cs.CV · 2017 · author #7
- Learning to Reason: End-to-End Module Networks for Visual Question Answering cs.CV · 2017 · author #3
- Generating Descriptions with Grounded and Co-Referenced People cs.CV · 2017 · author #2
- Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training cs.CV · 2017 · author #2
- Attentive Explanations: Justifying Decisions and Pointing to the Evidence cs.CV · 2016 · author #6
- Modeling Relationships in Referential Expressions with Compositional Modular Networks cs.CV · 2016 · author #2
- Utilizing Large Scale Vision and Text Datasets for Image Segmentation from Referring Expressions cs.CV · 2016 · author #2
- Captioning Images with Diverse Objects cs.CV · 2016 · author #3
- Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding cs.CV · 2016 · author #6
- Movie Description cs.CV · 2016 · author #3
- Ask Your Neurons: A Deep Learning Approach to Visual Question Answering cs.CV · 2016 · author #2
- Attributes as Semantic Units between Natural Language and Visual Recognition cs.CV · 2016 · author #1
- Generating Visual Explanations cs.CV · 2016 · author #3
- Segmentation from Natural Language Expressions cs.CV · 2016 · author #2
- Learning to Compose Neural Networks for Question Answering cs.CL · 2016 · author #2
- Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data cs.CV · 2015 · author #3
- Natural Language Object Retrieval cs.CV · 2015 · author #3
- Grounding of Textual Phrases in Images by Reconstruction cs.CV · 2015 · author #2
- Neural Module Networks cs.CV · 2015 · author #2
- Spatial Semantic Regularisation for Large Scale Object Detection cs.CV · 2015 · author #2
- The Long-Short Story of Movie Description cs.CV · 2015 · author #2
- A Multi-scale Multiple Instance Video Description Network cs.CV · 2015 · author #4
- Ask Your Neurons: A Neural-based Approach to Answering Questions about Images cs.CV · 2015 · author #2
- Sequence to Sequence -- Video to Text cs.CV · 2015 · author #2
- Recognizing Fine-Grained and Composite Activities using Hand-Centric Features and Script Data cs.CV · 2015 · author #1
- A Dataset for Movie Description cs.CV · 2015 · author #2
- Translating Videos to Natural Language Using Deep Recurrent Neural Networks cs.CV · 2014 · author #4
- Long-term Recurrent Convolutional Networks for Visual Recognition and Description cs.CV · 2014 · author #3
- Coherent Multi-Sentence Video Description with Variable Level of Detail cs.CV · 2014 · author #2
Mentions
No mention provenance yet.
Frequent Coauthors
- Trevor Darrell 19 shared papers
- Anna Rohrbach 13 shared papers
- Bernt Schiele 12 shared papers
- Kate Saenko 10 shared papers
- Lisa Anne Hendricks 8 shared papers
- Ronghang Hu 7 shared papers
- Subhashini Venugopalan 7 shared papers
- Devi Parikh 6 shared papers
- Dhruv Batra 5 shared papers
- Xinlei Chen 5 shared papers
- Dong Huk Park 4 shared papers
- Jacob Andreas 4 shared papers
- Jeff Donahue 4 shared papers
- Mohamed Elhoseiny 4 shared papers
- Raymond Mooney 4 shared papers
- Zeynep Akata 4 shared papers
- Mario Fritz 3 shared papers
- Rahaf Aljundi 3 shared papers
- Tinne Tuytelaars 3 shared papers
- Yannis Kalantidis 3 shared papers