pith. machine review for the scientific record.
sign in

Marcus Rohrbach

Identifiers

No identifiers captured yet.

Papers (51)

  1. Think Twice, Act Once: Verifier-Guided Action Selection For Embodied Agents cs.AI · 2026 · author #6
  2. SIEVES: Selective Prediction Generalizes through Visual Evidence Scoring cs.CV · 2026 · author #2
  3. ReCap: Lightweight Referential Grounding for Coherent Story Visualization cs.CV · 2026 · author #4
  4. HaloProbe: Bayesian Detection and Mitigation of Object Hallucinations in Vision-Language Models cs.CV · 2026 · author #6
  5. VeriTaS: The First Dynamic Benchmark for Multimodal Automated Fact-Checking cs.IR · 2026 · author #3
  6. Variational Visual Question Answering for Uncertainty-Aware Selective Prediction cs.CV · 2025 · author #4
  7. Towards VQA Models That Can Read cs.CL · 2019 · author #8
  8. On Tiny Episodic Memories in Continual Learning cs.LG · 2019 · author #2
  9. Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering cs.LG · 2019 · author #4
  10. Cycle-Consistency for Robust Visual Question Answering cs.CV · 2019 · author #3
  11. DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition cs.CV · 2019 · author #5
  12. Exploring the Challenges towards Lifelong Fact Learning cs.CV · 2018 · author #4
  13. Grounded Video Description cs.CV · 2018 · author #5
  14. Adversarial Inference for Multi-Sentence Video Description cs.CV · 2018 · author #2
  15. Efficient Lifelong Learning with A-GEM cs.LG · 2018 · author #3
  16. Graph-Based Global Reasoning Networks cs.CV · 2018 · author #2
  17. Visual Coreference Resolution in Visual Dialog using Neural Module Networks cs.CV · 2018 · author #5
  18. Pythia v0.1: the Winning Entry to the VQA Challenge 2018 cs.CV · 2018 · author #4
  19. Selfless Sequential Learning stat.ML · 2018 · author #2
  20. Multimodal Explanations: Justifying Decisions and Pointing to the Evidence cs.AI · 2018 · author #7
  21. CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication cs.CV · 2017 · author #4
  22. Memory Aware Synapses: Learning what (not) to forget cs.CV · 2017 · author #4
  23. Attentive Explanations: Justifying Decisions and Pointing to the Evidence (Extended Abstract) cs.CV · 2017 · author #7
  24. Learning to Reason: End-to-End Module Networks for Visual Question Answering cs.CV · 2017 · author #3
  25. Generating Descriptions with Grounded and Co-Referenced People cs.CV · 2017 · author #2
  26. Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training cs.CV · 2017 · author #2
  27. Attentive Explanations: Justifying Decisions and Pointing to the Evidence cs.CV · 2016 · author #6
  28. Modeling Relationships in Referential Expressions with Compositional Modular Networks cs.CV · 2016 · author #2
  29. Utilizing Large Scale Vision and Text Datasets for Image Segmentation from Referring Expressions cs.CV · 2016 · author #2
  30. Captioning Images with Diverse Objects cs.CV · 2016 · author #3
  31. Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding cs.CV · 2016 · author #6
  32. Movie Description cs.CV · 2016 · author #3
  33. Ask Your Neurons: A Deep Learning Approach to Visual Question Answering cs.CV · 2016 · author #2
  34. Attributes as Semantic Units between Natural Language and Visual Recognition cs.CV · 2016 · author #1
  35. Generating Visual Explanations cs.CV · 2016 · author #3
  36. Segmentation from Natural Language Expressions cs.CV · 2016 · author #2
  37. Learning to Compose Neural Networks for Question Answering cs.CL · 2016 · author #2
  38. Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data cs.CV · 2015 · author #3
  39. Natural Language Object Retrieval cs.CV · 2015 · author #3
  40. Grounding of Textual Phrases in Images by Reconstruction cs.CV · 2015 · author #2
  41. Neural Module Networks cs.CV · 2015 · author #2
  42. Spatial Semantic Regularisation for Large Scale Object Detection cs.CV · 2015 · author #2
  43. The Long-Short Story of Movie Description cs.CV · 2015 · author #2
  44. A Multi-scale Multiple Instance Video Description Network cs.CV · 2015 · author #4
  45. Ask Your Neurons: A Neural-based Approach to Answering Questions about Images cs.CV · 2015 · author #2
  46. Sequence to Sequence -- Video to Text cs.CV · 2015 · author #2
  47. Recognizing Fine-Grained and Composite Activities using Hand-Centric Features and Script Data cs.CV · 2015 · author #1
  48. A Dataset for Movie Description cs.CV · 2015 · author #2
  49. Translating Videos to Natural Language Using Deep Recurrent Neural Networks cs.CV · 2014 · author #4
  50. Long-term Recurrent Convolutional Networks for Visual Recognition and Description cs.CV · 2014 · author #3
  51. Coherent Multi-Sentence Video Description with Variable Level of Detail cs.CV · 2014 · author #2

Mentions

No mention provenance yet.

Frequent Coauthors