Mohit Bansal
Identifiers
No identifiers captured yet.
Papers (73)
- PhyMotion: Structured 3D Motion Reward for Physics-Grounded Human Video Generation cs.CV · 2026 · author #9
- Agent-BRACE: Decoupling Beliefs from Actions in Long-Horizon Tasks via Verbalized State Uncertainty cs.CL · 2026 · author #8
- EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding cs.CV · 2026 · author #9
- Stabilizing Efficient Reasoning with Step-Level Advantage Selection cs.CL · 2026 · author #6
- MERRIN: A Benchmark for Multimodal Evidence Retrieval and Reasoning in Noisy Web Environments cs.CL · 2026 · author #9
- Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind cs.CL · 2026 · author #6
- The Master Key Hypothesis: Unlocking Cross-Model Capability Transfer via Linear Subspace Alignment cs.LG · 2026 · author #8
- Cog-DRIFT: Exploration on Adaptively Reformulated Instances Enables Learning from Hard Reasoning Problems cs.LG · 2026 · author #7
- Does AI See like Art Historians? Interpreting How Vision Language Models Recognize Artistic Style cs.CV · 2026 · author #10
- Multimodal Fact-Level Attribution for Verifiable Reasoning cs.CL · 2026 · author #6
- StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos cs.CV · 2025 · author #9
- One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration cs.AI · 2025 · author #5
- VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing cs.RO · 2025 · author #9
- OpenThoughts: Data Recipes for Reasoning Models cs.LG · 2025 · author #37
- SiLVR: A Simple Language-based Video Reasoning Framework cs.CV · 2025 · author #4
- Self-Correcting Text-to-Video Generation with Misalignment Detection and Localized Refinement cs.CV · 2024 · author #4
- Evaluating Very Long-Term Conversational Memory of LLM Agents cs.CL · 2024 · author #4
- Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models cs.CL · 2022 · author #276
- Expressing Visual Relationships via Language cs.CL · 2019 · author #5
- Avoiding Reasoning Shortcuts: Adversarial Evaluation, Training, and Model Development for Multi-Hop QA cs.CL · 2019 · author #2
- Improving Visual Question Answering by Referring to Generated Paragraph Captions cs.CL · 2019 · author #2
- Continual and Multi-Task Architecture Search cs.CL · 2019 · author #2
- Explore, Propose, and Assemble: An Interpretable Model for Multi-Hop Reading Comprehension cs.CL · 2019 · author #4
- Crowdsourcing Lightweight Pyramids for Manual Summary Evaluation cs.CL · 2019 · author #6
- Multi-Target Embodied Question Answering cs.CV · 2019 · author #4
- Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout cs.CL · 2019 · author #3
- AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning cs.CL · 2019 · author #3
- Combining Fact Extraction and Verification with Neural Semantic Matching Networks cs.CL · 2018 · author #3
- Analyzing Compositionality-Sensitivity of NLI Models cs.CL · 2018 · author #3
- Commonsense for Generative Multi-Hop Question Answering Tasks cs.CL · 2018 · author #3
- SafeCity: Understanding Diverse Forms of Sexual Harassment Personal Stories cs.CL · 2018 · author #2
- Closed-Book Training to Improve Summarization Encoder Memory cs.CL · 2018 · author #2
- Game-Based Video-Context Dialogue cs.CL · 2018 · author #2
- Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models cs.CL · 2018 · author #2
- TVQA: Localized, Compositional Video Question Answering cs.CL · 2018 · author #3
- Dynamic Multi-Level Multi-Task Learning for Sentence Simplification cs.CL · 2018 · author #3
- Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting cs.CL · 2018 · author #2
- Soft Layer-Specific Multi-Task Summarization with Entailment and Question Generation cs.CL · 2018 · author #3
- Polite Dialogue Generation Without Parallel Data cs.CL · 2018 · author #2
- Object Ordering with Bidirectional Matchings for Visual Reasoning cs.CL · 2018 · author #2
- Robust Machine Comprehension Models via Adversarial Training cs.CL · 2018 · author #2
- Multi-Reward Reinforced Summarization with Saliency and Entailment cs.CL · 2018 · author #2
- Detecting Linguistic Characteristics of Alzheimer's Dementia by Interpreting Neural Models cs.CL · 2018 · author #3
- MAttNet: Modular Attention Network for Referring Expression Comprehension cs.CV · 2018 · author #6
- Hierarchically-Attentive RNN for Album Summarization and Storytelling cs.CL · 2017 · author #2
- Shortcut-Stacked Sentence Encoders for Multi-Domain Inference cs.CL · 2017 · author #2
- Reinforced Video Captioning with Entailment Rewards cs.CL · 2017 · author #2
- Video Highlight Prediction Using Audience Chat Reactions cs.CL · 2017 · author #3
- Source-Target Inference Models for Spatial Instruction Understanding cs.CL · 2017 · author #2
- Efficient Generation of Motion Plans from Attribute-Based Natural Language Instructions Using Dynamic Constraint Mapping cs.RO · 2017 · author #3
- Punny Captions: Witty Wordplay in Image Descriptions cs.CL · 2017 · author #3
- Multi-Task Video Captioning with Video and Entailment Generation cs.CL · 2017 · author #2
- Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information cs.CL · 2017 · author #3
- A Joint Speaker-Listener-Reinforcer Model for Referring Expressions cs.CV · 2016 · author #3
- Coherent Dialogue with Attention-based Language Models cs.CL · 2016 · author #2
- Navigational Instruction Generation as Inverse Reinforcement Learning with Neural Machine Translation cs.RO · 2016 · author #2
- Interpreting Neural Networks to Improve Politeness Comprehension cs.CL · 2016 · author #2
- Contextual RNN-GANs for Abstract Reasoning Diagram Generation cs.CV · 2016 · author #5
- Who did What: A Large-Scale Person-Centered Cloze Dataset cs.CL · 2016 · author #3
- Charagram: Embedding Words and Sentences via Character n-grams cs.CL · 2016 · author #2
- Sort Story: Sorting Jumbled Images and Captions into Stories cs.CL · 2016 · author #5
- Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions cs.CV · 2016 · author #3
- The Role of Context Types and Dimensionality in Learning Word Embeddings cs.CL · 2016 · author #4
- End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures cs.CL · 2016 · author #2
- We Are Humor Beings: Understanding and Predicting Visual Humor cs.CV · 2015 · author #4
- Towards Universal Paraphrastic Sentence Embeddings cs.CL · 2015 · author #2
- Learning Articulated Motion Models from Visual and Lingual Signals cs.RO · 2015 · author #2
- Accurate Vision-based Vehicle Localization using Satellite Imagery cs.RO · 2015 · author #3
- Mapping Unseen Words to Task-Trained Embedding Spaces cs.CL · 2015 · author #2
- What to talk about and how? Selective Generation using LSTMs with Coarse-to-Fine Alignment cs.CL · 2015 · author #2
- Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences cs.CL · 2015 · author #2
- From Paraphrase Database to Compositional Paraphrase Model and Back cs.CL · 2015 · author #2
- Web-scale Surface and Syntactic n-gram Features for Dependency Parsing cs.CL · 2015 · author #2
Mentions
No mention provenance yet.
Frequent Coauthors
- Ramakanth Pasunuru 9 shared papers
- Elias Stengel-Eskin 7 shared papers
- Kevin Gimpel 7 shared papers
- Karen Livescu 6 shared papers
- Licheng Yu 6 shared papers
- Matthew R. Walter 6 shared papers
- Hao Tan 5 shared papers
- Hyunji Lee 5 shared papers
- Zaid Khan 5 shared papers
- Devi Parikh 4 shared papers
- Dhruv Batra 4 shared papers
- Hongyuan Mei 4 shared papers
- Tamara L. Berg 4 shared papers
- Archiki Prasad 3 shared papers
- Arjun Chandrasekaran 3 shared papers
- Han Guo 3 shared papers
- Han Wang 3 shared papers
- Jaehong Yoon 3 shared papers
- Jaemin Cho 3 shared papers
- John Wieting 3 shared papers