pith. sign in

Mohit Bansal

Identifiers

  • name variant Mohit Bansal 0.60 · backfill

Papers (90)

  1. MuseBench: Benchmarking Intent-Level Audiovisual Arts Understanding in MLLMs cs.CV · 2026 · author #5
  2. Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generation cs.CV · 2026 · author #5
  3. A History-Aware Visually Grounded Critic for Computer Use Agents cs.AI · 2026 · author #10
  4. GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization cs.LG · 2026 · author #5
  5. Seeing Isn't Knowing: Do VLMs Know When Not to Answer Spatial Questions (and Why)? cs.CV · 2026 · author #6
  6. STORM: Internalized Modeling for Spatial-Temporal Reasoning in Video-Language Models cs.CV · 2026 · author #10
  7. AVSD: Adaptive-View Self-Distillation by Balancing Consensus and Teacher-Specific Privileged Signals cs.LG · 2026 · author #10
  8. MINTEval: Evaluating Memory under Multi-Target Interference in Long-Horizon Agent Systems cs.CL · 2026 · author #6
  9. PhyMotion: Structured 3D Motion Reward for Physics-Grounded Human Video Generation cs.CV · 2026 · author #9
  10. Agent-BRACE: Decoupling Beliefs from Actions in Long-Horizon Tasks via Verbalized State Uncertainty cs.CL · 2026 · author #8
  11. EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding cs.CV · 2026 · author #9
  12. Stabilizing Efficient Reasoning with Step-Level Advantage Selection cs.CL · 2026 · author #6
  13. MERRIN: A Benchmark for Multimodal Evidence Retrieval and Reasoning in Noisy Web Environments cs.CL · 2026 · author #9
  14. Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind cs.CL · 2026 · author #6
  15. The Master Key Hypothesis: Unlocking Cross-Model Capability Transfer via Linear Subspace Alignment cs.LG · 2026 · author #8
  16. Cog-DRIFT: Exploration on Adaptively Reformulated Instances Enables Learning from Hard Reasoning Problems cs.LG · 2026 · author #7
  17. Does AI See like Art Historians? Interpreting How Vision Language Models Recognize Artistic Style cs.CV · 2026 · author #10
  18. Multimodal Fact-Level Attribution for Verifiable Reasoning cs.CL · 2026 · author #6
  19. Effective Reasoning Chains Reduce Intrinsic Dimensionality cs.CL · 2026 · author #4
  20. When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning cs.CV · 2026 · author #7
  21. Active Video Perception: Iterative Evidence Seeking for Agentic Long Video Understanding cs.CV · 2025 · author #7
  22. StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos cs.CV · 2025 · author #9
  23. PRInTS: Reward Modeling for Long-Horizon Information Seeking cs.AI · 2025 · author #6
  24. One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration cs.AI · 2025 · author #5
  25. VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing cs.RO · 2025 · author #9
  26. OpenThoughts: Data Recipes for Reasoning Models cs.LG · 2025 · author #37
  27. SiLVR: A Simple Language-based Video Reasoning Framework cs.CV · 2025 · author #4
  28. EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance cs.CV · 2025 · author #7
  29. Skill-Based Mixture-of-Experts: Adaptive Routing for Heterogeneous Reasoning via Inferred Skills cs.CL · 2025 · author #5
  30. On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective cs.CY · 2025 · author #56
  31. Self-Correcting Text-to-Video Generation with Misalignment Detection and Localized Refinement cs.CV · 2024 · author #4
  32. Evaluating Very Long-Term Conversational Memory of LLM Agents cs.CL · 2024 · author #4
  33. TrustLLM: Trustworthiness in Large Language Models cs.CL · 2024 · author #31
  34. Analyzing and Mitigating Object Hallucination in Large Vision-Language Models cs.LG · 2023 · author #7
  35. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models cs.CL · 2022 · author #276
  36. Expressing Visual Relationships via Language cs.CL · 2019 · author #5
  37. Avoiding Reasoning Shortcuts: Adversarial Evaluation, Training, and Model Development for Multi-Hop QA cs.CL · 2019 · author #2
  38. Improving Visual Question Answering by Referring to Generated Paragraph Captions cs.CL · 2019 · author #2
  39. Continual and Multi-Task Architecture Search cs.CL · 2019 · author #2
  40. Explore, Propose, and Assemble: An Interpretable Model for Multi-Hop Reading Comprehension cs.CL · 2019 · author #4
  41. Crowdsourcing Lightweight Pyramids for Manual Summary Evaluation cs.CL · 2019 · author #6
  42. Multi-Target Embodied Question Answering cs.CV · 2019 · author #4
  43. Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout cs.CL · 2019 · author #3
  44. AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning cs.CL · 2019 · author #3
  45. Combining Fact Extraction and Verification with Neural Semantic Matching Networks cs.CL · 2018 · author #3
  46. Analyzing Compositionality-Sensitivity of NLI Models cs.CL · 2018 · author #3
  47. Commonsense for Generative Multi-Hop Question Answering Tasks cs.CL · 2018 · author #3
  48. SafeCity: Understanding Diverse Forms of Sexual Harassment Personal Stories cs.CL · 2018 · author #2
  49. Closed-Book Training to Improve Summarization Encoder Memory cs.CL · 2018 · author #2
  50. Game-Based Video-Context Dialogue cs.CL · 2018 · author #2
  51. Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models cs.CL · 2018 · author #2
  52. TVQA: Localized, Compositional Video Question Answering cs.CL · 2018 · author #3
  53. Dynamic Multi-Level Multi-Task Learning for Sentence Simplification cs.CL · 2018 · author #3
  54. Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting cs.CL · 2018 · author #2
  55. Soft Layer-Specific Multi-Task Summarization with Entailment and Question Generation cs.CL · 2018 · author #3
  56. Polite Dialogue Generation Without Parallel Data cs.CL · 2018 · author #2
  57. Object Ordering with Bidirectional Matchings for Visual Reasoning cs.CL · 2018 · author #2
  58. Robust Machine Comprehension Models via Adversarial Training cs.CL · 2018 · author #2
  59. Multi-Reward Reinforced Summarization with Saliency and Entailment cs.CL · 2018 · author #2
  60. Detecting Linguistic Characteristics of Alzheimer's Dementia by Interpreting Neural Models cs.CL · 2018 · author #3
  61. MAttNet: Modular Attention Network for Referring Expression Comprehension cs.CV · 2018 · author #6
  62. Hierarchically-Attentive RNN for Album Summarization and Storytelling cs.CL · 2017 · author #2
  63. Shortcut-Stacked Sentence Encoders for Multi-Domain Inference cs.CL · 2017 · author #2
  64. Reinforced Video Captioning with Entailment Rewards cs.CL · 2017 · author #2
  65. Video Highlight Prediction Using Audience Chat Reactions cs.CL · 2017 · author #3
  66. Source-Target Inference Models for Spatial Instruction Understanding cs.CL · 2017 · author #2
  67. Efficient Generation of Motion Plans from Attribute-Based Natural Language Instructions Using Dynamic Constraint Mapping cs.RO · 2017 · author #3
  68. Punny Captions: Witty Wordplay in Image Descriptions cs.CL · 2017 · author #3
  69. Multi-Task Video Captioning with Video and Entailment Generation cs.CL · 2017 · author #2
  70. Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information cs.CL · 2017 · author #3
  71. A Joint Speaker-Listener-Reinforcer Model for Referring Expressions cs.CV · 2016 · author #3
  72. Coherent Dialogue with Attention-based Language Models cs.CL · 2016 · author #2
  73. Navigational Instruction Generation as Inverse Reinforcement Learning with Neural Machine Translation cs.RO · 2016 · author #2
  74. Interpreting Neural Networks to Improve Politeness Comprehension cs.CL · 2016 · author #2
  75. Contextual RNN-GANs for Abstract Reasoning Diagram Generation cs.CV · 2016 · author #5
  76. Who did What: A Large-Scale Person-Centered Cloze Dataset cs.CL · 2016 · author #3
  77. Charagram: Embedding Words and Sentences via Character n-grams cs.CL · 2016 · author #2
  78. Sort Story: Sorting Jumbled Images and Captions into Stories cs.CL · 2016 · author #5
  79. Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions cs.CV · 2016 · author #3
  80. The Role of Context Types and Dimensionality in Learning Word Embeddings cs.CL · 2016 · author #4
  81. End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures cs.CL · 2016 · author #2
  82. We Are Humor Beings: Understanding and Predicting Visual Humor cs.CV · 2015 · author #4
  83. Towards Universal Paraphrastic Sentence Embeddings cs.CL · 2015 · author #2
  84. Learning Articulated Motion Models from Visual and Lingual Signals cs.RO · 2015 · author #2
  85. Accurate Vision-based Vehicle Localization using Satellite Imagery cs.RO · 2015 · author #3
  86. Mapping Unseen Words to Task-Trained Embedding Spaces cs.CL · 2015 · author #2
  87. What to talk about and how? Selective Generation using LSTMs with Coarse-to-Fine Alignment cs.CL · 2015 · author #2
  88. Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences cs.CL · 2015 · author #2
  89. From Paraphrase Database to Compositional Paraphrase Model and Back cs.CL · 2015 · author #2
  90. Web-scale Surface and Syntactic n-gram Features for Dependency Parsing cs.CL · 2015 · author #2

Mentions

  • 2606.30026 #5 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2606.25306 #5 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2511.19314 #6 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2606.11078 #10 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 1511.08198 #2 · backfill · confidence 0.70 Mohit Bansal
  • 1511.05526 #2 · backfill · confidence 0.70 Mohit Bansal
  • 1510.09171 #3 · backfill · confidence 0.70 Mohit Bansal
  • 1510.02387 #2 · backfill · confidence 0.70 Mohit Bansal
  • 2512.05774 #7 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 1509.00838 #2 · backfill · confidence 0.70 Mohit Bansal
  • 1506.04089 #2 · backfill · confidence 0.70 Mohit Bansal
  • 1506.03487 #2 · backfill · confidence 0.70 Mohit Bansal
  • 1502.07038 #2 · backfill · confidence 0.70 Mohit Bansal
  • 2602.08236 #7 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2503.05641 #5 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2605.31464 #5 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2605.30557 #6 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2602.09276 #4 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2505.21876 #7 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2605.26014 #10 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2605.20643 #10 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2603.11024 #10 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2605.18565 #6 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2502.14296 #56 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2401.05561 #31 · arxiv_oai · confidence 0.70 Mohit Bansal
  • 2310.00754 #7 · arxiv_oai · confidence 0.70 Mohit Bansal

Frequent Coauthors