Ali Farhadi
Identifiers
- name variant Ali Farhadi 0.60 · backfill
Papers (67)
- MolmoAct2: Action Reasoning Models for Real-world Deployment cs.RO · 2026 · author #27
- VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition cs.CV · 2026 · author #7
- Posterior Augmented Flow Matching cs.CV · 2026 · author #7
- Seeing Fast and Slow: Learning the Flow of Time in Videos cs.CV · 2026 · author #5
- MolmoWeb: Open Visual Web Agent and Open Data for the Open Web cs.CV · 2026 · author #15
- WildDet3D: Scaling Promptable 3D Detection in the Wild cs.CV · 2026 · author #15
- SERA: Soft-Verified Efficient Repository Agents cs.CL · 2026 · author #4
- Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding cs.CV · 2026 · author #20
- Olmo 3 cs.CL · 2025 · author #65
- MolmoAct: Action Reasoning Models that can Reason in Space cs.RO · 2025 · author #17
- Beyond the Frame: Generating 360 Panoramic Videos from Perspective Videos cs.CV · 2025 · author #3
- 2 OLMo 2 Furious cs.CL · 2024 · author #41
- Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models cs.CV · 2024 · author #49
- OLMoE: Open Mixture-of-Experts Language Models cs.CL · 2024 · author #20
- Objaverse-XL: A Universe of 10M+ 3D Objects cs.CV · 2023 · author #17
- Editing Models with Task Arithmetic cs.LG · 2022 · author #7
- Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index cs.CL · 2019 · author #5
- HellaSwag: Can a Machine Really Finish Your Sentence? cs.CL · 2019 · author #4
- Two Body Problem: Collaborative Visual Task Completion cs.CV · 2019 · author #6
- Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph cs.CV · 2019 · author #5
- What Should I Do Now? Marrying Reinforcement Learning and Symbolic Planning cs.CV · 2019 · author #3
- ELASTIC: Improving CNNs with Dynamic Scaling Policies cs.CV · 2018 · author #3
- Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning cs.CV · 2018 · author #4
- From Recognition to Cognition: Visual Commonsense Reasoning cs.CV · 2018 · author #3
- Visual Semantic Navigation using Scene Priors cs.CV · 2018 · author #3
- PhotoShape: Photorealistic Materials for Large-Scale Shape Collections cs.GR · 2018 · author #3
- Label Refinery: Improving ImageNet Classification through Label Progression cs.CV · 2018 · author #4
- Actor and Observer: Joint Modeling of First and Third-Person Videos cs.CV · 2018 · author #4
- Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos cs.CV · 2018 · author #4
- Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension cs.CL · 2018 · author #4
- Imagine This! Scripts to Compositions to Videos cs.CV · 2018 · author #3
- YOLOv3: An Incremental Improvement cs.CV · 2018 · author #2
- DOCK: Detecting Objects by transferring Common-sense Knowledge cs.CV · 2018 · author #3
- Who Let The Dogs Out? Modeling Dog Behavior From Visual Data cs.CV · 2018 · author #5
- AI2-THOR: An Interactive 3D Environment for Visual AI cs.CV · 2017 · author #13
- IQA: Visual Question Answering in Interactive Environments cs.CV · 2017 · author #6
- Structured Set Matching Networks for One-Shot Part Labeling cs.CV · 2017 · author #4
- Neural Speed Reading via Skim-RNN cs.CL · 2017 · author #3
- AJILE Movement Prediction: Multimodal Deep Learning for Natural Human Neural Recordings and Video cs.CV · 2017 · author #2
- Visual Semantic Planning using Deep Successor Representations cs.CV · 2017 · author #8
- Re3 : Real-Time Recurrent Regression Networks for Visual Tracking of Generic Objects cs.CV · 2017 · author #2
- SeGAN: Segmenting and Generating the Invisible cs.CV · 2017 · author #3
- See the Glass Half Full: Reasoning about Liquid Containers, their Volume and Content cs.CV · 2017 · author #4
- YOLO9000: Better, Faster, Stronger cs.CV · 2016 · author #2
- Asynchronous Temporal Fields for Action Recognition cs.CV · 2016 · author #3
- Commonly Uncommon: Semantic Sparsity in Situation Recognition cs.CV · 2016 · author #4
- LCNN: Lookup-based Convolutional Neural Network cs.CV · 2016 · author #3
- Bidirectional Attention Flow for Machine Comprehension cs.CL · 2016 · author #3
- Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning cs.CV · 2016 · author #7
- Much Ado About Time: Exhaustive Annotation of Temporal Data cs.HC · 2016 · author #3
- Query-Reduction Networks for Question Answering cs.CL · 2016 · author #3
- Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks cs.CV · 2016 · author #3
- Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding cs.CV · 2016 · author #4
- A Diagram Is Worth A Dozen Images cs.CV · 2016 · author #6
- "What happens if..." Learning to Predict the Effect of Forces in Images cs.CV · 2016 · author #4
- XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks cs.CV · 2016 · author #4
- Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects cs.AI · 2016 · author #4
- Toward a Taxonomy and Computational Models of Abnormalities in Images cs.CV · 2015 · author #4
- Actions ~ Transformations cs.CV · 2015 · author #2
- Unsupervised Deep Embedding for Clustering Analysis cs.LG · 2015 · author #3
- Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images cs.CV · 2015 · author #4
- VISALOGY: Answering Visual Analogy Questions cs.CV · 2015 · author #3
- Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing cs.CV · 2015 · author #5
- You Only Look Once: Unified, Real-Time Object Detection cs.CV · 2015 · author #4
- Image Classification and Retrieval from User-Supplied Tags cs.CV · 2014 · author #2
- Abnormal Object Recognition: A Comprehensive Study cs.CV · 2014 · author #2
- Semantic Understanding of Professional Soccer Commentaries cs.CL · 2012 · author #3
Mentions
- 1511.06335 #3 · backfill · confidence 0.70 Ali Farhadi
- 1511.04048 #4 · backfill · confidence 0.70 Ali Farhadi
- 1510.08973 #3 · backfill · confidence 0.70 Ali Farhadi
- 1509.08075 #5 · backfill · confidence 0.70 Ali Farhadi
- 1506.02640 #4 · backfill · confidence 0.70 Ali Farhadi
- 1411.6909 #2 · backfill · confidence 0.70 Ali Farhadi
- 1411.2214 #2 · backfill · confidence 0.70 Ali Farhadi
- 2601.20789 #4 · arxiv_oai · confidence 0.70 Ali Farhadi
- 1210.4854 #3 · backfill · confidence 0.70 Ali Farhadi
- 2307.05663 #17 · arxiv_oai · confidence 0.70 Ali Farhadi
- 2409.02060 #20 · arxiv_oai · confidence 0.70 Ali Farhadi
- 2601.10611 #20 · arxiv_oai · confidence 0.70 Ali Farhadi
Frequent Coauthors
- Hannaneh Hajishirzi 14 shared papers
- Abhinav Gupta 11 shared papers
- Aniruddha Kembhavi 10 shared papers
- Mohammad Rastegari 10 shared papers
- Roozbeh Mottaghi 10 shared papers
- Ranjay Krishna 8 shared papers
- Dieter Fox 7 shared papers
- Winson Han 7 shared papers
- Joseph Redmon 6 shared papers
- Kiana Ehsani 6 shared papers
- Minjoon Seo 6 shared papers
- Taira Anderson 6 shared papers
- Daniel Gordon 5 shared papers
- Eli VanderBilt 5 shared papers
- Eric Kolve 5 shared papers
- Gunnar A. Sigurdsson 5 shared papers
- Hessam Bagherinezhad 5 shared papers
- Matthew Wallingford 5 shared papers
- Yejin Choi 5 shared papers
- Dirk Groeneveld 4 shared papers