pith. sign in

Ali Farhadi

Identifiers

  • name variant Ali Farhadi 0.60 · backfill

Papers (67)

  1. MolmoAct2: Action Reasoning Models for Real-world Deployment cs.RO · 2026 · author #27
  2. VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition cs.CV · 2026 · author #7
  3. Posterior Augmented Flow Matching cs.CV · 2026 · author #7
  4. Seeing Fast and Slow: Learning the Flow of Time in Videos cs.CV · 2026 · author #5
  5. MolmoWeb: Open Visual Web Agent and Open Data for the Open Web cs.CV · 2026 · author #15
  6. WildDet3D: Scaling Promptable 3D Detection in the Wild cs.CV · 2026 · author #15
  7. SERA: Soft-Verified Efficient Repository Agents cs.CL · 2026 · author #4
  8. Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding cs.CV · 2026 · author #20
  9. Olmo 3 cs.CL · 2025 · author #65
  10. MolmoAct: Action Reasoning Models that can Reason in Space cs.RO · 2025 · author #17
  11. Beyond the Frame: Generating 360 Panoramic Videos from Perspective Videos cs.CV · 2025 · author #3
  12. 2 OLMo 2 Furious cs.CL · 2024 · author #41
  13. Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models cs.CV · 2024 · author #49
  14. OLMoE: Open Mixture-of-Experts Language Models cs.CL · 2024 · author #20
  15. Objaverse-XL: A Universe of 10M+ 3D Objects cs.CV · 2023 · author #17
  16. Editing Models with Task Arithmetic cs.LG · 2022 · author #7
  17. Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index cs.CL · 2019 · author #5
  18. HellaSwag: Can a Machine Really Finish Your Sentence? cs.CL · 2019 · author #4
  19. Two Body Problem: Collaborative Visual Task Completion cs.CV · 2019 · author #6
  20. Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph cs.CV · 2019 · author #5
  21. What Should I Do Now? Marrying Reinforcement Learning and Symbolic Planning cs.CV · 2019 · author #3
  22. ELASTIC: Improving CNNs with Dynamic Scaling Policies cs.CV · 2018 · author #3
  23. Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning cs.CV · 2018 · author #4
  24. From Recognition to Cognition: Visual Commonsense Reasoning cs.CV · 2018 · author #3
  25. Visual Semantic Navigation using Scene Priors cs.CV · 2018 · author #3
  26. PhotoShape: Photorealistic Materials for Large-Scale Shape Collections cs.GR · 2018 · author #3
  27. Label Refinery: Improving ImageNet Classification through Label Progression cs.CV · 2018 · author #4
  28. Actor and Observer: Joint Modeling of First and Third-Person Videos cs.CV · 2018 · author #4
  29. Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos cs.CV · 2018 · author #4
  30. Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension cs.CL · 2018 · author #4
  31. Imagine This! Scripts to Compositions to Videos cs.CV · 2018 · author #3
  32. YOLOv3: An Incremental Improvement cs.CV · 2018 · author #2
  33. DOCK: Detecting Objects by transferring Common-sense Knowledge cs.CV · 2018 · author #3
  34. Who Let The Dogs Out? Modeling Dog Behavior From Visual Data cs.CV · 2018 · author #5
  35. AI2-THOR: An Interactive 3D Environment for Visual AI cs.CV · 2017 · author #13
  36. IQA: Visual Question Answering in Interactive Environments cs.CV · 2017 · author #6
  37. Structured Set Matching Networks for One-Shot Part Labeling cs.CV · 2017 · author #4
  38. Neural Speed Reading via Skim-RNN cs.CL · 2017 · author #3
  39. AJILE Movement Prediction: Multimodal Deep Learning for Natural Human Neural Recordings and Video cs.CV · 2017 · author #2
  40. Visual Semantic Planning using Deep Successor Representations cs.CV · 2017 · author #8
  41. Re3 : Real-Time Recurrent Regression Networks for Visual Tracking of Generic Objects cs.CV · 2017 · author #2
  42. SeGAN: Segmenting and Generating the Invisible cs.CV · 2017 · author #3
  43. See the Glass Half Full: Reasoning about Liquid Containers, their Volume and Content cs.CV · 2017 · author #4
  44. YOLO9000: Better, Faster, Stronger cs.CV · 2016 · author #2
  45. Asynchronous Temporal Fields for Action Recognition cs.CV · 2016 · author #3
  46. Commonly Uncommon: Semantic Sparsity in Situation Recognition cs.CV · 2016 · author #4
  47. LCNN: Lookup-based Convolutional Neural Network cs.CV · 2016 · author #3
  48. Bidirectional Attention Flow for Machine Comprehension cs.CL · 2016 · author #3
  49. Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning cs.CV · 2016 · author #7
  50. Much Ado About Time: Exhaustive Annotation of Temporal Data cs.HC · 2016 · author #3
  51. Query-Reduction Networks for Question Answering cs.CL · 2016 · author #3
  52. Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks cs.CV · 2016 · author #3
  53. Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding cs.CV · 2016 · author #4
  54. A Diagram Is Worth A Dozen Images cs.CV · 2016 · author #6
  55. "What happens if..." Learning to Predict the Effect of Forces in Images cs.CV · 2016 · author #4
  56. XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks cs.CV · 2016 · author #4
  57. Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects cs.AI · 2016 · author #4
  58. Toward a Taxonomy and Computational Models of Abnormalities in Images cs.CV · 2015 · author #4
  59. Actions ~ Transformations cs.CV · 2015 · author #2
  60. Unsupervised Deep Embedding for Clustering Analysis cs.LG · 2015 · author #3
  61. Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images cs.CV · 2015 · author #4
  62. VISALOGY: Answering Visual Analogy Questions cs.CV · 2015 · author #3
  63. Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing cs.CV · 2015 · author #5
  64. You Only Look Once: Unified, Real-Time Object Detection cs.CV · 2015 · author #4
  65. Image Classification and Retrieval from User-Supplied Tags cs.CV · 2014 · author #2
  66. Abnormal Object Recognition: A Comprehensive Study cs.CV · 2014 · author #2
  67. Semantic Understanding of Professional Soccer Commentaries cs.CL · 2012 · author #3

Mentions

  • 1511.06335 #3 · backfill · confidence 0.70 Ali Farhadi
  • 1511.04048 #4 · backfill · confidence 0.70 Ali Farhadi
  • 1510.08973 #3 · backfill · confidence 0.70 Ali Farhadi
  • 1509.08075 #5 · backfill · confidence 0.70 Ali Farhadi
  • 1506.02640 #4 · backfill · confidence 0.70 Ali Farhadi
  • 1411.6909 #2 · backfill · confidence 0.70 Ali Farhadi
  • 1411.2214 #2 · backfill · confidence 0.70 Ali Farhadi
  • 2601.20789 #4 · arxiv_oai · confidence 0.70 Ali Farhadi
  • 1210.4854 #3 · backfill · confidence 0.70 Ali Farhadi
  • 2307.05663 #17 · arxiv_oai · confidence 0.70 Ali Farhadi
  • 2409.02060 #20 · arxiv_oai · confidence 0.70 Ali Farhadi
  • 2601.10611 #20 · arxiv_oai · confidence 0.70 Ali Farhadi

Frequent Coauthors