pith. sign in

Li Fei-Fei

Identifiers

  • name variant Li Fei-Fei 0.60 · backfill

Papers (98)

  1. SimFoundry: Modular and Automated Scene Generation for Policy Learning and Evaluation cs.RO · 2026 · author #15
  2. T-Rex: Tactile-Reactive Dexterous Manipulation cs.RO · 2026 · author #27
  3. GPIC: A Giant Permissive Image Corpus for Visual Generation cs.CV · 2026 · author #9
  4. Planning with the Views cs.AI · 2026 · author #6
  5. ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop cs.CV · 2026 · author #6
  6. StereoPolicy: Improving Robotic Manipulation Policies via Stereo Perception cs.RO · 2026 · author #8
  7. HumanScore: Benchmarking Human Motions in Generated Videos cs.CV · 2026 · author #6
  8. RAGEN-2: Reasoning Collapse in Agentic RL cs.LG · 2026 · author #13
  9. Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs cs.LG · 2026 · author #4
  10. Cambrian-S: Towards Spatial Supersensing in Video cs.CV · 2025 · author #14
  11. RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning cs.LG · 2025 · author #15
  12. s1: Simple test-time scaling cs.CL · 2025 · author #5
  13. Why Automate This? Exploring Correlations Between Desire for Robotic Automation, Invested Time and Well-Being cs.HC · 2025 · author #4
  14. Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces cs.CV · 2024 · author #5
  15. ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation cs.RO · 2024 · author #5
  16. BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation cs.RO · 2024 · author #35
  17. Agent AI: Surveying the Horizons of Multimodal Interaction cs.AI · 2024 · author #13
  18. Open X-Embodiment: Robotic Learning Datasets and RT-X Models cs.RO · 2023 · author #151
  19. VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models cs.RO · 2023 · author #6
  20. On the Opportunities and Risks of Foundation Models cs.LG · 2021 · author #26
  21. What Matters in Learning from Offline Human Demonstrations for Robot Manipulation cs.RO · 2021 · author #7
  22. Information Maximizing Visual Question Generation cs.CV · 2019 · author #3
  23. Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks cs.LG · 2019 · author #3
  24. Audio-Linguistic Embeddings for Spoken Sentences cs.SD · 2019 · author #4
  25. Peeking into the Future: Predicting Future Person Activities and Locations in Videos cs.CV · 2019 · author #5
  26. DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion cs.CV · 2019 · author #6
  27. Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation cs.CV · 2019 · author #7
  28. D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation cs.CV · 2019 · author #4
  29. Composing Text and Image for Image Retrieval - An Empirical Odyssey cs.CV · 2018 · author #6
  30. Vision-Based Gait Analysis for Senior Care cs.CV · 2018 · author #10
  31. Faster CryptoNets: Leveraging Sparsity for Real-World Encrypted Inference cs.CR · 2018 · author #6
  32. A Fully Private Pipeline for Deep Learning on Electronic Health Records cs.CR · 2018 · author #5
  33. Privacy-Preserving Action Recognition for Smart Hospitals using Low-Resolution Depth Images cs.CV · 2018 · author #7
  34. Measuring Depression Symptom Severity from Spoken Language and 3D Facial Expressions cs.CV · 2018 · author #4
  35. RoboTurk: A Crowdsourcing Platform for Robotic Skill Learning through Imitation cs.RO · 2018 · author #12
  36. Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks cs.RO · 2018 · author #6
  37. HiDDeN: Hiding Data With Deep Networks cs.CV · 2018 · author #4
  38. Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video Demonstration cs.CV · 2018 · author #6
  39. Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision cs.RO · 2018 · author #6
  40. Flexible Neural Representation for Physics Prediction cs.AI · 2018 · author #5
  41. Learning to Decompose and Disentangle Representations for Video Prediction cs.LG · 2018 · author #4
  42. Image Generation from Scene Graphs cs.CV · 2018 · author #3
  43. Iterative Visual Reasoning Beyond Convolutions cs.CV · 2018 · author #3
  44. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks cs.CV · 2018 · author #3
  45. Referring Relationships cs.CV · 2018 · author #4
  46. Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks cs.CV · 2018 · author #7
  47. Emergence of Structured Behaviors from Curiosity-Based Intrinsic Motivation cs.LG · 2018 · author #3
  48. Learning to Play with Intrinsically-Motivated Self-Aware Agents cs.LG · 2018 · author #3
  49. MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels cs.CV · 2017 · author #5
  50. Progressive Neural Architecture Search cs.CV · 2017 · author #7
  51. Label Efficient Learning of Transferable Representations across Domains and Tasks stat.ML · 2017 · author #4
  52. Graph Distillation for Action Detection with Privileged Modalities cs.CV · 2017 · author #5
  53. Thoracic Disease Identification and Localization with Limited Supervision cs.CV · 2017 · author #7
  54. Neural Task Programming: Learning to Generalize Across Hierarchical Tasks cs.AI · 2017 · author #6
  55. Scalable Annotation of Fine-Grained Categories Without Experts cs.HC · 2017 · author #4
  56. Fine-Grained Car Detection for Visual Census Estimation cs.CV · 2017 · author #6
  57. Fine-grained Recognition in the Wild: A Multi-Task Domain Adaptation Approach cs.CV · 2017 · author #3
  58. Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance cs.CV · 2017 · author #13
  59. ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems cs.RO · 2017 · author #6
  60. Learning to Learn from Noisy Web Videos cs.CV · 2017 · author #6
  61. Tackling Over-pruning in Variational Autoencoders cs.LG · 2017 · author #4
  62. Visual Semantic Planning using Deep Successor Representations cs.CV · 2017 · author #5
  63. Inferring and Executing Programs for Visual Reasoning cs.CV · 2017 · author #5
  64. Characterizing and Improving Stability in Neural Style Transfer cs.CV · 2017 · author #4
  65. Dense-Captioning Events in Videos cs.CV · 2017 · author #4
  66. Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos cs.CV · 2017 · author #3
  67. Scene Graph Generation by Iterative Message Passing cs.CV · 2017 · author #4
  68. Unsupervised Learning of Long-Term Motion Dynamics for Videos cs.CV · 2017 · author #5
  69. CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning cs.CV · 2016 · author #4
  70. Recurrent Attention Models for Depth-Based Person Identification cs.CV · 2016 · author #3
  71. A Hierarchical Approach for Generating Descriptive Image Paragraphs cs.CV · 2016 · author #4
  72. Crowdsourcing in Computer Vision cs.CV · 2016 · author #3
  73. Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning cs.CV · 2016 · author #6
  74. A Glimpse Far into the Future: Understanding Long-term Crowd Worker Quality cs.HC · 2016 · author #3
  75. Visual Relationship Detection with Language Priors cs.CV · 2016 · author #4
  76. Connectionist Temporal Modeling for Weakly Supervised Action Labeling cs.CV · 2016 · author #2
  77. Locally-Optimized Inter-Subject Alignment of Functional Cortical Regions q-bio.NC · 2016 · author #4
  78. Perceptual Losses for Real-Time Style Transfer and Super-Resolution cs.CV · 2016 · author #3
  79. Towards Viewpoint Invariant 3D Human Pose Estimation cs.CV · 2016 · author #6
  80. Embracing Error to Enable Rapid Crowdsourcing cs.HC · 2016 · author #6
  81. DenseCap: Fully Convolutional Localization Networks for Dense Captioning cs.CV · 2015 · author #3
  82. End-to-end Learning of Action Detection from Frame Glimpses in Videos cs.CV · 2015 · author #4
  83. The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition cs.CV · 2015 · author #8
  84. Visual7W: Grounded Question Answering in Images cs.CV · 2015 · author #4
  85. Detecting events and key actors in multi-person videos cs.CV · 2015 · author #6
  86. SentenceRacer: A Game with a Purpose for Image Sentence Annotation cs.HC · 2015 · author #5
  87. Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos cs.CV · 2015 · author #6
  88. Building a Large-scale Multimodal Knowledge Base System for Answering Visual Queries cs.CV · 2015 · author #4
  89. What's the Point: Semantic Segmentation with Point Supervision cs.CV · 2015 · author #4
  90. Visualizing and Understanding Recurrent Networks cs.LG · 2015 · author #3
  91. Improving Image Classification with Location Context cs.CV · 2015 · author #3
  92. Learning Temporal Embeddings for Complex Video Analysis cs.CV · 2015 · author #4
  93. Deep Visual-Semantic Alignments for Generating Image Descriptions cs.CV · 2014 · author #2
  94. Affordances Provide a Fundamental Categorization Principle for Visual Scenes q-bio.NC · 2014 · author #5
  95. Visual Noise from Natural Scene Statistics Reveals Human Scene Category Representations cs.CV · 2014 · author #4
  96. ImageNet Large Scale Visual Recognition Challenge cs.CV · 2014 · author #12
  97. VideoSET: Video Summary Evaluation through Text cs.CV · 2014 · author #3
  98. Deep Fragment Embeddings for Bidirectional Image Sentence Mapping cs.CV · 2014 · author #3

Mentions

  • 2606.28276 #15 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2606.17055 #27 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2501.06348 #4 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 1511.07571 #3 · backfill · confidence 0.70 Li Fei-Fei
  • 1511.06984 #4 · backfill · confidence 0.70 Li Fei-Fei
  • 1511.06789 #8 · backfill · confidence 0.70 Li Fei-Fei
  • 1511.03416 #4 · backfill · confidence 0.70 Li Fei-Fei
  • 1511.02917 #6 · backfill · confidence 0.70 Li Fei-Fei
  • 2605.09989 #8 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 1508.07053 #5 · backfill · confidence 0.70 Li Fei-Fei
  • 1507.05738 #6 · backfill · confidence 0.70 Li Fei-Fei
  • 1507.05670 #4 · backfill · confidence 0.70 Li Fei-Fei
  • 1506.02106 #4 · backfill · confidence 0.70 Li Fei-Fei
  • 1506.02078 #3 · backfill · confidence 0.70 Li Fei-Fei
  • 1505.03873 #3 · backfill · confidence 0.70 Li Fei-Fei
  • 1505.00315 #4 · backfill · confidence 0.70 Li Fei-Fei
  • 1412.2306 #2 · backfill · confidence 0.70 Li Fei-Fei
  • 1411.5340 #5 · backfill · confidence 0.70 Li Fei-Fei
  • 1411.5331 #4 · backfill · confidence 0.70 Li Fei-Fei
  • 1409.0575 #12 · backfill · confidence 0.70 Li Fei-Fei
  • 1406.5824 #3 · backfill · confidence 0.70 Li Fei-Fei
  • 1406.5679 #3 · backfill · confidence 0.70 Li Fei-Fei
  • 2605.30341 #9 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2605.29563 #6 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2602.21198 #4 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2412.14171 #5 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2605.18746 #6 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2401.03568 #13 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2511.04670 #14 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2403.09227 #35 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2409.01652 #5 · arxiv_oai · confidence 0.70 Li Fei-Fei

Frequent Coauthors