pith. machine review for the scientific record. sign in

Li Fei-Fei

Identifiers

  • name variant Li Fei-Fei 0.60 · backfill

Papers (90)

  1. StereoPolicy: Improving Robotic Manipulation Policies via Stereo Perception cs.RO · 2026 · author #8
  2. HumanScore: Benchmarking Human Motions in Generated Videos cs.CV · 2026 · author #6
  3. RAGEN-2: Reasoning Collapse in Agentic RL cs.LG · 2026 · author #13
  4. Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs cs.LG · 2026 · author #4
  5. Cambrian-S: Towards Spatial Supersensing in Video cs.CV · 2025 · author #14
  6. RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning cs.LG · 2025 · author #15
  7. s1: Simple test-time scaling cs.CL · 2025 · author #5
  8. ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation cs.RO · 2024 · author #5
  9. BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation cs.RO · 2024 · author #35
  10. Open X-Embodiment: Robotic Learning Datasets and RT-X Models cs.RO · 2023 · author #151
  11. VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models cs.RO · 2023 · author #6
  12. On the Opportunities and Risks of Foundation Models cs.LG · 2021 · author #26
  13. What Matters in Learning from Offline Human Demonstrations for Robot Manipulation cs.RO · 2021 · author #7
  14. Information Maximizing Visual Question Generation cs.CV · 2019 · author #3
  15. Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks cs.LG · 2019 · author #3
  16. Audio-Linguistic Embeddings for Spoken Sentences cs.SD · 2019 · author #4
  17. Peeking into the Future: Predicting Future Person Activities and Locations in Videos cs.CV · 2019 · author #5
  18. DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion cs.CV · 2019 · author #6
  19. Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation cs.CV · 2019 · author #7
  20. D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation cs.CV · 2019 · author #4
  21. Composing Text and Image for Image Retrieval - An Empirical Odyssey cs.CV · 2018 · author #6
  22. Vision-Based Gait Analysis for Senior Care cs.CV · 2018 · author #10
  23. Faster CryptoNets: Leveraging Sparsity for Real-World Encrypted Inference cs.CR · 2018 · author #6
  24. A Fully Private Pipeline for Deep Learning on Electronic Health Records cs.CR · 2018 · author #5
  25. Privacy-Preserving Action Recognition for Smart Hospitals using Low-Resolution Depth Images cs.CV · 2018 · author #7
  26. Measuring Depression Symptom Severity from Spoken Language and 3D Facial Expressions cs.CV · 2018 · author #4
  27. RoboTurk: A Crowdsourcing Platform for Robotic Skill Learning through Imitation cs.RO · 2018 · author #12
  28. Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks cs.RO · 2018 · author #6
  29. HiDDeN: Hiding Data With Deep Networks cs.CV · 2018 · author #4
  30. Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video Demonstration cs.CV · 2018 · author #6
  31. Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision cs.RO · 2018 · author #6
  32. Flexible Neural Representation for Physics Prediction cs.AI · 2018 · author #5
  33. Learning to Decompose and Disentangle Representations for Video Prediction cs.LG · 2018 · author #4
  34. Image Generation from Scene Graphs cs.CV · 2018 · author #3
  35. Iterative Visual Reasoning Beyond Convolutions cs.CV · 2018 · author #3
  36. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks cs.CV · 2018 · author #3
  37. Referring Relationships cs.CV · 2018 · author #4
  38. Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks cs.CV · 2018 · author #7
  39. Emergence of Structured Behaviors from Curiosity-Based Intrinsic Motivation cs.LG · 2018 · author #3
  40. Learning to Play with Intrinsically-Motivated Self-Aware Agents cs.LG · 2018 · author #3
  41. MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels cs.CV · 2017 · author #5
  42. Progressive Neural Architecture Search cs.CV · 2017 · author #7
  43. Label Efficient Learning of Transferable Representations across Domains and Tasks stat.ML · 2017 · author #4
  44. Graph Distillation for Action Detection with Privileged Modalities cs.CV · 2017 · author #5
  45. Thoracic Disease Identification and Localization with Limited Supervision cs.CV · 2017 · author #7
  46. Neural Task Programming: Learning to Generalize Across Hierarchical Tasks cs.AI · 2017 · author #6
  47. Scalable Annotation of Fine-Grained Categories Without Experts cs.HC · 2017 · author #4
  48. Fine-Grained Car Detection for Visual Census Estimation cs.CV · 2017 · author #6
  49. Fine-grained Recognition in the Wild: A Multi-Task Domain Adaptation Approach cs.CV · 2017 · author #3
  50. Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance cs.CV · 2017 · author #13
  51. ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems cs.RO · 2017 · author #6
  52. Learning to Learn from Noisy Web Videos cs.CV · 2017 · author #6
  53. Tackling Over-pruning in Variational Autoencoders cs.LG · 2017 · author #4
  54. Visual Semantic Planning using Deep Successor Representations cs.CV · 2017 · author #5
  55. Inferring and Executing Programs for Visual Reasoning cs.CV · 2017 · author #5
  56. Characterizing and Improving Stability in Neural Style Transfer cs.CV · 2017 · author #4
  57. Dense-Captioning Events in Videos cs.CV · 2017 · author #4
  58. Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos cs.CV · 2017 · author #3
  59. Scene Graph Generation by Iterative Message Passing cs.CV · 2017 · author #4
  60. Unsupervised Learning of Long-Term Motion Dynamics for Videos cs.CV · 2017 · author #5
  61. CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning cs.CV · 2016 · author #4
  62. Recurrent Attention Models for Depth-Based Person Identification cs.CV · 2016 · author #3
  63. A Hierarchical Approach for Generating Descriptive Image Paragraphs cs.CV · 2016 · author #4
  64. Crowdsourcing in Computer Vision cs.CV · 2016 · author #3
  65. Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning cs.CV · 2016 · author #6
  66. A Glimpse Far into the Future: Understanding Long-term Crowd Worker Quality cs.HC · 2016 · author #3
  67. Visual Relationship Detection with Language Priors cs.CV · 2016 · author #4
  68. Connectionist Temporal Modeling for Weakly Supervised Action Labeling cs.CV · 2016 · author #2
  69. Locally-Optimized Inter-Subject Alignment of Functional Cortical Regions q-bio.NC · 2016 · author #4
  70. Perceptual Losses for Real-Time Style Transfer and Super-Resolution cs.CV · 2016 · author #3
  71. Towards Viewpoint Invariant 3D Human Pose Estimation cs.CV · 2016 · author #6
  72. Embracing Error to Enable Rapid Crowdsourcing cs.HC · 2016 · author #6
  73. DenseCap: Fully Convolutional Localization Networks for Dense Captioning cs.CV · 2015 · author #3
  74. End-to-end Learning of Action Detection from Frame Glimpses in Videos cs.CV · 2015 · author #4
  75. The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition cs.CV · 2015 · author #8
  76. Visual7W: Grounded Question Answering in Images cs.CV · 2015 · author #4
  77. Detecting events and key actors in multi-person videos cs.CV · 2015 · author #6
  78. SentenceRacer: A Game with a Purpose for Image Sentence Annotation cs.HC · 2015 · author #5
  79. Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos cs.CV · 2015 · author #6
  80. Building a Large-scale Multimodal Knowledge Base System for Answering Visual Queries cs.CV · 2015 · author #4
  81. What's the Point: Semantic Segmentation with Point Supervision cs.CV · 2015 · author #4
  82. Visualizing and Understanding Recurrent Networks cs.LG · 2015 · author #3
  83. Improving Image Classification with Location Context cs.CV · 2015 · author #3
  84. Learning Temporal Embeddings for Complex Video Analysis cs.CV · 2015 · author #4
  85. Deep Visual-Semantic Alignments for Generating Image Descriptions cs.CV · 2014 · author #2
  86. Affordances Provide a Fundamental Categorization Principle for Visual Scenes q-bio.NC · 2014 · author #5
  87. Visual Noise from Natural Scene Statistics Reveals Human Scene Category Representations cs.CV · 2014 · author #4
  88. ImageNet Large Scale Visual Recognition Challenge cs.CV · 2014 · author #12
  89. VideoSET: Video Summary Evaluation through Text cs.CV · 2014 · author #3
  90. Deep Fragment Embeddings for Bidirectional Image Sentence Mapping cs.CV · 2014 · author #3

Mentions

  • 2511.04670 #14 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2403.09227 #35 · arxiv_oai · confidence 0.70 Li Fei-Fei
  • 2409.01652 #5 · arxiv_oai · confidence 0.70 Li Fei-Fei

Frequent Coauthors