pith. sign in

Jifeng Dai

Identifiers

  • name variant Jifeng Dai 0.60 · backfill

Papers (33)

  1. AnyScene: Towards Highly Controllable Driving Scene Generation at Anywhere and Beyond cs.RO · 2026 · author #7
  2. Driving Intents Amplify Planning-Oriented Reinforcement Learning cs.RO · 2026 · author #5
  3. MindVLA-U1: VLA Beats VA with Unified Streaming Architecture for Autonomous Driving cs.RO · 2026 · author #7
  4. Action Emergence from Streaming Intent cs.RO · 2026 · author #4
  5. MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling cs.CL · 2025 · author #9
  6. GenExam: A Multidisciplinary Text-to-Image Exam cs.CV · 2025 · author #7
  7. InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency cs.CV · 2025 · author #69
  8. InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models cs.CV · 2025 · author #50
  9. Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling cs.CV · 2024 · author #41
  10. Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization cs.CL · 2024 · author #11
  11. InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output cs.CV · 2024 · author #24
  12. How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites cs.CV · 2024 · author #34
  13. InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks cs.CV · 2023 · author #15
  14. Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory cs.AI · 2023 · author #13
  15. Deformable DETR: Deformable Transformers for End-to-End Object Detection cs.CV · 2020 · author #6
  16. MMDetection: Open MMLab Detection Toolbox and Benchmark cs.CV · 2019 · author #20
  17. An Empirical Study of Spatial Attention Mechanisms in Deep Networks cs.CV · 2019 · author #5
  18. Deformable ConvNets v2: More Deformable, Better Results cs.CV · 2018 · author #4
  19. Integrated Object Detection and Tracking with Tracklet-Conditioned Detection cs.CV · 2018 · author #5
  20. Towards High Performance Video Object Detection for Mobiles cs.CV · 2018 · author #2
  21. Learning Region Features for Object Detection cs.CV · 2018 · author #5
  22. Towards High Performance Video Object Detection cs.CV · 2017 · author #2
  23. Relation Networks for Object Detection cs.CV · 2017 · author #4
  24. Flow-Guided Feature Aggregation for Video Object Detection cs.CV · 2017 · author #3
  25. Deformable Convolutional Networks cs.CV · 2017 · author #1
  26. Deep Feature Flow for Video Recognition cs.CV · 2016 · author #3
  27. Fully Convolutional Instance-aware Semantic Segmentation cs.CV · 2016 · author #3
  28. ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation cs.CV · 2016 · author #2
  29. Instance-sensitive Fully Convolutional Networks cs.CV · 2016 · author #1
  30. Instance-aware Semantic Segmentation via Multi-task Network Cascades cs.CV · 2015 · author #1
  31. BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation cs.CV · 2015 · author #1
  32. Generative Modeling of Convolutional Neural Networks cs.CV · 2014 · author #1
  33. Convolutional Feature Masking for Joint Object and Stuff Segmentation cs.CV · 2014 · author #1

Mentions

  • 2605.26113 #7 · arxiv_oai · confidence 0.70 Jifeng Dai
  • 2407.03320 #24 · arxiv_oai · confidence 0.70 Jifeng Dai
  • 2411.10442 #11 · arxiv_oai · confidence 0.70 Jifeng Dai
  • 2305.17144 #13 · arxiv_oai · confidence 0.70 Jifeng Dai

Frequent Coauthors