pith. sign in

Gang Yu

Identifiers

  • name variant Gang Yu 0.60 · backfill

Papers (34)

  1. High-order synchrosqueezed wavelet-chirplet transform for instantaneous frequency and chirprate estimation eess.SP · 2026 · author #4
  2. Rethinking Memory as Continuously Evolving Connectivity cs.CL · 2026 · author #9
  3. StepAudio 2.5 Technical Report eess.AS · 2026 · author #99
  4. Vision Foundation Models as Generalist Tokenizers for Image Generation cs.CV · 2026 · author #6
  5. Head Forcing: Long Autoregressive Video Generation via Head Heterogeneity cs.CV · 2026 · author #3
  6. Step-Audio-R1.5 Technical Report eess.AS · 2026 · author #17
  7. MMPhysVideo: Scaling Physical Plausibility in Video Generation via Joint Multimodal Modeling cs.CV · 2026 · author #5
  8. Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks cs.CL · 2026 · author #14
  9. Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models cs.CL · 2025 · author #11
  10. Step-Audio 2 Technical Report cs.CL · 2025 · author #8
  11. Step1X-Edit: A Practical Framework for General Image Editing cs.CV · 2025 · author #23
  12. Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model cs.CV · 2025 · author #36
  13. ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment cs.CV · 2024 · author #6
  14. AppAgent: Multimodal Agents as Smartphone Users cs.CV · 2023 · author #8
  15. TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection cs.CV · 2019 · author #3
  16. Shape Robust Text Detection with Progressive Scale Expansion Network cs.CV · 2019 · author #6
  17. An End-to-End Network for Panoptic Segmentation cs.CV · 2019 · author #6
  18. WIDER Face and Pedestrian Challenge 2018: Methods and Results cs.CV · 2019 · author #17
  19. Rethinking on Multi-Stage Networks for Human Pose Estimation cs.CV · 2019 · author #7
  20. Scene Text Detection with Supervised Pyramid Context Network cs.CV · 2018 · author #4
  21. Modeling Local Geometric Structure of 3D Point Clouds using Geo-CNN cs.CV · 2018 · author #3
  22. BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation cs.CV · 2018 · author #5
  23. CrowdHuman: A Benchmark for Detecting Human in a Crowd cs.CV · 2018 · author #5
  24. Learning a Discriminative Feature Network for Semantic Segmentation cs.CV · 2018 · author #5
  25. SFace: An Efficient Network for Face Detection in Large Scale Variations cs.CV · 2018 · author #4
  26. DetNet: A Backbone network for Object Detection cs.CV · 2018 · author #3
  27. SOT for MOT cs.CV · 2017 · author #3
  28. Cascaded Pyramid Network for Multi-Person Pose Estimation cs.CV · 2017 · author #5
  29. Light-Head R-CNN: In Defense of Two-Stage Object Detector cs.CV · 2017 · author #3
  30. Face Attention Network: An Effective Face Detector for the Occluded Faces cs.CV · 2017 · author #3
  31. MegDet: A Large Mini-Batch Object Detector cs.CV · 2017 · author #7
  32. Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network cs.CV · 2017 · author #3
  33. Provable Secure Identity Based Generalized Signcryption Scheme cs.CR · 2010 · author #1
  34. Sieving by large integers and covering systems of congruences math.NT · 2005 · author #5

Mentions

  • 2606.01965 #4 · arxiv_oai · confidence 0.70 Gang Yu
  • 2604.25719 #17 · arxiv_oai · confidence 0.70 Gang Yu
  • 2605.28773 #9 · arxiv_oai · confidence 0.70 Gang Yu
  • 2605.23463 #99 · arxiv_oai · confidence 0.70 Gang Yu
  • 2605.18390 #6 · arxiv_oai · confidence 0.70 Gang Yu
  • 2502.10248 #36 · arxiv_oai · confidence 0.70 Gang Yu
  • 1004.1304 #1 · backfill · confidence 0.70 Gang Yu
  • 2312.13771 #8 · arxiv_oai · confidence 0.70 Gang Yu
  • 2507.16632 #8 · arxiv_oai · confidence 0.70 Gang Yu

Frequent Coauthors