Xiangyu Zhang
Identifiers
- name variant Xiangyu Zhang 0.60 · backfill
Papers (53)
- MemoryVLA++: Temporal Modeling via Memory and Imagination in Vision-Language-Action Models cs.RO · 2026 · author #7
- The WER Trap: Shattering the Illusion of Unified Tokens in Speech Language Models eess.AS · 2026 · author #1
- Diagnosing Live Within-Policy Instruction Conflicts in LLM Agents with Witnessed Resolution Profiles cs.AI · 2026 · author #3
- AndroidDaily: A Verifiable Benchmark for Mobile GUI Agents on Real-World Closed-Source Applications cs.CV · 2026 · author #15
- StepAudio 2.5 Technical Report eess.AS · 2026 · author #100
- Vision Foundation Models as Generalist Tokenizers for Image Generation cs.CV · 2026 · author #7
- Step-Audio-R1.5 Technical Report eess.AS · 2026 · author #18
- Spike-NVPT: Learning Robust Visual Prompts via Bio-Inspired Temporal Filtering and Discretization cs.CV · 2026 · author #5
- Breaking the Training Barrier of Billion-Parameter Universal Machine Learning Interatomic Potentials cs.DC · 2026 · author #7
- SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments cs.CV · 2026 · author #15
- Why Your Tokenizer Fails in Information Fusion: A Timing-Aware Pre-Quantization Fusion for Video-Enhanced Audio Tokenization eess.AS · 2026 · author #1
- MemoPhishAgent: Memory-Augmented Multi-Modal LLM Agent for Phishing URL Detection cs.CR · 2026 · author #6
- DockSmith: Scaling Reliable Coding Environments via an Agentic Docker Builder cs.AI · 2026 · author #13
- Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models cs.CL · 2025 · author #9
- MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation cs.RO · 2025 · author #9
- WISCA: A Lightweight Model Transition Method to Improve LLM Training via Weight Scaling cs.LG · 2025 · author #7
- A New Class of Asymptotically Distribution-Free Smooth Tests math.ST · 2025 · author #1
- Step-Audio 2 Technical Report cs.CL · 2025 · author #108
- BugScope: Learn to Find Bugs Like Human cs.SE · 2025 · author #6
- VERA: Variational Inference Framework for Jailbreaking Large Language Models cs.CR · 2025 · author #4
- Mixture-of-Experts Can Surpass Dense LLMs Under Strictly Equal Resource cs.CL · 2025 · author #9
- Raw Pointer Rewriting with LLMs for Translating C to Safer Rust cs.SE · 2025 · author #6
- Step1X-Edit: A Practical Framework for General Image Editing cs.CV · 2025 · author #22
- Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model cs.LG · 2025 · author #5
- Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction cs.CL · 2025 · author #143
- Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model cs.CV · 2025 · author #111
- Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety cs.CR · 2025 · author #39
- NESA: Relational Neuro-Symbolic Static Program Analysis cs.PL · 2024 · author #8
- General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model cs.CV · 2024 · author #12
- Poisoning with A Pill: Circumventing Detection in Federated Learning cs.LG · 2024 · author #7
- Arbitrage of Energy Storage in Electricity Markets with Deep Reinforcement Learning cs.LG · 2019 · author #3
- Meta-SR: A Magnification-Arbitrary Network for Super-Resolution cs.CV · 2019 · author #3
- Attacks Meet Interpretability: Attribute-steered Detection of Adversarial Samples cs.LG · 2018 · author #4
- Bounding Box Regression with Uncertainty for Accurate Object Detection cs.CV · 2018 · author #5
- ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design cs.CV · 2018 · author #2
- MetaAnchor: Learning to Detect Objects with Customized Anchors cs.CV · 2018 · author #2
- CrowdHuman: A Benchmark for Detecting Human in a Crowd cs.CV · 2018 · author #6
- DetNet: A Backbone network for Object Detection cs.CV · 2018 · author #4
- ExFuse: Enhancing Feature Fusion for Semantic Segmentation cs.CV · 2018 · author #2
- Light-Head R-CNN: In Defense of Two-Stage Object Detector cs.CV · 2017 · author #4
- MegDet: A Large Mini-Batch Object Detector cs.CV · 2017 · author #5
- Channel Pruning for Accelerating Very Deep Neural Networks cs.CV · 2017 · author #2
- ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices cs.CV · 2017 · author #1
- Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network cs.CV · 2017 · author #2
- Identity Mappings in Deep Residual Networks cs.CV · 2016 · author #2
- Deep Residual Learning for Image Recognition cs.CV · 2015 · author #2
- Accelerating Very Deep Convolutional Networks for Classification and Detection cs.CV · 2015 · author #1
- Discrete solitons in self-defocusing systems with $\mathcal{PT}$-symmetric defects nlin.PS · 2015 · author #4
- Object Detection Networks on Convolutional Feature Maps cs.CV · 2015 · author #4
- Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification cs.CV · 2015 · author #2
- Efficient and Accurate Approximations of Nonlinear Convolutional Networks cs.CV · 2014 · author #1
- Discrete solitons and scattering of lattice waves in guiding arrays with a nonlinear $\mathcal{PT}$-symmetric defect physics.optics · 2014 · author #1
- Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition cs.CV · 2014 · author #2
Mentions
- 2606.09827 #7 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 1505.06798 #1 · backfill · confidence 0.70 Xiangyu Zhang
- 1504.06191 #4 · backfill · confidence 0.70 Xiangyu Zhang
- 1504.06066 #4 · backfill · confidence 0.70 Xiangyu Zhang
- 1502.01852 #2 · backfill · confidence 0.70 Xiangyu Zhang
- 2604.25719 #18 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2508.01973 #1 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2506.22666 #4 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 1411.4229 #1 · backfill · confidence 0.70 Xiangyu Zhang
- 1411.3944 #1 · backfill · confidence 0.70 Xiangyu Zhang
- 1406.4729 #2 · backfill · confidence 0.70 Xiangyu Zhang
- 2605.29209 #1 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2605.27784 #3 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2605.27761 #15 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2605.23463 #100 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2605.18390 #7 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2506.12119 #9 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2502.10248 #111 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2502.11946 #143 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2409.01704 #12 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2507.16632 #108 · arxiv_oai · confidence 0.70 Xiangyu Zhang
- 2508.19236 #9 · arxiv_oai · confidence 0.70 Xiangyu Zhang
Frequent Coauthors
- Jian Sun 18 shared papers
- Gang Yu 12 shared papers
- Daxin Jiang 10 shared papers
- Kaiming He 7 shared papers
- Haoyang Zhang 6 shared papers
- Zheng Ge 6 shared papers
- Chao Peng 5 shared papers
- Fei Tian 5 shared papers
- Liang Zhao 5 shared papers
- Shaoqing Ren 5 shared papers
- Xuerui Yang 5 shared papers
- Yibo Zhu 5 shared papers
- Binxing Jiao 4 shared papers
- Brian Li 4 shared papers
- Heung-Yeung Shum 4 shared papers
- Jianjian Sun 4 shared papers
- Kang An 4 shared papers
- Mingliang Li 4 shared papers
- Na Wang 4 shared papers
- Qi Han 4 shared papers