Xiangyu Zhang — Pith Author Registry

Identifiers

name variant Xiangyu Zhang 0.60 · backfill

Papers (53)

MemoryVLA++: Temporal Modeling via Memory and Imagination in Vision-Language-Action Models cs.RO · 2026 · author #7
The WER Trap: Shattering the Illusion of Unified Tokens in Speech Language Models eess.AS · 2026 · author #1
Diagnosing Live Within-Policy Instruction Conflicts in LLM Agents with Witnessed Resolution Profiles cs.AI · 2026 · author #3
AndroidDaily: A Verifiable Benchmark for Mobile GUI Agents on Real-World Closed-Source Applications cs.CV · 2026 · author #15
StepAudio 2.5 Technical Report eess.AS · 2026 · author #100
Vision Foundation Models as Generalist Tokenizers for Image Generation cs.CV · 2026 · author #7
Step-Audio-R1.5 Technical Report eess.AS · 2026 · author #18
Spike-NVPT: Learning Robust Visual Prompts via Bio-Inspired Temporal Filtering and Discretization cs.CV · 2026 · author #5
Breaking the Training Barrier of Billion-Parameter Universal Machine Learning Interatomic Potentials cs.DC · 2026 · author #7
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments cs.CV · 2026 · author #15
Why Your Tokenizer Fails in Information Fusion: A Timing-Aware Pre-Quantization Fusion for Video-Enhanced Audio Tokenization eess.AS · 2026 · author #1
MemoPhishAgent: Memory-Augmented Multi-Modal LLM Agent for Phishing URL Detection cs.CR · 2026 · author #6
DockSmith: Scaling Reliable Coding Environments via an Agentic Docker Builder cs.AI · 2026 · author #13
Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models cs.CL · 2025 · author #9
MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation cs.RO · 2025 · author #9
WISCA: A Lightweight Model Transition Method to Improve LLM Training via Weight Scaling cs.LG · 2025 · author #7
A New Class of Asymptotically Distribution-Free Smooth Tests math.ST · 2025 · author #1
Step-Audio 2 Technical Report cs.CL · 2025 · author #108
BugScope: Learn to Find Bugs Like Human cs.SE · 2025 · author #6
VERA: Variational Inference Framework for Jailbreaking Large Language Models cs.CR · 2025 · author #4
Mixture-of-Experts Can Surpass Dense LLMs Under Strictly Equal Resource cs.CL · 2025 · author #9
Raw Pointer Rewriting with LLMs for Translating C to Safer Rust cs.SE · 2025 · author #6
Step1X-Edit: A Practical Framework for General Image Editing cs.CV · 2025 · author #22
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model cs.LG · 2025 · author #5
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction cs.CL · 2025 · author #143
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model cs.CV · 2025 · author #111
Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety cs.CR · 2025 · author #39
NESA: Relational Neuro-Symbolic Static Program Analysis cs.PL · 2024 · author #8
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model cs.CV · 2024 · author #12
Poisoning with A Pill: Circumventing Detection in Federated Learning cs.LG · 2024 · author #7
Arbitrage of Energy Storage in Electricity Markets with Deep Reinforcement Learning cs.LG · 2019 · author #3
Meta-SR: A Magnification-Arbitrary Network for Super-Resolution cs.CV · 2019 · author #3
Attacks Meet Interpretability: Attribute-steered Detection of Adversarial Samples cs.LG · 2018 · author #4
Bounding Box Regression with Uncertainty for Accurate Object Detection cs.CV · 2018 · author #5
ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design cs.CV · 2018 · author #2
MetaAnchor: Learning to Detect Objects with Customized Anchors cs.CV · 2018 · author #2
CrowdHuman: A Benchmark for Detecting Human in a Crowd cs.CV · 2018 · author #6
DetNet: A Backbone network for Object Detection cs.CV · 2018 · author #4
ExFuse: Enhancing Feature Fusion for Semantic Segmentation cs.CV · 2018 · author #2
Light-Head R-CNN: In Defense of Two-Stage Object Detector cs.CV · 2017 · author #4
MegDet: A Large Mini-Batch Object Detector cs.CV · 2017 · author #5
Channel Pruning for Accelerating Very Deep Neural Networks cs.CV · 2017 · author #2
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices cs.CV · 2017 · author #1
Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network cs.CV · 2017 · author #2
Identity Mappings in Deep Residual Networks cs.CV · 2016 · author #2
Deep Residual Learning for Image Recognition cs.CV · 2015 · author #2
Accelerating Very Deep Convolutional Networks for Classification and Detection cs.CV · 2015 · author #1
Discrete solitons in self-defocusing systems with $\mathcal{PT}$-symmetric defects nlin.PS · 2015 · author #4
Object Detection Networks on Convolutional Feature Maps cs.CV · 2015 · author #4
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification cs.CV · 2015 · author #2
Efficient and Accurate Approximations of Nonlinear Convolutional Networks cs.CV · 2014 · author #1
Discrete solitons and scattering of lattice waves in guiding arrays with a nonlinear $\mathcal{PT}$-symmetric defect physics.optics · 2014 · author #1
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition cs.CV · 2014 · author #2

Mentions

2606.09827 #7 · arxiv_oai · confidence 0.70 Xiangyu Zhang
1505.06798 #1 · backfill · confidence 0.70 Xiangyu Zhang
1504.06191 #4 · backfill · confidence 0.70 Xiangyu Zhang
1504.06066 #4 · backfill · confidence 0.70 Xiangyu Zhang
1502.01852 #2 · backfill · confidence 0.70 Xiangyu Zhang
2604.25719 #18 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2508.01973 #1 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2506.22666 #4 · arxiv_oai · confidence 0.70 Xiangyu Zhang
1411.4229 #1 · backfill · confidence 0.70 Xiangyu Zhang
1411.3944 #1 · backfill · confidence 0.70 Xiangyu Zhang
1406.4729 #2 · backfill · confidence 0.70 Xiangyu Zhang
2605.29209 #1 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2605.27784 #3 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2605.27761 #15 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2605.23463 #100 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2605.18390 #7 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2506.12119 #9 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2502.10248 #111 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2502.11946 #143 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2409.01704 #12 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2507.16632 #108 · arxiv_oai · confidence 0.70 Xiangyu Zhang
2508.19236 #9 · arxiv_oai · confidence 0.70 Xiangyu Zhang

Frequent Coauthors

Jian Sun 18 shared papers
Gang Yu 12 shared papers
Daxin Jiang 10 shared papers
Kaiming He 7 shared papers
Haoyang Zhang 6 shared papers
Zheng Ge 6 shared papers
Chao Peng 5 shared papers
Fei Tian 5 shared papers
Liang Zhao 5 shared papers
Shaoqing Ren 5 shared papers
Xuerui Yang 5 shared papers
Yibo Zhu 5 shared papers
Binxing Jiao 4 shared papers
Brian Li 4 shared papers
Heung-Yeung Shum 4 shared papers
Jianjian Sun 4 shared papers
Kang An 4 shared papers
Mingliang Li 4 shared papers
Na Wang 4 shared papers
Qi Han 4 shared papers