Zhiyong Wu
Identifiers
- name variant Zhiyong Wu 0.60 · backfill
Papers (23)
- LoSATok: Low-dimensional Semantic-Acoustic Tokenizer for Cross-Domain Audio Understanding and Generation eess.AS · 2026 · author #6
- UniSRM: A Unified Speech Reward Model for Reasoning-Based Fine-grained Assessment eess.AS · 2026 · author #4
- OpenCompass: A Universal Evaluation Platform for Large Language Models cs.CL · 2026 · author #17
- How Should LLMs Listen While Speaking? A Study of User-Stream Routing in Full-Duplex Spoken Dialogue cs.CL · 2026 · author #7
- SPG-Codec: Exploring the Role and Boundaries of Semantic Priors in Ultra-Low-Bitrate Neural Speech Coding eess.AS · 2026 · author #4
- TTS-PRISM: A Perceptual Reasoning and Interpretable Speech Model for Fine-Grained Diagnosis cs.CL · 2026 · author #11
- Towards Streaming Target Speaker Extraction via Chunk-wise Interleaved Splicing of Autoregressive Language Model cs.SD · 2026 · author #11
- SongBench: A Fine-Grained Multi-Aspect Benchmark for Song Quality Assessment eess.AS · 2026 · author #8
- BugForge: Constructing and Utilizing DBMS Bug Repository to Enhance DBMS Testing cs.SE · 2026 · author #5
- UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning cs.AI · 2025 · author #16
- ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows cs.AI · 2025 · author #21
- Seed1.5-VL Technical Report cs.CV · 2025 · author #192
- Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling cs.CV · 2024 · author #29
- OS-ATLAS: A Foundation Action Model for Generalist GUI Agents cs.CL · 2024 · author #1
- SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents cs.HC · 2024 · author #7
- A Survey on In-context Learning cs.CL · 2022 · author #9
- DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models cs.CL · 2022 · author #4
- Strong Exciton-Photon Coupling and lasing behavior in All-Inorganic CsPbBr3 Micro/nanowire Fabry-Perot cavity cond-mat.mes-hall · 2017 · author #5
- Exciton-Polaritons in Hybrid Inorganic-organic Perovskite Fabry-P\'erot Microcavity cond-mat.mes-hall · 2017 · author #7
- NEXT: A Neural Network Framework for Next POI Recommendation cs.IR · 2017 · author #3
- Study on Feature Subspace of Archetypal Emotions for Speech Emotion Recognition cs.LG · 2016 · author #2
- Measuring and Maximizing Influence via Random Walk in Social Activity Networks cs.SI · 2016 · author #4
- Feature Learning with Gaussian Restricted Boltzmann Machine for Robust Speech Recognition cs.CL · 2013 · author #2
Mentions
- 1309.6176 #2 · backfill · confidence 0.70 Zhiyong Wu
- 2605.27840 #6 · arxiv_oai · confidence 0.70 Zhiyong Wu
- 2605.23261 #4 · arxiv_oai · confidence 0.70 Zhiyong Wu
- 2210.08933 #4 · arxiv_oai · confidence 0.70 Zhiyong Wu
- 2605.19276 #17 · arxiv_oai · confidence 0.70 Zhiyong Wu
- 2401.10935 #7 · arxiv_oai · confidence 0.70 Zhiyong Wu
Frequent Coauthors
- Fangzhi Xu 3 shared papers
- Helen Meng 3 shared papers
- Kanzhi Cheng 3 shared papers
- Qiushi Sun 3 shared papers
- Faming Wu 2 shared papers
- Fuxing Leng 2 shared papers
- Guang Shi 2 shared papers
- Haihua Yang 2 shared papers
- Haobin Chen 2 shared papers
- Haoming Wang 2 shared papers
- Hui Lu 2 shared papers
- Huimeng Wang 2 shared papers
- Jia Shi 2 shared papers
- Jie Chen 2 shared papers
- Jingjia Huang 2 shared papers
- Junjie Fang 2 shared papers
- Junting Lu 2 shared papers
- Kai Chen 2 shared papers
- Kai Shen 2 shared papers
- Lei Li 2 shared papers