pith. sign in

Zhiyong Wu

Identifiers

  • name variant Zhiyong Wu 0.60 · backfill

Papers (23)

  1. LoSATok: Low-dimensional Semantic-Acoustic Tokenizer for Cross-Domain Audio Understanding and Generation eess.AS · 2026 · author #6
  2. UniSRM: A Unified Speech Reward Model for Reasoning-Based Fine-grained Assessment eess.AS · 2026 · author #4
  3. OpenCompass: A Universal Evaluation Platform for Large Language Models cs.CL · 2026 · author #17
  4. How Should LLMs Listen While Speaking? A Study of User-Stream Routing in Full-Duplex Spoken Dialogue cs.CL · 2026 · author #7
  5. SPG-Codec: Exploring the Role and Boundaries of Semantic Priors in Ultra-Low-Bitrate Neural Speech Coding eess.AS · 2026 · author #4
  6. TTS-PRISM: A Perceptual Reasoning and Interpretable Speech Model for Fine-Grained Diagnosis cs.CL · 2026 · author #11
  7. Towards Streaming Target Speaker Extraction via Chunk-wise Interleaved Splicing of Autoregressive Language Model cs.SD · 2026 · author #11
  8. SongBench: A Fine-Grained Multi-Aspect Benchmark for Song Quality Assessment eess.AS · 2026 · author #8
  9. BugForge: Constructing and Utilizing DBMS Bug Repository to Enhance DBMS Testing cs.SE · 2026 · author #5
  10. UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning cs.AI · 2025 · author #16
  11. ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows cs.AI · 2025 · author #21
  12. Seed1.5-VL Technical Report cs.CV · 2025 · author #192
  13. Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling cs.CV · 2024 · author #29
  14. OS-ATLAS: A Foundation Action Model for Generalist GUI Agents cs.CL · 2024 · author #1
  15. SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents cs.HC · 2024 · author #7
  16. A Survey on In-context Learning cs.CL · 2022 · author #9
  17. DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models cs.CL · 2022 · author #4
  18. Strong Exciton-Photon Coupling and lasing behavior in All-Inorganic CsPbBr3 Micro/nanowire Fabry-Perot cavity cond-mat.mes-hall · 2017 · author #5
  19. Exciton-Polaritons in Hybrid Inorganic-organic Perovskite Fabry-P\'erot Microcavity cond-mat.mes-hall · 2017 · author #7
  20. NEXT: A Neural Network Framework for Next POI Recommendation cs.IR · 2017 · author #3
  21. Study on Feature Subspace of Archetypal Emotions for Speech Emotion Recognition cs.LG · 2016 · author #2
  22. Measuring and Maximizing Influence via Random Walk in Social Activity Networks cs.SI · 2016 · author #4
  23. Feature Learning with Gaussian Restricted Boltzmann Machine for Robust Speech Recognition cs.CL · 2013 · author #2

Mentions

  • 1309.6176 #2 · backfill · confidence 0.70 Zhiyong Wu
  • 2605.27840 #6 · arxiv_oai · confidence 0.70 Zhiyong Wu
  • 2605.23261 #4 · arxiv_oai · confidence 0.70 Zhiyong Wu
  • 2210.08933 #4 · arxiv_oai · confidence 0.70 Zhiyong Wu
  • 2605.19276 #17 · arxiv_oai · confidence 0.70 Zhiyong Wu
  • 2401.10935 #7 · arxiv_oai · confidence 0.70 Zhiyong Wu

Frequent Coauthors