pith. sign in

Lei Xie

Identifiers

  • name variant Lei Xie 0.60 · backfill

Papers (48)

  1. FlashTTS: Fast Streaming TTS with MTP Acceleration and X-pred Mean Flow Distillation eess.AS · 2026 · author #13
  2. MeanVC 2: Robust Low-Latency Streaming Zero-Shot Voice Conversion eess.AS · 2026 · author #8
  3. G-MaP-SE: Guided Speech Enhancement via GMM-Based Prior Matching eess.AS · 2026 · author #8
  4. Towards Unified Song Generation and Singing Voice Conversion with Accompaniment Co-Generation cs.SD · 2026 · author #13
  5. Beyond Semantic Dominance: Cognitive Affective Reasoning and Empathetic Response Alignment in Audio Language Models eess.AS · 2026 · author #6
  6. Self-Optimizing Control of Continuous Processes Based on Reinforcement Learning eess.SY · 2026 · author #3
  7. SoulX-Transcriber: A Robust End-to-End Framework for Multi-Speaker Speech Transcription eess.AS · 2026 · author #12
  8. InfoMerge: Information-aware Token Compression for Efficient Video Large Language Models cs.CV · 2026 · author #5
  9. UrduSpeech: A 156-Hour Urdu Speech Corpus with 12-Dimension Paralinguistic Annotations eess.AS · 2026 · author #5
  10. S2Accompanist: A Semantic-Aware and Structure-Guided Diffusion Model for Music Accompaniment Generation eess.AS · 2026 · author #10
  11. Towards Fine-Grained Multi-Dimensional Speech Understanding: Data Pipeline, Benchmark, and Model eess.AS · 2026 · author #12
  12. Listening with Time: Precise Temporal Awareness for Long-Form Audio Understanding eess.AS · 2026 · author #9
  13. Full-Duplex Interaction in Spoken Dialogue Systems: A Comprehensive Study from the ICASSP 2026 HumDial Challenge eess.AS · 2026 · author #9
  14. MINT-Bench: A Comprehensive Multilingual Benchmark for Instruction-Following Text-to-Speech eess.AS · 2026 · author #15
  15. Audio-Cogito: Towards Deep Audio Reasoning in Large Audio Language Models eess.AS · 2026 · author #7
  16. HumDial-EIBench: A Human-Recorded Multi-Turn Emotional Intelligence Benchmark for Audio Language Models eess.AS · 2026 · author #8
  17. EvoTSE: Evolving Enrollment for Target Speaker Extraction eess.AS · 2026 · author #7
  18. Speaker-Reasoner: Scaling Interaction Turns and Reasoning Patterns for Timestamped Speaker-Attributed ASR eess.AS · 2026 · author #8
  19. FastTurn: Unifying Acoustic and Streaming Semantic Cues for Low-Latency and Robust Turn Detection cs.SD · 2026 · author #11
  20. YingMusic-Singer-Plus: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance eess.AS · 2026 · author #9
  21. AugVLA-3D: Depth-Driven Feature Augmentation for Vision-Language-Action Models cs.CV · 2026 · author #3
  22. SongFormer: Scaling Music Structure Analysis with Heterogeneous Supervision eess.AS · 2025 · author #8
  23. SenSE: Semantic-Aware High-Fidelity Universal Speech Enhancement eess.AS · 2025 · author #7
  24. Towards Building Speech Large Language Models for Multitask Understanding in Low-Resource Languages cs.SD · 2025 · author #7
  25. Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens cs.SD · 2025 · author #23
  26. An Overtaking Trajectory Planning Framework Based on Spatio-temporal Topology and Reachable Set Analysis Ensuring Time Efficiency cs.RO · 2024 · author #4
  27. Memristive Devices for Computation-In-Memory cs.ET · 2019 · author #3
  28. A New GAN-based End-to-End TTS Training Algorithm cs.CL · 2019 · author #4
  29. Exploiting Syntactic Features in a Parsed Tree to Improve End-to-End TTS cs.CL · 2019 · author #4
  30. Improved Speaker-Dependent Separation for CHiME-5 Challenge eess.AS · 2019 · author #6
  31. Structural insights into characterizing binding sites in EGFR kinase mutants q-bio.MN · 2018 · author #2
  32. Exploring RNN-Transducer for Chinese Speech Recognition cs.CL · 2018 · author #5
  33. Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition cs.CL · 2018 · author #3
  34. Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search cs.CL · 2018 · author #3
  35. Domain Adversarial Training for Accented Speech Recognition cs.CL · 2018 · author #5
  36. Training Augmentation with Adversarial Examples for Robust Speech Recognition cs.CL · 2018 · author #5
  37. Attention-based End-to-End Models for Small-Footprint Keyword Spotting cs.SD · 2018 · author #4
  38. Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model cs.SD · 2018 · author #4
  39. Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition cs.SD · 2018 · author #6
  40. Attention-Based End-to-End Speech Recognition on Voice Search cs.CL · 2017 · author #4
  41. Statistical Parametric Speech Synthesis Using Generative Adversarial Networks Under A Multi-task Learning Framework cs.SD · 2017 · author #2
  42. Empirical Evaluation of Parallel Training Algorithms on Acoustic Modeling cs.CL · 2017 · author #3
  43. Spin-Valley Beam Splitter in Graphene cond-mat.mes-hall · 2016 · author #2
  44. Energy-aware Traffic Engineering in Hybrid SDN/IP Backbone Networks cs.NI · 2016 · author #3
  45. Automatic Prosody Prediction for Chinese Speech Synthesis using BLSTM-RNN and Embedding Features cs.CL · 2015 · author #2
  46. A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis cs.SD · 2015 · author #4
  47. Tunable giant exchange bias in single-phase rare earth-transition metal intermetallics YMn12-xFex with highly homogenous inter-sublattice exchange coupling cond-mat.str-el · 2015 · author #8
  48. Bi-objective Optimization for Robust RGB-D Visual Odometry cs.RO · 2014 · author #4

Mentions

  • 2606.09141 #13 · arxiv_oai · confidence 0.70 Lei Xie
  • 2606.09050 #8 · arxiv_oai · confidence 0.70 Lei Xie
  • 2606.08580 #8 · arxiv_oai · confidence 0.70 Lei Xie
  • 2606.07015 #13 · arxiv_oai · confidence 0.70 Lei Xie
  • 2606.06940 #6 · arxiv_oai · confidence 0.70 Lei Xie
  • 1511.00360 #2 · backfill · confidence 0.70 Lei Xie
  • 1510.01443 #4 · backfill · confidence 0.70 Lei Xie
  • 1508.07833 #8 · backfill · confidence 0.70 Lei Xie
  • 2606.04471 #3 · arxiv_oai · confidence 0.70 Lei Xie
  • 2606.02400 #12 · arxiv_oai · confidence 0.70 Lei Xie
  • 2606.02161 #5 · arxiv_oai · confidence 0.70 Lei Xie
  • 1411.7445 #4 · backfill · confidence 0.70 Lei Xie
  • 2605.17846 #5 · arxiv_oai · confidence 0.70 Lei Xie
  • 2605.17414 #10 · arxiv_oai · confidence 0.70 Lei Xie
  • 2503.01710 #23 · arxiv_oai · confidence 0.70 Lei Xie

Frequent Coauthors