pith. sign in

Yonghui Wu

Identifiers

  • name variant Yonghui Wu 0.60 · backfill

Papers (55)

  1. Prompt, Plan, Extract: Zero-Shot Agentic LLMs Workflows for Lung Pathology Extraction from Clinical Narratives cs.CL · 2026 · author #11
  2. Agentic AI Enhances Physician Trust in Clinical Decision Making cs.CY · 2026 · author #12
  3. MARCH: Multi-Agent Radiology Clinical Hierarchy for CT Report Generation cs.AI · 2026 · author #3
  4. Seedance 2.0: Advancing Video Generation for World Complexity cs.CV · 2026 · author #127
  5. Detecting HIV-Related Stigma in Clinical Narratives Using Large Language Models cs.CL · 2026 · author #10
  6. A Parameter-Efficient Transfer Learning Approach through Multitask Prompt Distillation and Decomposition for Clinical NLP cs.CL · 2026 · author #4
  7. Retrieval-Augmented LLMs for Evidence Localization in Clinical Trial Recruitment from Longitudinal EHR Narratives cs.CL · 2026 · author #4
  8. Improving Automatic Summarization of Radiology Reports through Mid-Training of Large Language Models cs.CL · 2026 · author #6
  9. Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model cs.CV · 2025 · author #139
  10. Seedream 4.0: Toward Next-generation Multimodal Image Generation cs.CV · 2025 · author #34
  11. Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference cs.CL · 2025 · author #21
  12. Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #1454
  13. Seed1.5-VL Technical Report cs.CV · 2025 · author #170
  14. VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks cs.AI · 2025 · author #26
  15. DAPO: An Open-Source LLM Reinforcement Learning System at Scale cs.LG · 2025 · author #34
  16. Narrative Feature or Structured Feature? A Study of Large Language Models to Identify Cancer Patients at Risk of Heart Failure cs.LG · 2024 · author #7
  17. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context cs.CL · 2024 · author #1132
  18. Gemini: A Family of Highly Capable Multimodal Models cs.CL · 2023 · author #1347
  19. PaLM 2 Technical Report cs.CL · 2023 · author #128
  20. Scaling Autoregressive Models for Content-Rich Text-to-Image Generation cs.CV · 2022 · author #17
  21. CoCa: Contrastive Captioners are Image-Text Foundation Models cs.CV · 2022 · author #6
  22. Vector-quantized Image Modeling with Improved VQGAN cs.CV · 2021 · author #10
  23. GSPMD: General and Scalable Parallelization for ML Computation Graphs cs.DC · 2021 · author #15
  24. Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges cs.CL · 2019 · author #13
  25. Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning cs.CL · 2019 · author #4
  26. Gmail Smart Compose: Real-Time Assisted Writing cs.CL · 2019 · author #12
  27. Direct speech-to-speech translation with a sequence-to-sequence model cs.CL · 2019 · author #7
  28. LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech cs.SD · 2019 · author #8
  29. Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling cs.LG · 2019 · author #3
  30. Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes eess.AS · 2018 · author #4
  31. GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism cs.CV · 2018 · author #10
  32. Streaming End-to-end Speech Recognition For Mobile Devices cs.CL · 2018 · author #9
  33. Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation cs.CL · 2018 · author #9
  34. Hierarchical Generative Modeling for Controllable Speech Synthesis cs.CL · 2018 · author #5
  35. Training Deeper Neural Machine Translation Models with Transparent Attention cs.CL · 2018 · author #5
  36. A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition eess.AS · 2018 · author #4
  37. Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis cs.CL · 2018 · author #11
  38. The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation cs.CL · 2018 · author #11
  39. Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions cs.CL · 2017 · author #13
  40. An analysis of incorporating an external language model into a sequence-to-sequence model eess.AS · 2017 · author #2
  41. No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models cs.CL · 2017 · author #10
  42. Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models cs.CL · 2017 · author #3
  43. Improving the Performance of Online Neural Transducer Models cs.CL · 2017 · author #5
  44. State-of-the-art Speech Recognition With Sequence-to-Sequence Models cs.CL · 2017 · author #3
  45. Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model eess.AS · 2017 · author #8
  46. Speech recognition for medical conversations cs.CL · 2017 · author #13
  47. Tacotron: Towards End-to-End Speech Synthesis cs.CL · 2017 · author #4
  48. Sequence-to-Sequence Models Can Directly Translate Foreign Speech cs.CL · 2017 · author #4
  49. Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation cs.CL · 2016 · author #5
  50. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation cs.CL · 2016 · author #1
  51. Reward Augmented Maximum Likelihood for Neural Structured Prediction cs.LG · 2016 · author #6
  52. Exploring the Limits of Language Modeling cs.CL · 2016 · author #5
  53. Barcoding-free BAC Pooling Enables Combinatorial Selective Sequencing of the Barley Gene Space q-bio.GN · 2011 · author #7
  54. Prisoner's dilemma in structured scale-free networks physics.soc-ph · 2009 · author #2
  55. A unified model for Sierpinski networks with scale-free scaling and small-world effect cond-mat.dis-nn · 2009 · author #5

Mentions

  • 2606.30658 #12 · arxiv_oai · confidence 0.70 Yonghui Wu
  • 2403.11425 #7 · arxiv_oai · confidence 0.70 Yonghui Wu
  • 2606.19852 #11 · arxiv_oai · confidence 0.70 Yonghui Wu
  • 1112.4438 #7 · backfill · confidence 0.70 Yonghui Wu
  • 2105.04663 #15 · arxiv_oai · confidence 0.70 Yonghui Wu
  • 2110.04627 #10 · arxiv_oai · confidence 0.70 Yonghui Wu
  • 0905.2724 #2 · backfill · confidence 0.70 Yonghui Wu
  • 2512.13507 #139 · arxiv_oai · confidence 0.70 Yonghui Wu
  • 0903.3997 #5 · backfill · confidence 0.70 Yonghui Wu
  • 2508.02193 #21 · arxiv_oai · confidence 0.70 Yonghui Wu

Frequent Coauthors