pith. sign in

Baosong Yang

Identifiers

  • name variant Baosong Yang 0.60 · backfill

Papers (24)

  1. A Data-Efficient Path to Multilingual LLMs: Language Expansion via Post-training PARAM$\Delta$ Integration into Upcycled MoE cs.CL · 2026 · author #7
  2. Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models cs.CL · 2026 · author #6
  3. Language as a Latent Variable for Reasoning Optimization cs.CL · 2026 · author #5
  4. TEMPO: Scaling Test-time Training for Large Reasoning Models cs.LG · 2026 · author #6
  5. Judge Like Human Examiners: A Weighted Importance Multi-Point Evaluation Framework for Generative Tasks with Long-form Answers cs.CL · 2026 · author #7
  6. Towards Cross-lingual Values Judgment: A Consensus-Pluralism Perspective cs.CL · 2026 · author #8
  7. Qwen3-ASR Technical Report cs.CL · 2026 · author #10
  8. Qwen3-TTS Technical Report cs.SD · 2026 · author #13
  9. Qwen3Guard Technical Report cs.CL · 2025 · author #11
  10. Qwen3-Omni Technical Report cs.CL · 2025 · author #20
  11. Direct Simultaneous Translation Activation for Large Audio-Language Models cs.SD · 2025 · author #4
  12. Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models cs.CL · 2025 · author #6
  13. Qwen3 Technical Report cs.CL · 2025 · author #3
  14. Qwen2.5 Technical Report cs.CL · 2024 · author #2
  15. Qwen2 Technical Report cs.CL · 2024 · author #2
  16. Assessing the Ability of Self-Attention Networks to Learn Word Order cs.CL · 2019 · author #1
  17. Convolutional Self-Attention Networks cs.CL · 2019 · author #1
  18. Information Aggregation for Multi-Head Attention with Routing-by-Agreement cs.CL · 2019 · author #2
  19. Modeling Recurrence for Transformer cs.CL · 2019 · author #3
  20. Context-Aware Self-Attention Networks cs.CL · 2019 · author #1
  21. Convolutional Self-Attention Network cs.CL · 2018 · author #1
  22. Multi-Head Attention with Disagreement Regularization cs.CL · 2018 · author #3
  23. Modeling Localness for Self-Attention Networks cs.CL · 2018 · author #1
  24. Towards Bidirectional Hierarchical Representations for Attention-Based Neural Machine Translation cs.CL · 2017 · author #1

Mentions

  • 2605.18083 #7 · arxiv_oai · confidence 0.70 Baosong Yang
  • 2601.15621 #13 · arxiv_oai · confidence 0.70 Baosong Yang
  • 2505.09388 #3 · backfill · confidence 0.70 Baosong Yang
  • 2412.15115 #2 · backfill · confidence 0.70 Baosong Yang
  • 2605.11887 #6 · backfill · confidence 0.70 Baosong Yang
  • 2407.10671 #2 · backfill · confidence 0.70 Baosong Yang

Frequent Coauthors