pith. sign in

Di He

Identifiers

  • name variant Di He 0.60 · backfill

Papers (26)

  1. One LR Doesn't Fit All: Heavy-Tail Guided Layerwise Learning Rates for LLMs cs.LG · 2026 · author #1
  2. Lossless Anti-Distillation Sampling cs.LG · 2026 · author #6
  3. Quotient-Space Diffusion Models cs.LG · 2026 · author #6
  4. Evaluation-driven Scaling for Scientific Discovery cs.LG · 2026 · author #19
  5. In-Place Test-Time Training cs.LG · 2026 · author #5
  6. Towards Solving the Gilbert-Pollak Conjecture via Large Language Models cs.DM · 2026 · author #4
  7. Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value cs.LG · 2025 · author #4
  8. Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View cs.LG · 2019 · author #3
  9. Multilingual Neural Machine Translation with Knowledge Distillation cs.CL · 2019 · author #3
  10. Non-Autoregressive Machine Translation with Auxiliary Regularization cs.CL · 2019 · author #3
  11. Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input cs.CL · 2018 · author #3
  12. Sentence-wise Smooth Regularization for Sequence to Sequence Learning cs.CL · 2018 · author #3
  13. When CTC Training Meets Acoustic Landmarks eess.AS · 2018 · author #1
  14. Joint bi-modal image reconstruction of DOT and XCT with an extended Mumford-Shah functional eess.IV · 2018 · author #1
  15. Augmenting Input Method Language Model with user Location Type Information cs.SI · 2018 · author #1
  16. Beyond Error Propagation in Neural Machine Translation: Characteristics of Language Also Matter cs.CL · 2018 · author #3
  17. Double Path Networks for Sequence to Sequence Learning cs.CL · 2018 · author #3
  18. Towards Binary-Valued Gates for Robust LSTM Training cs.LG · 2018 · author #2
  19. Dense Information Flow for Neural Machine Translation cs.CL · 2018 · author #3
  20. Improved ASR for Under-Resourced Languages Through Multi-Task Learning with Acoustic Landmarks cs.CL · 2018 · author #1
  21. Acoustic Landmarks Contain More Information About the Phone String than Other Frames for Automatic Speech Recognition with Deep Neural Network Acoustic Model eess.AS · 2017 · author #1
  22. Dual Learning for Machine Translation cs.CL · 2016 · author #2
  23. Sentence Level Recurrent Topic Model: Letting Topics Speak for Themselves cs.LG · 2016 · author #3
  24. A Game-theoretic Machine Learning Approach for Revenue Maximization in Sponsored Search cs.GT · 2014 · author #1
  25. Generalized Second Price Auction with Probabilistic Broad Match cs.GT · 2014 · author #2
  26. A Theoretical Analysis of NDCG Type Ranking Measures cs.LG · 2013 · author #4

Mentions

  • 1406.0728 #1 · backfill · confidence 0.70 Di He
  • 1404.3828 #2 · backfill · confidence 0.70 Di He
  • 1304.6480 #4 · backfill · confidence 0.70 Di He
  • 2605.22297 #1 · arxiv_oai · confidence 0.70 Di He
  • 2601.22365 #4 · arxiv_oai · confidence 0.70 Di He
  • 2605.18829 #6 · arxiv_oai · confidence 0.70 Di He

Frequent Coauthors