pith. machine review for the scientific record. sign in

Weizhu Chen

Identifiers

No identifiers captured yet.

Papers (16)

  1. Shuffle the Context: RoPE-Perturbed Self-Distillation for Long-Context Adaptation cs.CL · 2026 · author #6
  2. Rethinking Language Model Scaling under Transferable Hypersphere Optimization cs.LG · 2026 · author #4
  3. Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs cs.CL · 2025 · author #14
  4. Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone cs.CL · 2024 · author #20
  5. CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing cs.CL · 2023 · author #7
  6. AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning cs.CL · 2023 · author #7
  7. LoRA: Low-Rank Adaptation of Large Language Models cs.CL · 2021 · author #8
  8. DeBERTa: Decoding-enhanced BERT with Disentangled Attention cs.CL · 2020 · author #4
  9. Lessons from Contextual Bandit Learning in a Customer Support Bot cs.LG · 2019 · author #6
  10. Improving Multi-Task Deep Neural Networks via Knowledge Distillation for Natural Language Understanding cs.CL · 2019 · author #3
  11. Multi-Task Deep Neural Networks for Natural Language Understanding cs.CL · 2019 · author #3
  12. IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles cs.CL · 2018 · author #6
  13. Learning to Attend On Essential Terms: An Enhanced Retriever-Reader Model for Open-domain Question Answering cs.CL · 2018 · author #3
  14. FusionNet: Fusing via Fully-Aware Attention with Application to Machine Comprehension cs.CL · 2017 · author #4
  15. DSCOVR: Randomized Primal-Dual Block Coordinate Algorithms for Asynchronous Distributed Optimization math.OC · 2017 · author #4
  16. ReasoNet: Learning to Stop Reading in Machine Comprehension cs.LG · 2016 · author #4

Mentions

No mention provenance yet.

Frequent Coauthors