pith. sign in

Alexandre Muzio

Identifiers

  • name variant Alexandre Muzio 0.60 · backfill

Papers (8)

  1. SEER-MoE: Sparse Expert Efficiency through Regularization for Mixture-of-Experts cs.CL · 2024 · author #1
  2. Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers cs.LG · 2022 · author #3
  3. Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task cs.CL · 2021 · author #7
  4. Scalable and Efficient MoE Training for Multitask Multilingual Models cs.CL · 2021 · author #3
  5. Improving Multilingual Translation by Representation and Gradient Regularization cs.CL · 2021 · author #3
  6. Discovering Representation Sprachbund For Multilingual Pre-Training cs.CL · 2021 · author #3
  7. DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders cs.CL · 2021 · author #5
  8. XLM-T: Scaling up Multilingual Machine Translation with Pretrained Cross-lingual Transformer Encoders cs.CL · 2020 · author #8

Mentions

  • 2404.05089 #1 · arxiv_oai · confidence 0.70 Alexandre Muzio
  • 2205.14336 #3 · arxiv_oai · confidence 0.70 Alexandre Muzio
  • 2109.04778 #3 · arxiv_oai · confidence 0.70 Alexandre Muzio
  • 2111.02086 #7 · arxiv_oai · confidence 0.70 Alexandre Muzio
  • 2109.10465 #3 · arxiv_oai · confidence 0.70 Alexandre Muzio
  • 2109.00271 #3 · arxiv_oai · confidence 0.70 Alexandre Muzio
  • 2106.13736 #5 · arxiv_oai · confidence 0.70 Alexandre Muzio
  • 2012.15547 #8 · arxiv_oai · confidence 0.70 Alexandre Muzio

Frequent Coauthors