pith. sign in

Mike Lewis

Identifiers

  • name variant Mike Lewis 0.60 · backfill

Papers (26)

  1. Compute Optimal Tokenization cs.CL · 2026 · author #4
  2. Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models cs.CL · 2024 · author #8
  3. The Llama 3 Herd of Models cs.AI · 2024 · author #135
  4. Efficient Streaming Language Models with Attention Sinks cs.CL · 2023 · author #5
  5. LIMA: Less Is More for Alignment cs.CL · 2023 · author #13
  6. REPLUG: Retrieval-Augmented Black-Box Language Models cs.CL · 2023 · author #6
  7. Measuring and Narrowing the Compositionality Gap in Language Models cs.CL · 2022 · author #6
  8. LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale cs.LG · 2022 · author #2
  9. InCoder: A Generative Model for Code Infilling and Synthesis cs.SE · 2022 · author #10
  10. Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? cs.CL · 2022 · author #5
  11. Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation cs.CL · 2021 · author #3
  12. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks cs.CL · 2020 · author #8
  13. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension cs.CL · 2019 · author #1
  14. RoBERTa: A Robustly Optimized BERT Pretraining Approach cs.CL · 2019 · author #8
  15. MelNet: A Generative Model for Audio in the Frequency Domain eess.AS · 2019 · author #2
  16. Improving Semantic Parsing for Task Oriented Dialog cs.CL · 2019 · author #6
  17. Strategies for Structuring Story Generation cs.CL · 2019 · author #2
  18. Cross-Lingual Transfer Learning for Multilingual Task Oriented Dialog cs.CL · 2018 · author #4
  19. Semantic Parsing for Task Oriented Dialog using Hierarchical Representations cs.CL · 2018 · author #5
  20. Neural Compositional Denotational Semantics for Question Answering cs.CL · 2018 · author #2
  21. Community Regularization of Visually-Grounded Dialog cs.CV · 2018 · author #4
  22. Hierarchical Neural Story Generation cs.CL · 2018 · author #2
  23. Hierarchical Text Generation and Planning for Strategic Dialogue cs.CL · 2017 · author #2
  24. End-to-end Neural Coreference Resolution cs.CL · 2017 · author #3
  25. Deal or No Deal? End-to-End Learning for Negotiation Dialogues cs.AI · 2017 · author #1
  26. Global Neural CCG Parsing with Optimality Guarantees cs.CL · 2016 · author #2

Mentions

  • 2605.01188 #4 · arxiv_oai · confidence 0.70 Mike Lewis
  • 2411.04996 #8 · arxiv_oai · confidence 0.70 Mike Lewis
  • 2210.03350 #6 · arxiv_oai · confidence 0.70 Mike Lewis
  • 2301.12652 #6 · arxiv_oai · confidence 0.70 Mike Lewis
  • 2305.11206 #13 · arxiv_oai · confidence 0.70 Mike Lewis
  • 2204.05999 #10 · arxiv_oai · confidence 0.70 Mike Lewis

Frequent Coauthors