pith. sign in

Colin Raffel

Identifiers

  • name variant Colin Raffel 0.60 · backfill

Papers (26)

  1. How Can We Synthesize High-Quality Pretraining Data? A Systematic Study of Prompt Design, Generator Model, and Source Data cs.CL · 2026 · author #10
  2. SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model cs.CL · 2025 · author #20
  3. The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale cs.CL · 2024 · author #6
  4. Uncovering Model Processing Strategies with Non-Negative Per-Example Fisher Factorization cs.LG · 2023 · author #2
  5. Scaling Data-Constrained Language Models cs.CL · 2023 · author #9
  6. BLOOM: A 176B-Parameter Open-Access Multilingual Language Model cs.CL · 2022 · author #34
  7. Emergent Abilities of Large Language Models cs.CL · 2022 · author #4
  8. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models cs.CL · 2022 · author #85
  9. Multitask Prompted Training Enables Zero-Shot Task Generalization cs.LG · 2021 · author #3
  10. How Much Knowledge Can You Pack Into the Parameters of a Language Model? cs.CL · 2020 · author #2
  11. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer cs.LG · 2019 · author #1
  12. Monotonic Infinite Lookback Attention for Simultaneous Machine Translation cs.CL · 2019 · author #8
  13. Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition eess.AS · 2019 · author #5
  14. Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling cs.LG · 2019 · author #61
  15. Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer cs.LG · 2018 · author #2
  16. Learning a Latent Space of Multitrack Measures stat.ML · 2018 · author #3
  17. Realistic Evaluation of Deep Semi-Supervised Learning Algorithms cs.LG · 2018 · author #3
  18. Is Generator Conditioning Causally Related to GAN Performance? stat.ML · 2018 · author #6
  19. Monotonic Chunkwise Attention cs.CL · 2017 · author #2
  20. Onsets and Frames: Dual-Objective Piano Transcription cs.SD · 2017 · author #6
  21. Learning Hard Alignments with Variational Inference cs.AI · 2017 · author #4
  22. Online and Linear-Time Attention by Enforcing Monotonic Alignments cs.LG · 2017 · author #1
  23. Training a Subsampling Mechanism in Expectation cs.LG · 2017 · author #1
  24. Theano: A Python framework for fast computation of mathematical expressions cs.SC · 2016 · author #80
  25. Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems cs.LG · 2015 · author #1
  26. Poker-CNN: A Pattern Learning Strategy for Making Draws and Bets in Poker Games cs.AI · 2015 · author #3

Mentions

  • 1509.06731 #3 · backfill · confidence 0.70 Colin Raffel
  • 2310.04649 #2 · arxiv_oai · confidence 0.70 Colin Raffel
  • 2305.16264 #9 · arxiv_oai · confidence 0.70 Colin Raffel

Frequent Coauthors