Colin Raffel
Identifiers
- name variant Colin Raffel 0.60 · backfill
Papers (26)
- How Can We Synthesize High-Quality Pretraining Data? A Systematic Study of Prompt Design, Generator Model, and Source Data cs.CL · 2026 · author #10
- SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model cs.CL · 2025 · author #20
- The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale cs.CL · 2024 · author #6
- Uncovering Model Processing Strategies with Non-Negative Per-Example Fisher Factorization cs.LG · 2023 · author #2
- Scaling Data-Constrained Language Models cs.CL · 2023 · author #9
- BLOOM: A 176B-Parameter Open-Access Multilingual Language Model cs.CL · 2022 · author #34
- Emergent Abilities of Large Language Models cs.CL · 2022 · author #4
- Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models cs.CL · 2022 · author #85
- Multitask Prompted Training Enables Zero-Shot Task Generalization cs.LG · 2021 · author #3
- How Much Knowledge Can You Pack Into the Parameters of a Language Model? cs.CL · 2020 · author #2
- Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer cs.LG · 2019 · author #1
- Monotonic Infinite Lookback Attention for Simultaneous Machine Translation cs.CL · 2019 · author #8
- Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition eess.AS · 2019 · author #5
- Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling cs.LG · 2019 · author #61
- Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer cs.LG · 2018 · author #2
- Learning a Latent Space of Multitrack Measures stat.ML · 2018 · author #3
- Realistic Evaluation of Deep Semi-Supervised Learning Algorithms cs.LG · 2018 · author #3
- Is Generator Conditioning Causally Related to GAN Performance? stat.ML · 2018 · author #6
- Monotonic Chunkwise Attention cs.CL · 2017 · author #2
- Onsets and Frames: Dual-Objective Piano Transcription cs.SD · 2017 · author #6
- Learning Hard Alignments with Variational Inference cs.AI · 2017 · author #4
- Online and Linear-Time Attention by Enforcing Monotonic Alignments cs.LG · 2017 · author #1
- Training a Subsampling Mechanism in Expectation cs.LG · 2017 · author #1
- Theano: A Python framework for fast computation of mathematical expressions cs.SC · 2016 · author #80
- Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems cs.LG · 2015 · author #1
- Poker-CNN: A Pattern Learning Strategy for Making Draws and Bets in Poker Games cs.AI · 2015 · author #3
Mentions
- 1509.06731 #3 · backfill · confidence 0.70 Colin Raffel
- 2310.04649 #2 · arxiv_oai · confidence 0.70 Colin Raffel
- 2305.16264 #9 · arxiv_oai · confidence 0.70 Colin Raffel
Frequent Coauthors
- Thomas Wolf 6 shared papers
- Adam Roberts 5 shared papers
- Chung-Cheng Chiu 4 shared papers
- Ian Goodfellow 4 shared papers
- Leandro Von Werra 4 shared papers
- Alexander M. Rush 3 shared papers
- Andrea Santilli 3 shared papers
- Debajyoti Datta 3 shared papers
- Douglas Eck 3 shared papers
- Guilherme Penedo 3 shared papers
- Hynek Kydl\'i\v{c}ek 3 shared papers
- Jesse Engel 3 shared papers
- Jos Rozen 3 shared papers
- Leo Gao 3 shared papers
- Loubna Ben Allal 3 shared papers
- Niklas Muennighoff 3 shared papers
- Ryan Teehan 3 shared papers
- Stella Biderman 3 shared papers
- Trishala Neeraj 3 shared papers
- Abheesht Sharma 2 shared papers