Colin Raffel — Pith Author Registry

Identifiers

name variant Colin Raffel 0.60 · backfill

Papers (26)

How Can We Synthesize High-Quality Pretraining Data? A Systematic Study of Prompt Design, Generator Model, and Source Data cs.CL · 2026 · author #10
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model cs.CL · 2025 · author #20
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale cs.CL · 2024 · author #6
Uncovering Model Processing Strategies with Non-Negative Per-Example Fisher Factorization cs.LG · 2023 · author #2
Scaling Data-Constrained Language Models cs.CL · 2023 · author #9
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model cs.CL · 2022 · author #34
Emergent Abilities of Large Language Models cs.CL · 2022 · author #4
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models cs.CL · 2022 · author #85
Multitask Prompted Training Enables Zero-Shot Task Generalization cs.LG · 2021 · author #3
How Much Knowledge Can You Pack Into the Parameters of a Language Model? cs.CL · 2020 · author #2
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer cs.LG · 2019 · author #1
Monotonic Infinite Lookback Attention for Simultaneous Machine Translation cs.CL · 2019 · author #8
Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition eess.AS · 2019 · author #5
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling cs.LG · 2019 · author #61
Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer cs.LG · 2018 · author #2
Learning a Latent Space of Multitrack Measures stat.ML · 2018 · author #3
Realistic Evaluation of Deep Semi-Supervised Learning Algorithms cs.LG · 2018 · author #3
Is Generator Conditioning Causally Related to GAN Performance? stat.ML · 2018 · author #6
Monotonic Chunkwise Attention cs.CL · 2017 · author #2
Onsets and Frames: Dual-Objective Piano Transcription cs.SD · 2017 · author #6
Learning Hard Alignments with Variational Inference cs.AI · 2017 · author #4
Online and Linear-Time Attention by Enforcing Monotonic Alignments cs.LG · 2017 · author #1
Training a Subsampling Mechanism in Expectation cs.LG · 2017 · author #1
Theano: A Python framework for fast computation of mathematical expressions cs.SC · 2016 · author #80
Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems cs.LG · 2015 · author #1
Poker-CNN: A Pattern Learning Strategy for Making Draws and Bets in Poker Games cs.AI · 2015 · author #3

Mentions

1509.06731 #3 · backfill · confidence 0.70 Colin Raffel
2310.04649 #2 · arxiv_oai · confidence 0.70 Colin Raffel
2305.16264 #9 · arxiv_oai · confidence 0.70 Colin Raffel

Frequent Coauthors

Thomas Wolf 6 shared papers
Adam Roberts 5 shared papers
Chung-Cheng Chiu 4 shared papers
Ian Goodfellow 4 shared papers
Leandro Von Werra 4 shared papers
Alexander M. Rush 3 shared papers
Andrea Santilli 3 shared papers
Debajyoti Datta 3 shared papers
Douglas Eck 3 shared papers
Guilherme Penedo 3 shared papers
Hynek Kydl\'i\v{c}ek 3 shared papers
Jesse Engel 3 shared papers
Jos Rozen 3 shared papers
Leo Gao 3 shared papers
Loubna Ben Allal 3 shared papers
Niklas Muennighoff 3 shared papers
Ryan Teehan 3 shared papers
Stella Biderman 3 shared papers
Trishala Neeraj 3 shared papers
Abheesht Sharma 2 shared papers