Multi-source transfer learning incurs an intrinsic adaptation cost that can exceed one, with phase transitions separating regimes where bias-agnostic estimators match oracle performance from those where they cannot.
Angelopoulos, Stephen Bates, Clara Fannjiang, Michael I
6 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
method 2polarities
use method 2representative citing papers
NaiAD is a new dataset and framework for LLM-native advertising that uses decoupled generation and calibrated scoring to identify four semantic strategies for balancing user and commercial utilities.
LLMs show a consistent performance drop on arithmetic, spatial, and temporal reasoning tasks when framed in multi-turn dialogues versus isolated settings, demonstrated by the new BOULDER benchmark across eight travel-related tasks.
Response times modeled as drift-diffusion processes enable consistent estimation of population-average preferences from heterogeneous anonymous binary choices.
PPI++ yields easy-to-compute confidence sets for any-dimensional parameters that always improve on classical intervals from labeled data alone by leveraging abundant ML predictions.
citing papers explorer
-
The Statistical Cost of Adaptation in Multi-Source Transfer Learning
Multi-source transfer learning incurs an intrinsic adaptation cost that can exceed one, with phase transitions separating regimes where bias-agnostic estimators match oracle performance from those where they cannot.
-
NaiAD: Initiate Data-Driven Research for LLM Advertising
NaiAD is a new dataset and framework for LLM-native advertising that uses decoupled generation and calibrated scoring to identify four semantic strategies for balancing user and commercial utilities.
-
Reasoning Gets Harder for LLMs Inside A Dialogue
LLMs show a consistent performance drop on arithmetic, spatial, and temporal reasoning tasks when framed in multi-turn dialogues versus isolated settings, demonstrated by the new BOULDER benchmark across eight travel-related tasks.
-
Response Time Enhances Alignment with Heterogeneous Preferences
Response times modeled as drift-diffusion processes enable consistent estimation of population-average preferences from heterogeneous anonymous binary choices.
-
PPI++: Efficient Prediction-Powered Inference
PPI++ yields easy-to-compute confidence sets for any-dimensional parameters that always improve on classical intervals from labeled data alone by leveraging abundant ML predictions.
- Real vs. Semi-Simulated: Rethinking Evaluation for Treatment Effect Estimation