Continual pre-training on a German medical corpus lets 7B models close much of the performance gap with 24B general models on medical benchmarks, though merging introduces some language mixing and verbosity.
2023.Foundation Models for Natural Lan- guage Processing
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Foundation models slightly outperform task-specific models on probabilistic electricity price forecasts but the gap narrows or reverses with extra features or few-shot adaptation, showing that efficiency often outweighs marginal accuracy gains.
citing papers explorer
-
Can Continual Pre-training Bridge the Performance Gap between General-purpose and Specialized Language Models in the Medical Domain?
Continual pre-training on a German medical corpus lets 7B models close much of the performance gap with 24B general models on medical benchmarks, though merging introduces some language mixing and verbosity.
-
Assessing the Performance-Efficiency Trade-off of Foundation Models in Probabilistic Electricity Price Forecasting
Foundation models slightly outperform task-specific models on probabilistic electricity price forecasts but the gap narrows or reverses with extra features or few-shot adaptation, showing that efficiency often outweighs marginal accuracy gains.