For the LoRA adapter, we specified a rank of 128, an 𝛼 value of 512, and a dropout rate of 0.1 and applied it across all attention matrices

G Instruction Tuning Details All experiments were conducted with parameter-efficient finetuning method LoRA (Hu et al · 2021

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Rethinking Data Curation in LLM Training: Online Reweighting Offers Better Generalization than Offline Methods

cs.LG · 2026-04-19 · unverdicted · novelty 5.0

ADAPT is an online reweighting framework for LLM training that outperforms offline data selection and mixing methods in cross-benchmark generalization under equal compute.

citing papers explorer

Showing 1 of 1 citing paper.

Rethinking Data Curation in LLM Training: Online Reweighting Offers Better Generalization than Offline Methods cs.LG · 2026-04-19 · unverdicted · none · ref 59
ADAPT is an online reweighting framework for LLM training that outperforms offline data selection and mixing methods in cross-benchmark generalization under equal compute.

For the LoRA adapter, we specified a rank of 128, an 𝛼 value of 512, and a dropout rate of 0.1 and applied it across all attention matrices

fields

years

verdicts

representative citing papers

citing papers explorer