Triviaqa: A large scale distantly supervised challenge dataset for reading comprehension

Mandar Joshi, Eunsol Choi, Daniel S Weld, Luke Zettlemoyer · 2017

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Enhancing Trust in Large Language Models via Uncertainty-Calibrated Fine-Tuning

cs.CL · 2024-12-03 · unverdicted · novelty 5.0

Uncertainty-aware fine-tuning with a decision-theory-based loss produces better-calibrated uncertainty estimates than standard training on free-form QA tasks.

Junk DNA Hypothesis: Pruning Small Pre-Trained Weights Irreversibly and Monotonically Impairs "Difficult" Downstream Tasks in LLMs

cs.LG · 2023-09-29 · unverdicted · novelty 5.0

Pruning small-magnitude weights from pre-trained LLMs causes monotonic irreversible performance degradation on difficult downstream tasks, supporting the Junk DNA Hypothesis that these weights hold essential knowledge.

citing papers explorer

Showing 2 of 2 citing papers.

Enhancing Trust in Large Language Models via Uncertainty-Calibrated Fine-Tuning cs.CL · 2024-12-03 · unverdicted · none · ref 24
Uncertainty-aware fine-tuning with a decision-theory-based loss produces better-calibrated uncertainty estimates than standard training on free-form QA tasks.
Junk DNA Hypothesis: Pruning Small Pre-Trained Weights Irreversibly and Monotonically Impairs "Difficult" Downstream Tasks in LLMs cs.LG · 2023-09-29 · unverdicted · none · ref 24
Pruning small-magnitude weights from pre-trained LLMs causes monotonic irreversible performance degradation on difficult downstream tasks, supporting the Junk DNA Hypothesis that these weights hold essential knowledge.

Triviaqa: A large scale distantly supervised challenge dataset for reading comprehension

fields

years

verdicts

representative citing papers

citing papers explorer