Cat.” indicates language-heavy (lang) or OCR/document (ocr) sources. “%>1024

Biao Zhang, Paul Suganthan, Ga¨el Liu, Ilya Philippov, Sahil Dua, Ben Hora, Kat Black, Gus Martins, Omar Sanseviero, Shreya Pathak, et al · 2025 · arXiv 2512.14856

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 2 method 1

citation-polarity summary

background 2 use method 1

representative citing papers

BoostLLM: Boosting-inspired LLM Fine-tuning for Few-shot Tabular Classification

cs.LG · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

BoostLLM trains sequential PEFT adapters in a boosting framework with tree path inputs to improve LLM performance on few-shot tabular classification, matching or exceeding XGBoost.

Retrieval from Within: An Intrinsic Capability of Attention-Based Models

cs.LG · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

Attention-based models can retrieve evidence intrinsically by using decoder attention to score and reuse their own pre-encoded chunks, outperforming separate retrieval pipelines on QA benchmarks.

LinguDistill: Recovering Linguistic Ability in Vision-Language Models via Selective Cross-Modal Distillation

cs.CV · 2026-04-01 · unverdicted · novelty 6.0

LinguDistill recovers approximately 10% of lost performance on language benchmarks in VLMs by selectively distilling from a frozen LM teacher using KV-cache sharing, while preserving vision performance.

Motif-Video 2B: Technical Report

cs.CV · 2026-04-14 · unverdicted · novelty 4.0 · 2 refs

Motif-Video 2B reaches 83.76% on VBench, outperforming a 14B-parameter model with 7x fewer parameters and far less training data through shared cross-attention and a three-part backbone.

LLMs and Speech: Integration vs. Combination

eess.AS · 2026-03-16 · unverdicted · novelty 4.0

Tight integration of acoustic models with LLMs for ASR is ablated against shallow fusion across label units, fine-tuning strategies, LLM sizes, and joint CTC decoding to mitigate hallucinations.

citing papers explorer

Showing 5 of 5 citing papers.

BoostLLM: Boosting-inspired LLM Fine-tuning for Few-shot Tabular Classification cs.LG · 2026-05-07 · unverdicted · none · ref 53 · 2 links
BoostLLM trains sequential PEFT adapters in a boosting framework with tree path inputs to improve LLM performance on few-shot tabular classification, matching or exceeding XGBoost.
Retrieval from Within: An Intrinsic Capability of Attention-Based Models cs.LG · 2026-05-07 · unverdicted · none · ref 41 · 2 links
Attention-based models can retrieve evidence intrinsically by using decoder attention to score and reuse their own pre-encoded chunks, outperforming separate retrieval pipelines on QA benchmarks.
LinguDistill: Recovering Linguistic Ability in Vision-Language Models via Selective Cross-Modal Distillation cs.CV · 2026-04-01 · unverdicted · none · ref 4
LinguDistill recovers approximately 10% of lost performance on language benchmarks in VLMs by selectively distilling from a frozen LM teacher using KV-cache sharing, while preserving vision performance.
Motif-Video 2B: Technical Report cs.CV · 2026-04-14 · unverdicted · none · ref 51 · 2 links
Motif-Video 2B reaches 83.76% on VBench, outperforming a 14B-parameter model with 7x fewer parameters and far less training data through shared cross-attention and a three-part backbone.
LLMs and Speech: Integration vs. Combination eess.AS · 2026-03-16 · unverdicted · none · ref 39
Tight integration of acoustic models with LLMs for ASR is ablated against shallow fusion across label units, fine-tuning strategies, LLM sizes, and joint CTC decoding to mitigate hallucinations.

Cat.” indicates language-heavy (lang) or OCR/document (ocr) sources. “%>1024

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer