Title resolution pending

doi: 10 · 2022 · DOI 10.1145/3735633

15 Pith papers cite this work. Polarity classification is still indexing.

15 Pith papers citing it

open at publisher browse 15 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

When the Same Musical Knowledge Forgets Differently: A Clean Probe of Pathway-Dependent Forgetting

cs.SD · 2026-06-13 · unverdicted · novelty 6.0

Acquisition route affects forgetting rates in multimodal models, with text-pathway knowledge forgetting faster than audio-pathway knowledge in music understanding tasks.

Dynamic Proxy-Mixing: Transferring Replay Controllers from Small to Large Models for Continual Instruction Tuning

cs.LG · 2026-05-29 · unverdicted · novelty 6.0

PROXYMIX learns a dynamic replay controller on a small proxy model and transfers it to a large target model, improving accuracy by 3.4 points and reducing forgetting by 3.5 points on LLaMA-3-8B continual tuning sequences.

Dissociative Identity: Language Model Agents Lack Grounding for Reputation Mechanisms

cs.CY · 2026-05-28 · unverdicted · novelty 6.0

LM agents' changeable modules prevent persistent identity and sanction sensitivity, making reputation mechanisms structurally inapplicable and requiring protocol-based behavioral harnesses instead.

Make LLM Learn to Synthesize from Streaming Experiences through Feedback

cs.AI · 2026-05-28 · unverdicted · novelty 6.0

SynLearner lets LLMs improve synthetic data generation on later tasks in a stream by learning reusable patterns and balancing quality with diversity from feedback on earlier tasks.

Always Learning, Always Mixing: Efficient and Simple Data Mixing All The Time

cs.CL · 2026-05-13 · conditional · novelty 6.0

OP-Mix is an on-policy data mixing method that uses low-rank adapter interpolation to find near-optimal data mixtures throughout language model training with reduced compute.

Low-Rank Adapters Initialization via Gradient Surgery for Continual Learning

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

SLICE applies gradient surgery via projection and truncated SVD to initialize LoRA adapters, yielding better stability-plasticity trade-offs on continual learning benchmarks including adversarial task sequences.

HypEHR: Hyperbolic Modeling of Electronic Health Records for Efficient Question Answering

cs.AI · 2026-04-22 · unverdicted · novelty 6.0

HypEHR is a hyperbolic embedding model for EHR data that uses Lorentzian geometry and hierarchy-aware pretraining to answer clinical questions nearly as well as large language models but with much smaller size.

On the Shelf Life of Fine-Tuned LLM-Judges: Future-Proofing, Backward-Compatibility, and Question Generalization

cs.CL · 2025-09-28 · unverdicted · novelty 6.0

Fine-tuned LLM judges struggle with future-proofing to newer generators but maintain backward-compatibility more easily; DPO training and continual learning improve adaptation while all models degrade on unseen questions.

CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations

cs.CL · 2026-05-25 · unverdicted · novelty 5.0

CroCo applies English-reward-ranked self-generations for contrastive preference tuning that improves two LLMs on structured and open-ended tasks across 14 languages without language-specific annotations.

RocketSmith: Agentic Additive Manufacturing of High-Powered Rockets

cs.RO · 2026-05-25 · unverdicted · novelty 5.0

RocketSmith is an LLM-based agentic system that designs four high-powered rockets via additive manufacturing, with two achieving stable launches and recovery after reaching 80% of simulated apogee.

Boosting Automatic Java-to-Cangjie Translation with Multi-Stage LLM Training and Error Repair

cs.SE · 2026-05-08 · unverdicted · novelty 5.0

Multi-stage LLM training plus compiler-guided error repair boosts functional equivalence in Java-to-Cangjie translation by 6.06% over prior methods despite scarce parallel data.

Continual Knowledge Updating in LLM Systems: Learning Through Multi-Timescale Memory Dynamics

cs.LG · 2026-05-06 · unverdicted · novelty 5.0

Memini is introduced as a graph-based external memory using multi-timescale edge dynamics to enable emergent episodic sensitivity, consolidation, and selective forgetting in LLM systems.

Federated continual learning: A comprehensive survey on lifelong and privacy-preserving learning over distributed and non-stationary data

cs.LG · 2026-06-09 · unverdicted · novelty 3.0

This survey defines the Federated Continual Learning problem, proposes a taxonomy for approaches, reviews applications and metrics, and identifies open challenges in lifelong privacy-preserving learning on non-stationary distributed data.

Position: Anthropomorphic Misalignment Research Needs Stronger Evidence

cs.CY · 2026-05-29 · unverdicted · novelty 3.0

Position paper calling for stronger evidentiary standards and a diagnostic checklist in anthropomorphic misalignment research.

Improve Large Language Model Systems with User Logs

cs.CL · 2026-02-06

citing papers explorer

Showing 15 of 15 citing papers.

When the Same Musical Knowledge Forgets Differently: A Clean Probe of Pathway-Dependent Forgetting cs.SD · 2026-06-13 · unverdicted · none · ref 27
Acquisition route affects forgetting rates in multimodal models, with text-pathway knowledge forgetting faster than audio-pathway knowledge in music understanding tasks.
Dynamic Proxy-Mixing: Transferring Replay Controllers from Small to Large Models for Continual Instruction Tuning cs.LG · 2026-05-29 · unverdicted · none · ref 40
PROXYMIX learns a dynamic replay controller on a small proxy model and transfers it to a large target model, improving accuracy by 3.4 points and reducing forgetting by 3.5 points on LLaMA-3-8B continual tuning sequences.
Dissociative Identity: Language Model Agents Lack Grounding for Reputation Mechanisms cs.CY · 2026-05-28 · unverdicted · none · ref 121
LM agents' changeable modules prevent persistent identity and sanction sensitivity, making reputation mechanisms structurally inapplicable and requiring protocol-based behavioral harnesses instead.
Make LLM Learn to Synthesize from Streaming Experiences through Feedback cs.AI · 2026-05-28 · unverdicted · none · ref 20
SynLearner lets LLMs improve synthetic data generation on later tasks in a stream by learning reusable patterns and balancing quality with diversity from feedback on earlier tasks.
Always Learning, Always Mixing: Efficient and Simple Data Mixing All The Time cs.CL · 2026-05-13 · conditional · none · ref 41
OP-Mix is an on-policy data mixing method that uses low-rank adapter interpolation to find near-optimal data mixtures throughout language model training with reduced compute.
Low-Rank Adapters Initialization via Gradient Surgery for Continual Learning cs.LG · 2026-05-12 · unverdicted · none · ref 10
SLICE applies gradient surgery via projection and truncated SVD to initialize LoRA adapters, yielding better stability-plasticity trade-offs on continual learning benchmarks including adversarial task sequences.
HypEHR: Hyperbolic Modeling of Electronic Health Records for Efficient Question Answering cs.AI · 2026-04-22 · unverdicted · none · ref 137
HypEHR is a hyperbolic embedding model for EHR data that uses Lorentzian geometry and hierarchy-aware pretraining to answer clinical questions nearly as well as large language models but with much smaller size.
On the Shelf Life of Fine-Tuned LLM-Judges: Future-Proofing, Backward-Compatibility, and Question Generalization cs.CL · 2025-09-28 · unverdicted · none · ref 59
Fine-tuned LLM judges struggle with future-proofing to newer generators but maintain backward-compatibility more easily; DPO training and continual learning improve adaptation while all models degrade on unseen questions.
CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations cs.CL · 2026-05-25 · unverdicted · none · ref 47
CroCo applies English-reward-ranked self-generations for contrastive preference tuning that improves two LLMs on structured and open-ended tasks across 14 languages without language-specific annotations.
RocketSmith: Agentic Additive Manufacturing of High-Powered Rockets cs.RO · 2026-05-25 · unverdicted · none · ref 7
RocketSmith is an LLM-based agentic system that designs four high-powered rockets via additive manufacturing, with two achieving stable launches and recovery after reaching 80% of simulated apogee.
Boosting Automatic Java-to-Cangjie Translation with Multi-Stage LLM Training and Error Repair cs.SE · 2026-05-08 · unverdicted · none · ref 26
Multi-stage LLM training plus compiler-guided error repair boosts functional equivalence in Java-to-Cangjie translation by 6.06% over prior methods despite scarce parallel data.
Continual Knowledge Updating in LLM Systems: Learning Through Multi-Timescale Memory Dynamics cs.LG · 2026-05-06 · unverdicted · none · ref 3
Memini is introduced as a graph-based external memory using multi-timescale edge dynamics to enable emergent episodic sensitivity, consolidation, and selective forgetting in LLM systems.
Federated continual learning: A comprehensive survey on lifelong and privacy-preserving learning over distributed and non-stationary data cs.LG · 2026-06-09 · unverdicted · none · ref 29
This survey defines the Federated Continual Learning problem, proposes a taxonomy for approaches, reviews applications and metrics, and identifies open challenges in lifelong privacy-preserving learning on non-stationary distributed data.
Position: Anthropomorphic Misalignment Research Needs Stronger Evidence cs.CY · 2026-05-29 · unverdicted · none · ref 113
Position paper calling for stronger evidentiary standards and a diagnostic checklist in anthropomorphic misalignment research.
Improve Large Language Model Systems with User Logs cs.CL · 2026-02-06 · unreviewed · ref 31

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer