A new polarizable QM/MM method for periodic systems uses SCME for water with multipoles up to hexadecapole and anisotropic polarizabilities, achieving full QM accuracy via careful near/far-field expansions and damping.
K-means++: The advantages of careful seeding
8 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
representative citing papers
CRUMB speeds up PFN inference on large tabular datasets by clustering queries and selecting MMD-matched context subsets, outperforming prior selection methods on the 51-dataset TabArena benchmark across three architectures while handling covariate drift.
EpiCache clusters long conversation history into coherent episodes for per-episode KV cache eviction, delivering up to 30% accuracy gains and 3.7x peak memory reduction on LongConvQA tasks under fixed budgets.
New bound on Newton polytope support for minimal DEs in polynomial systems enables evaluation-interpolation projection algorithm outperforming prior software.
Framework for dataset subset selection via clustering, A/D-optimality, and FAFI with bootstrap intervals to preserve model rankings, showing high Spearman correlation (0.95 with 5 datasets) in TSC but limited gains in recommender systems.
A survey that taxonomizes data mixing strategies for LLM pretraining into static rule-based, learning-based, and dynamic adaptive families while highlighting transferability challenges and evaluation gaps.
citing papers explorer
-
Polarizable Embedding QM/MM for Periodic Systems
A new polarizable QM/MM method for periodic systems uses SCME for water with multipoles up to hexadecapole and anisotropic polarizabilities, achieving full QM accuracy via careful near/far-field expansions and damping.
-
CRUMB: Efficient Prior Fitted Network Inference via Distributionally Matched Context Batching
CRUMB speeds up PFN inference on large tabular datasets by clustering queries and selecting MMD-matched context subsets, outperforming prior selection methods on the 51-dataset TabArena benchmark across three architectures while handling covariate drift.
-
EpiCache: Episodic KV Cache Management for Long-Term Conversation on Resource-Constrained Environments
EpiCache clusters long conversation history into coherent episodes for per-episode KV cache eviction, delivering up to 30% accuracy gains and 3.7x peak memory reduction on LongConvQA tasks under fixed budgets.
-
Projecting dynamical systems via a support bound
New bound on Newton polytope support for minimal DEs in polynomial systems enables evaluation-interpolation projection algorithm outperforming prior software.
-
Benchmarking on Tasks That Matter: Dataset Selection for Preserving Model Rankings
Framework for dataset subset selection via clustering, A/D-optimality, and FAFI with bootstrap intervals to preserve model rankings, showing high Spearman correlation (0.95 with 5 datasets) in TSC but limited gains in recommender systems.