K-means++: The advantages of careful seeding

David Arthur, Sergei Vassilvitskii · 2007 · arXiv 3383.128349

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

read on arXiv browse 8 citing papers

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

Polarizable Embedding QM/MM for Periodic Systems

physics.chem-ph · 2026-05-10 · unverdicted · novelty 7.0

A new polarizable QM/MM method for periodic systems uses SCME for water with multipoles up to hexadecapole and anisotropic polarizabilities, achieving full QM accuracy via careful near/far-field expansions and damping.

CRUMB: Efficient Prior Fitted Network Inference via Distributionally Matched Context Batching

cs.LG · 2026-06-09 · unverdicted · novelty 6.0

CRUMB speeds up PFN inference on large tabular datasets by clustering queries and selecting MMD-matched context subsets, outperforming prior selection methods on the 51-dataset TabArena benchmark across three architectures while handling covariate drift.

EpiCache: Episodic KV Cache Management for Long-Term Conversation on Resource-Constrained Environments

cs.CL · 2025-09-22 · unverdicted · novelty 6.0

EpiCache clusters long conversation history into coherent episodes for per-episode KV cache eviction, delivering up to 30% accuracy gains and 3.7x peak memory reduction on LongConvQA tasks under fixed budgets.

Projecting dynamical systems via a support bound

cs.SC · 2025-01-23 · unverdicted · novelty 6.0

New bound on Newton polytope support for minimal DEs in polynomial systems enables evaluation-interpolation projection algorithm outperforming prior software.

Benchmarking on Tasks That Matter: Dataset Selection for Preserving Model Rankings

cs.LG · 2026-06-26 · unverdicted · novelty 4.0

Framework for dataset subset selection via clustering, A/D-optimality, and FAFI with bootstrap intervals to preserve model rankings, showing high Spearman correlation (0.95 with 5 datasets) in TSC but limited gains in recommender systems.

Data Mixing for Large Language Models Pretraining: A Survey and Outlook

cs.CL · 2026-03-25 · accept · novelty 4.0

A survey that taxonomizes data mixing strategies for LLM pretraining into static rule-based, learning-based, and dynamic adaptive families while highlighting transferability challenges and evaluation gaps.

Much of Geospatial Web Search Is Beyond Traditional GIS

cs.IR · 2026-05-11

Computing k-means in mixed precision

math.NA · 2024-07-16

citing papers explorer

Showing 5 of 5 citing papers after filters.

Polarizable Embedding QM/MM for Periodic Systems physics.chem-ph · 2026-05-10 · unverdicted · none · ref 69
A new polarizable QM/MM method for periodic systems uses SCME for water with multipoles up to hexadecapole and anisotropic polarizabilities, achieving full QM accuracy via careful near/far-field expansions and damping.
CRUMB: Efficient Prior Fitted Network Inference via Distributionally Matched Context Batching cs.LG · 2026-06-09 · unverdicted · none · ref 28
CRUMB speeds up PFN inference on large tabular datasets by clustering queries and selecting MMD-matched context subsets, outperforming prior selection methods on the 51-dataset TabArena benchmark across three architectures while handling covariate drift.
EpiCache: Episodic KV Cache Management for Long-Term Conversation on Resource-Constrained Environments cs.CL · 2025-09-22 · unverdicted · none · ref 47
EpiCache clusters long conversation history into coherent episodes for per-episode KV cache eviction, delivering up to 30% accuracy gains and 3.7x peak memory reduction on LongConvQA tasks under fixed budgets.
Projecting dynamical systems via a support bound cs.SC · 2025-01-23 · unverdicted · none · ref 5
New bound on Newton polytope support for minimal DEs in polynomial systems enables evaluation-interpolation projection algorithm outperforming prior software.
Benchmarking on Tasks That Matter: Dataset Selection for Preserving Model Rankings cs.LG · 2026-06-26 · unverdicted · none · ref 1
Framework for dataset subset selection via clustering, A/D-optimality, and FAFI with bootstrap intervals to preserve model rankings, showing high Spearman correlation (0.95 with 5 datasets) in TSC but limited gains in recommender systems.

K-means++: The advantages of careful seeding

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer