Active multiple matrix completion with adaptive confidence sets

Alexandra Carpentier; Andrea Locatelli; Michal Valko

arxiv: 2605.02458 · v1 · submitted 2026-05-04 · 📊 stat.ML · cs.LG

Active multiple matrix completion with adaptive confidence sets

Andrea Locatelli , Alexandra Carpentier , Michal Valko This is my paper

Pith reviewed 2026-05-08 18:59 UTC · model grok-4.3

classification 📊 stat.ML cs.LG

keywords active learningmatrix completionmulti-task learningadaptive algorithmsminimax optimalityconfidence setsrank adaptation

0 comments

The pith

The MAlocate algorithm solves multiple matrix completion tasks simultaneously by adapting to each matrix's unknown rank.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper defines a multi-task active learning setting where the learner must complete several matrices at once and can choose which matrix to query for a uniformly random entry at each step. Matrices may differ in size and in rank, and the ranks are unknown in advance. The authors introduce the MAlocate algorithm that uses adaptive confidence sets to allocate queries efficiently across the matrices without prior rank knowledge. They prove a matching lower bound establishing minimax optimality of the approach and support the analysis with synthetic experiments. A sympathetic reader would care because the result gives a principled way to collect data for applications such as market segmentation across regions with different preference structures.

Core claim

The paper formulates the active multiple matrix completion problem and presents the MAlocate algorithm that adapts its query allocation to the unknown ranks of the different matrices. It analyzes the algorithm's performance and establishes a lower bound showing that the strategy is minimax-optimal for this setting.

What carries the argument

The MAlocate algorithm, which maintains adaptive confidence sets to decide which matrix to query next without knowing the ranks in advance.

If this is right

The algorithm achieves optimal query complexity without requiring advance knowledge of ranks.
It handles matrices of arbitrary sizes and differing ranks in a single unified procedure.
The matching lower bound shows that no algorithm can do substantially better in the worst case over rank configurations.
Synthetic experiments confirm that the adaptive strategy performs close to rank-aware baselines.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same adaptive-confidence-set idea could be tested in other multi-task active learning problems where task difficulty parameters are unknown.
In recommendation systems spanning multiple domains, this approach might reduce total data collection cost by automatically focusing on harder matrices.
Extending the analysis to noisy observations or non-uniform sampling distributions would be a natural next step left open by the current uniform-sampling assumption.

Load-bearing premise

Entries are observed uniformly at random from each chosen matrix and each matrix has a fixed but unknown low rank.

What would settle it

A concrete counter-example where MAlocate requires asymptotically more total queries than an oracle that knows all ranks in advance would falsify the minimax-optimality claim.

Figures

Figures reproduced from arXiv: 2605.02458 by Alexandra Carpentier, Andrea Locatelli, Michal Valko.

**Figure 1.** Figure 1: Results for the first experiment Second experiment We fix dk ≜ d ≜ 200 and K ≜ 15. The ranks rk are given by rk ≜ 18+ 0.0015k 4 . Note that the hardest instance is such that r15 = 76 and half of the sub-problems have rank at most 22. This set of problems is more varied than the previous one and shows the adaptivity of our strategy ( view at source ↗

**Figure 2.** Figure 2: Results for the second experiment run. Finally, as explained in Section 3, instead of splitting the entire sample, we use entries that have been observed once to train the estimator, and the other entries (sampled at least twice) to estimate the error. Across the experiments, we see that MALocate run with the proper loss parameter p indeed performs better on the associated loss L p . For the max loss, we … view at source ↗

read the original abstract

In this work, we formulate a new multi-task active learning setting in which the learner's goal is to solve multiple matrix completion problems simultaneously. At each round, the learner can choose from which matrix it receives a sample from an entry drawn uniformly at random. Our main practical motivation is market segmentation, where the matrices represent different regions with different preferences of the customers. The challenge in this setting is that each of the matrices can be of a different size and also of a different rank which is unknown. We provide and analyze a new algorithm, MAlocate that is able to adapt to the unknown ranks of the different matrices. We then give a lower-bound showing that our strategy is minimax-optimal and demonstrate its performance with synthetic experiments.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper defines a new multi-matrix active completion setting with an adaptive algorithm and matching lower bound.

read the letter

The main takeaway is a new setting for active completion of multiple matrices where the algorithm picks which matrix to query at each step and adapts to their unknown ranks, backed by a matching lower bound. MAlocate is the algorithm they analyze for this. It handles matrices of different sizes and ranks without prior knowledge of the ranks. The market segmentation example makes the motivation concrete, and they include synthetic experiments to back up the performance. This is genuinely new in combining the multi-matrix choice with rank adaptation and optimality. The lower bound is presented separately, which avoids circularity. The soft spots are minor but worth noting. The uniform random sampling and fixed low-rank assumptions are standard, yet they restrict how far the results extend to messier data. Without the full proofs, it's difficult to assess how the adaptive confidence sets are built and whether the bounds have reasonable constants. The experiments are synthetic, so real data tests would strengthen the practical claim. This work is aimed at researchers in active learning and matrix completion who want to handle several related but heterogeneous problems at once. Anyone thinking about adaptive data collection for low-rank structures would get something from the formulation and the algorithm. I recommend putting it through peer review. The new problem and the optimality result are substantive enough to merit referee attention.

Referee Report

2 major / 2 minor

Summary. The paper formulates a multi-task active learning setting for simultaneously solving multiple matrix completion problems. At each round the learner selects which matrix to sample from, receiving a uniformly random entry; the matrices may have different sizes and unknown fixed ranks. The authors introduce the MAlocate algorithm that adapts to these unknown ranks via adaptive confidence sets, analyze its sample complexity, provide a matching lower bound establishing minimax optimality, and report synthetic experiments motivated by market segmentation.

Significance. If the analysis and matching bounds hold, the result is significant: it extends active matrix completion to a heterogeneous multi-matrix regime while achieving rank-adaptivity and information-theoretic optimality. The combination of an adaptive algorithm with a non-trivial lower bound is a strength, and the setting has clear practical relevance for multi-region preference modeling or other multi-task low-rank problems.

major comments (2)

[§4] §4 (MAlocate analysis): the upper bound on the number of samples needed to achieve a given error appears to depend on the construction of the adaptive confidence sets; the paper should explicitly verify that these sets can be formed without knowledge of the rank or the singular-value gap, otherwise the claimed adaptation is not fully parameter-free.
[§5] §5 (lower bound): the minimax lower bound is stated to match the upper bound, but the precise rate (including any dependence on the number of matrices, their dimensions, and the target accuracy) is not compared side-by-side; without this explicit matching it is difficult to confirm that the lower bound is tight for the adaptive multi-matrix allocation problem.

minor comments (2)

[Experiments] The experimental section should report the precise error metric (e.g., Frobenius or entrywise) and the baselines used for comparison; currently only “synthetic experiments” are mentioned.
[§2] Notation for the per-matrix ranks r_i and dimensions m_i × n_i is introduced late; defining them in the problem statement would improve readability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the positive evaluation and the recommendation of minor revision. The comments are helpful for strengthening the presentation of the rank-adaptivity and the tightness of the bounds. We address each major comment below.

read point-by-point responses

Referee: [§4] §4 (MAlocate analysis): the upper bound on the number of samples needed to achieve a given error appears to depend on the construction of the adaptive confidence sets; the paper should explicitly verify that these sets can be formed without knowledge of the rank or the singular-value gap, otherwise the claimed adaptation is not fully parameter-free.

Authors: We thank the referee for this observation. The adaptive confidence sets in the MAlocate analysis (Section 4) are constructed in a fully data-driven manner using only the collected entries: we maintain a collection of candidate singular-value thresholds that are refined successively via empirical estimates and a union bound over a logarithmic number of possible rank values (up to the matrix dimensions). No a priori knowledge of the true rank or the minimal singular-value gap is used; the sets are guaranteed to contain the true low-rank matrix with high probability by construction. To make this explicit, we will add a short clarifying remark and a supporting lemma in the revised Section 4 that verifies the parameter-free nature of the confidence-set construction. revision: yes
Referee: [§5] §5 (lower bound): the minimax lower bound is stated to match the upper bound, but the precise rate (including any dependence on the number of matrices, their dimensions, and the target accuracy) is not compared side-by-side; without this explicit matching it is difficult to confirm that the lower bound is tight for the adaptive multi-matrix allocation problem.

Authors: We agree that an explicit side-by-side comparison improves readability. The upper bound (Theorem 4.1) and matching lower bound (Theorem 5.1) both scale as the sum over matrices of (n_i + m_i) r_i / ε² times polylog factors in the dimensions and number of matrices M; the lower bound holds even when the algorithm must adapt to unknown ranks. In the revision we will insert a dedicated paragraph and a compact table in Section 5 that aligns the two bounds term-by-term, explicitly displaying the dependence on M, the per-matrix dimensions, the ranks, and the accuracy parameter ε. This will make the minimax optimality for the adaptive multi-matrix setting immediate. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper introduces MAlocate for simultaneous active matrix completion across multiple matrices of varying sizes and unknown ranks, with uniform random entry sampling. It analyzes the algorithm's adaptation to ranks and provides a separate lower bound establishing minimax optimality. These elements rely on standard low-rank assumptions and random sampling explicitly stated in the abstract; the upper-bound analysis and lower bound are presented as independent results without reduction to fitted parameters, self-definitions, or load-bearing self-citations. The multi-matrix allocation strategy follows directly from the problem formulation without circular renaming or ansatz smuggling.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no explicit free parameters, axioms, or invented entities are stated beyond standard low-rank matrix assumptions implicit in the matrix completion literature.

pith-pipeline@v0.9.0 · 5416 in / 1052 out tokens · 50504 ms · 2026-05-08T18:59:44.019904+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

Cost.FunctionalEquation (J-cost uniqueness) washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

the matrix square-root lasso estimator ... ρ(r,n,d) = O(rd log d / n)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

24 extracted references · 24 canonical work pages

[1]

Bubeck, S., Munos, R., and Stoltz, G. (2011). Pure exploration in finitely-armed and continuous-armed bandits . Theoretical Computer Science , 412(19):1832--1852

work page 2011
[2]

Cand \` e s, E. J. and Recht, B. (2009). Exact matrix completion via convex optimization . Foundations of Computational Mathematics , 9(6):717--772

work page 2009
[3]

Cand \` e s, E. J. and Tao, T. (2006). Near-optimal signal recovery from random projections: universal encoding strategies? IEEE Transactions on Information Theory , 52(12):5406--5425

work page 2006
[4]

Carpentier, A., Klopp, O., L \" o ffler, M., and Nickl, R. (2017). Adaptive confidence sets for matrix completion . Bernoulli

work page 2017
[5]

Castro, R. M. and Nowak, R. D. (2008). Minimax bounds for active learning . IEEE Transactions on Information Theory , 54(5):2339--2353

work page 2008
[6]

Chatterjee, S. (2015). Matrix estimation by universal singular value thresholding . Annals of Statistics , 43(1):177--214

work page 2015
[7]

Dhanjal, C., Gaudel, R., and Cl \'e men c on, S. (2014). Online matrix completion through nuclear norm regularisation. In Proceedings of the 2014 SIAM International Conference on Data Mining , pages 623--631. SIAM

work page 2014
[8]

Gabillon, V., Ghavamzadeh, M., Lazaric, A., and Bubeck, S. (2011). Multi-bandit best arm identification. In Neural Information Processing Systems (NeurIPS) , pages 2222--2230

work page 2011
[9]

and Lecu \' e , G

Ga \" i ffas, S. and Lecu \' e , G. (2011). Sharp oracle inequalities for high-dimensional matrix prediction . IEEE Transactions on Information Theory , 57(10):6942--6957

work page 2011
[10]

Gilbert, E. N. (1952). A comparison of signalling alphabets . Bell System Technical Journal , 31(3):504--522

work page 1952
[11]

M., and Netrapalli, P

Jin, C., Kakade, S. M., and Netrapalli, P. (2016). Provable efficient online matrix completion via non-convex stochastic gradient descent. In Neural Information Processing Systems (NeurIPS) , pages 4520--4528

work page 2016
[12]

Katariya, S., Kveton, B., Szepesv \' a ri, C., Vernade, C., and Wen, Z. (2017a). Bernoulli rank-1 bandits for click feedback . In International Joint Conference on Artificial Intelligence (IJCAI)

work page
[13]

Katariya, S., Kveton, B., Szepesv \' a ri, C., Vernade, C., and Wen, Z. (2017b). Stochastic rank-1 bandits . In International Conference on Artificial Intelligence and Statistics (AISTATS)

work page
[14]

Klopp, O. (2014). Noisy low-rank matrix completion with general sampling distribution . Bernoulli

work page 2014
[15]

Klopp, O. (2015). Matrix completion by singular value thresholding: Sharp bounds . Electronic Journal of Statistics , 9(2):2348--2369

work page 2015
[16]

Koltchinskii, V., Lounici, K., and Tsybakov, A. B. (2011). Nuclear-norm penalization and optimal rates for noisy low-rank matrix completion . The Annals of Statistics , 39(5):2302--2329

work page 2011
[17]

and Vaswani, N

Lois, B. and Vaswani, N. (2015). Online Matrix Completion and Online Robust PCA . In IEEE International Symposium on Information Theory (ISIT)

work page 2015
[18]

Mazumder, R., Hastie, T., and Tibshirani, R. (2010). Spectral regularization algorithms for learning large incomplete matrices. Journal of Machine Learning Research , 11(Aug):2287--2322

work page 2010
[19]

and Wainwright, M

Negahban, S. and Wainwright, M. J. (2012). Restricted strong convexity and weighted matrix completion: Optimal bounds with noise . Journal of Machine Learning Research , 13:1665--1697

work page 2012
[20]

Riquelme, C., Ghavamzadeh, M., and Lazaric, A. (2017). Active learning for accurate estimation of linear models . In International Conference on Machine Learning (ICML)

work page 2017
[21]

and Tsybakov, A

Rohde, A. and Tsybakov, A. B. (2011). Estimation of high-dimensional low-rank matrices . Annals of Statistics , 39(2):887--930

work page 2011
[22]

Tsybakov, A. B. (2009). Introduction to Nonparametric Estimation . Springer Series in Statistics. Springer New York, New York, NY

work page 2009
[23]

Varshamov, R. R. (1957). Estimate of the number of signals in error correcting codes . Doklady Akademii Nauk SSSR , 117:739--741

work page 1957
[24]

and Kamakura, W

Wedel, M. and Kamakura, W. A. (2000). Market segmentation : Conceptual and methodological foundations . Springer US

work page 2000

[1] [1]

Bubeck, S., Munos, R., and Stoltz, G. (2011). Pure exploration in finitely-armed and continuous-armed bandits . Theoretical Computer Science , 412(19):1832--1852

work page 2011

[2] [2]

Cand \` e s, E. J. and Recht, B. (2009). Exact matrix completion via convex optimization . Foundations of Computational Mathematics , 9(6):717--772

work page 2009

[3] [3]

Cand \` e s, E. J. and Tao, T. (2006). Near-optimal signal recovery from random projections: universal encoding strategies? IEEE Transactions on Information Theory , 52(12):5406--5425

work page 2006

[4] [4]

Carpentier, A., Klopp, O., L \" o ffler, M., and Nickl, R. (2017). Adaptive confidence sets for matrix completion . Bernoulli

work page 2017

[5] [5]

Castro, R. M. and Nowak, R. D. (2008). Minimax bounds for active learning . IEEE Transactions on Information Theory , 54(5):2339--2353

work page 2008

[6] [6]

Chatterjee, S. (2015). Matrix estimation by universal singular value thresholding . Annals of Statistics , 43(1):177--214

work page 2015

[7] [7]

Dhanjal, C., Gaudel, R., and Cl \'e men c on, S. (2014). Online matrix completion through nuclear norm regularisation. In Proceedings of the 2014 SIAM International Conference on Data Mining , pages 623--631. SIAM

work page 2014

[8] [8]

Gabillon, V., Ghavamzadeh, M., Lazaric, A., and Bubeck, S. (2011). Multi-bandit best arm identification. In Neural Information Processing Systems (NeurIPS) , pages 2222--2230

work page 2011

[9] [9]

and Lecu \' e , G

Ga \" i ffas, S. and Lecu \' e , G. (2011). Sharp oracle inequalities for high-dimensional matrix prediction . IEEE Transactions on Information Theory , 57(10):6942--6957

work page 2011

[10] [10]

Gilbert, E. N. (1952). A comparison of signalling alphabets . Bell System Technical Journal , 31(3):504--522

work page 1952

[11] [11]

M., and Netrapalli, P

Jin, C., Kakade, S. M., and Netrapalli, P. (2016). Provable efficient online matrix completion via non-convex stochastic gradient descent. In Neural Information Processing Systems (NeurIPS) , pages 4520--4528

work page 2016

[12] [12]

Katariya, S., Kveton, B., Szepesv \' a ri, C., Vernade, C., and Wen, Z. (2017a). Bernoulli rank-1 bandits for click feedback . In International Joint Conference on Artificial Intelligence (IJCAI)

work page

[13] [13]

Katariya, S., Kveton, B., Szepesv \' a ri, C., Vernade, C., and Wen, Z. (2017b). Stochastic rank-1 bandits . In International Conference on Artificial Intelligence and Statistics (AISTATS)

work page

[14] [14]

Klopp, O. (2014). Noisy low-rank matrix completion with general sampling distribution . Bernoulli

work page 2014

[15] [15]

Klopp, O. (2015). Matrix completion by singular value thresholding: Sharp bounds . Electronic Journal of Statistics , 9(2):2348--2369

work page 2015

[16] [16]

Koltchinskii, V., Lounici, K., and Tsybakov, A. B. (2011). Nuclear-norm penalization and optimal rates for noisy low-rank matrix completion . The Annals of Statistics , 39(5):2302--2329

work page 2011

[17] [17]

and Vaswani, N

Lois, B. and Vaswani, N. (2015). Online Matrix Completion and Online Robust PCA . In IEEE International Symposium on Information Theory (ISIT)

work page 2015

[18] [18]

Mazumder, R., Hastie, T., and Tibshirani, R. (2010). Spectral regularization algorithms for learning large incomplete matrices. Journal of Machine Learning Research , 11(Aug):2287--2322

work page 2010

[19] [19]

and Wainwright, M

Negahban, S. and Wainwright, M. J. (2012). Restricted strong convexity and weighted matrix completion: Optimal bounds with noise . Journal of Machine Learning Research , 13:1665--1697

work page 2012

[20] [20]

Riquelme, C., Ghavamzadeh, M., and Lazaric, A. (2017). Active learning for accurate estimation of linear models . In International Conference on Machine Learning (ICML)

work page 2017

[21] [21]

and Tsybakov, A

Rohde, A. and Tsybakov, A. B. (2011). Estimation of high-dimensional low-rank matrices . Annals of Statistics , 39(2):887--930

work page 2011

[22] [22]

Tsybakov, A. B. (2009). Introduction to Nonparametric Estimation . Springer Series in Statistics. Springer New York, New York, NY

work page 2009

[23] [23]

Varshamov, R. R. (1957). Estimate of the number of signals in error correcting codes . Doklady Akademii Nauk SSSR , 117:739--741

work page 1957

[24] [24]

and Kamakura, W

Wedel, M. and Kamakura, W. A. (2000). Market segmentation : Conceptual and methodological foundations . Springer US

work page 2000