Routine Modeling with Time Series Metric Learning
Pith reviewed 2026-05-25 01:05 UTC · model grok-4.3
The pith
A sequence-to-sequence model learns distances between inertial time series so that clustering recovers daily routines.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Training a sequence-to-sequence model on inertial time series produces a distance function whose values, when supplied to an unsupervised clustering algorithm, allow the algorithm to recover the recurrent activity patterns that constitute daily routines.
What carries the argument
The SS2S architecture, a sequence-to-sequence model trained to embed inertial time series so that Euclidean distance in the embedding space reflects routine similarity.
If this is right
- Daily routines become discoverable without any predefined activity labels or supervised training data.
- Routine modeling works from inertial sensors alone, keeping the system non-intrusive.
- The same learned distance can be reused by any clustering or nearest-neighbor method that operates on time series.
- The approach applies to any domain where recurrent patterns appear in unlabeled time series.
Where Pith is reading between the lines
- Wearable devices could run this pipeline continuously to surface changes in a user's routines over weeks or months.
- The learned embedding might serve as a drop-in feature representation for other unsupervised tasks such as anomaly detection in behavior.
- Extending the training to include multiple users could produce a shared routine space that still respects individual privacy.
Load-bearing premise
The distance learned by the SS2S model on inertial time series alone captures the structure of recurrent routines sufficiently well for unsupervised clustering to recover them without additional labels or context.
What would settle it
Running the clustering algorithm on the learned distance and finding that the resulting groups do not align with the actual daily routines present in the recorded time series.
Figures
read the original abstract
Traditionally, the automatic recognition of human activities is performed with supervised learning algorithms on limited sets of specific activities. This work proposes to recognize recurrent activity patterns, called routines, instead of precisely defined activities. The modeling of routines is defined as a metric learning problem, and an architecture, called SS2S, based on sequence-to-sequence models is proposed to learn a distance between time series. This approach only relies on inertial data and is thus non intrusive and preserves privacy. Experimental results show that a clustering algorithm provided with the learned distance is able to recover daily routines.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper frames routine modeling as a metric learning task on inertial time series rather than supervised classification of discrete activities. It introduces the SS2S sequence-to-sequence architecture to learn a distance between time series and reports that feeding this learned distance to a standard clustering algorithm recovers daily routines. The method is positioned as non-intrusive and privacy-preserving because it uses only inertial sensor data.
Significance. If the experimental claim holds, the work would demonstrate that an unsupervised metric-learning pipeline on raw inertial streams can surface recurrent behavioral structure without activity labels, offering a scalable alternative to supervised activity recognition while mitigating privacy risks associated with labeled datasets.
major comments (2)
- [Experimental results (as referenced in the abstract)] The central experimental claim (that clustering with the SS2S distance recovers daily routines) is load-bearing, yet the manuscript provides no description of the datasets used, the number of subjects or days recorded, the choice of clustering algorithm and its hyperparameters, the evaluation metrics, or any baseline comparisons. Without these elements it is impossible to determine whether the reported recovery is attributable to the learned metric or to dataset artifacts.
- [Method and Experiments] The weakest assumption—that the embedding produced by SS2S on inertial series alone encodes routine structure strongly enough for off-the-shelf clustering to recover it without additional supervision or context—is not tested by any ablation that removes the metric-learning component or substitutes a standard distance (e.g., DTW or Euclidean). Such a control is required to establish that the SS2S distance, rather than generic time-series similarity, is responsible for the observed clustering performance.
minor comments (2)
- [SS2S architecture] Notation for the SS2S encoder-decoder and the learned distance function should be introduced with explicit equations rather than prose descriptions.
- [Data processing] The abstract states that the approach 'only relies on inertial data' but does not clarify whether any preprocessing (filtering, windowing, or normalization) is applied before the sequence-to-sequence model; this detail belongs in the methods section.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. The comments highlight important gaps in the experimental reporting and validation. We address each point below and commit to revisions that provide the requested details and controls without altering the core claims.
read point-by-point responses
-
Referee: [Experimental results (as referenced in the abstract)] The central experimental claim (that clustering with the SS2S distance recovers daily routines) is load-bearing, yet the manuscript provides no description of the datasets used, the number of subjects or days recorded, the choice of clustering algorithm and its hyperparameters, the evaluation metrics, or any baseline comparisons. Without these elements it is impossible to determine whether the reported recovery is attributable to the learned metric or to dataset artifacts.
Authors: We agree that the experimental description is insufficient as presented. The manuscript text focuses on the SS2S architecture and high-level results but omits the concrete dataset statistics, subject count, recording length, clustering implementation details, metrics, and baselines. In the revised version we will expand the Experiments section with a full dataset description (including number of subjects and days), specify the clustering algorithm and hyperparameters, define the evaluation metrics for routine recovery, and add baseline comparisons. This will make it possible to assess whether the observed structure arises from the learned metric. revision: yes
-
Referee: [Method and Experiments] The weakest assumption—that the embedding produced by SS2S on inertial series alone encodes routine structure strongly enough for off-the-shelf clustering to recover it without additional supervision or context—is not tested by any ablation that removes the metric-learning component or substitutes a standard distance (e.g., DTW or Euclidean). Such a control is required to establish that the SS2S distance, rather than generic time-series similarity, is responsible for the observed clustering performance.
Authors: We concur that an ablation isolating the contribution of the learned metric is necessary. The current manuscript does not report comparisons against standard distances. We will add these controls in the revision by re-running the clustering pipeline with Euclidean distance and DTW on the same inertial data and reporting the resulting routine recovery performance, thereby demonstrating whether the SS2S distance provides a measurable advantage over generic time-series measures. revision: yes
Circularity Check
No significant circularity detected
full rationale
The paper frames routine modeling explicitly as a metric-learning task solved by the proposed SS2S sequence-to-sequence architecture on inertial time series; the learned distance is then supplied to an off-the-shelf clustering algorithm whose ability to recover daily routines is assessed experimentally. No equation or claim reduces a derived quantity to its own fitted parameters by construction, no load-bearing premise rests on a self-citation chain, and the central empirical result is not a renaming or re-derivation of the input metric itself. The derivation chain therefore remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
modeling of routines is defined as a metric learning problem... SS2S... learn a distance between time series... almost-periodic functions... d(fS(t),fS(t+T))≤ε
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
CRL(S,Ŝ)=l−∑cos(S(t),Ŝ(t))... contrastive loss... KISSME... Mahalanobis
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Autowarp: Learning a Warping Distance from Unlabeled Time Series Using Sequence Autoencoders
Abid, A., Zou, J.: Autowarp: Learning a warping distance from unlabeled time series using sequence autoencoders. arXiv preprint arXiv:1810.10107 (2018)
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[2]
Information Systems 53, 16–38 (2015)
Aghabozorgi, S., Shirkhorshidi, A.S., Wah, T.Y.: Time-series clustering–a decade review. Information Systems 53, 16–38 (2015)
work page 2015
- [3]
-
[4]
Neurocomputing 273, 47–56 (2018)
Berlemont, S., Lefebvre, G., Duffner, S., Garcia, C.: Class-balanced siamese neural networks. Neurocomputing 273, 47–56 (2018)
work page 2018
- [5]
-
[6]
Acta Mathematica 46(1-2), 101–214 (1925)
Bohr, H.: Zur theorie der fastperiodischen funktionen. Acta Mathematica 46(1-2), 101–214 (1925)
work page 1925
- [7]
-
[8]
Chatzaki, C., Pediaditis, M., Vavoulas, G., Tsiknakis, M.: Human daily activity and fall recognition using a smartphone’s acceleration sensor. In: ICT4AWE. pp. 100–118. Springer (2016)
work page 2016
-
[9]
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Cho, K., Van Merri¨ enboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., Bengio, Y.: Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014)
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[10]
Cumin, J., Lefebvre, G., Ramparany, F., Crowley, J.L.: Human activity recognition using place-based decision fusion in smart homes. In: CONTEXT. pp. 137–150. Springer (2017)
work page 2017
-
[11]
Esling, P., Agon, C.: Time-series data mining. ACM CSUR 45(1), 12 (2012)
work page 2012
-
[12]
Faraki, M., Harandi, M.T., Porikli, F.: Large-scale metric learning: A voyage from shallow to deep. IEEE TNNLS 29(9), 4339–4346 (2018) 14 Paul Compagnon, Gr´ egoire Lefebvre, Stefan Duffner and Christophe Garcia
work page 2018
- [13]
-
[14]
Gonzalez, M.C., Hidalgo, C.A., Barabasi, A.L.: Understanding individual human mobility patterns. nature 453(7196), 779 (2008)
work page 2008
- [15]
-
[16]
Neural computation 9(8), 1735–1780 (1997)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural computation 9(8), 1735–1780 (1997)
work page 1997
-
[17]
Jaeger, H.: The ”echo state” approach to analysing and training recurrent neu- ral networks-with an erratum note. Bonn, Germany: German National Research Center for Information Technology GMD Technical Report 148(34), 13 (2001)
work page 2001
-
[18]
Kalpakis, K., Gada, D., Puttagunta, V.: Distance measures for effective clustering of arima time-series. In: IEEE ICDM. pp. 273–280. IEEE (2001)
work page 2001
- [19]
-
[20]
European journal of social psychology 40(6), 998–1009 (2010)
Lally, P., Van Jaarsveld, C.H., Potts, H.W., Wardle, J.: How are habits formed: Modelling habit formation in the real world. European journal of social psychology 40(6), 998–1009 (2010)
work page 2010
-
[21]
Predicting structured data 1(0) (2006)
LeCun, Y., Chopra, S., Hadsell, R., Ranzato, M., Huang, F.: A tutorial on energy- based learning. Predicting structured data 1(0) (2006)
work page 2006
- [22]
-
[23]
Martin, R.J.: A metric for ARMA processes. IEEE Trans. Signal Process. 48(4), 1164–1170 (2000)
work page 2000
- [24]
-
[25]
Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acou. Speech Signal Process. 26(1), 43–49 (1978)
work page 1978
-
[26]
Intelligent Data Analysis 11(5), 561–580 (2007)
Salvador, S., Chan, P.: Toward accurate dynamic time warping in linear time and space. Intelligent Data Analysis 11(5), 561–580 (2007)
work page 2007
- [27]
-
[28]
In: Proceedings of the 25th interna- tional conference on Machine learning
Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.A.: Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th interna- tional conference on Machine learning. pp. 1096–1103. ACM (2008)
work page 2008
-
[29]
Weinberger, K.Q., Saul, L.K.: Distance metric learning for large margin nearest neighbor classification. JMLR 10(Feb), 207–244 (2009)
work page 2009
-
[30]
Neurorehabilitation and neural repair 27(8), 742–752 (2013)
Weiss, A., Brozgol, M., Dorfman, M., Herman, T., Shema, S., Giladi, N., Hausdorff, J.M.: Does the evaluation of gait quality during daily life provide insight into fall risk? a novel approach using 3-day accelerometer recordings. Neurorehabilitation and neural repair 27(8), 742–752 (2013)
work page 2013
-
[31]
In: Proceedings of the 23rd international conference on Machine learning
Xi, X., Keogh, E., Shelton, C., Wei, L., Ratanamahatana, C.A.: Fast time series classification using numerosity reduction. In: Proceedings of the 23rd international conference on Machine learning. pp. 1033–1040. ACM (2006)
work page 2006
- [32]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.