CapyMOA: Efficient Machine Learning for Data Streams and Online Continual Learning in Python

Albert Bifet; Anton Lee; Bernhard Pfahringer; Guilherme Weigert Cassales; Heitor Murilo Gomes; Justin Liu; Marco Heyden; Maroua Bahri; Nuwan Gunasekara; Vitor Cerqueira

arxiv: 2502.07432 · v2 · submitted 2025-02-11 · 💻 cs.LG

CapyMOA: Efficient Machine Learning for Data Streams and Online Continual Learning in Python

Heitor Murilo Gomes , Anton Lee , Nuwan Gunasekara , Yibin Sun , Guilherme Weigert Cassales , Justin Liu , Marco Heyden , Vitor Cerqueira

show 4 more authors

Maroua Bahri Yun Sing Koh Bernhard Pfahringer Albert Bifet

This is my paper

Pith reviewed 2026-05-23 03:24 UTC · model grok-4.3

classification 💻 cs.LG

keywords data streamsonline learningcontinual learningPython librarymachine learningadaptive modelsreal-time learning

0 comments

The pith

CapyMOA supplies a Python library that integrates online algorithms with deep learning for data streams.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

CapyMOA is presented as an open-source Python library for machine learning on data streams and online continual learning. It offers a framework for models that adapt in real time as new data arrives. The library is built to work together with MOA for fast online methods, scikit-learn, and PyTorch for neural networks. This combination is meant to help users address changing data problems in many areas through better efficiency and ease of use.

Core claim

The central claim is that CapyMOA supplies a structured Python framework for real-time learning on streams, with an architecture that supports combining high-performance online algorithms from MOA with deep learning from PyTorch and scikit-learn, thereby enabling adaptive models for dynamic tasks.

What carries the argument

The CapyMOA architecture for integrating stream learning components with external machine learning libraries.

If this is right

Users can build systems that mix traditional online algorithms with modern deep learning models.
Adaptive models become easier to implement for evolving data environments.
Research and practice in dynamic learning tasks gain from improved scalability and usability.
Domains requiring real-time updates benefit from the library's design.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the integrations succeed, hybrid models could become standard for handling non-stationary data.
Future work might focus on specific performance benchmarks to quantify the efficiency gains.
Applications in fields like fraud detection or sensor data analysis could expand with easier access to these tools.

Load-bearing premise

The architecture delivers actual efficiency and usability gains beyond just connecting existing libraries.

What would settle it

An experiment comparing training time and accuracy on a stream dataset using CapyMOA versus separate use of MOA and PyTorch showing no improvement or added overhead.

read the original abstract

CapyMOA is an open-source Python library for efficient machine learning on data streams and online continual learning. It provides a structured framework for real-time learning, supporting adaptive models that evolve over time. CapyMOA's architecture allows integration with frameworks such as MOA, scikit-learn and PyTorch, enabling the combination of high-performance online algorithms with modern deep learning techniques. By emphasizing efficiency, scalability, and usability, CapyMOA allows researchers and practitioners to tackle dynamic learning challenges across various domains. Website: https://capymoa.org. GitHub: https://github.com/adaptive-machine-learning/CapyMOA.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The manuscript presents CapyMOA, an open-source Python library for machine learning on data streams and online continual learning. It describes a structured framework supporting adaptive models, with an architecture that integrates MOA, scikit-learn, and PyTorch to combine high-performance online algorithms with deep learning techniques, while emphasizing efficiency, scalability, and usability for dynamic learning tasks.

Significance. If the claimed integrations and design choices deliver measurable efficiency and usability gains on streaming tasks, the library could serve as a practical bridge between established stream mining tools and modern deep learning frameworks, lowering barriers for researchers working on continual and online learning problems.

major comments (2)

[Abstract] Abstract: The central claim that the architecture 'allows integration with frameworks such as MOA, scikit-learn and PyTorch, enabling the combination of high-performance online algorithms with modern deep learning techniques' is presented without any API details, code snippets, architectural diagrams, or performance measurements showing that the integrations are more than thin compatibility layers.
[Abstract] Abstract: Assertions that CapyMOA emphasizes 'efficiency, scalability, and usability' and 'allows researchers and practitioners to tackle dynamic learning challenges' are unsupported by any benchmarks, runtime comparisons, memory profiles, or user studies; the provided text contains no experimental validation of these properties.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on the abstract. We address each point below and will revise the manuscript to strengthen the presentation of claims.

read point-by-point responses

Referee: [Abstract] Abstract: The central claim that the architecture 'allows integration with frameworks such as MOA, scikit-learn and PyTorch, enabling the combination of high-performance online algorithms with modern deep learning techniques' is presented without any API details, code snippets, architectural diagrams, or performance measurements showing that the integrations are more than thin compatibility layers.

Authors: We agree that the abstract is concise and omits these supporting elements. The full manuscript details the integrations in Section 3 (Architecture), including API descriptions, code examples, and an architectural diagram (Figure 2), with performance measurements in Section 5 demonstrating non-trivial benefits over thin wrappers. We will revise the abstract to include a brief reference to these sections and a short integration example to better substantiate the claim. revision: yes
Referee: [Abstract] Abstract: Assertions that CapyMOA emphasizes 'efficiency, scalability, and usability' and 'allows researchers and practitioners to tackle dynamic learning challenges' are unsupported by any benchmarks, runtime comparisons, memory profiles, or user studies; the provided text contains no experimental validation of these properties.

Authors: We acknowledge that the abstract does not contain benchmarks or validation data. The manuscript provides runtime and memory comparisons in Section 5 (Experiments), along with usability discussion in Section 4. We will revise the abstract to qualify these assertions by referencing the experimental results presented in the paper, ensuring the claims are directly supported. revision: yes

Circularity Check

0 steps flagged

No circularity: library description paper has no derivations or predictions

full rationale

The paper is a software library announcement with no mathematical derivations, equations, predictions, fitted parameters, or uniqueness theorems. Claims about architecture and integrations with MOA/scikit-learn/PyTorch are presented as design statements rather than results derived from prior steps. No load-bearing argument reduces to self-definition, self-citation, or renaming. The reader's assessment of score 0.0 is confirmed; this is a standard non-finding for descriptive software papers.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No mathematical content, free parameters, axioms, or invented entities appear in the abstract; the work is a software engineering contribution.

pith-pipeline@v0.9.0 · 5683 in / 1066 out tokens · 29503 ms · 2026-05-23T03:24:53.348328+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

12 extracted references · 12 canonical work pages

[1]

Survey on feature transformation techniques for data streams

Maroua Bahri, Albert Bifet, Silviu Maniu, and Heitor Murilo Gomes. Survey on feature transformation techniques for data streams. In Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence, pages 4796--4802, 2021

work page 2021
[2]

Moa: Massive online analysis, a framework for stream classification and clustering

Albert Bifet, Geoff Holmes, Bernhard Pfahringer, Philipp Kranen, Hardy Kremer, Timm Jansen, and Thomas Seidl. Moa: Massive online analysis, a framework for stream classification and clustering. In Proceedings of the first workshop on applications of pattern analysis, pages 44--50. PMLR, 2010

work page 2010
[3]

Density-based clustering over an evolving data stream with noise

Feng Cao, Martin Estert, Weining Qian, and Aoying Zhou. Density-based clustering over an evolving data stream with noise. In 2006 SIAM international conference on data mining, pages 328--339, 2006

work page 2006
[4]

Improving the performance of bagging ensembles for data streams through mini-batching

Guilherme Cassales, Heitor Gomes, Albert Bifet, Bernhard Pfahringer, and Hermes Senger. Improving the performance of bagging ensembles for data streams through mini-batching. Information Sciences, 580: 0 260--282, 2021

work page 2021
[5]

Adaptive random forests for evolving data stream classification

Heitor M Gomes, Albert Bifet, Jesse Read, Jean Paul Barddal, Fabr \' cio Enembreck, Bernhard Pfahringer, Geoff Holmes, and Talel Abdessalem. Adaptive random forests for evolving data stream classification. Machine Learning, 106: 0 1469--1495, 2017

work page 2017
[6]

Streaming random patches for evolving data stream classification

Heitor Murilo Gomes, Jesse Read, and Albert Bifet. Streaming random patches for evolving data stream classification. In 2019 IEEE international conference on data mining (ICDM), pages 240--249. IEEE, 2019 a

work page 2019
[7]

Machine learning for streaming data: state of the art, challenges, and opportunities

Heitor Murilo Gomes, Jesse Read, Albert Bifet, Jean Paul Barddal, and Jo \ a o Gama. Machine learning for streaming data: state of the art, challenges, and opportunities. ACM SIGKDD Explorations Newsletter, 21 0 (2): 0 6--22, 2019 b

work page 2019
[8]

Mining time-changing data streams

Geoff Hulten, Laurie Spencer, and Pedro Domingos. Mining time-changing data streams. In Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, pages 97--106, 2001

work page 2001
[9]

Temporal silhouette: validation of stream clustering robust to concept drift

F \'e lix Iglesias V \'a zquez and Tanja Zseby. Temporal silhouette: validation of stream clustering robust to concept drift. Machine Learning, 113 0 (4): 0 2067--2091, 2024

work page 2067
[10]

Extremely fast decision tree

Chaitanya Manapragada, Geoffrey I Webb, and Mahsa Salehi. Extremely fast decision tree. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 1953--1962, 2018

work page 1953
[11]

Scikit-multiflow: A multi-output streaming framework

Jacob Montiel, Jesse Read, Albert Bifet, and Talel Abdessalem. Scikit-multiflow: A multi-output streaming framework. JMLR, 19 0 (72), 2018

work page 2018
[12]

River: machine learning for streaming data in python

Jacob Montiel, Max Halford, Saulo Martiello Mastelini, Geoffrey Bolmier, Raphael Sourty, Robin Vaysse, Adil Zouitine, Heitor Murilo Gomes, Jesse Read, Talel Abdessalem, et al. River: machine learning for streaming data in python. Journal of Machine Learning Research, 22 0 (110): 0 1--8, 2021

work page 2021

[1] [1]

Survey on feature transformation techniques for data streams

Maroua Bahri, Albert Bifet, Silviu Maniu, and Heitor Murilo Gomes. Survey on feature transformation techniques for data streams. In Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence, pages 4796--4802, 2021

work page 2021

[2] [2]

Moa: Massive online analysis, a framework for stream classification and clustering

Albert Bifet, Geoff Holmes, Bernhard Pfahringer, Philipp Kranen, Hardy Kremer, Timm Jansen, and Thomas Seidl. Moa: Massive online analysis, a framework for stream classification and clustering. In Proceedings of the first workshop on applications of pattern analysis, pages 44--50. PMLR, 2010

work page 2010

[3] [3]

Density-based clustering over an evolving data stream with noise

Feng Cao, Martin Estert, Weining Qian, and Aoying Zhou. Density-based clustering over an evolving data stream with noise. In 2006 SIAM international conference on data mining, pages 328--339, 2006

work page 2006

[4] [4]

Improving the performance of bagging ensembles for data streams through mini-batching

Guilherme Cassales, Heitor Gomes, Albert Bifet, Bernhard Pfahringer, and Hermes Senger. Improving the performance of bagging ensembles for data streams through mini-batching. Information Sciences, 580: 0 260--282, 2021

work page 2021

[5] [5]

Adaptive random forests for evolving data stream classification

Heitor M Gomes, Albert Bifet, Jesse Read, Jean Paul Barddal, Fabr \' cio Enembreck, Bernhard Pfahringer, Geoff Holmes, and Talel Abdessalem. Adaptive random forests for evolving data stream classification. Machine Learning, 106: 0 1469--1495, 2017

work page 2017

[6] [6]

Streaming random patches for evolving data stream classification

Heitor Murilo Gomes, Jesse Read, and Albert Bifet. Streaming random patches for evolving data stream classification. In 2019 IEEE international conference on data mining (ICDM), pages 240--249. IEEE, 2019 a

work page 2019

[7] [7]

Machine learning for streaming data: state of the art, challenges, and opportunities

Heitor Murilo Gomes, Jesse Read, Albert Bifet, Jean Paul Barddal, and Jo \ a o Gama. Machine learning for streaming data: state of the art, challenges, and opportunities. ACM SIGKDD Explorations Newsletter, 21 0 (2): 0 6--22, 2019 b

work page 2019

[8] [8]

Mining time-changing data streams

Geoff Hulten, Laurie Spencer, and Pedro Domingos. Mining time-changing data streams. In Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, pages 97--106, 2001

work page 2001

[9] [9]

Temporal silhouette: validation of stream clustering robust to concept drift

F \'e lix Iglesias V \'a zquez and Tanja Zseby. Temporal silhouette: validation of stream clustering robust to concept drift. Machine Learning, 113 0 (4): 0 2067--2091, 2024

work page 2067

[10] [10]

Extremely fast decision tree

Chaitanya Manapragada, Geoffrey I Webb, and Mahsa Salehi. Extremely fast decision tree. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 1953--1962, 2018

work page 1953

[11] [11]

Scikit-multiflow: A multi-output streaming framework

Jacob Montiel, Jesse Read, Albert Bifet, and Talel Abdessalem. Scikit-multiflow: A multi-output streaming framework. JMLR, 19 0 (72), 2018

work page 2018

[12] [12]

River: machine learning for streaming data in python

Jacob Montiel, Max Halford, Saulo Martiello Mastelini, Geoffrey Bolmier, Raphael Sourty, Robin Vaysse, Adil Zouitine, Heitor Murilo Gomes, Jesse Read, Talel Abdessalem, et al. River: machine learning for streaming data in python. Journal of Machine Learning Research, 22 0 (110): 0 1--8, 2021

work page 2021