Attractive and Repulsive Pattern Control in Sequence Generation

Francois Pachet

arxiv: 2606.24911 · v1 · pith:INM3KZLAnew · submitted 2026-06-19 · 💻 cs.SD · cs.AI

Attractive and Repulsive Pattern Control in Sequence Generation

Francois Pachet This is my paper

Pith reviewed 2026-06-26 13:10 UTC · model grok-4.3

classification 💻 cs.SD cs.AI

keywords variable-order Markovpattern controlsequence generationbelief propagationself-reusemusic generationsigned couplingrecurrence automaton

0 comments

The pith

Signed pattern control via weighted recurrence automata lets variable-order Markov models repel overactive patterns to cut high-order self-reuse.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Variable-order Markov models preserve local syntax but can lock into recurring high-order tunnels during long generation. The paper adds signed pattern control: a weighted recurrence automaton computes activation R for a chosen family of target patterns, and belief propagation samples from P_beta(x) proportional to P_0(x) exp(beta R(x)). Negative beta makes the targets costly, reducing generated 8-gram self-reuse on six duration-bearing monophonic sources including Bach and Telemann while increasing the effective number of distinct 8-grams and 4-gram context coverage. The same mechanism with positive beta turns the patterns into controlled attractors for probing basins and transitions. The target family is typically mined online from overactive material in the sampling history.

Core claim

A weighted recurrence automaton computes an activation R for a family of target patterns, and BP-Regular sampling draws exactly from P_beta(x) proportional to P_0(x) exp(beta R(x)). Negative coupling penalizes the targets during generation; positive coupling rewards them. On six monophonic sources the negative branch reduces 8-gram self-reuse, raises the count of distinct 8-grams produced, and widens coverage of training-supported 4-gram contexts while retaining substantial lower-order support. The identical signed mechanism on five Weimar Jazz solos reproduces the anti-reuse effect outside Baroque material.

What carries the argument

The signed coupling beta applied to the activation R of a weighted recurrence automaton inside BP-Regular sampling.

If this is right

Negative control increases the number of distinct 8-grams generated from the same model.
Coverage of training-supported 4-gram contexts rises while lower-order statistics remain largely intact.
The same signed mechanism supplies a positive branch for controlled attractor probing and phase-transition studies.
The approach transfers from Baroque monophonic material to Weimar jazz solos with the same anti-reuse signature.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The mechanism could be applied to text or symbolic sequence tasks outside music by mining different pattern families.
Supplying patterns from an external style vocabulary instead of online mining would test whether the control remains effective when targets are chosen independently of the generation history.
Extending the recurrence automaton to track polyphonic or multi-stream patterns would reveal whether the signed control generalizes beyond monophonic duration sequences.

Load-bearing premise

The online homeostatic selection of target patterns from overactive generated material produces a representative and unbiased family that the automaton can penalize without distorting the underlying variable-order statistics.

What would settle it

Re-running the negative branch on the same six monophonic sources and measuring no drop in 8-gram self-reuse, or no rise in effective 8-gram count, would falsify the reported experimental outcome.

Figures

Figures reproduced from arXiv: 2606.24911 by Francois Pachet.

**Figure 2.** Figure 2: The same 20-note absolute-pitch passage appears twice in the baseline seed-17 run, at [PITH_FULL_IMAGE:figures/full_fig_p010_2.png] view at source ↗

**Figure 3.** Figure 3: The longest repeated passage in the matched seed-17 penalty run has length 15 pitch [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗

**Figure 4.** Figure 4: Partita seed 17. Left: one occurrence of the strongest baseline 8-gram corridor, which [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

**Figure 5.** Figure 5: The same 14-note absolute-pitch passage appears twice in the Partita baseline seed-17 [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗

**Figure 6.** Figure 6: Contiguous fixed-motif phase probe, seed 17. Top: the four-note pitch-class recognizer [PITH_FULL_IMAGE:figures/full_fig_p013_6.png] view at source ↗

read the original abstract

Variable-order Markov models preserve local symbolic syntax by adapting context length, but long continuations can enter recurring high-order "tunnels": repeated suffixes, locally periodic passages, or copied fragments longer than the formal Markov order. This paper introduces signed pattern control for variable-order Markov generation with BP-Regular sampling. A weighted recurrence automaton computes an activation R for a chosen family of target patterns, and belief propagation samples exactly from P_beta(x) proportional to P_0(x) exp(beta R(x)). Negative coupling makes the target patterns costly during sampling; positive coupling rewards the same patterns and turns them into controlled attractors. The target family may be mined online from overactive generated material, supplied by a score or style vocabulary, or designed as an experimental probe. The main experiments use the online homeostatic case, choosing patterns that become overactive in the sampling history. On six duration-bearing monophonic sources, including Bach and Telemann material, the negative branch reduces generated 8-gram self-reuse, increases the effective number of generated 8-grams, and increases coverage of training-supported 4-gram contexts while preserving substantial lower-order support. A pitch-sequence replication on five Weimar Jazz Database solos gives the same anti-reuse signature outside Baroque material. The same signed mechanism also provides a positive branch for probing attractor basins, phase transitions, and hysteresis in the underlying variable-order model.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper adds a signed coupling layer on variable-order Markov models that reduces high-order self-reuse in music sequences when the negative branch is active, but the homeostatic mining step lacks controls to rule out bias.

read the letter

This paper gives a practical way to steer variable-order Markov generators away from or toward chosen patterns using a weighted recurrence automaton and belief propagation sampling.

What is new is the signed attractive/repulsive mechanism combined with online homeostatic pattern selection from the generation history itself. The negative setting is shown to cut 8-gram self-reuse, raise the count of distinct 8-grams, and improve coverage of training 4-gram contexts on six monophonic sources including Bach and Telemann, with a similar pattern on jazz solos. The positive branch is available for attractor studies.

The work does a clear job of keeping the lower-order Markov support intact while applying the control. The directional effects are consistent across the reported datasets.

The soft spots are the absence of error bars, statistical tests, or numerical values in the abstract, and the reliance on online mining without an ablation against a fixed external pattern set. If the mined patterns correlate with the tunnels being suppressed, the coupling could shift the effective context distribution in ways the current metrics do not catch.

This is for people building controllable symbolic music tools or sequence models who need a tunable avoidance mechanism. A reader working on Markov continuation would find the sampling construction and the empirical signature useful.

It deserves peer review so the implementation details and the mining bias can be checked against the full runs.

Referee Report

3 major / 1 minor

Summary. The manuscript introduces signed pattern control for variable-order Markov models using BP-Regular sampling. A weighted recurrence automaton computes activation R(x) for a family of target patterns (mined online from overactive generated material in the main regime), and sequences are sampled exactly from P_beta(x) proportional to P_0(x) exp(beta R(x)). Negative beta penalizes the patterns to reduce self-reuse; positive beta rewards them as attractors. On six duration-bearing monophonic sources (Bach, Telemann, etc.) and a Weimar Jazz pitch replication, the negative branch is reported to reduce 8-gram self-reuse, increase the effective number of distinct 8-grams, and improve coverage of training-supported 4-gram contexts while preserving lower-order support.

Significance. If the empirical claims hold after proper controls and statistical validation, the signed mechanism supplies a lightweight, post-training knob for modulating high-order reuse and attractor structure in variable-order generators. This is potentially useful for music and symbolic sequence tasks where one wishes to suppress tunnels or probe phase transitions without retraining. The approach is distinguished by its exact sampling guarantee and the homeostatic online mining option, but the current evidence base is too thin to support strong claims about unbiased pattern control.

major comments (3)

[Abstract] Abstract: the directional effects (reduced 8-gram self-reuse, increased effective 8-gram count, improved 4-gram context coverage) are stated without error bars, statistical tests, or any description of how the 8-gram counts, effective-number metric, or context-coverage quantities were computed across the six sources; this prevents assessment of whether the reported consistency is robust or an artifact of implementation choices.
[Abstract / Experiments] Main experimental regime (online homeostatic case): the central claim that negative coupling reduces reuse while preserving lower-order statistics rests on the assumption that patterns mined online from overactive generated material form an unbiased family for the weighted recurrence automaton. No ablation (fixed external pattern set, random selection, or removal of the homeostatic rule) is described to isolate the mining step from the signed coupling R(x), leaving open the possibility that selection preferentially samples patterns already correlated with high-order tunnels and thereby distorts the effective context distribution.
[Method] Method section: the weighted recurrence automaton and beta are external to the base variable-order model; the reported metrics compare generated statistics against training data rather than against an internal, parameter-free baseline derived from the same model, so the anti-reuse signature could partly reflect the external penalty rather than a genuine change in the variable-order dynamics.

minor comments (1)

[Method] Notation for the belief-propagation sampler and the precise definition of the effective-number metric should be expanded for reproducibility.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major point below and indicate the revisions we will make to strengthen the manuscript.

read point-by-point responses

Referee: [Abstract] Abstract: the directional effects (reduced 8-gram self-reuse, increased effective 8-gram count, improved 4-gram context coverage) are stated without error bars, statistical tests, or any description of how the 8-gram counts, effective-number metric, or context-coverage quantities were computed across the six sources; this prevents assessment of whether the reported consistency is robust or an artifact of implementation choices.

Authors: We agree that the abstract lacks sufficient detail on metric definitions and variability. In the revised manuscript we will expand the abstract to briefly describe the computation of the 8-gram self-reuse, effective-number, and context-coverage metrics, and we will state that results are aggregated across sources with standard deviations reported in the main text and figures. Statistical significance tests will be added to the experiments section. revision: yes
Referee: [Abstract / Experiments] Main experimental regime (online homeostatic case): the central claim that negative coupling reduces reuse while preserving lower-order statistics rests on the assumption that patterns mined online from overactive generated material form an unbiased family for the weighted recurrence automaton. No ablation (fixed external pattern set, random selection, or removal of the homeostatic rule) is described to isolate the mining step from the signed coupling R(x), leaving open the possibility that selection preferentially samples patterns already correlated with high-order tunnels and thereby distorts the effective context distribution.

Authors: The online homeostatic mining is presented as an integral component of the method rather than an optional add-on. To directly address the concern about possible selection bias, the revised version will include an ablation study that compares the homeostatic case against (i) a fixed external pattern set and (ii) random pattern selection, while keeping the signed coupling fixed. This will isolate the contribution of the mining rule. revision: yes
Referee: [Method] Method section: the weighted recurrence automaton and beta are external to the base variable-order model; the reported metrics compare generated statistics against training data rather than against an internal, parameter-free baseline derived from the same model, so the anti-reuse signature could partly reflect the external penalty rather than a genuine change in the variable-order dynamics.

Authors: We will revise the experiments section to make the internal baseline explicit: all reported metrics will be shown relative to the base variable-order model at beta = 0 (the same model without the weighted recurrence automaton). This comparison will be added alongside the training-data reference to clarify that the observed changes are attributable to the signed control rather than solely to the external penalty term. revision: yes

Circularity Check

0 steps flagged

No circularity in derivation or empirical claims

full rationale

The paper introduces an explicit sampling procedure (weighted recurrence automaton computing R(x), then exact sampling from P_beta(x) ∝ P_0(x) exp(beta R(x))) and applies it to external monophonic corpora. Reported metrics (8-gram self-reuse, effective 8-gram count, 4-gram context coverage) are computed by direct comparison of generated output against the same training data used to build the base variable-order model; these quantities are not algebraically forced by the definition of R(x) or the homeostatic mining rule. No self-citation supplies a uniqueness theorem, no parameter is fitted then relabeled as a prediction, and no ansatz is smuggled via prior work. The derivation chain therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 1 invented entities

The method rests on the new weighted recurrence automaton and the assumption that belief propagation exactly samples the modified distribution; beta is a free parameter controlling coupling strength.

free parameters (1)

beta
Coupling strength that scales the effect of pattern activation R on the sampling distribution.

axioms (1)

standard math Belief propagation exactly samples from P_beta(x) proportional to P_0(x) exp(beta R(x))
Invoked to justify the BP-Regular sampling procedure.

invented entities (1)

weighted recurrence automaton no independent evidence
purpose: Computes activation R for chosen target pattern families
New component introduced to track and weight pattern recurrences during generation.

pith-pipeline@v0.9.1-grok · 5767 in / 1328 out tokens · 23405 ms · 2026-06-26T13:10:37.215961+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

12 extracted references · 6 canonical work pages

[1]

D. J. Amit, G. Parisi, and L. Peliti. Asymptotic behavior of the “true” self-avoiding walk. Physical Review B, 27(3):1635–1645, 1983. https://doi.org/10.1103/PhysRevB.27.1635

work page doi:10.1103/physrevb.27.1635 1983
[2]

Papadopoulos, F

A. Papadopoulos, F. Pachet, P. Roy, and J. Sakellariou. Exact sampling for regular and Markov constraints with belief propagation. InPrinciples and Practice of Constraint Programming, 2015

2015
[3]

F. Pachet. Exact regular-constrained variable-order Markov generation via sparse context- state belief propagation.arXiv preprint arXiv:2605.07839, 2026. https://doi.org/10.48550/ arXiv.2605.07839

Pith/arXiv arXiv 2026
[4]

Papadopoulos, P

A. Papadopoulos, P. Roy, and F. Pachet. Avoiding plagiarism in Markov sequence generation. InProceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2014
[5]

D. Conklin. Antipattern discovery in folk tunes.Journal of New Music Research, 42(2):161– 169, 2013. https://doi.org/10.1080/09298215.2013.809125

work page doi:10.1080/09298215.2013.809125 2013
[6]

Conklin and S

D. Conklin and S. Weisser. Pattern and antipattern discovery in Ethiopian bagana songs. In D. Meredith, editor,Computational Music Analysis, pages 425–443. Springer, 2016. https://doi.org/10.1007/978-3-319-25931-4_16. 15

work page doi:10.1007/978-3-319-25931-4_16 2016
[7]

G. Ritchie. Some empirical criteria for attributing creativity to a computer program.Minds and Machines, 17:67–99, 2007. https://doi.org/10.1007/s11023-007-9066-2

work page doi:10.1007/s11023-007-9066-2 2007
[8]

Sakellariou, F

J. Sakellariou, F. Tria, V. Loreto, and F. Pachet. Maximum entropy models capture melodic styles.Scientific Reports, 7:9172, 2017. https://doi.org/10.1038/s41598-017-08028-4

work page doi:10.1038/s41598-017-08028-4 2017
[9]

doi: 10.1109/TAMD.2010.2056368

J. Schmidhuber. Formal theory of creativity, fun, and intrinsic motivation (1990–2010). IEEE Transactions on Autonomous Mental Development, 2(3):230–247, 2010. https://doi. org/10.1109/TAMD.2010.2056368

work page doi:10.1109/tamd.2010.2056368 1990
[10]

Pfleiderer, K

M. Pfleiderer, K. Frieler, J. Abeßer, W.-G. Zaddach, and B. Burkhart, editors.Inside the Jazzomat: New Perspectives for Jazz Research. Schott Campus, 2017. Weimar Jazz Database: https://jazzomat.hfm-weimar.de/

2017
[11]

The power of amnesia: learning probabilistic automata with variable memory length.Machine Learning, 1996

Ron, D., Singer, Y., and Tishby, N. The power of amnesia: learning probabilistic automata with variable memory length.Machine Learning, 1996

1996
[12]

Pearce, M. T. and Wiggins, G. A. Improved methods for statistical modelling of monophonic music.Journal of New Music Research, 2004. 16

2004

[1] [1]

D. J. Amit, G. Parisi, and L. Peliti. Asymptotic behavior of the “true” self-avoiding walk. Physical Review B, 27(3):1635–1645, 1983. https://doi.org/10.1103/PhysRevB.27.1635

work page doi:10.1103/physrevb.27.1635 1983

[2] [2]

Papadopoulos, F

A. Papadopoulos, F. Pachet, P. Roy, and J. Sakellariou. Exact sampling for regular and Markov constraints with belief propagation. InPrinciples and Practice of Constraint Programming, 2015

2015

[3] [3]

F. Pachet. Exact regular-constrained variable-order Markov generation via sparse context- state belief propagation.arXiv preprint arXiv:2605.07839, 2026. https://doi.org/10.48550/ arXiv.2605.07839

Pith/arXiv arXiv 2026

[4] [4]

Papadopoulos, P

A. Papadopoulos, P. Roy, and F. Pachet. Avoiding plagiarism in Markov sequence generation. InProceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2014

[5] [5]

D. Conklin. Antipattern discovery in folk tunes.Journal of New Music Research, 42(2):161– 169, 2013. https://doi.org/10.1080/09298215.2013.809125

work page doi:10.1080/09298215.2013.809125 2013

[6] [6]

Conklin and S

D. Conklin and S. Weisser. Pattern and antipattern discovery in Ethiopian bagana songs. In D. Meredith, editor,Computational Music Analysis, pages 425–443. Springer, 2016. https://doi.org/10.1007/978-3-319-25931-4_16. 15

work page doi:10.1007/978-3-319-25931-4_16 2016

[7] [7]

G. Ritchie. Some empirical criteria for attributing creativity to a computer program.Minds and Machines, 17:67–99, 2007. https://doi.org/10.1007/s11023-007-9066-2

work page doi:10.1007/s11023-007-9066-2 2007

[8] [8]

Sakellariou, F

J. Sakellariou, F. Tria, V. Loreto, and F. Pachet. Maximum entropy models capture melodic styles.Scientific Reports, 7:9172, 2017. https://doi.org/10.1038/s41598-017-08028-4

work page doi:10.1038/s41598-017-08028-4 2017

[9] [9]

doi: 10.1109/TAMD.2010.2056368

J. Schmidhuber. Formal theory of creativity, fun, and intrinsic motivation (1990–2010). IEEE Transactions on Autonomous Mental Development, 2(3):230–247, 2010. https://doi. org/10.1109/TAMD.2010.2056368

work page doi:10.1109/tamd.2010.2056368 1990

[10] [10]

Pfleiderer, K

M. Pfleiderer, K. Frieler, J. Abeßer, W.-G. Zaddach, and B. Burkhart, editors.Inside the Jazzomat: New Perspectives for Jazz Research. Schott Campus, 2017. Weimar Jazz Database: https://jazzomat.hfm-weimar.de/

2017

[11] [11]

The power of amnesia: learning probabilistic automata with variable memory length.Machine Learning, 1996

Ron, D., Singer, Y., and Tishby, N. The power of amnesia: learning probabilistic automata with variable memory length.Machine Learning, 1996

1996

[12] [12]

Pearce, M. T. and Wiggins, G. A. Improved methods for statistical modelling of monophonic music.Journal of New Music Research, 2004. 16

2004