pith. sign in

arxiv: 1906.08481 · v1 · pith:SDQI5PWZnew · submitted 2019-06-20 · 🧬 q-bio.QM · q-bio.CB

Detecting long-range attraction between migrating cells based on p-value distributions

Pith reviewed 2026-05-25 19:10 UTC · model grok-4.3

classification 🧬 q-bio.QM q-bio.CB
keywords cell migrationp-value distributionchemotaxistarget-directed behaviorstatistical testimmune cellssimulation
0
0 comments X

The pith

A p-value distribution test on cell steps distinguishes random migration from target-directed hunting in simulations.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a statistical approach to decide whether immune cells reach targets by chance alone or by directed movement. For each recorded step, it computes a p-value as the probability that a move at least as well aimed at the target would arise under purely random, target-independent motion. The collection of these p-values is then compared with the distribution obtained after the target positions have been randomly reshuffled. When the procedure is run on simulated trajectories generated from several chemotactic search rules, the real-data p-value histogram deviates from the reference in a way that flags the presence of goal-directed behavior.

Core claim

By assigning to every migration step the probability that an equally or more target-aligned displacement would occur under target-independent motion, and by testing whether the resulting p-value histogram differs from the histogram produced by randomized target locations, the method separates blind migration from target-directed hunting on simulated data drawn from multiple chemotactic mechanisms.

What carries the argument

The per-step p-value (probability of a move at least as target-directed under null random motion) together with its empirical distribution compared against the reference distribution from randomized target positions.

If this is right

  • The procedure can be applied directly to time-lapse recordings to test for chemotactic attraction.
  • It remains effective across a range of simulated chemotactic search strategies.
  • Deviation of the observed p-value histogram from the randomized reference signals the presence of long-range target influence.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If validated on real data, the same comparison could quantify how strongly cells respond to distant targets.
  • The approach could be adapted to analyze directed motion in other systems such as bacterial chemotaxis or collective animal movement.

Load-bearing premise

Randomizing the locations of the targets produces a reference distribution that matches the p-value statistics expected when cells truly ignore the targets.

What would settle it

If real trajectories known to be purely random yield p-value distributions that differ systematically from the randomized-target reference, or if known directed trajectories fail to produce such a difference, the test would be falsified.

Figures

Figures reproduced from arXiv: 1906.08481 by Claus Metzner.

Figure 1
Figure 1. Figure 1: FIG. 1. Explanation of our method to detect goal-directed migration. [PITH_FULL_IMAGE:figures/full_fig_p012_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2. Application of our method to four different types of surrogate data. Shown are in each [PITH_FULL_IMAGE:figures/full_fig_p013_2.png] view at source ↗
read the original abstract

Immune cells have evolved to recognize and eliminate pathogens, and the efficiency of this process can be measured in a Petri dish. Yet, even if the cells are time-lapse recorded and tracked with high resolution, it is difficult to judge whether the immune cells find their targets by mere chance, or if they approach them in a goal-directed way, perhaps using remote sensing mechanisms such as chemotaxis. To answer this question, we assign to each step of an immune cell a 'p-value', the probability that a move, at least as target-directed as observed, can be explained with target-independent migration behavior. The resulting distribution of p-values is compared to the distribution of a reference system with randomized target positions. By using simulated data, based on various chemotactic search mechanisms, we demonstrate that our method can reliably distinguish between blind migration and target-directed 'hunting' behavior.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper introduces a statistical method to detect target-directed ('hunting') migration versus blind migration in cell tracks. Each step is assigned a p-value representing the probability of observing a move at least as target-directed under a model of target-independent behavior; the empirical p-value distribution is then compared against a reference distribution generated by randomizing target positions. Validation is performed exclusively on simulated trajectories generated from various chemotactic search mechanisms, with the central claim that the method reliably distinguishes blind from directed behavior.

Significance. If the central claim holds, the method supplies a mechanism-agnostic statistical test for long-range attraction in migration data that could be applied to experimental cell tracks. A strength is the controlled simulation framework using multiple chemotactic mechanisms to test separation; this provides a falsifiable benchmark. However, the significance is tempered by the dependence on an unverified match between the p-value null model and the blind-migration generative process used in simulations.

major comments (2)
  1. [Methods / Simulation setup and p-value definition] The central claim (abstract and results) that the method 'reliably distinguishes' rests on simulations in which blind-migration p-value distributions match the randomized-target reference. However, computing each p-value requires an explicit probabilistic model of target-independent moves (step lengths, turning angles, persistence). No section demonstrates that this model is identical to (or even calibrated against) the generative process used to create the 'blind' simulation trajectories; a mismatch would make the observed separation an artifact rather than evidence of the statistic's validity.
  2. [Results] Results section: the paper reports that the method distinguishes behaviors on simulated data, but provides no quantitative performance metrics (e.g., ROC-AUC, false-positive rate at chosen thresholds, or sensitivity analysis to model misspecification). Without these, it is impossible to assess how 'reliable' the distinction is or whether it survives small changes to the null model.
minor comments (2)
  1. [Abstract] Abstract: the precise formula or algorithm for the per-step p-value is not stated, even at a high level; this should be added for reproducibility.
  2. [Introduction / Methods] Notation: the term 'target-directed' is used without an explicit mathematical definition (e.g., angle or distance metric) in the early sections; this should be formalized before the simulation results.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thoughtful comments. We provide point-by-point responses below and will make revisions to address the concerns regarding model matching and quantitative metrics.

read point-by-point responses
  1. Referee: [Methods / Simulation setup and p-value definition] The central claim (abstract and results) that the method 'reliably distinguishes' rests on simulations in which blind-migration p-value distributions match the randomized-target reference. However, computing each p-value requires an explicit probabilistic model of target-independent moves (step lengths, turning angles, persistence). No section demonstrates that this model is identical to (or even calibrated against) the generative process used to create the 'blind' simulation trajectories; a mismatch would make the observed separation an artifact rather than evidence of the statistic's validity.

    Authors: The p-value is computed using a general model of target-independent migration that does not assume a specific generative process beyond the absence of target direction. The blind simulations are also generated under target-independent rules. While the manuscript does not include an explicit side-by-side parameter comparison, the randomized-target reference is the key null, and the p-value model is meant to be broadly applicable. Nevertheless, to strengthen the paper, we will revise the Methods to include the specific distributions used in both the p-value calculation and the simulations, and demonstrate that they are consistent or calibrated from the data. revision: yes

  2. Referee: [Results] Results section: the paper reports that the method distinguishes behaviors on simulated data, but provides no quantitative performance metrics (e.g., ROC-AUC, false-positive rate at chosen thresholds, or sensitivity analysis to model misspecification). Without these, it is impossible to assess how 'reliable' the distinction is or whether it survives small changes to the null model.

    Authors: We agree that adding quantitative metrics would improve the assessment of reliability. The manuscript currently presents the distinction through the p-value distributions and their comparison to the reference. In the revised version, we will include performance metrics such as the area under the ROC curve for distinguishing the two behaviors, false-positive rates, and an analysis of sensitivity to the choice of null model parameters. revision: yes

Circularity Check

0 steps flagged

No significant circularity; null model via external randomization is independent

full rationale

The paper's core procedure assigns p-values using an explicit probabilistic model of target-independent migration and obtains the reference distribution by randomizing target positions in the observed trajectories. This randomization step is an external operation performed on the data rather than a fit or self-referential definition internal to the statistic. Validation relies on separate simulations generated from chemotactic mechanisms, which are not used to define or tune the null model. No self-citations, fitted parameters renamed as predictions, or ansatzes smuggled via prior work appear in the derivation. The method is therefore self-contained against the external benchmark of randomized targets and simulated trajectories.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The method depends on a domain assumption that a well-defined probability can be assigned to each observed step under a target-independent migration model; no free parameters or invented entities are mentioned in the abstract.

axioms (1)
  • domain assumption A p-value can be computed for each cell step as the probability of observing a move at least as target-directed under target-independent migration.
    This definition is required to generate the p-value distribution that is then compared to the randomized reference.

pith-pipeline@v0.9.0 · 5674 in / 1303 out tokens · 29659 ms · 2026-05-25T19:10:07.553612+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Inferring long-range interactions between immune and tumor cells -- pitfalls and (partial) solutions

    q-bio.QM 2019-07 unverdicted novelty 6.0

    Maximum likelihood fitting of distance-dependent turning probabilities to cell trajectories recovers true interaction parameters even when migration behavior switches over time, unlike force-based models.

Reference graph

Works this paper leans on

16 extracted references · 16 canonical work pages · cited by 1 Pith paper · 2 internal anchors

  1. [1]

    Immunobiology: the immune system in health and disease , volume 7

    Charles A Janeway, Paul Travers, Mark Walport, Mark Shlomchik, et al. Immunobiology: the immune system in health and disease , volume 7. Current Biology London, 1996

  2. [2]

    Pathogen recognition by the innate immune system

    Himanshu Kumar, Taro Kawai, and Shizuo Akira. Pathogen recognition by the innate immune system. International reviews of immunology , 30(1):16–34, 2011

  3. [3]

    Cancer immunotherapy

    Manfred Schuster, Andreas Nechansky, and Ralf Kircheis. Cancer immunotherapy. Biotech- nology Journal: Healthcare Nutrition Technology , 1(2):138–147, 2006

  4. [4]

    Decade in reviewcancer immunotherapy: entering the mainstream of cancer treatment

    Steven A Rosenberg. Decade in reviewcancer immunotherapy: entering the mainstream of cancer treatment. Nature Reviews Clinical Oncology , 11(11):630, 2014

  5. [5]

    Breathing new life into immunother- apy: review of melanoma, lung and kidney cancer

    Charles G Drake, Evan J Lipson, and Julie R Brahmer. Breathing new life into immunother- apy: review of melanoma, lung and kidney cancer. Nature reviews Clinical oncology, 11(1):24, 2014

  6. [6]

    Chemotaxis

    Michael Eisenbach. Chemotaxis. World Scientific Publishing Company, 2004

  7. [7]

    Resampling-based multiple testing: Examples and methods for p-value adjustment , volume 279

    Peter H Westfall, S Stanley Young, et al. Resampling-based multiple testing: Examples and methods for p-value adjustment , volume 279. John Wiley & Sons, 1993

  8. [8]

    Principles of efficient chemotactic pursuit

    Claus Metzner. Principles of efficient chemotactic pursuit. arXiv, 1902.10589:1–27, 2019

  9. [9]

    Superstatistical analysis and modelling of heterogeneous random walks

    Claus Metzner, Christoph Mark, Julian Steinwachs, Lena Lautscham, Franz Stadler, and 9 Ben Fabry. Superstatistical analysis and modelling of heterogeneous random walks. Nature communications, 6(May):7516, jun 2015

  10. [10]

    Statistical distributions

    Merran Evans, Nicholas Hastings, and Brian Peacock. Statistical distributions. 2000

  11. [11]

    The difference between significant and not significant is not itself statistically significant

    Andrew Gelman and Hal Stern. The difference between significant and not significant is not itself statistically significant. The American Statistician , 60(4):328–331, 2006

  12. [12]

    A dirty dozen: twelve p-value misconceptions

    Steven Goodman. A dirty dozen: twelve p-value misconceptions. In Seminars in hematology , volume 45, pages 135–140. Elsevier, 2008

  13. [13]

    Revised standards for statistical evidence

    Valen E Johnson. Revised standards for statistical evidence. Proceedings of the National Academy of Sciences , 110(48):19313–19317, 2013

  14. [14]

    The enduring evolution of the p value

    Demetrios N Kyriacou. The enduring evolution of the p value. Jama, 315(11):1113–1115, 2016

  15. [15]

    P values as random variables - expected p values

    Harold Sackrowitz and Ester Samuel-Cahn. P values as random variables - expected p values. The American Statistician , 53(4):326–331, 1999

  16. [16]

    A Short Note on P-Value Hacking

    Nassim Nicholas Taleb. A short note on p-value hacking. arXiv preprint arXiv:1603.07532 , 2016. AUTHOR CONTRIBUTIONS ST A TEMENT CM developed the concept, implemented the method, and wrote the paper. ADDITIONAL INFORMA TION F unding This work was funded by the Grant ME1260/11-1 of the German Research Foundation DFG. Competing interests statement The autho...