A method for statistical research on binary stars using radial velocities

Liu Chao; Luo Feng; Zhao YongHeng

arxiv: 2502.18577 · v2 · submitted 2025-02-25 · 🌌 astro-ph.SR · astro-ph.GA· astro-ph.IM

A method for statistical research on binary stars using radial velocities

Luo Feng , Zhao YongHeng , Liu Chao This is my paper

Pith reviewed 2026-05-23 02:26 UTC · model grok-4.3

classification 🌌 astro-ph.SR astro-ph.GAastro-ph.IM

keywords binary starsradial velocitystatistical analysisAPOGEEred giantsstellar evolutionmetallicity

0 comments

The pith

The DVCD method analyzes binary star fractions from radial velocity data with high efficiency.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes the Differential Velocity Cumulative Distribution method as a new way to statistically research binary stars using radial velocity measurements. It claims this approach is far more computationally efficient than previous methods while providing accurate results. Application to red giant stars in the APOGEE DR16 survey shows the binary fraction decreases as surface gravity drops and metallicity increases. If correct, this offers a practical tool for handling large astronomical datasets on binary systems. The findings constrain how binary stars evolve over time in different stellar populations.

Core claim

The central claim is that the DVCD algorithm recovers binary fractions from radial velocity data more accurately and with computation time reduced by factors of 10^{-4} to 10^{-5} compared to existing approaches, and its application to 16 subsets of APOGEE DR16 red giant data divided by log g and M/H reveals that the binary fraction decreases with decreasing surface gravity and increasing metallicity.

What carries the argument

The Differential Velocity Cumulative Distribution (DVCD) algorithm that uses the cumulative distribution of differential radial velocities to infer binary star fractions.

If this is right

The binary fraction can be measured efficiently for large samples from surveys like APOGEE.
Binary fractions in red giants vary systematically with evolutionary stage indicated by surface gravity.
Metallicity influences the observed binary fraction in stellar populations.
Constraints are placed on evolutionary processes affecting binary stars.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the method works, it could be extended to other radial velocity surveys to study binary fractions across different stellar types.
Trends observed might help model how binaries are disrupted or formed in galactic environments.
Further validation with simulated data would strengthen confidence in the recovered fractions.

Load-bearing premise

The DVCD method correctly recovers the true binary fractions from the radial velocity data without significant biases from observational effects or orbital degeneracies.

What would settle it

Simulating a population of binary and single stars with known fractions, applying realistic observational errors, and checking if the DVCD method returns the input binary fraction within expected errors.

Figures

Figures reproduced from arXiv: 2502.18577 by Liu Chao, Luo Feng, Zhao YongHeng.

**Figure 1.** Figure 1: Result of the prediction of orbital parameters using the MCMC approach. The black dashed lines in each 1d subgraph respectively indicate the position of the 15th, 50th, and 85th percentiles [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗

**Figure 2.** Figure 2: Result of the prediction of orbital parameters using the MCMC approach. The black dashed lines in each 1d subgraph respectively indicate the position of the 15th, 50th, and 85th percentiles. 30 20 10 0 10 20 30 DtajrvList (km s 1 ) 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Normalized probability epochs = 2 epochs = 15 epochs = 30 [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 5.** Figure 5: CDC of 𝐷𝑡 𝑎 𝑗𝑟 𝑣𝐿𝑖𝑠𝑡 of single star samples with different 𝜀. 3.4 Distribution laws of orbital parameters In the case of multiple samples, the orbital period 𝑃, mass ratio 𝑞, and eccentricity 𝑒 of binary systems may follow certain distribution laws. Previous studies have investigated the distributions of these orbital parameters and proposed different hypotheses, such as assuming that the orbital period 𝑃 … view at source ↗

**Figure 8.** Figure 8: The difference of the cumulative distribution curves of 𝐷𝑡 𝑎 𝑗𝑟 𝑣𝐿𝑖𝑠𝑡 of binary star samples generated with different ( 𝜋, 𝜅, 𝜂) and 𝑓𝑏𝑖𝑛 [PITH_FULL_IMAGE:figures/full_fig_p005_8.png] view at source ↗

**Figure 7.** Figure 7: The difference of the cumulative distribution curves of 𝐷𝑡 𝑎 𝑗𝑟 𝑣𝐿𝑖𝑠𝑡 of binary star samples generated with different 𝜋, 𝜅 and 𝜂. These variations correspondingly affect the cumulative distribution characteristics of 𝐷𝑡𝑎 𝑗𝑟𝑣𝐿𝑖𝑠𝑡. To verify the above theoretical analysis, we first examined the influence of 𝜋 on 𝐷𝑡𝑎 𝑗𝑟𝑣𝐿𝑖𝑠𝑡. Let 𝜋 be equal to (−1.5, −1, 0.5), 𝜅 = 0, 𝜂 = 0, 𝜀 = 1.0, and 𝑒 𝑝𝑜𝑐ℎ𝑠 = 11. We gener… view at source ↗

**Figure 9.** Figure 9: Calculated results of samples whose 𝜋 equals −1.5, the black solid lines in the graph represent the position of true values. Red point in the lower left sub-image indicated the peak position. 2.0 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0 0.0 0.2 0.4 0.6 0.8 1.0 fbin 2 1 0 1 2 0.0 0.1 0.2 0.3 0.4 0.5 likelihood P15=0.00 P50=0.70 P85=1.60 peak=0.40 true =0.50 0.0 0.2 0.4 0.6 0.8 1.0 fbin 0.0 2.5 5.0 7.5 10.0 12.5 like… view at source ↗

**Figure 10.** Figure 10: Calculated results of samples whose 𝜋 equals 0.5, the black solid lines in the graph represent the position of true values. Red point in the lower left sub-image indicated the peak position. be written as Equation 5. Then we can study the binary characteristics by analyzing the performance of 𝑃𝑣𝑎𝑙𝑢𝑒 in the parameter space. 𝑃𝑣𝑎𝑙𝑢𝑒(𝜋, 𝜅, 𝜂, 𝜀, 𝑓𝑏𝑖𝑛) = 𝑃𝐾𝑆 (𝐷𝑡𝑎 𝑗𝑟𝑣𝐿𝑖𝑠𝑡𝑜𝑏𝑠, 𝐷𝑡𝑎 𝑗𝑟𝑣𝐿𝑖𝑠𝑡𝑡𝑚𝑝𝑙(𝜋, 𝜅, 𝜂, 𝜀, 𝑓𝑏𝑖𝑛)) … view at source ↗

**Figure 13.** Figure 13: The predictions of 𝜋 and 𝑓𝑏𝑖𝑛 with different observational epochs. 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.0 0.2 0.4 0.6 0.8 1.0 fbin true fbin 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 true fbin 2 1 0 1 2 true = 1.5 [PITH_FULL_IMAGE:figures/full_fig_p007_13.png] view at source ↗

**Figure 14.** Figure 14: The predictions of 𝜋 and 𝑓𝑏𝑖𝑛 with different truths of 𝑓𝑏𝑖𝑛. bars also shrink significantly. As the binary fraction increases, the error bars continue to shorten, indicating that the accuracy of the prediction results is getting better and better. This finding emphasizes that when analyzing datasets with relatively small sample sizes and obtaining low binary fractions ( 𝑓𝑏𝑖𝑛), the accuracy of orbital par… view at source ↗

**Figure 15.** Figure 15: Red giant samples from APOGEE DR16. (i) 𝑆𝑁 𝑅 ≥ 20; (ii) There are at least 6 repeated observations; (iii) log 𝑔 < 4 when 4600 < 𝑇eff < 5600 or log 𝑔 < 3.5 when 𝑇eff < 4600; (iv) Remove spectra satisfying: In “APOGEE_STARFLAG” field, the “LOW_SNR”, “VERY_BRIGHT_NEIGHBOR”, “PERSIST_JUMP_NEG”, “PERSIST_JUMP_POS”, “BAD_PIXELS”, “PERSIST_HIGH” markers equal to 1. In “APOGEE_ASPCAPFLAG” field the “STAR_BAD” mar… view at source ↗

**Figure 16.** Figure 16: Basic information of the selected red giant samples. when 𝜀 = 0.074. Then we adopted 0.074 km s−1 as the uniform RV measurement error of all observational samples. 5.3 Lower limits of orbital period detection Samples whose 2.2 < log𝑔 < 2.8 were not included since this area may contain many red clumps. We divided the rest 7607 samples into 16 parts according to M/H and log𝑔, see [PITH_FULL_IMAGE:figures/f… view at source ↗

**Figure 17.** Figure 17: Estimating the error of RV through generations of single star samples with different 𝜀. 2.5 2.0 1.5 1.0 0.5 0.0 0.5 M/H 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 logg 890 614 239 211 568 582 291 326 258 700 505 520 173 547 564 619 20 40 60 80 100 120 140 R Vm a x (min = 0.0 2, m a x = 1 4 8.2 9) [PITH_FULL_IMAGE:figures/full_fig_p009_17.png] view at source ↗

**Figure 19.** Figure 19: The obtained 1959 primary masses from Ness et al. (2016) of mock data. Our analysis reveals that 𝑓𝑏𝑖𝑛 estimates demonstrate high accuracy across nearly all test cases. However, 𝜋 predictions exhibit reduced accuracy when 𝑓𝑏𝑖𝑛 = 0.1, irrespective of sample size or number of observational epochs. As 𝑓𝑏𝑖𝑛 increases to 0.3 or higher, the results progressively converge toward their respective true values. Over… view at source ↗

**Figure 18.** Figure 18: Meshes according M/H and log𝑔, the number in each block is the amount of observational sources included in which. we can see the maximum of the observation interval between any pairs of observations is about 1452 days. It means that we can not detect the binaries with orbital periods longer than 1452 days from these RGB data. Then the upper limit of the detected range of 𝑃 was determined as 1400 days. Exc… view at source ↗

**Figure 21.** Figure 21: Estimating the error of RV through generations of single star samples with different 𝜀. single star, the binary system is devastated (Ivanova & Podsiadlowski 2002; Exter 2010; Jermyn & Cantiello 2020; Nakauchi et al. 2020). For another, if there is a non-negligible mass loss in a binary system (Sana 2022; Pejcha et al. 2021; Ekström 2021; Doikov & Yushchenko 2021; Farrell et al. 2020; Brinkman et al. 2019… view at source ↗

**Figure 20.** Figure 20: Simulated experiments with random samples that mimic the real data. Each cross-point of two dotted lines indicated the true value position of a group of mock data. Green, red and blue lines represent the true 𝜋 of -1.5, 0, 1.5, respectively. when 3.2 <= log𝑔 < 4.3 𝑎𝑛𝑑 2.5 <= M/H < −0.5, and the min is about 0.05 when 1.8 <= log𝑔 < 2.2 𝑎𝑛𝑑 0.0 <= M/H < 0.6. Generally, the 𝑓𝑏𝑖𝑛 estimations show a tendency o… view at source ↗

read the original abstract

Binary stars are fundamental to astrophysics, providing critical insights into stellar evolution, galactic dynamics, and fundamental physics. However, the high dimensionality of orbital parameters and observational constraints present significant challenges in statistically characterizing their properties. In this study, we propose and implement a novel algorithm, the Differential Velocity Cumulative Distribution (DVCD), to analyze binary star systems using radial velocity data. The DVCD method demonstrates superior accuracy and computational efficiency compared to existing approaches, reducing computation time by factors of $10^{-4}$ to $10^{-5}$ under comparable conditions. We applied the DVCD algorithm to red giant samples from APOGEE DR16, dividing the dataset into 16 subsets based on $\log g$ and M/H. Our findings reveal that the binary fraction decreases with decreasing surface gravity and increasing metallicity, offering valuable constraints on the evolutionary processes of binary stars. This study underscores the potential of the DVCD method for large-scale statistical analyzes of binary systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

DVCD claims 10,000x speedups and new binary fraction trends but the abstract supplies zero derivation, simulation tests, or method comparisons.

read the letter

The paper introduces DVCD as a new algorithm for statistical binary-star work from radial velocities and applies it to APOGEE DR16 red giants split by log g and metallicity. It reports that binary fraction falls with lower surface gravity and higher metallicity, plus large claimed gains in speed and accuracy over prior approaches. That is the core of what is new here: a purportedly much faster tool plus one set of survey trends. The application itself is straightforward and targets a real bottleneck in handling large RV catalogs, which is useful in principle for people doing population statistics on existing surveys. Credit for trying to tackle computational cost on real data. The problems sit in the missing pieces. The abstract states superior accuracy and efficiency with specific factors but shows no equations, no mock-data recovery tests, no bias checks against orbital degeneracies or sampling cadence, and no head-to-head numbers against standard tools. Without those, the reported trends cannot be separated from possible methodological artifacts, and the efficiency claim stays untestable. The circularity burden is also high because nothing is shown about how the method reduces the high-dimensional orbital space or handles selection effects. This is aimed at stellar astrophysicists who analyze binary fractions in large spectroscopic surveys. A reader already working on APOGEE-style data might skim it for the trend direction, but the lack of validation means it does not yet supply a result worth building on or citing. I would not bring it to reading group in this state. I would not cite it. It does not look ready for serious peer review until the validation work is added and shown.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces the Differential Velocity Cumulative Distribution (DVCD) algorithm for statistical characterization of binary stars from radial velocity data. It asserts that DVCD achieves superior accuracy and computational efficiency relative to prior methods, with reported speedups of 10^{-4} to 10^{-5}. The method is applied to 16 subsets of APOGEE DR16 red giants binned by log g and [M/H], yielding the result that binary fraction declines with decreasing surface gravity and rising metallicity.

Significance. A validated, parameter-light method for large-scale binary population statistics would be useful for constraining binary evolution and selection effects in spectroscopic surveys. The reported trends with log g and metallicity, if shown to be free of recovery bias, would supply observational constraints on evolutionary processes. However, the absence of any simulation-based validation, bias tests, or error budgets means the claimed accuracy advantage and the astrophysical trends cannot yet be assessed as robust.

major comments (2)

[Abstract] Abstract: the central claim that DVCD demonstrates 'superior accuracy' and reduces computation time by factors of 10^{-4} to 10^{-5} 'under comparable conditions' is presented without any derivation of the speedup, definition of the comparison baseline, error bars, or quantitative recovery statistics on mock catalogs.
[Results] Results section (application to APOGEE DR16): the reported decline in binary fraction with decreasing log g and increasing metallicity is stated without recovery fractions, bias tests on simulated binaries, or an error budget that accounts for sampling cadence, orbital degeneracies, or observational selection. This leaves open whether the trend is distinguishable from methodological artifacts.

minor comments (1)

[Method] Notation for the DVCD statistic and its relation to the cumulative distribution of velocity differences should be defined explicitly before the application to real data.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive report. The comments highlight important gaps in validation and presentation that we agree require attention. We provide point-by-point responses below and will incorporate the necessary additions and clarifications in a revised manuscript.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that DVCD demonstrates 'superior accuracy' and reduces computation time by factors of 10^{-4} to 10^{-5} 'under comparable conditions' is presented without any derivation of the speedup, definition of the comparison baseline, error bars, or quantitative recovery statistics on mock catalogs.

Authors: We agree that the abstract states these performance claims without the supporting details requested. The speedup estimate arises from the O(N) scaling of cumulative distribution construction versus the higher-dimensional sampling required by traditional orbit-fitting methods (e.g., MCMC), with the baseline being standard radial-velocity binary codes applied to the same data volume. However, the manuscript as written does not include the explicit derivation, baseline definition, or mock-catalog recovery statistics in the abstract or main text. We will revise the abstract to remove the unqualified 'superior accuracy' phrasing and add a concise statement of the complexity argument, while moving quantitative timing and recovery results to a new methods subsection with error bars. revision: yes
Referee: [Results] Results section (application to APOGEE DR16): the reported decline in binary fraction with decreasing log g and increasing metallicity is stated without recovery fractions, bias tests on simulated binaries, or an error budget that accounts for sampling cadence, orbital degeneracies, or observational selection. This leaves open whether the trend is distinguishable from methodological artifacts.

Authors: The referee is correct that the reported trends lack accompanying recovery tests and an explicit error budget. The current analysis applies DVCD directly to the observed APOGEE subsets without injecting synthetic binaries to quantify completeness or bias as a function of log g and [M/H]. We will add a dedicated validation subsection that (1) generates mock catalogs matching the APOGEE cadence and uncertainties, (2) reports recovery fractions and bias in recovered binary fractions, and (3) discusses how sampling cadence and orbital degeneracies propagate into the final error budget. These additions will allow readers to assess whether the observed trends exceed methodological artifacts. revision: yes

Circularity Check

0 steps flagged

No circularity detected; derivation chain not present in visible text

full rationale

The provided abstract and reader summary contain no equations, derivations, fitted parameters presented as predictions, or self-citations. The DVCD method is introduced as novel without reference to prior author work that would create a load-bearing chain. Claims of accuracy and application results are stated without any reduction to inputs by construction. Per rules, absent any quotable reduction or self-citation dependency in the visible material, the score is 0 and steps array is empty. The central claims rest on the algorithm's performance on external data (APOGEE DR16) rather than internal redefinition.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Only abstract available; no explicit free parameters, axioms, or invented entities are described. The method implicitly assumes radial velocity differences can be cumulatively distributed to isolate binary signals without detailed modeling of selection effects.

axioms (1)

domain assumption Radial velocity data from APOGEE DR16 can be partitioned by log g and M/H to reveal evolutionary trends in binary fractions without significant observational bias.
Invoked when applying DVCD to the 16 subsets and interpreting the trends.

pith-pipeline@v0.9.0 · 5698 in / 1260 out tokens · 26181 ms · 2026-05-23T02:26:26.327638+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

3 extracted references · 3 canonical work pages

[1]

, " * write output.state after.block = add.period write newline

ENTRY address archivePrefix author booktitle chapter doi edition editor eprint howpublished institution journal key month number organization pages publisher school series title misctitle type volume year version url label extra.label sort.label short.list INTEGERS output.state before.all mid.sentence after.sentence after.block FUNCTION init.state.consts ...

work page
[2]

write newline

" write newline "" before.all 'output.state := FUNCTION format.url url empty "" new.block "" url * "" * if FUNCTION format.eprint eprint empty "" archivePrefix empty "" archivePrefix "arXiv" = new.block " " eprint * " " * new.block " " eprint * " " * if if if FUNCTION format.doi doi empty "" " " doi * " " * if FUNCTION format.pid doi empty eprint empty ur...

work page
[3]

write newline

" write newline "" before.all 'output.state := FUNCTION fin.entry write newline FUNCTION new.block output.state before.all = 'skip after.block 'output.state := if FUNCTION new.sentence output.state after.block = 'skip output.state before.all = 'skip after.sentence 'output.state := if if FUNCTION not #0 #1 if FUNCTION and 'skip pop #0 if FUNCTION or pop #1...

work page

[1] [1]

, " * write output.state after.block = add.period write newline

ENTRY address archivePrefix author booktitle chapter doi edition editor eprint howpublished institution journal key month number organization pages publisher school series title misctitle type volume year version url label extra.label sort.label short.list INTEGERS output.state before.all mid.sentence after.sentence after.block FUNCTION init.state.consts ...

work page

[2] [2]

write newline

" write newline "" before.all 'output.state := FUNCTION format.url url empty "" new.block "" url * "" * if FUNCTION format.eprint eprint empty "" archivePrefix empty "" archivePrefix "arXiv" = new.block " " eprint * " " * new.block " " eprint * " " * if if if FUNCTION format.doi doi empty "" " " doi * " " * if FUNCTION format.pid doi empty eprint empty ur...

work page

[3] [3]

write newline

" write newline "" before.all 'output.state := FUNCTION fin.entry write newline FUNCTION new.block output.state before.all = 'skip after.block 'output.state := if FUNCTION new.sentence output.state after.block = 'skip output.state before.all = 'skip after.sentence 'output.state := if if FUNCTION not #0 #1 if FUNCTION and 'skip pop #0 if FUNCTION or pop #1...

work page