hub Canonical reference

30 Leland McInnes, John Healy, and Steve Astels

author Smirnov, N · 1948 · arXiv aoms/1177730

Canonical reference. 71% of citing Pith papers cite this work as background.

39 Pith papers citing it

Background 71% of classified citations

read on arXiv browse 39 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 5 dataset 1 method 1

citation-polarity summary

background 5 use dataset 1 use method 1

representative citing papers

Information on hidden birth events restores identifiability in phylodynamic inference

q-bio.PE · 2026-04-20 · unverdicted · novelty 8.0

Hidden birth event information restores identifiability to time-dependent birth-death phylodynamic models; mutation-at-birth models make sequences sufficient to recover it.

Scale-Calibrated Median-of-Means for Robust Distributed Principal Component Analysis

stat.ME · 2026-05-20 · unverdicted · novelty 7.0

Proposes a scale-calibrated median-of-means estimator for robust aggregation of distributed PCA estimates on the product of Euclidean space and Grassmann manifold.

Solving linear-rate ODE hierarchies (like master equations) using closures and operator splitting

math.NA · 2026-05-16 · unverdicted · novelty 7.0

For linear-rate master equations the generating function admits an exact composition-multiplier representation whose Taylor coefficients on any finite window are obtained from a closed lower-triangular ODE of size 2(N+1), independent of the truncation cap N; the same closure is combined with Strang–

Zombies in Alternate Realities: The Afterlife of Domain Names in DNS Integrations

cs.CR · 2026-05-07 · unverdicted · novelty 7.0

Zombie domain linkages persist after ownership changes in DNS integrations at rates of 3% in Web PKI, 24% in ENS, and 15% in Maven Central, with validate-once designs accumulating long-term risks while per-use validation prevents them.

Profile Likelihood Inference for Anisotropic Hyperbolic Wrapped Normal Models on Hyperbolic Space

math.ST · 2026-05-01 · unverdicted · novelty 7.0

The profile maximum likelihood estimator for the location in anisotropic hyperbolic wrapped normal models is strongly consistent, asymptotically normal, and attains the Hájek-Le Cam minimax lower bound under squared geodesic loss.

A Large-Scale Empirical Study of AI-Generated Code in Real-World Repositories

cs.SE · 2026-03-28 · unverdicted · novelty 7.0

A large-scale study of real-world repositories finds that AI-generated code differs from human-written code in complexity, structural traits, defect indicators, and commit-level activity patterns.

The Dynamical Origin of Millimetre-Sized Sporadic Meteoroids

astro-ph.EP · 2026-06-25 · unverdicted · novelty 6.0

Dynamical simulations show mm-sized meteoroids impacting Earth below 17 km/s are mostly asteroidal if released in the last 150-200 kyr, with cometary fraction rising above that speed and dominating above 27 km/s.

The Chandra-Gaia Catalog of Counterparts: Resolving ambiguous Gaia matches to X-ray sources in the Chandra Source Catalog using Machine Learning

astro-ph.IM · 2026-06-17 · unverdicted · novelty 6.0

A LightGBM classifier trained on NWAY Bayesian matches identifies true Chandra-Gaia counterparts for 113k X-ray sources, flags 7k ambiguous cases, and attributes half of 20k separation-only matches to chance coincidences, validated at 95% on COUP without positional features.

The Dynamics of Human and AI-Generated Language: How Semantics Fluctuates across Different Timescales

cs.CL · 2026-06-09 · unverdicted · novelty 6.0

Develops ACW-based semantic timescale features showing longer autocorrelation windows associate with generic vocabulary and shorter ones with specific words in both human and LLM speech, with the pattern abolished by randomizing word order and timing.

The Nonparametric Kiefer-Weiss Problem

math.ST · 2026-05-29 · unverdicted · novelty 6.0

The nonparametric Kiefer-Weiss problem is solved by deriving an optimal stopping policy based on a two-dimensional statistic (likelihood ratio plus expected remaining sample size) whose randomization rule maps the likelihood ratio to an integer sample size.

Comparing Two Categorical Gini Correlations with Applications to Classification Problems

stat.ME · 2026-05-18 · unverdicted · novelty 6.0

Proposes an inferential framework to test differences in categorical Gini correlations for predictor importance in classification, establishing asymptotic normality and consistency while accommodating unequal dimensions and dependence.

Semantic Feature Segmentation for Interpretable Predictive Maintenance in Complex Systems

cs.AI · 2026-05-14 · unverdicted · novelty 6.0

Semantic segmentation decomposes monitoring features into canonical and residual components that concentrate fault-predictive information while preserving operational meaning in predictive maintenance.

Pattern-based tests for two-dimensional copulas

math.ST · 2026-05-13 · unverdicted · novelty 6.0

A functional central limit theorem for pattern frequencies in 2D samples enables nonparametric goodness-of-fit, two-sample, and symmetry tests for copulas, with bootstrap critical values and parametric examples.

What Software Engineering Looks Like to AI Agents? -- An Empirical Study of AI-Only Technical Discourse on MoltBook

cs.SE · 2026-05-08 · unverdicted · novelty 6.0 · 2 refs

Empirical analysis of 4707 MoltBook posts shows AI-only technical discourse focuses on security, trust, and abstract topics while lacking concrete runtime and project details found in human GitHub discussions.

Scale selection for geometric medians on product manifolds

math.ST · 2026-05-08 · unverdicted · novelty 6.0

Joint location-scale minimization for geometric medians on product manifolds degenerates to marginal medians, and three new scale-selection methods restore identifiability with asymptotic guarantees.

Sketching the Readout of Large Language Models for Scalable Data Attribution and Valuation

cs.LG · 2026-04-17 · unverdicted · novelty 6.0

RISE applies CountSketch to dual lexical and semantic channels derived from output-layer gradient outer products, cutting data attribution storage by up to 112x and enabling retrospective and prospective influence analysis on LLMs up to 32B parameters.

A test for normality based on self-similarity

stat.ME · 2026-04-04 · conditional · novelty 6.0

The SSTN detects non-normality by tracking how the standardized empirical characteristic function changes under repeated self-similarity transformations, with the null distribution calibrated by Monte Carlo simulation.

Do Good, Stay Longer? Temporal Patterns and Predictors of Newcomer-to-Core Transitions in Conventional OSS and OSS4SG

cs.SE · 2026-01-30 · unverdicted · novelty 6.0

OSS4SG projects retain contributors at 2.2X higher rates with 19.6% higher core status probability than conventional OSS, and a late-spike temporal pattern enables faster core achievement (21 weeks) than early intensive contributions.

Automation and Reuse Practices in GitHub Actions Workflows: A Practitioner's Perspective

cs.SE · 2026-01-16 · conditional · novelty 6.0

A survey of 419 practitioners shows strong reliance on reusable GitHub Actions for core CI/CD tasks but limited adoption of reusable workflows, with copy-pasting remaining common due to versioning and trust issues.

Sensitivity Analysis on the Sphere and a Spherical ANOVA Decomposition

math.NA · 2025-12-29 · unverdicted · novelty 6.0

A parity-augmented ANOVA decomposition is established for functions on the sphere using orthogonal bases to capture geometry-induced variable dependencies.

Constrained Co-evolutionary Metamorphic Differential Testing for Autonomous Systems with an Interpretability Approach

cs.SE · 2025-09-20 · unverdicted · novelty 6.0

CoCoMagic applies constrained cooperative co-evolution to metamorphic and differential testing to find up to 287% more distinct behavioral divergences in an end-to-end ADS than baseline search methods.

Understanding the Challenges and Opportunities of Generative AI Apps: An Empirical Study

cs.SE · 2025-06-19 · unverdicted · novelty 6.0

Large-scale review mining of 1M+ comments from 171 Gen-AI apps using an LLM framework reveals top topics plus three opportunities and three challenges for developers.

Line Drawings using LightBenders: Authoring and Illuminating

cs.GR · 2026-06-21 · unverdicted · novelty 5.0

Hardware-software architecture for drone swarms illuminating line drawings mid-air, including Blender add-on, SVG import, and user study validating misalignment tolerance.

Exploring Statistical Change Point Detection Techniques for Performance Anomaly Detection at Mozilla

cs.SE · 2026-06-16 · unverdicted · novelty 5.0

Ensemble voting strategies for change point detection improve F1-score by 11% over Mozilla's T-test method on a new ground-truth dataset of 174 performance time series annotated by practitioners.

citing papers explorer

Showing 39 of 39 citing papers.

Information on hidden birth events restores identifiability in phylodynamic inference q-bio.PE · 2026-04-20 · unverdicted · none · ref 3
Hidden birth event information restores identifiability to time-dependent birth-death phylodynamic models; mutation-at-birth models make sequences sufficient to recover it.
Scale-Calibrated Median-of-Means for Robust Distributed Principal Component Analysis stat.ME · 2026-05-20 · unverdicted · none · ref 278
Proposes a scale-calibrated median-of-means estimator for robust aggregation of distributed PCA estimates on the product of Euclidean space and Grassmann manifold.
Solving linear-rate ODE hierarchies (like master equations) using closures and operator splitting math.NA · 2026-05-16 · unverdicted · none · ref 111
For linear-rate master equations the generating function admits an exact composition-multiplier representation whose Taylor coefficients on any finite window are obtained from a closed lower-triangular ODE of size 2(N+1), independent of the truncation cap N; the same closure is combined with Strang–
Zombies in Alternate Realities: The Afterlife of Domain Names in DNS Integrations cs.CR · 2026-05-07 · unverdicted · none · ref 31
Zombie domain linkages persist after ownership changes in DNS integrations at rates of 3% in Web PKI, 24% in ENS, and 15% in Maven Central, with validate-once designs accumulating long-term risks while per-use validation prevents them.
Profile Likelihood Inference for Anisotropic Hyperbolic Wrapped Normal Models on Hyperbolic Space math.ST · 2026-05-01 · unverdicted · none · ref 213
The profile maximum likelihood estimator for the location in anisotropic hyperbolic wrapped normal models is strongly consistent, asymptotically normal, and attains the Hájek-Le Cam minimax lower bound under squared geodesic loss.
A Large-Scale Empirical Study of AI-Generated Code in Real-World Repositories cs.SE · 2026-03-28 · unverdicted · none · ref 23
A large-scale study of real-world repositories finds that AI-generated code differs from human-written code in complexity, structural traits, defect indicators, and commit-level activity patterns.
The Dynamical Origin of Millimetre-Sized Sporadic Meteoroids astro-ph.EP · 2026-06-25 · unverdicted · none · ref 69
Dynamical simulations show mm-sized meteoroids impacting Earth below 17 km/s are mostly asteroidal if released in the last 150-200 kyr, with cometary fraction rising above that speed and dominating above 27 km/s.
The Chandra-Gaia Catalog of Counterparts: Resolving ambiguous Gaia matches to X-ray sources in the Chandra Source Catalog using Machine Learning astro-ph.IM · 2026-06-17 · unverdicted · none · ref 65
A LightGBM classifier trained on NWAY Bayesian matches identifies true Chandra-Gaia counterparts for 113k X-ray sources, flags 7k ambiguous cases, and attributes half of 20k separation-only matches to chance coincidences, validated at 95% on COUP without positional features.
The Dynamics of Human and AI-Generated Language: How Semantics Fluctuates across Different Timescales cs.CL · 2026-06-09 · unverdicted · none · ref 66
Develops ACW-based semantic timescale features showing longer autocorrelation windows associate with generic vocabulary and shorter ones with specific words in both human and LLM speech, with the pattern abolished by randomizing word order and timing.
The Nonparametric Kiefer-Weiss Problem math.ST · 2026-05-29 · unverdicted · none · ref 1
The nonparametric Kiefer-Weiss problem is solved by deriving an optimal stopping policy based on a two-dimensional statistic (likelihood ratio plus expected remaining sample size) whose randomization rule maps the likelihood ratio to an integer sample size.
Comparing Two Categorical Gini Correlations with Applications to Classification Problems stat.ME · 2026-05-18 · unverdicted · none · ref 21
Proposes an inferential framework to test differences in categorical Gini correlations for predictor importance in classification, establishing asymptotic normality and consistency while accommodating unequal dimensions and dependence.
Semantic Feature Segmentation for Interpretable Predictive Maintenance in Complex Systems cs.AI · 2026-05-14 · unverdicted · none · ref 9
Semantic segmentation decomposes monitoring features into canonical and residual components that concentrate fault-predictive information while preserving operational meaning in predictive maintenance.
Pattern-based tests for two-dimensional copulas math.ST · 2026-05-13 · unverdicted · none · ref 28
A functional central limit theorem for pattern frequencies in 2D samples enables nonparametric goodness-of-fit, two-sample, and symmetry tests for copulas, with bootstrap critical values and parametric examples.
What Software Engineering Looks Like to AI Agents? -- An Empirical Study of AI-Only Technical Discourse on MoltBook cs.SE · 2026-05-08 · unverdicted · none · ref 10 · 2 links
Empirical analysis of 4707 MoltBook posts shows AI-only technical discourse focuses on security, trust, and abstract topics while lacking concrete runtime and project details found in human GitHub discussions.
Scale selection for geometric medians on product manifolds math.ST · 2026-05-08 · unverdicted · none · ref 258
Joint location-scale minimization for geometric medians on product manifolds degenerates to marginal medians, and three new scale-selection methods restore identifiability with asymptotic guarantees.
Sketching the Readout of Large Language Models for Scalable Data Attribution and Valuation cs.LG · 2026-04-17 · unverdicted · none · ref 51
RISE applies CountSketch to dual lexical and semantic channels derived from output-layer gradient outer products, cutting data attribution storage by up to 112x and enabling retrospective and prospective influence analysis on LLMs up to 32B parameters.
A test for normality based on self-similarity stat.ME · 2026-04-04 · conditional · none · ref 7
The SSTN detects non-normality by tracking how the standardized empirical characteristic function changes under repeated self-similarity transformations, with the null distribution calibrated by Monte Carlo simulation.
Do Good, Stay Longer? Temporal Patterns and Predictors of Newcomer-to-Core Transitions in Conventional OSS and OSS4SG cs.SE · 2026-01-30 · unverdicted · none · ref 36
OSS4SG projects retain contributors at 2.2X higher rates with 19.6% higher core status probability than conventional OSS, and a late-spike temporal pattern enables faster core achievement (21 weeks) than early intensive contributions.
Automation and Reuse Practices in GitHub Actions Workflows: A Practitioner's Perspective cs.SE · 2026-01-16 · conditional · none · ref 37
A survey of 419 practitioners shows strong reliance on reusable GitHub Actions for core CI/CD tasks but limited adoption of reusable workflows, with copy-pasting remaining common due to versioning and trust issues.
Sensitivity Analysis on the Sphere and a Spherical ANOVA Decomposition math.NA · 2025-12-29 · unverdicted · none · ref 13
A parity-augmented ANOVA decomposition is established for functions on the sphere using orthogonal bases to capture geometry-induced variable dependencies.
Constrained Co-evolutionary Metamorphic Differential Testing for Autonomous Systems with an Interpretability Approach cs.SE · 2025-09-20 · unverdicted · none · ref 96
CoCoMagic applies constrained cooperative co-evolution to metamorphic and differential testing to find up to 287% more distinct behavioral divergences in an end-to-end ADS than baseline search methods.
Understanding the Challenges and Opportunities of Generative AI Apps: An Empirical Study cs.SE · 2025-06-19 · unverdicted · none · ref 48
Large-scale review mining of 1M+ comments from 171 Gen-AI apps using an LLM framework reveals top topics plus three opportunities and three challenges for developers.
Line Drawings using LightBenders: Authoring and Illuminating cs.GR · 2026-06-21 · unverdicted · none · ref 27
Hardware-software architecture for drone swarms illuminating line drawings mid-air, including Blender add-on, SVG import, and user study validating misalignment tolerance.
Exploring Statistical Change Point Detection Techniques for Performance Anomaly Detection at Mozilla cs.SE · 2026-06-16 · unverdicted · none · ref 54
Ensemble voting strategies for change point detection improve F1-score by 11% over Mozilla's T-test method on a new ground-truth dataset of 174 performance time series annotated by practitioners.
Deep Slice Interpolation for Reducing Through-Plane Anisotropy and Noise in Head CT eess.IV · 2026-06-08 · unverdicted · none · ref 38
Deep learning system synthesizes intermediate head CT slices to halve through-plane anisotropy while providing implicit denoising, outperforming baselines on structural metrics.
DXA-Derived Skeletal Phenotypes and Hip Fracture Risk: A Backdoor-Adjusted Causal Analysis q-bio.QM · 2026-05-29 · unverdicted · none · ref 25
Backdoor-adjusted ATEs on 21,098 UK Biobank participants showed total femur BMC and BMD with the largest hip fracture risk reductions (-0.0047 per SD), and adding the top 11 phenotypes to clinical variables raised AUC to 0.842 versus FRAX 0.709.
Social Policy of Large Language Models: How GPT, Claude, DeepSeek and Grok Allocate Social Budgets in Spain and Germany cs.CY · 2026-05-11 · unverdicted · none · ref 12
Four LLMs exhibit a shared implicit social policy that under-allocates pensions by a factor of three and over-allocates housing by four compared to OECD budgets, with only Claude showing meaningful response to national context.
Reduced-Precision Stochastic Simulation for Mathematical Biology q-bio.QM · 2026-05-01 · unverdicted · none · ref 35
Mixed-precision SSA with stochastic rounding preserves ensemble statistics across five biological models while cutting memory use by 2-4x and delivering up to 1.5x CPU speedup.
Buoyancy-dependent induced flow by vertically migrating swimmers physics.flu-dyn · 2025-12-10 · conditional · none · ref 4
Induced flow velocity from vertically migrating Artemia salina swarms scales with the product of swimmer number and buoyancy-driven density difference.
Exploring the Grassroots Understanding and Practices of Collective Memory Co-Contribution in a University Community cs.HC · 2025-12-09 · unverdicted · none · ref 52
University community members split between reflecting on past events or recording today's experiences as future history when contributing to collective memory, yielding design considerations for community platforms.
Optimal control of the future via prospective learning with control stat.ML · 2025-11-11 · unverdicted · none · ref 8
Prospective Learning with Control proves ERM asymptotically achieves the Bayes optimal policy in non-stationary reset-free settings and outperforms time-aware RL on a 1D foraging benchmark.
Network Inequality through Preferential Attachment, Triadic Closure, and Homophily physics.soc-ph · 2025-09-27 · unverdicted · none · ref 33
PATCH model simulations show preferential attachment and homophily increase segregation and degree inequality while triadic closure reduces segregation but amplifies overall inequality, and the model accounts for observed gender disparities in 50 years of physics and CS collaboration networks.
Cumulative Advantage of Brokerage in Academia physics.soc-ph · 2024-07-16 · unverdicted · none · ref 33
Early brokerage in academic networks produces cumulative advantage in later participation and career impact for physicists, equally for men and women.
sumoITScontrol: Traffic Controller Collection for SUMO Traffic Simulations eess.SY · 2026-04-25 · unverdicted · none · ref 46
sumoITScontrol provides a collection of traffic controllers for SUMO simulations and stresses the importance of variance-aware evaluation methods for reproducible research.
Time-dependent structural equation modeling of fans' football fever using activity tracking data during the 2025 DFB Cup final stat.AP · 2026-04-22 · unverdicted · none · ref 236
Football fever in spectators follows a V-shaped time course captured as a latent process from heart rate and stress data via time-dependent structural equation modeling.
Empirical Comparison of Agent Communication Protocols for Task Orchestration cs.AI · 2026-03-24 · unverdicted · none · ref 34
This work provides an empirical comparison of tool integration, multi-agent delegation, and hybrid architectures for LLM task orchestration, measuring response time, context consumption, cost, error recovery, and implementation complexity.
Community-Based Early-Stage Chronic Kidney Disease Screening using Explainable Machine Learning for Low-Resource Settings cs.LG · 2026-01-03 · unverdicted · none · ref 58
Machine learning models trained on Bangladeshi community data achieve 89-90% balanced accuracy for early CKD detection using few accessible features, outperforming traditional screening tools and generalizing across external datasets from India, UAE, and Bangladesh.
Bayesian inference for compact binary coalescences with BILBY: Validation and application to the first LIGO--Virgo gravitational-wave transient catalogue astro-ph.IM · 2020-06-01 · unverdicted · none · ref 151
BILBY is validated on simulated compact binary signals and reproduces the eleven GWTC-1 results with configuration and output files provided for reproduction.
Multi-Task Optimization over Networks of Tasks cs.LG · 2026-04-23 · unreviewed · ref 36

30 Leland McInnes, John Healy, and Steve Astels

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer