Extended Radio Galaxies in EMU: A Comparative Look at Source-Finding Techniques

Andrew M. Hopkins; Duncan Farrah; Gary Segal; Heinz Andernach; Lachlan J. Barnes; Michael J. I. Brown; Nikhel Gupta; O. Ivy Wong; Sarah V. White; Stanislav S. Shabala

arxiv: 2603.10579 · v2 · submitted 2026-03-11 · 🌌 astro-ph.GA

Extended Radio Galaxies in EMU: A Comparative Look at Source-Finding Techniques

Lachlan J. Barnes , Andrew M. Hopkins , Yjan Gordon , Nikhel Gupta , Gary Segal , Heinz Andernach , Michael J. I. Brown , Duncan Farrah

show 3 more authors

Stanislav S. Shabala Sarah V. White O. Ivy Wong

This is my paper

Pith reviewed 2026-05-15 13:42 UTC · model grok-4.3

classification 🌌 astro-ph.GA

keywords extended radio sourcessource findingEMU surveyDRAGNsmachine learningradio galaxiescomplex emissionGAMA field

0 comments

The pith

Three automated detection methods recover nearly all extended radio sources in EMU-G09 but capture mostly distinct subsets.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper applies three automatic approaches to detect complex extended radio emission in the EMU observations of the GAMA 09 field. These are DRAGNHunter for identifying likely DRAGNs from component catalogues, a coarse-grained complexity metric to highlight regions of complex emission, and RG-CAT, a machine learning pipeline trained on radio sources. The three methods together recover nearly all extended sources yet identify largely distinct, partially overlapping subsets, with only 375 sources found by all three. A sympathetic reader would care because wide-field surveys like EMU generate enormous data volumes where complete catalogues of extended sources are needed to study galaxy evolution and active galactic nuclei. The comparison prepares for scaling these techniques across future EMU releases by showing that complementary methods are required.

Core claim

The three automatic approaches to detect complex radio emission in EMU-G09—DRAGNHunter, coarse-grained complexity, and RG-CAT—recover nearly all extended sources but identify largely distinct, partially-overlapping subsets, with only 375 sources identified by all finders. This demonstrates that a combination of complementary techniques will be required to achieve a complete census of extended radio sources in future large-scale surveys.

What carries the argument

Comparison of three source-finding methods (DRAGNHunter for DRAGNs, coarse-grained complexity metric, and RG-CAT machine learning pipeline) applied to EMU-G09 data to evaluate their detection of extended radio sources.

If this is right

No single automated method captures all extended radio sources in wide-field surveys.
Complementary techniques are essential for a complete census in future EMU data releases.
Each method detects unique aspects of source morphology and complexity.
Partial overlap between the methods indicates that diverse source types require tailored detection strategies.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Ensemble voting across the three methods could improve overall completeness for large surveys.
Cross-matching with multi-wavelength data might reveal extended sources missed by all radio-only approaches.
Efficient implementation of these combined methods will be needed to process the full EMU dataset volume.
Similar comparative evaluations could optimize source finding in other upcoming radio surveys.

Load-bearing premise

The claim that the three methods together recover nearly all extended sources assumes an independent way exists to determine the true total number of such sources in the field.

What would settle it

Identification of a substantial population of extended radio sources in the EMU-G09 field that none of the three methods detected would falsify the near-complete recovery claim.

read the original abstract

Extended radio sources present unique challenges for automated detection and classification in wide-field radio surveys. With current surveys such as the Evolutionary Map of the Universe (EMU), robust and scalable methods are essential to identify and catalogue these complex sources. We apply three automatic approaches to detect complex radio emission in EMU observations of the Galaxy And Mass Assembly (GAMA) 09 field (EMU-G09) in order to evaluate their relative strengths and limitations in preparation for large-scale application across future EMU data releases. These include DRAGNHunter, designed to detect likely DRAGNs (Double Radio sources associated with Active Galactic Nuclei) from a component catalogue; coarse-grained complexity, a metric designed to highlight regions of complex emission; and RG-CAT, a machine learning pipeline trained on radio sources identified in the EMU pilot survey. We find that together, the three methods recover nearly all extended sources in EMU-G09 but identify largely distinct, partially-overlapping subsets, with only 375 sources identified by all finders. This demonstrates that a combination of complementary techniques will be required to achieve a complete census of extended radio sources in future large-scale surveys.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper runs three finders on EMU-G09 and shows they catch mostly different extended sources, with only 375 in common, but the claim that their union recovers nearly all such sources rests on an unstated reference set.

read the letter

The key point is that DRAGNHunter, the coarse-grained complexity measure, and RG-CAT applied to the same EMU-G09 field return largely separate lists of extended radio sources, intersecting at just 375 objects. The authors conclude that pipelines for the full EMU survey will need to combine techniques to get a complete census. That overlap number is the concrete new result here, and it is worth knowing for anyone planning source extraction on wide-field radio data. The work is a direct side-by-side test on real survey observations, which is the right way to expose how the methods differ in practice. It gives a clear picture of complementarity without overclaiming any single method as superior. The main weakness is the assertion that the three together recover nearly all extended sources. The abstract gives no description of the reference catalog, visual inspection protocol, or completeness calculation used to support that statement. Without an independent way to know the true total, the union result is hard to interpret beyond the tautology that the methods together find what they find. The stress-test note identifies this gap correctly, and nothing in the provided text resolves it. The paper is aimed at teams building detection pipelines for EMU or similar surveys. A reader who needs to know how different automated approaches behave on complex radio galaxies will find the overlap counts useful. It is solid enough on the empirical comparison to merit peer review, though the completeness section will need expansion before acceptance.

Referee Report

2 major / 2 minor

Summary. The manuscript evaluates three automated techniques—DRAGNHunter, coarse-grained complexity, and RG-CAT—for detecting extended radio sources in the EMU-G09 field. It reports that the union of their outputs recovers nearly all such sources while identifying largely distinct subsets, with only 375 sources common to all three, and concludes that complementary methods will be required for complete censuses in future EMU releases.

Significance. If the completeness assessment is robust, the result would demonstrate the partial overlap of current detection algorithms for complex radio emission and supply concrete evidence that single-method pipelines are insufficient for wide-field surveys, directly informing source-finding strategies for EMU and similar projects.

major comments (2)

[Abstract] Abstract and §3: The central claim that the three methods 'recover nearly all extended sources' requires an independent reference catalog or validation set whose construction, size, and completeness are not described; without this, the statement that the union captures 'nearly all' cannot be evaluated and reduces to a description of the union itself.
[§4.2] §4.2 and Table 2: The reported overlap statistics (only 375 sources common to all three) and the conclusion of complementarity depend on the cross-matching radius, flux thresholds, and handling of fragmented components; these parameters and any sensitivity tests are not specified, preventing assessment of whether the distinct-subset finding is robust.

minor comments (2)

[Figure 3] Figure 3: The Venn diagram would benefit from explicit labeling of the total number of unique sources recovered by the union and the field area used for normalization.
[§2.3] §2.3: The training set size and feature list for RG-CAT should be stated explicitly rather than referenced only to the pilot survey.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thorough review and constructive feedback on our manuscript. We address each of the major comments below and have revised the manuscript to improve clarity and robustness of the presented results.

read point-by-point responses

Referee: [Abstract] Abstract and §3: The central claim that the three methods 'recover nearly all extended sources' requires an independent reference catalog or validation set whose construction, size, and completeness are not described; without this, the statement that the union captures 'nearly all' cannot be evaluated and reduces to a description of the union itself.

Authors: We agree with the referee that additional details regarding the independent reference catalog or validation set are required to substantiate the claim that the three methods recover nearly all extended sources. In the revised manuscript, we will expand §3 to describe the construction, size, and completeness of this validation set. The abstract has also been updated to better contextualize this claim. revision: yes
Referee: [§4.2] §4.2 and Table 2: The reported overlap statistics (only 375 sources common to all three) and the conclusion of complementarity depend on the cross-matching radius, flux thresholds, and handling of fragmented components; these parameters and any sensitivity tests are not specified, preventing assessment of whether the distinct-subset finding is robust.

Authors: The referee correctly notes that the overlap statistics and complementarity conclusion depend on specific parameters that were not detailed in the original submission. We have revised §4.2 to specify the cross-matching radius, flux thresholds, and method for handling fragmented components. Sensitivity tests on these parameters have been added to demonstrate that the finding of largely distinct subsets is robust. Table 2 has been updated with the relevant information. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical comparison of detection methods on observational data

full rationale

The paper conducts a direct empirical comparison of three independent source-finding techniques (DRAGNHunter, coarse-grained complexity, and RG-CAT) applied to EMU-G09 observational data. No mathematical derivations, equations, fitted parameters renamed as predictions, or self-referential steps exist in the provided text. The central claim of recovering nearly all extended sources via their union is an observational statement about detection overlap (with 375 sources common to all), not a reduction by construction to inputs or a self-citation load-bearing premise. The analysis is self-contained against the survey data without circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is an observational comparison study with no new theoretical constructs; no free parameters, axioms, or invented entities are introduced.

pith-pipeline@v0.9.0 · 5549 in / 1215 out tokens · 57727 ms · 2026-05-15T13:42:05.351353+00:00 · methodology

Extended Radio Galaxies in EMU: A Comparative Look at Source-Finding Techniques

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)