Probabilistic PCA latent-space model with Bayesian inference reconstructs TNO near-IR spectra from photometry, achieving 95% credible-interval coverage and supporting taxonomy plus survey optimization.
hub
Data analysis recipes: Fitting a model to data
14 Pith papers cite this work. Polarity classification is still indexing.
abstract
We go through the many considerations involved in fitting a model to data, using as an example the fit of a straight line to a set of points in a two-dimensional plane. Standard weighted least-squares fitting is only appropriate when there is a dimension along which the data points have negligible uncertainties, and another along which all the uncertainties can be described by Gaussians of known variance; these conditions are rarely met in practice. We consider cases of general, heterogeneous, and arbitrarily covariant two-dimensional uncertainties, and situations in which there are bad data (large outliers), unknown uncertainties, and unknown but expected intrinsic scatter in the linear relationship being fit. Above all we emphasize the importance of having a "generative model" for the data, even an approximate one. Once there is a generative model, the subsequent fitting is non-arbitrary because the model permits direct computation of the likelihood of the parameters or the posterior probability distribution. Construction of a posterior probability distribution is indispensible if there are "nuisance parameters" to marginalize away.
hub tools
representative citing papers
AMIGO is an end-to-end differentiable forward model of JWST AMI that corrects detector systematics to recover high-precision astrometry and detect close high-contrast companions.
MCMC fits to accretion disk SEDs of 23 high-z blazars give black hole masses of 10^8-10^10 solar masses and Eddington ratios 0.04-1, showing that ignoring IGM attenuation overestimates masses with larger bias at higher redshift.
emcee delivers a stable Python implementation of the affine-invariant ensemble MCMC algorithm that requires minimal hand-tuning and supports easy parallelization.
A JWST census detects neutral ISM absorption in 76 of 309 galaxies at 0.6<z<4 and outflows in 26, indicating AGN-driven neutral outflows dominate in quiescent systems at cosmic noon.
Direct probabilistic modeling of raw event-mode scattering data claims greater efficiency and lower systematic error than histogram-plus-least-squares methods.
A new 3D dust reddening map with finer distance resolution, a spatial correlation prior, and Gaia-based distances covering the sky north of -30 degrees declination out to several kiloparsecs.
BAGPIPES fitting of 9289 massive quiescent galaxies shows most SFHs rise gradually then quench in 1-2 Gyr, with faster quenching at z>1 and slower at z<1, interpreted as multiple AGN feedback and gas-supply mechanisms.
JADES DR5 delivers a public catalog of Bayesian-inferred stellar masses, SFRs, SFHs, dust, metallicities, and AGN contributions for ~500k galaxies via Prospector with an evolving SFMS prior.
A homogeneous analysis of 699 spectra yields Te-Te relations for ions including N II, O II, O III, S II, S III and Ar III, with lower dispersions for relations involving Te([N II]).
Multi-transition CO observations reveal that the star formation-molecular gas relation becomes more linear for denser gas tracers, implying a volume density power-law index of approximately 1.5.
MCMC-based joint likelihood extraction constrains chiral-odd CFFs from DVMP cross-sections and asymmetries.
Bayesian calibration of neutrino-hard X-ray luminosity relation on six AGN shows seven blazars are consistent, with joint permutation test indicating non-random association at 3.23 sigma.
Prospector is a flexible code for Bayesian inference of stellar population parameters from multi-wavelength photometry and spectroscopy via forward modeling and posterior sampling.
citing papers explorer
-
Probabilistic Spectral Reconstruction of Trans-Neptunian Objects from Sparse Photometry: A Framework for Taxonomy, Survey Optimization, and Outlier Detection
Probabilistic PCA latent-space model with Bayesian inference reconstructs TNO near-IR spectra from photometry, achieving 95% credible-interval coverage and supporting taxonomy plus survey optimization.
-
AMIGO: a Data-Driven Calibration of the JWST Interferometer
AMIGO is an end-to-end differentiable forward model of JWST AMI that corrects detector systematics to recover high-precision astrometry and detect close high-contrast companions.
-
Black-hole mass estimation through accretion disk spectral fitting for high-redshift blazars
MCMC fits to accretion disk SEDs of 23 high-z blazars give black hole masses of 10^8-10^10 solar masses and Eddington ratios 0.04-1, showing that ignoring IGM attenuation overestimates masses with larger bias at higher redshift.
-
emcee: The MCMC Hammer
emcee delivers a stable Python implementation of the affine-invariant ensemble MCMC algorithm that requires minimal hand-tuning and supports easy parallelization.
-
A Census of Na D-traced neutral ISM and outflows at $0.6<z<4$
A JWST census detects neutral ISM absorption in 76 of 309 galaxies at 0.6<z<4 and outflows in 26, indicating AGN-driven neutral outflows dominate in quiescent systems at cosmic noon.
-
Probabilistic Analysis of Event-Mode Experimental Data
Direct probabilistic modeling of raw event-mode scattering data claims greater efficiency and lower systematic error than histogram-plus-least-squares methods.
-
A 3D Dust Map Based on Gaia, Pan-STARRS 1 and 2MASS
A new 3D dust reddening map with finer distance resolution, a spatial correlation prior, and Gaia-based distances covering the sky north of -30 degrees declination out to several kiloparsecs.
-
Inferring the star-formation histories of massive quiescent galaxies with BAGPIPES: Evidence for multiple quenching mechanisms
BAGPIPES fitting of 9289 massive quiescent galaxies shows most SFHs rise gradually then quench in 1-2 Gyr, with faster quenching at z>1 and slower at z<1, interpreted as multiple AGN feedback and gas-supply mechanisms.
-
JWST Advanced Deep Extragalactic Survey (JADES) Data Release 5: stellar population catalogue for galaxies in GOODS-N and GOODS-S
JADES DR5 delivers a public catalog of Bayesian-inferred stellar masses, SFRs, SFHs, dust, metallicities, and AGN contributions for ~500k galaxies via Prospector with an evolving SFMS prior.
-
The DESIRED electron temperature relations in star-forming regions of the local Universe
A homogeneous analysis of 699 spectra yields Te-Te relations for ions including N II, O II, O III, S II, S III and Ar III, with lower dispersions for relations involving Te([N II]).
-
Constraining the Molecular Kennicutt-Schmidt Relation with Multi-Transition CO Observations of Nearby Galaxies
Multi-transition CO observations reveal that the star formation-molecular gas relation becomes more linear for denser gas tracers, implying a volume density power-law index of approximately 1.5.
-
Markov chain Monte Carlo (MCMC) based Likelihood Extraction of Chiral-Odd Compton Form Factors from Deeply Virtual Exclusive Experiments
MCMC-based joint likelihood extraction constrains chiral-odd CFFs from DVMP cross-sections and asymmetries.
-
Correlation Between X-Ray and Cosmic Neutrino Sources: From Obscured AGN to Blazars
Bayesian calibration of neutrino-hard X-ray luminosity relation on six AGN shows seven blazars are consistent, with joint permutation test indicating non-random association at 3.23 sigma.
-
Stellar Population Inference with Prospector
Prospector is a flexible code for Bayesian inference of stellar population parameters from multi-wavelength photometry and spectroscopy via forward modeling and posterior sampling.