Data-driven photometric redshift estimation from type Ia supernovae light curves
Autor: | Felipe M F de Oliveira, Marcelo Vargas dos Santos, Ribamar R R Reis |
---|---|
Přispěvatelé: | Institut de Recherches sur les lois Fondamentales de l'Univers (IRFU), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay |
Rok vydání: | 2022 |
Předmět: |
transients: supernovae
Cosmology and Nongalactic Astrophysics (astro-ph.CO) software: data analysis FOS: Physical sciences Astronomy and Astrophysics techniques: photometric Space and Planetary Science cosmology: observations [PHYS.PHYS.PHYS-INS-DET]Physics [physics]/Physics [physics]/Instrumentation and Detectors [physics.ins-det] cosmology: miscellaneous [PHYS.ASTR]Physics [physics]/Astrophysics [astro-ph] Astrophysics - Instrumentation and Methods for Astrophysics Instrumentation and Methods for Astrophysics (astro-ph.IM) Astrophysics - Cosmology and Nongalactic Astrophysics |
Zdroj: | Monthly Notices of the Royal Astronomical Society Monthly Notices of the Royal Astronomical Society, 2022, 518 (2), pp.2385-2397. ⟨10.1093/mnras/stac3202⟩ |
ISSN: | 0035-8711 1365-2966 |
DOI: | 10.48550/arxiv.2212.14668 |
Popis: | Redshift measurement has always been a constant need in modern astronomy and cosmology. And as new surveys have been providing an immense amount of data on astronomical objects, the need to process such data automatically proves to be increasingly necessary. In this article, we use simulated data from the Dark Energy Survey, and from a pipeline originally created to classify supernovae, we developed a linear regression algorithm optimized through novel automated machine learning (AutoML) frameworks achieving an error score better than ordinary data pre-processing methods when compared with other modern algorithms (such as xgboost). Numerically, the photometric prediction RMSE of type Ia supernovae events was reduced from 0.16 to 0.09 and the RMSE of all supernovae types decreased from 0.20 to 0.14. Our pipeline consists of four steps: through spectroscopic data points we interpolate the light curve using Gaussian process fitting algorithm, then using a wavelet transform we extract the most important features of such curves; in sequence we reduce the dimensionality of such features through principal component analysis, and in the end we applied super learning techniques (stacked ensemble methods) through an AutoML framework dedicated to optimize the parameters of several different machine learning models, better resolving the problem. As a final check, we obtained probability distribution functions (PDFs) using Gaussian kernel density estimations through the predictions of more than 50 models trained and optimized by AutoML. Those PDFs were calculated to replicate the original curves that used SALT2 model, a model used for the simulation of the raw data itself. |
Databáze: | OpenAIRE |
Externí odkaz: |