Improving the performance of photometric regression models via massive parallel feature selection

Abstract:

Regression tasks are common in astronomy, for instance, the estimation of the redshift or the metallicity of galaxies. Generating regression models, however, is often hindered by the heterogeneity of the available input catalogs, which leads to missing data and/or features of differing explanatory power. In this work, we show how simple but effective feature selection schemes from data mining can be used to significantly improve the performance of regression models for photometric redshift and metallicity estimation (even without any particular knowledge of the input parameters' physical properties). Our framework tests huge amounts of possible feature combinations. Since corresponding (single-core) implementations are computationally very demanding, we make use of the massive computational resources provided by nowadays graphics processing units to significantly reduce the overall runtime. This renders an exhaustive search possible, as we demonstrate in our experimental evaluation. We conclude the work by discussing further applications of our approach in the context of large-scale astronomical learning settings.

SEEK ID: https://publications.h-its.org/publications/447

Research Groups: Astroinformatics

Publication type: InProceedings

Book Title: Astronomical Data Analysis Software and Systems XXIII

Editors: N. Manset and P. Forshay

Citation: Astronomical Data Analysis Software and Systems XXIII, 485:425

Date Published: 1st May 2014

Registered Mode: imported from a bibtex file

Authors: K. L. Polsterer, F. Gieseke, Christian Igel, Tomotsugu Goto

help Submitter
Activity

Views: 5835

Created: 18th Oct 2019 at 09:33

Last updated: 5th Mar 2024 at 21:23

help Tags

This item has not yet been tagged.

help Attributions

None

Powered by
(v.1.14.2)
Copyright © 2008 - 2023 The University of Manchester and HITS gGmbH