Nowadays astronomical catalogs contain patterns of hundreds of millions of objects with data volumes in the terabyte range. Upcoming projects will gather such patterns for several billions of objects with peta- and exabytes of data. From a machine learning point of view, these settings often yield unsupervised, semi-supervised, or fully supervised tasks, with large training and huge test sets. Recent studies have demonstrated the effectiveness of prototype-based learning schemes such as simple nearest neighbor models. However, although being among the most computationally efficient methods for such settings (if implemented via spatial data structures), applying these models on all remaining patterns in a given catalog can easily take hours or even days. In this work, we investigate the practical effectiveness of GPU-based approaches to accelerate such nearest neighbor queries in this context. Our experiments indicate that carefully tuned implementations of spatial search structures for such multi-core devices can significantly reduce the practical runtime. This renders the resulting frameworks an important algorithmic tool for current and upcoming data analyses in astronomy.
SEEK ID: https://publications.h-its.org/publications/448
Research Groups: Astroinformatics
Publication type: InProceedings
Book Title: KI 2013: Advances in Artificial Intelligence
Editors: Timm, Ingo J. and Thimm, Matthias
Publisher: Springer Berlin Heidelberg
Citation: In KI 2013: Advances in Artificial Intelligence, pp. 86–97, Springer Berlin Heidelberg, Berlin, Heidelberg
Date Published: 2013
Registered Mode: imported from a bibtex file
Views: 6363
Created: 18th Oct 2019 at 09:35
Last updated: 5th Mar 2024 at 21:23
This item has not yet been tagged.
None