Dataset of artefacts for machine learning applications in astronomy
Dataset of artefacts for machine learning applications in astronomy
Accurate photometry in astronomical surveys is challenged by image artefacts, which affect measurements and degrade data quality. Due to the large amount of available data, this task is increasingly handled using machine learning algorithms, which often require a labelled training set to learn data patterns. We present an expert-labelled dataset of 1127 artefacts with 1213 labels from 26 fields in ZTF DR3, along with a complementary set of nominal objects. The artefact dataset was compiled using the active anomaly detection algorithm PineForest, developed by the SNAD team. These datasets can serve as valuable resources for real-bogus classification, catalogue cleaning, anomaly detection, and educational purposes. Both artefacts and nominal images are provided in FITS format in two sizes (28 x 28 and 63 x 63 pixels). The datasets are publicly available for further scientific applications.
Sreevarsha Sreejith、Maria V. Pruzhinskaya、Alina A. Volnova、Vadim V. Krushinsky、Konstantin L. Malanchev、Emille E. O. Ishida、Anastasia D. Lavrukhina、Timofey A. Semenikhin、Emmanuel Gangler、Matwey V. Kornilov、Vladimir S. Korolev
天文学
Sreevarsha Sreejith,Maria V. Pruzhinskaya,Alina A. Volnova,Vadim V. Krushinsky,Konstantin L. Malanchev,Emille E. O. Ishida,Anastasia D. Lavrukhina,Timofey A. Semenikhin,Emmanuel Gangler,Matwey V. Kornilov,Vladimir S. Korolev.Dataset of artefacts for machine learning applications in astronomy[EB/OL].(2025-04-10)[2025-05-28].https://arxiv.org/abs/2504.08053.点此复制
评论