Machine Learning for Exoplanet Detection: A Comparative Analysis Using Kepler Data
Machine Learning for Exoplanet Detection: A Comparative Analysis Using Kepler Data
The discovery of exoplanets has expanded our understanding of planetary systems and opened new avenues for astronomical research. In this study, we present a machine learning (ML) framework for exoplanet identification using a time-series photometric dataset from the Kepler Space Telescope, comprising 3,198 flux measurements across 5,074 stars. We investigate the performance of four supervised classification algorithms, namely Random Forest, k-Nearest Neighbors (KNN), Decision Tree, and Logistic Regression, using a comprehensive set of evaluation metrics such as accuracy, precision, recall, F1-score, Area Under the Receiver Operating Characteristic Curve (AUC-ROC), confusion matrices, and learning curves. Among the models, Random Forest achieves the highest accuracy (99.8\%) and near-perfect F1-scores, demonstrating superior generalization and robustness. KNN also performs strongly, achieving 99.3\% accuracy, while Decision Tree demonstrates moderate performance with 97.1\% accuracy, and Logistic Regression trails behind with the lowest accuracy and generalization at 95.8\%. Notably, the application of the Synthetic Minority Over-sampling Technique (SMOTE) significantly improves performance across all models by addressing class imbalance. These findings underscore the effectiveness of ensemble-based machine learning techniques, particularly Random Forest, in handling large volumes of photometric data for automated exoplanet detection. This approach holds significant potential for implementation at ground-based facilities, such as the Iranian National Observatory (INO), where such extensive and precise datasets can further advance exoplanet discovery and characterization efforts.
Reihaneh Karimi、Mahdiyar Mousavi-Sadr、Mohammad H. Zhoolideh Haghighi、Fatemeh S. Tabatabaei
天文学计算技术、计算机技术
Reihaneh Karimi,Mahdiyar Mousavi-Sadr,Mohammad H. Zhoolideh Haghighi,Fatemeh S. Tabatabaei.Machine Learning for Exoplanet Detection: A Comparative Analysis Using Kepler Data[EB/OL].(2025-08-13)[2025-08-24].https://arxiv.org/abs/2508.09689.点此复制
评论