Automated Classification of Celestial Objects Using Machine Learning

Authors

  • Muhammad Aiman Haris Bin Muhamad Suwaid Department of Computer Science, International Islamic University Malaysia, Kuala Lumpur, Malaysia
  • Muhammad ‘Ilyas Amierrullah Ab Karim Department of Computer Science, International Islamic University Malaysia, Kuala Lumpur, Malaysia
  • Raini Hassan Department of Computer Science, International Islamic University Malaysia, Kuala Lumpur, Malaysia
  • Azni Abdul Aziz Department of Physics, Kulliyyah of Science, International Islamic University Malaysia, Pahang, Malaysia

DOI:

https://doi.org/10.31436/ijpcc.v11i2.537

Keywords:

SDSS, Astronomy, Machine Learning, Random Forest, Classification

Abstract

The swift expansion of astronomical data requires the automated classification of celestial objects for practical use. Because of its manual and monotonous nature, classification is more prone to errors and is rapidly losing its viability. This study performs the classification of stars, galaxies, and quasars from SDSS (Sloan Digital Sky Survey) data using the Random Forest, XGBoost, Decision Tree, Gradient Boosting, Linear SVM, KNN, and Logistic Regression. In order to fix the imbalance in the data, the SMOTE algorithm was used, making the model more robust. Random Forest topped the models with their accuracy and reliability across many multiple data releases, hitting an astonishing 99.12% accuracy in SDSS DR18. This work shows how much machine deep learning can change astronomical surveys, providing readily available, precise techniques that are much more effective than manual approaches. The results add towards the development of astrophysics while simultaneously meeting Sustainable Development Goal 9 on fostering innovation through the need for infrastructure

References

I. B. Vavilova, D. V. Dobrycheva, M. Yu. Vasylenko, A. A. Elyiv, O. V. Melnyk, and V. Khramtsov, “Machine learning technique for morphological classification of galaxies from the SDSS,” Astronomy and Astrophysics, vol. 648, p. A122, Feb. 2021, doi: 10.1051/0004-6361/202038981.

M. Ashai, R. G. Mukherjee, S. P. Mundharikar, V. D. Kuanr, and R. Harikrishnan, “Classification of Astronomical Objects using KNN Algorithm,” in Smart innovation, systems and technologies, 2022, pp. 377–387. doi: 10.1007/978-981-16-9669-5_34.

M. Wierzbi?ski, P. P?awiak, M. Hammad, and U. R. Acharya, “Development of accurate classification of heavenly bodies using novel machine learning techniques,” Soft Computing, vol. 25, no. 10, pp. 7213–7228, Mar. 2021, doi: 10.1007/s00500-021-05687-4.

G. M. Hungund, “Computational Astronomy,” 2020. doi: 10.31979/etd.vavn-e3xc.

J.-L. Solorio-Ramírez, R. Jiménez-Cruz, Y. Villuendas-Rey, and C. Yáñez-Márquez, “Random forest Algorithm for the Classification of Spectral Data of Astronomical Objects,” Algorithms, vol. 16, no. 6, p. 293, Jun. 2023, doi: 10.3390/a16060293.

SDSS, “Data release 18 - SDSS,” SDSS - Mapping the Universe, Jul. 26, 2023. https://www.sdss.org/dr18/

A. T. Hassina, "Using machine learning to classify and localize stellar objects,"2023. https://www.researchgate.net/publication/373219104_Using_machine_learning_to_classify_and_localize_stellar_objects

E. Yoshino, B. Juarto, and F. I. Kurniadi, “Exploring XGBOOST as an effective machine learning algorithm for stellar spectral data classification in astronomy,” 2020 International Seminar on Application for Technology of Information and Communication (iSemantic), pp. 187–191, Sep. 2023, doi: 10.1109/isemantic59612.2023.10295329.

S. Sharma and R. Sharma, “Classification of astronomical objects using various machine learning techniques,” in Lecture notes in electrical engineering, 2019, pp. 275–283. doi: 10.1007/978-981-15-0372-6_21.

M. A. T. Rony, D. S. a. A. Reza, R. Mostafa, and Md. A. Ullah, “Application of machine learning to interpret predictability of different models: Approach to Classification for SDSS sources,” 2021 International Conference on Electronics, Communications and Information Technology (ICECIT), pp. 1–4, Sep. 2021, doi: 10.1109/icecit54077.2021.9641238.

K. Smita, Sneha, B. Hafeeza, and D. Sandhya, “Machine learning for classification of stars, galaxies through exploring the SDSS Space Observation Dataset,” Journal of Interdisciplinary Cycle Research, vol. 16, no. 1, pp. 497–510, Jan. 2024.

GeeksforGeeks, “Logistic regression in machine learning,” GeeksforGeeks, Jun. 20, 2024. https://www.geeksforgeeks.org/understanding-logistic-regression/

GeeksforGeeks, “Decision tree,” GeeksforGeeks, Jan. 16, 2025. https://www.geeksforgeeks.org/decision-tree/

“XGBoost Documentation — xgboost 2.1.3 documentation.” https://xgboost.readthedocs.io/en/stable/

GeeksforGeeks, “XGBoost,” GeeksforGeeks, Jan. 16, 2025. https://www.geeksforgeeks.org/xgboost/

GeeksforGeeks, "Gradient boosting in ML," GeeksforGeeks, Mar. 31, 2023. https://www.geeksforgeeks.org/ml-gradient-boosting/

Analytics Vidhya, “Guide to K-Nearest Neighbors (KNN) Algorithm [2025 Edition],” Analytics Vidhya, Nov. 18, 2024. https://www.analyticsvidhya.com/blog/2018/03/introduction-k-neighbours-algorithm-clustering/

"Machine learning random forest algorithm," JavatPoint. https://www.javatpoint.com/machine-learning-random-forest-algorithm

"Support vector machine (SVM) algorithm," JavatPoint. https://www.javatpoint.com/machine-learning-support-vector-machine-algorithm

GeeksforGeeks, “Evaluation metrics in machine learning,” GeeksforGeeks, Jul. 03, 2024.

https://www.geeksforgeeks.org/metrics-for-machine-learning-model/

"What is the genetic algorithm?" MATLAB & Simulink. https://www.mathworks.com/help/gads/what-is-the-genetic-algorithm.html

J. Brownlee, "SMOTE Oversampling for Imbalanced Classification,", Machine Learning Mastery. https://machinelearningmastery.com/smote-oversampling-for-imbalanced-classification/

“Sloan Digital Sky Survey - DR18,” Kaggle, Jul. 29, 2023. https://www.kaggle.com/datasets/diraf0/sloan-digital-sky-survey-dr18/data

“Home—SkyServer SDSS.” https://skyserver.sdss.org/dr18

“StandardScaler,” Scikit-learn. https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html

“Sloan Digital Sky Survey - DR17,” Kaggle, Jan. 14, 2023. https://www.kaggle.com/datasets/brsdincer/sloan-digital-sky-survey-dr17

“SDSS DR16 Plus (Sloan Digital Sky Survey),” Kaggle, Dec. 15, 2020. https://www.kaggle.com/datasets/kriegersaurusrex/sdss-dr16-sloan-digital-sky-survey

“Sloan Digital Sky Survey DR14,” Kaggle, Sep. 20, 2018. https://www.kaggle.com/datasets/lucidlenn/sloan-digital-sky-survey

“Sloan Digital Sky Survey DR12 Server Data,” Kaggle, Jan. 03, 2019. https://www.kaggle.com/datasets/ashishsaxena2209/sloan-digital-sky-survey-dr12-server-data

A. Gholamy, V. Kreinovich, and O. Kosheleva, “Why 70/30 or 80/20 relation between training and testing sets: A pedagogical explanation,” ScholarWorks@UTEP.

https://scholarworks.utep.edu/cs_techrep/1209

A. Chatzi and O. Doody, “The one-way ANOVA test explained,” Nurse Researcher, vol. 31, no. 3, pp. 8–14, Sep. 2023, doi: 10.7748/nr.2023.e1885.

O. J. P. Cruz, C. A. M. Pinto, S. G. N. Jiménez, L. J. C. Escobedo, and M. M. Outeiro, “Analyzing supervised machine learning models for classifying astronomical objects using GAIA DR3 spectral features,” Applied Sciences, vol. 14, no. 19, p. 9058, Oct. 2024, doi: 10.3390/app14199058.

M. F. Er and T. T. Bilgin, “Performance comparison of supervised machine learning methods in classifying celestial objects,” Black Sea Journal of Engineering and Science, vol. 7, no. 5, pp. 960–970, Sep. 2024, doi: 10.34248/bsengineering.1517904.

Y. Zhang, “Classification of quasars, galaxies, and stars by using XGBOOST in SDSS-DR16,” International Conference on Machine Learning and Knowledge Engineering, Feb. 2022, doi: 10.1109/mlke55170.2022.00058.

F. Z. Zeraatgari et al., “Machine learning-based photometric classification of galaxies, quasars, emission-line galaxies, and stars,” arXiv (Cornell University), Jan. 2023, doi: 10.48550/arxiv.2311.02951.

Downloads

Published

30-07-2025

How to Cite

Bin Muhamad Suwaid, M. A. H., Ab Karim, M. ‘Ilyas A., Hassan, R., & Abdul Aziz, A. (2025). Automated Classification of Celestial Objects Using Machine Learning. International Journal on Perceptive and Cognitive Computing, 11(2), 22–41. https://doi.org/10.31436/ijpcc.v11i2.537

Issue

Section

Articles

Most read articles by the same author(s)

<< < 1 2