Stellar Classification based on Various Star Characteristics using Machine Learning Algorithms

Authors

  • Roberto Tamez The Woodlands High School
  • Sophia Barton Inspirit AI and Stanford University

DOI:

https://doi.org/10.47611/jsrhs.v12i1.4375

Keywords:

Stellar, Stellar Classification, Artificial Intelligence, Machine Learning

Abstract

The task of stellar classification can be tedious and lengthy when done manually. One can expedite stellar classification by creating an artificial intelligence model to automate the process. As we as a species continue to explore the frontier of the observable universe, we should seek to automate time-intensive problems like stellar classification. The current stellar classification model serves to effectively categorize stars for research purposes regarding their distribution around the universe, so automating the development of this resource would allow professionals to allocate more time to explore the bounds of our current understanding of space and the universe. After finding and analyzing a dataset containing numerical and categorical features, a supervised learning approach was then used to train and test different models on their ability to classify the stars. A Decision Tree Classifier, Random Forest Classifier, Ridge Classifier, and Support Vector Classifier were trained and tested. The most successful models were the Decision Tree Classifier and Random Forest Classifier, each with about a 94 percent prediction accuracy across different accuracy metrics on the test data. Despite some drawbacks in regard to the availability of usable data, four models were trained and two were proven to be consistently and successfully accurate. Any future attempts at developing models for stellar classification should concentrate more on gathering data as to have a more thoroughly trained set of models.

Downloads

Download data is not yet available.

References or Bibliography

Encyclopædia Britannica, inc. (n.d.). Stellar classification. Encyclopædia Britannica. Retrieved February 18, 2023, from http://www.britannica.com/science/stellar-classification

Xiao-Qing, W., & Jin-Meng, Y. (2021). Classification of star/galaxy/QSO and star spectral types from LAMOST data release 5 with Machine Learning Approaches. Chinese Journal of Physics, 69, 303–311. https://doi.org/10.1016/j.cjph.2020.03.008

Paulycyclic>. (n.d.). Paul Bacher: Notebooks expert. Kaggle. Retrieved February 18, 2023, from http://www.kaggle.com/paulbacher/code

Sklearn.tree.decisiontreeclassifier. scikit. (n.d.). Retrieved February 18, 2023, from https://scikit-learn.org/stable/modules/generated/sklearn.tree.DecisionTreeClassifier.html

Decision tree. CORP-MIDS1 (MDS). (2022, July 11). Retrieved February 19, 2023, from https://www.mastersindatascience.org/learning/machine-learning-algorithms/decision-tree/

Decision tree algorithm in Machine Learning - Javatpoint. www.javatpoint.com. (n.d.). Retrieved February 19, 2023, from http://www.javatpoint.com/machine-learning-decision-tree-classification-algorithm

Sklearn.ensemble.randomforestclassifier. scikit. (n.d.). Retrieved February 19, 2023, from https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html

Meltzer, R. (2021, July 15). What is Random Forest? [beginner's guide + examples]. CareerFoundry. Retrieved February 19, 2023, from https://careerfoundry.com/en/blog/data-analytics/what-is-random-forest/

Chauhan, A. (2021, February 23). Random Forest classifier and its hyperparameters. Medium. Retrieved February 19, 2023, from https://medium.com/analytics-vidhya/random-forest-classifier-and-its-hyperparameters-8467bec755f6

Sklearn.linear_model.ridgeclassifier. scikit. (n.d.). Retrieved February 19, 2023, from https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.RidgeClassifier.html

Starmer, J. (2018, September 24). Regularization part 1: Ridge (L2) regression. YouTube. Retrieved February 19, 2023, from http://www.youtube.com/watch?v=Q81RR3yKn30

Sklearn.svm.SVC. scikit. (n.d.). Retrieved February 19, 2023, from https://scikit-learn.org/stable/modules/generated/sklearn.svm.SVC.html

How SVM works. (n.d.). Retrieved February 19, 2023, from http://www.ibm.com/docs/en/spss-modeler/saas?topic=models-how-svm-works

Gandhi, R. (2018, July 5). Support Vector Machine - introduction to machine learning algorithms. Medium. Retrieved February 19, 2023, from https://towardsdatascience.com/support-vector-machine-introduction-to-machine-learning-algorithms-934a444fca47

Published

02-28-2023

How to Cite

Tamez Villarreal, J., & Barton, S. (2023). Stellar Classification based on Various Star Characteristics using Machine Learning Algorithms. Journal of Student Research, 12(1). https://doi.org/10.47611/jsrhs.v12i1.4375

Issue

Section

HS Research Projects