Enhancing Automated Autism Detection With Improved Word Embeddings

Authors

  • Yao Chen The Quarry Lane School
  • Darnell Granberry

DOI:

https://doi.org/10.47611/jsrhs.v14i1.8845

Keywords:

Artificial intelligence, Machine learning, Autism spectrum disorder, Early diagnosis, Screening, Word embeddings, Hyperparameter optimization

Abstract

Autism Spectrum Disorder (ASD) is a developmental disorder that affects a significant amount of people. Unfortunately, ASD manifests in a large number of ways, meaning that diagnosing ASD is both time-consuming and inaccurate, which results in many children with ASD not being diagnosed until later childhood. One advantage of an early diagnosis is that it allows for early intervention, which typically leads to much better results. We investigated multiple different machine learning models as potential methods of predicting ASD in children from text segments provided by the child's caregiver. Two promising models are multilayer perceptrons and logistic regression. We investigated different hyperparameters for multilayer perceptrons, such as the number of layers and size of hidden layers. We then conclude that text descriptions of a toddler's behavior given by a caregiver are highly accurate for predicting autism when combined with a multilayer perceptron.

Downloads

Download data is not yet available.

References or Bibliography

American Psychiatric Association, Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition. American Psychiatric Association, 2013. doi: 10.1176/appi.books.9780890425596.

Chistol, Mihaela; Danubianu, Mirela, “Automated Detection of Autism Spectrum Disorder Symptoms using Text Mining and Machine Learning for Early Diagnosis” International Journal of Advanced Computer Science and Applications(IJACSA), 15(2), 2024. http://dx.doi.org/10.14569/IJACSA.2024.0150264

Hachemi, Rania; Degha, Houssem Eddine (2024), “TASD-Dataset: Text-based Early Autism Spectrum Disorder Detection Dataset for Toddlers”, Mendeley Data, V2, doi: 10.17632/87s2br3ptb.2

Mikolov, Tomas; Chen, Kai; Corrado, Greg; Dean, Jeffrey (16 January 2013). "Efficient Estimation of Word Representations in Vector Space". arXiv:1301.3781

“Logistic Regression in Machine Learning” GeeksforGeeks, 9 May 2017, www.geeksforgeeks.org/understanding-logistic-regression/. Accessed 26 Oct. 2024.

“Random Forest Algorithm in Machine Learning” GeeksforGeeks, 22 Feb. 2024, www.geeksforgeeks.org/random-forest-algorithm-in-machine-learning/. Accessed 26 Oct. 2024.

“Classification Using Sklearn Multi-layer Perceptron” GeeksforGeeks, 11 Oct. 2023, www.geeksforgeeks.org/classification-using-sklearn-multi-layer-perceptron/. Accessed 26 Oct. 2024.

“Boosting in Machine Learning | Boosting and AdaBoost” GeeksforGeeks, 3 May 2019, www.geeksforgeeks.org/boosting-in-machine-learning-boosting-and-adaboost/. Accessed 26 Oct. 2024.

“Support Vector Machine (SVM) Algorithm” GeeksforGeeks, 20 Jan. 2021, www.geeksforgeeks.org/support-vector-machine-algorithm/. Accessed 26 Oct. 2024.

Li Yang, Abdallah Shami, On hyperparameter optimization of machine learning algorithms: Theory and practice, Neurocomputing, Volume 415, 2020, Pages 295-316, ISSN 0925-2312, https://doi.org/10.1016/j.neucom.2020.07.061.

Erickson, Bradley J., and Felipe Kitamura. "Magician’s corner: 9. Performance metrics for machine learning models." Radiology: Artificial Intelligence 3.3 (2021): e200126. https://doi.org/10.1148/ryai.2021200126

Published

02-28-2025

How to Cite

Chen, Y., & Granberry, D. (2025). Enhancing Automated Autism Detection With Improved Word Embeddings. Journal of Student Research, 14(1). https://doi.org/10.47611/jsrhs.v14i1.8845

Issue

Section

HS Research Projects