Enhancing Automated Autism Detection With Improved Word Embeddings
DOI:
https://doi.org/10.47611/jsrhs.v14i1.8845Keywords:
Artificial intelligence, Machine learning, Autism spectrum disorder, Early diagnosis, Screening, Word embeddings, Hyperparameter optimizationAbstract
Autism Spectrum Disorder (ASD) is a developmental disorder that affects a significant amount of people. Unfortunately, ASD manifests in a large number of ways, meaning that diagnosing ASD is both time-consuming and inaccurate, which results in many children with ASD not being diagnosed until later childhood. One advantage of an early diagnosis is that it allows for early intervention, which typically leads to much better results. We investigated multiple different machine learning models as potential methods of predicting ASD in children from text segments provided by the child's caregiver. Two promising models are multilayer perceptrons and logistic regression. We investigated different hyperparameters for multilayer perceptrons, such as the number of layers and size of hidden layers. We then conclude that text descriptions of a toddler's behavior given by a caregiver are highly accurate for predicting autism when combined with a multilayer perceptron.
Downloads
References or Bibliography
American Psychiatric Association, Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition. American Psychiatric Association, 2013. doi: 10.1176/appi.books.9780890425596.
Chistol, Mihaela; Danubianu, Mirela, “Automated Detection of Autism Spectrum Disorder Symptoms using Text Mining and Machine Learning for Early Diagnosis” International Journal of Advanced Computer Science and Applications(IJACSA), 15(2), 2024. http://dx.doi.org/10.14569/IJACSA.2024.0150264
Hachemi, Rania; Degha, Houssem Eddine (2024), “TASD-Dataset: Text-based Early Autism Spectrum Disorder Detection Dataset for Toddlers”, Mendeley Data, V2, doi: 10.17632/87s2br3ptb.2
Mikolov, Tomas; Chen, Kai; Corrado, Greg; Dean, Jeffrey (16 January 2013). "Efficient Estimation of Word Representations in Vector Space". arXiv:1301.3781
“Logistic Regression in Machine Learning” GeeksforGeeks, 9 May 2017, www.geeksforgeeks.org/understanding-logistic-regression/. Accessed 26 Oct. 2024.
“Random Forest Algorithm in Machine Learning” GeeksforGeeks, 22 Feb. 2024, www.geeksforgeeks.org/random-forest-algorithm-in-machine-learning/. Accessed 26 Oct. 2024.
“Classification Using Sklearn Multi-layer Perceptron” GeeksforGeeks, 11 Oct. 2023, www.geeksforgeeks.org/classification-using-sklearn-multi-layer-perceptron/. Accessed 26 Oct. 2024.
“Boosting in Machine Learning | Boosting and AdaBoost” GeeksforGeeks, 3 May 2019, www.geeksforgeeks.org/boosting-in-machine-learning-boosting-and-adaboost/. Accessed 26 Oct. 2024.
“Support Vector Machine (SVM) Algorithm” GeeksforGeeks, 20 Jan. 2021, www.geeksforgeeks.org/support-vector-machine-algorithm/. Accessed 26 Oct. 2024.
Li Yang, Abdallah Shami, On hyperparameter optimization of machine learning algorithms: Theory and practice, Neurocomputing, Volume 415, 2020, Pages 295-316, ISSN 0925-2312, https://doi.org/10.1016/j.neucom.2020.07.061.
Erickson, Bradley J., and Felipe Kitamura. "Magician’s corner: 9. Performance metrics for machine learning models." Radiology: Artificial Intelligence 3.3 (2021): e200126. https://doi.org/10.1148/ryai.2021200126
Published
How to Cite
Issue
Section
Copyright (c) 2025 Yao Chen; Darnell Granberry

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.


