Preprint / Version 1

Construction of the Chi-square Goodness-of-Fit Test

##article.authors##

  • JunYang (Michael) Ma

Keywords:

Chi-square, Goodness-of-Fit test, Hypothesis testing, Multinomial distribution, Maximum likelihood estimation, Fisher information, Central limit theorem, Monte Carlo Simulation, Mendelian inheritance, Overparametrisation

Abstract

The goal of statistical tests is to use data to learn about the nature of the system from which it is collected. It is an important bridge between raw data and interpreting its meaning/drawing conclusions from it. One recent use of statistical tests is in COVID-19 tests. The results were analysed to determine whether the patient is tested positive or negative, and statistics is used to determine the false positive rate. In this paper we will focus on one of the most commonly used statistical tests, the Chi-square test, and why it works. We will first discuss the general procedure of hypothesis testing, followed by the construction of an appropriate estimator for statistical models before finally, finding the limiting distribution of the estimator. Using these properties, we will construct the Chi-square distribution and discuss its relevance to hypothesis testing. Python simulations using the Chi-square test will be generated to investigate the effect of multiple variables on its statistical power. 

References or Bibliography

Biologywise.com, Codominance Explained with Examples.

Geeks for Geeks.org, Greeshmanala, Plotting Multiple Bar Charts Using Matplotlib in Python.

Larry Wasserman, All of Statistics, Springer Science and Business Media, 2004.

MIT Open CourseWare, 4, Parametric Inference (cont.) and Maximum Likelihood Estimation, 2017.

Nicholas J. Higham, What is the Sherman-Morrison-Woodbury Formula?

Nikhil Kumar, Graph Plotting in Python. Sep 1, 2021.

Numpy.Random.Multinomial, 2008.

Statology.org, Zach: How to Plot a Chi-Square Distribution in Python, 2021.

Victor Chernozhukov, Nonlinear Econometrics, 2007.

Wikipedia, Square Root of a Matrix, 2005.

Wikipedia, Total Variation Distance of Probability Measures, 2004.

Downloads

Posted

10-24-2023

Categories