Analyzing the GPT-3 AI’s Ability to Predict the Answer to Algebraical Questions

Authors

  • Daniil Novak Castro Valley High School

DOI:

https://doi.org/10.47611/jsrhs.v12i1.3998

Keywords:

GPT-3, math problems, T-test, artificial intelligence

Abstract

AI Algorithms have been getting increasingly smart and complex since their conception. The Most Modern AI’s currently available are able to solve incredibly complex problems and are even able to solve abstract problems. It is only a matter of time until AI is able to supersede humans in Mathematical Calculations. (MGI et al. 2022). In this paper we took the most advanced AI currently available for commercial usage, GPT-3, and programmed it to use randomly generated mathematical input in order to test its solving abilities. By feeding this input, we evaluated whether the AIs ability to solve problems is solely based on complexity. We, however, found out that AI was also influenced by the type of operation. The distance and similarity metrics of many different AI Generated answers were taken and then compared with the actual ones.We conducted a series of T-tests to determine which results are statistically significant. We found out that the type of arithmetical expression in the problem, which was either addition, multiplication, division or subtraction, was significant (p=.0000027), suggesting that the type of problem matters more than previously expected.

Downloads

Download data is not yet available.

References or Bibliography

: “Ai's Golden Age.” Artificial Intelligence, UBS, https://www.ubs.com/microsites/artificial-intelligence/en/golden-age.html.

:Sagar, Ram. “What's next for AI: Solving Advanced Math Problems.” Analytics India Magazine, Analytics India Mag, 15 Feb. 2021, https://analyticsindiamag.com/whats-next-for-ai-solving-advanced-math-problems/.

:Freeland, Devon. “Modern Algebra.” How Has Algebra's Methods Changed Over Time?, https://algebrasmethods.weebly.com/modern-algebra.html#:~:text=Algebra%20has%20really%20affected%20the%20human%20race%20more,huge%20scale%20to%20create%20extremely%20complex%20computer%20system.

:Prompt Engineering GPT-3 to Solve Project Euler Problems. https://towardsdatascience.com/prompt-engineering-gpt-3-to-solve-project-euler-problems-1ff3b12f7d56.

:Khanam, Sana, et al. “Artificial Intelligence Surpassing Human Intelligence: Factual or Hoax.” OUP Academic, Oxford University Press, 2 Jan. 2020, https://academic.oup.com/comjnl/article/64/12/1832/5688168.

:Gkionaki, Melina. “How Does Artificial Intelligence Work?” European Investment Bank, European Investment Bank, 9 Feb. 2022,

Brown, et .al “Language Models are Few-Shot Learners”

“Competitive Programming with AlphaCode.” RSS, https://www.deepmind.com/blog/competitive-programming-with-alphacode.

Kathryn Rich, Aman Yadav “Applying Levels of Abstraction to Mathematics Word Problems”

Sumrak, Jesse. “What Is GPT-3: How It Works and Why You Should Care.” Twilio Blog, Twilio, 27 July 2021, https://www.twilio.com/blog/what-is-gpt-3.

“What Is GPT-3 and Why Is It Important?” RSS, https://www.genei.io/blog/what-is-gpt-3-and-why-is-it-important?genei_segment_id=327ae72a-198b-4acc-85d7-133b7ca31445.

:https://pypi.org/project/bert/

:https://pypi.org/project/statsmodels/

:DeepMath - Deep Sequence Models for Premise Selection. https://arxiv.org/pdf/1606.04442.pdf.

:Prabhakaran, S. (2022) Cosine similarity - understanding the math and how it works? (with python), Machine Learning Plus. Available at: https://www.machinelearningplus.com/nlp/cosine-similarity/ (Accessed: November 12, 2022).

:Možina, Martin, et al. “Argument Based Machine Learning.” Artificial Intelligence, Elsevier, 29 Apr. 2007, https://www.sciencedirect.com/science/article/pii/S0004370207000690.

:“A Comparison of GPT-3 and Existing Conversational AI Solutions.” HackerNoon, https://hackernoon.com/a-comparison-of-gpt-3-and-existing-conversational-ai-solutions-0q2z3z9x.

:Heaven, Will Douglas. “OpenAI's New Language Generator GPT-3 Is Shockingly Good-and Completely Mindless.” MIT Technology Review, MIT Technology Review, 20 Oct. 2021, https://www.technologyreview.com/2020/07/20/1005454/openai-machine-learning-language-generator-gpt-3-nlp/.

Published

02-28-2023

How to Cite

Novak, D. (2023). Analyzing the GPT-3 AI’s Ability to Predict the Answer to Algebraical Questions. Journal of Student Research, 12(1). https://doi.org/10.47611/jsrhs.v12i1.3998

Issue

Section

HS Research Articles