Knowledge Retrieval-Based Intelligent Question and Answer Generation Framework for Education

Authors

  • Anay Pardasani Prospect High school
  • Mrs Maleski
  • Sayantan Roy

DOI:

https://doi.org/10.47611/jsrhs.v14i1.8684

Keywords:

question generation, educational technology, large language models, retrieval augmented generation, examination preparation

Abstract

The paper introduces a Knowledge Retrieval-Based Intelligent Question and Answer Generation Framework for Education, leveraging Retrieval Augmented Generation (RAG) to enhance Large Language Models (LLMs) in producing high-quality, contextually relevant examination questions and answers across subjects. The framework addresses challenges in ensuring comprehensive subject coverage, educational standards, and evaluation metrics. Key components include Optical Character Recognition (OCR), data chunking, vectorization using multilingual BERT, and custom retrieval functions. The RAG system's effectiveness is evaluated using the RAGAs metric, covering metrics like faithfulness, answer relevancy, context recall, and context precision. Comparative results reveal OpenAI models surpassing Google's Gemini in coherence and context relevance due to differing architectures and training methods. Concluding, the paper highlights potential for both large and small language models in tailored educational applications, noting limitations such as hallucinations, high resource demands, and contextual drift.

Downloads

Download data is not yet available.

References or Bibliography

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. *arXiv*. https://doi.org/10.48550/arXiv.1706.03762

Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang (2020). Retrieval Augmented Generation for knowledge intensive NLP tasks. https://doi.org/10.48550/arXiv.2005.11401

Shahul Es, Jithin James, Luis Espinosa-Anke, Steven Schockaert (2023). RAGAS: Automated evaluation for Retrieval Augmented Generation. https://doi.org/10.48550/arXiv.2309.15217

Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova (2019). BERT: Pre Training of Deep Bidirectional Transformer for Language understanding. https://doi.org/10.48550/arXiv.1810.04805

P. Joshi, A. Gupta, P. Kumar and M. Sisodia, "Robust Multi Model RAG Pipeline For Documents Containing Text, Table & Images," 2024 3rd International Conference on Applied Artificial Intelligence and Computing (ICAAIC), Salem, India, 2024, pp. 993-999, doi: 10.1109/ICAAIC60222.2024.10574972.

S. Kukreja, T. Kumar, V. Bharate, A. Purohit, A. Dasgupta and D. Guha, "Vector Databases and Vector Embeddings-Review," 2023 International Workshop on Artificial Intelligence and Image Processing (IWAIIP), Yogyakarta, Indonesia, 2023, pp. 231-236, doi: 10.1109/IWAIIP58158.2023.10462847.

Archit Parnami and Minwoo Lee (2022). Learning from Few Examples: A Summary of Approaches to Few-Shot Learning.https://doi.org/10.48550/arXiv.2203.04291

P. Omrani, A. Hosseini, K. Hooshanfar, Z. Ebrahimian, R. Toosi and M. Ali Akhaee, "Hybrid Retrieval-Augmented Generation Approach for LLMs Query Response Enhancement," 2024 10th International Conference on Web Research (ICWR), Tehran, Iran, Islamic Republic of, 2024, pp. 22-26, doi: 10.1109/ICWR61162.2024.10533345.

S. Selva Kumar, A. K. M. A. Khan, I. A. Banday, M. Gada and V. V. Shanbhag, "Overcoming LLM Challenges using RAG-Driven Precision in Coffee Leaf Disease Remediation," 2024 International Conference on Emerging Technologies in Computer Science for Interdisciplinary Applications (ICETCS), Bengaluru, India, 2024, pp. 1-6, doi: 10.1109/ICETCS61022.2024.10543859.

Jules White, Quchen Fu, Sam Hays, Michael Sandborn, Carlos Olea, Henry Gilbert, Ashraf Elnashar, Jesse Spencer-Smith, Douglas C. Schmidt (2023). A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT. https://doi.org/10.48550/arXiv.2302.11382

Published

02-28-2025

How to Cite

Pardasani, A., Maleski, K., & Roy, S. (2025). Knowledge Retrieval-Based Intelligent Question and Answer Generation Framework for Education. Journal of Student Research, 14(1). https://doi.org/10.47611/jsrhs.v14i1.8684

Issue

Section

HS Research Projects