Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation

#1 Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation [PDF¹] [Copy] [Kimi¹⁰] [REL]

Authors: Yucheng Chu, Peng He, Hang Li, Haoyu Han, Kaiqi Yang, Yu Xue, Tingting Li, Joseph Krajcik, Jiliang Tang

Short answer assessment is a vital component of science education, allowing evaluation of students' complex three-dimensional understanding. Large language models (LLMs) that possess human-like ability in linguistic tasks are increasingly popular in assisting human graders to reduce their workload. However, LLMs' limitations in domain knowledge restrict their understanding in task-specific requirements and hinder their ability to achieve satisfactory performance. Retrieval-augmented generation (RAG) emerges as a promising solution by enabling LLMs to access relevant domain-specific knowledge during assessment. In this work, we propose an adaptive RAG framework for automated grading that dynamically retrieves and incorporates domain-specific knowledge based on the question and student answer context. Our approach combines semantic search and curated educational sources to retrieve valuable reference materials. Experimental results in a science education dataset demonstrate that our system achieves an improvement in grading accuracy compared to baseline LLM approaches. The findings suggest that RAG-enhanced grading systems can serve as reliable support with efficient performance gains.

Subject: Computation and Language

Publish: 2025-04-07 17:17:41 UTC

2504.05276

#1 Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation [PDF1] [Copy] [Kimi10] [REL]

#1 Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation [PDF¹] [Copy] [Kimi¹⁰] [REL]