BegoniaGPT: Cultivating the large language model to be an exceptional K-12 English teacher.
Journal:
Neural networks : the official journal of the International Neural Network Society
Published Date:
Sep 1, 2025
Abstract
Large language models (LLMs) have taken the natural language processing (NLP) domain by storm, and their transformative momentum has surged into the domain of education, giving rise to a nascent wave of education-tailored LLMs. Despite their potential to facilitate homework assistance, such LLMs fall short in the fine-grained domain of elementary and secondary school (i.e., K-12) education. They often indiscriminately incorporate broad knowledge across diverse disciplines, overlooking the stark disparities in cognitive demands and curricular content among elementary, middle, and high school phases. To fill this gap, we propose a new English teaching LLM, called BegoniaGPT, which discards irrelevant knowledge from other disciplines, and shapes the general LLM to be an exceptional English teacher by emphasizing four key aspects: foundational English knowledge, professional proficiency, international vision, and psychological support. In particular, we build a large-scale English corpus named EngCorpus, including 35,000 instructions and conversations tailored towards three roles: students, teachers, and parents, as well as 30,000 emotional conversations. By continued pre-training and supervised fine-tuning the general LLM on the carefully curated EngCorpus and aligning it with reinforcement learning with expert feedback, BegoniaGPT could provide refined, specialized, personalized and compassionate English education. Through a comprehensive empirical comparison on four English benchmarks, e.g., E-EVAL, 2023-2024 the PEP edition of entrance examination for middle school in China (EEM), 2024 the PEP edition of entrance examination for high school (EEH), 2024 Gaokao, National Paper I (Eng-Gaokao), we show that BegoniaGPT achieves the state-of-the-art performance over 10 SOTA LLMs. Further Claude 3-opus and expert manual evaluations further validate BegoniaGPT's teaching advantages.