Towards Responsible Development of Generative AI for Education: An Evaluation-Driven Approach

Here we present our work collaborating with learners and educators to translate high level principles from learning science into a pragmatic set of seven diverse educational benchmarks, spanning quantitative, qualitative, automatic and human evaluations; and to develop a new set of fine-tuning datasets to improve the pedagogical capabilities of

Gemini, introducing LearnLM-Tutor.