Once Upon a Time: Interactive Learning for Storytelling with Small Language Models

  • 2025-09-19 07:45:34
  • Jonas Mayer Martins, Ali Hamza Bashir, Muhammad Rehan Khalid, Lisa Beinborn
  • 0

Abstract

Children efficiently acquire language not just by listening, but byinteracting with others in their social environment. Conversely, large languagemodels are typically trained with next-word prediction on massive amounts oftext. Motivated by this contrast, we investigate whether language models can betrained with less data by learning not only from next-word prediction but alsofrom high-level, cognitively inspired feedback. We train a student model togenerate stories, which a teacher model rates on readability, narrativecoherence, and creativity. By varying the amount of pretraining before thefeedback loop, we assess the impact of this interactive learning on formal andfunctional linguistic competence. We find that the high-level feedback ishighly data efficient: With just 1 M words of input in interactive learning,storytelling skills can improve as much as with 410 M words of next-wordprediction.

 

Quick Read (beta)

loading the full paper ...