Can Large Language Models Invent Algorithms to Improve Themselves?

Abstract

Large Language Models (LLMs) have shown remarkable performance improvementsand are rapidly gaining adoption in industry. However, the methods forimproving LLMs are still designed by humans, which restricts the invention ofnew model-improving algorithms to human expertise and imagination. To addressthis, we propose the Self-Developing framework, which enables LLMs toautonomously generate and learn model-improvement algorithms. In thisframework, the seed model generates, applies, and learns model-improvingalgorithms, continuously improving both the seed model and the algorithmsthemselves. In mathematical reasoning tasks, Self-Developing not only createsmodels that surpass the seed model but also consistently outperforms modelscreated using human-designed algorithms. Additionally, these LLM-discoveredalgorithms demonstrate strong effectiveness, including transferability toout-of-domain models.

Quick Read (beta)

loading the full paper ...