On the creation of narrow AI: hierarchy and nonlocality of neural network skills

Abstract

We study the problem of creating strong, yet narrow, AI systems. While recentAI progress has been driven by the training of large general-purpose foundationmodels, the creation of smaller models specialized for narrow domains could bevaluable for both efficiency and safety. In this work, we explore twochallenges involved in creating such systems, having to do with basicproperties of how neural networks learn and structure their representations.The first challenge regards when it is possible to train narrow models fromscratch. Through experiments on a synthetic task, we find that it is sometimesnecessary to train networks on a wide distribution of data to learn certainnarrow skills within that distribution. This effect arises when skills dependon each other hierarchically, and training on a broad distribution introduces acurriculum which substantially accelerates learning. The second challengeregards how to transfer particular skills from large general models into smallspecialized models. We find that model skills are often not perfectly localizedto a particular set of prunable components. However, we find that methods basedon pruning can still outperform distillation. We investigate the use of aregularization objective to align desired skills with prunable components whileunlearning unnecessary skills.

Quick Read (beta)

loading the full paper ...