Abstract
This study explores the capacity of large language models (LLMs) for explicitlearning, a process involving the assimilation of metalinguistic explanationsto carry out language tasks. Using constructed languages generated bycryptographic means as controlled test environments, we designed experiments toassess an LLM's ability to explicitly learn and apply grammar rules. Ourresults demonstrate that while LLMs possess a measurable capacity for explicitlearning, this ability diminishes as the complexity of the linguistic phenomenaat hand increases. Supervised fine-tuning on chains of thought significantlyenhances LLM performance but struggles to generalize to typologically novel ormore complex linguistic features. These findings point to the need for morediverse training sets and alternative fine-tuning strategies to further improveexplicit learning by LLMs.