Abstract
Historical linguists have identified regularities in the process of historicsound change. The comparative method utilizes those regularities to reconstructproto-words based on observed forms in daughter languages. Can this process beefficiently automated? We address the task of proto-word reconstruction, inwhich the model is exposed to cognates in contemporary daughter languages, andhas to predict the proto word in the ancestor language. We provide a noveldataset for this task, encompassing over 8,000 comparative entries, and showthat neural sequence models outperform conventional methods applied to thistask so far. Error analysis reveals a variability in the ability of neuralmodel to capture different phonological changes, correlating with thecomplexity of the changes. Analysis of learned embeddings reveals the modelslearn phonologically meaningful generalizations, corresponding to well-attestedphonological shifts documented by historical linguistics.