Abstract
We present a computational analysis of cognate effects on the spontaneouslinguistic productions of advanced non-native speakers. Introducing a largecorpus of highly competent non-native English speakers, and using a set ofcarefully selected lexical items, we show that the lexical choices ofnon-natives are affected by cognates in their native language. This effect isso powerful that we are able to reconstruct the phylogenetic language tree ofthe Indo-European language family solely from the frequencies of specificlexical items in the English of authors with various native languages. Wequantitatively analyze non-native lexical choice, highlighting cognatefacilitation as one of the important phenomena shaping the language ofnon-native speakers.