Pretrained multilingual models (PMMs) enable zero-shot learning viacross-lingual transfer, performing best for languages seen during pretraining.While methods exist to improve performance for unseen languages, they havealmost exclusively been evaluated using amounts of raw text only available fora small fraction of the world's languages. In this paper, we evaluate theperformance of existing methods to adapt PMMs to new languages using a resourceavailable for over 1600 languages: the New Testament. This is challenging fortwo reasons: (1) the small corpus size, and (2) the narrow domain. Whileperformance drops for all approaches, we surprisingly still see gains of up to$17.69\%$ accuracy for part-of-speech tagging and $6.29$ F1 for NER on averageover all languages as compared to XLM-R. Another unexpected finding is thatcontinued pretraining, the simplest approach, performs best. Finally, weperform a case study to disentangle the effects of domain and size and to shedlight on the influence of the finetuning source language.