Abstract
Online news outlets operate predominantly on an advertising-based revenuemodel, compelling journalists to create headlines that are often scandalous,intriguing, and provocative -- commonly referred to as clickbait. Automaticdetection of clickbait headlines is essential for preserving informationquality and reader trust in digital media and requires both contextualunderstanding and world knowledge. For this task, particularly inless-resourced languages, it remains unclear whether fine-tuned methods orin-context learning (ICL) yield better results. In this paper, we compile CLIC,a novel dataset for clickbait detection of Croatian news headlines spanning a20-year period and encompassing mainstream and fringe outlets. We fine-tune theBERTi\'c model on this task and compare its performance to LLM-based ICLmethods with prompts both in Croatian and English. Finally, we analyze thelinguistic properties of clickbait. We find that nearly half of the analyzedheadlines contain clickbait, and that finetuned models deliver better resultsthan general LLMs.