A Computational Framework to Identify Self-Aspects in Text

  • 2025-07-17 13:31:04
  • Jaya Caporusso, Matthew Purver, Senja Pollak
  • 0

Abstract

This Ph.D. proposal introduces a plan to develop a computational framework toidentify Self-aspects in text. The Self is a multifaceted construct and it isreflected in language. While it is described across disciplines like cognitivescience and phenomenology, it remains underexplored in natural languageprocessing (NLP). Many of the aspects of the Self align with psychological andother well-researched phenomena (e.g., those related to mental health),highlighting the need for systematic NLP-based analysis. In line with this, weplan to introduce an ontology of Self-aspects and a gold-standard annotateddataset. Using this foundation, we will develop and evaluate conventionaldiscriminative models, generative large language models, and embedding-basedretrieval approaches against four main criteria: interpretability, ground-truthadherence, accuracy, and computational efficiency. Top-performing models willbe applied in case studies in mental health and empirical phenomenology.

 

Quick Read (beta)

loading the full paper ...