Anaphora Resolution in Dialogue Systems for South Asian Languages

Abstract

Anaphora resolution is a challenging task which has been the interest of NLPresearchers for a long time. Traditional resolution techniques like eliminativeconstraints and weighted preferences were successful in many languages.However, they are ineffective in free word order languages like most SouthAsianlanguages.Heuristic and rule-based techniques were typical in these languages,which are constrained to context and domain.In this paper, we venture a newstrategy us-ing neural networks for resolving anaphora in human-humandialogues. The architecture chiefly consists of three components, a shallowparser for extracting features, a feature vector generator which produces theword embed-dings, and a neural network model which will predict the antecedentmention of an anaphora.The system has been trained and tested on Teluguconversation corpus we generated. Given the advantage of the semanticinformation in word embeddings and appending actor, gender, number, person andpart of plural features the model has reached an F1-score of 86.

Quick Read (beta)

loading the full paper ...