OntoSenseNet: A Verb-Centric Ontological Resource for Indian Languages

  • 2018-08-02 07:18:55
  • Jyoti Jha, Sreekavitha Parupalli, Navjyoti Singh
  • 6

Abstract

Following approaches for understanding lexical meaning developed by Yaska,Patanjali and Bhartrihari from Indian linguistic traditions and extendingapproaches developed by Leibniz and Brentano in the modern times, a frameworkof formal ontology of language was developed. This framework proposes thatmeaning of words are in-formed by intrinsic and extrinsic ontologicalstructures. The paper aims to capture such intrinsic and extrinsic meanings ofwords for two major Indian languages, namely, Hindi and Telugu. Parts-of-speechhave been rendered into sense-types and sense-classes. Using them we havedeveloped a gold- standard annotated lexical resource to support semanticunderstanding of a language. The resource has collection of Hindi and Telugulexicons, which has been manually annotated by native speakers of the languagesfollowing our annotation guidelines. Further, the resource was utilised toderive adverbial sense-class distribution of verbs and karaka-verb sense- typedistribution. Different corpora (news, novels) were compared using verbsense-types distribution. Word Embedding was used as an aid for the enrichmentof the resource. This is a work in progress that aims at lexical coverage oflanguage extensively.

 

Quick Read (beta)

loading the full paper ...