Basic concepts and tools for the Toki Pona minimal and constructed language: description of the language and main issues; analysis of the vocabulary; text synthesis and syntax highlighting; Wordnet synsets

  • 2018-07-04 00:18:33
  • Renato Fabbri
  • 0

Abstract

A minimal constructed language (conlang) is useful for experiments andcomfortable for making tools. The Toki Pona (TP) conlang is minimal both in thevocabulary (with only 14 letters and 124 lemmas) and in the (about) 10 syntaxrules. The language is useful for being a used and somewhat established minimalconlang with at least hundreds of fluent speakers. This article exposes currentconcepts and resources for TP, and makes available Python (and Vim) scriptedroutines for the analysis of the language, synthesis of texts, syntaxhighlighting schemes, and the achievement of a preliminary TP Wordnet. Focus ison the analysis of the basic vocabulary, as corpus analyses were found. Thesynthesis is based on sentence templates, relates to context by keeping trackof used words, and renders larger texts by using a fixed number of phonemes(e.g. for poems) and number of sentences, words and letters (e.g. forparagraphs). Syntax highlighting reflects morphosyntactic classes given in theofficial dictionary and different solutions are described and implemented inthe well-established Vim text editor. The tentative TP Wordnet is madeavailable in three patterns of relations between synsets and word lemmas. Insummary, this text holds potentially novel conceptualizations about, and toolsand results in analyzing, synthesizing and syntax highlighting the TP language.

 

Quick Read (beta)

loading the full paper ...