Abstract
We used a dictionary built from biomedical terminology extracted from varioussources such as DrugBank, MedDRA, MedlinePlus, TCMGeneDIT, to tag more than 8million Instagram posts by users who have mentioned an epilepsy-relevant drugat least once, between 2010 and early 2016. A random sample of 1,771 posts with2,947 term matches was evaluated by human annotators to identifyfalse-positives. OpenAI's GPT series models were compared against humanannotation. Frequent terms with a high false-positive rate were removed fromthe dictionary. Analysis of the estimated false-positive rates of the annotatedterms revealed 8 ambiguous terms (plus synonyms) used in Instagram posts, whichwere removed from the original dictionary. To study the effect of removingthose terms, we constructed knowledge networks using the refined and theoriginal dictionaries and performed an eigenvector-centrality analysis on bothnetworks. We show that the refined dictionary thus produced leads to asignificantly different rank of important terms, as measured by theireigenvector-centrality of the knowledge networks. Furthermore, the mostimportant terms obtained after refinement are of greater medical relevance. Inaddition, we show that OpenAI's GPT series models fare worse than humanannotators in this task.