Exploring Chemical Space using Natural Language Processing Methodologies for Drug Discovery

  • 2020-02-10 21:02:05
  • Hakime Öztürk, Arzucan Özgür, Philippe Schwaller, Teodoro Laino, Elif Ozkirimli
  • 0

Abstract

Text-based representations of chemicals and proteins can be thought of asunstructured languages codified by humans to describe domain-specificknowledge. Advances in natural language processing (NLP) methodologies in theprocessing of spoken languages accelerated the application of NLP to elucidatehidden knowledge in textual representations of these biochemical entities andthen use it to construct models to predict molecular properties or to designnovel molecules. This review outlines the impact made by these advances on drugdiscovery and aims to further the dialogue between medicinal chemists andcomputer scientists.

 

Quick Read (beta)

loading the full paper ...