Abstract
RNA-binding proteins (RBPs) play crucial roles in many biological processes,e.g. gene regulation. Computational identification of RBP binding sites on RNAsare urgently needed. In particular, RBPs bind to RNAs by recognizing sequencemotifs. Thus, fast locating those motifs on RNA sequences is crucial andtime-efficient for determining whether the RNAs interact with the RBPs or not.In this study, we present an attention based convolutional neural network,iDeepA, to predict RNA-protein binding sites from raw RNA sequences. We firstencode RNA sequences into one-hot encoding. Next, we design a deep learningmodel with a convolutional neural network (CNN) and an attention mechanism,which automatically search for important positions, e.g. binding motifs, tolearn discriminant high-level features for predicting RBP binding sites. Weevaluate iDeepA on publicly gold-standard RBP binding sites derived fromCLIP-seq data. The results demonstrate iDeepA achieves comparable performancewith other state-of-the-art methods.