DANCE: Deep Learning-Assisted Analysis of Protein Sequences Using Chaos Enhanced Kaleidoscopic Images

  • 2024-09-10 18:55:59
  • Taslim Murad, Prakash Chourasia, Sarwan Ali, Murray Patterson
  • 0

Abstract

Cancer is a complex disease characterized by uncontrolled cell growth. T cellreceptors (TCRs), crucial proteins in the immune system, play a key role inrecognizing antigens, including those associated with cancer. Recentadvancements in sequencing technologies have facilitated comprehensiveprofiling of TCR repertoires, uncovering TCRs with potent anti-cancer activityand enabling TCR-based immunotherapies. However, analyzing these intricatebiomolecules necessitates efficient representations that capture theirstructural and functional information. T-cell protein sequences pose uniquechallenges due to their relatively smaller lengths compared to otherbiomolecules. An image-based representation approach becomes a preferred choicefor efficient embeddings, allowing for the preservation of essential detailsand enabling comprehensive analysis of T-cell protein sequences. In this paper,we propose to generate images from the protein sequences using the idea ofChaos Game Representation (CGR) using the Kaleidoscopic images approach. ThisDeep Learning Assisted Analysis of Protein Sequences Using Chaos EnhancedKaleidoscopic Images (called DANCE) provides a unique way to visualize proteinsequences by recursively applying chaos game rules around a central seed point.we perform the classification of the T cell receptors (TCRs) protein sequencesin terms of their respective target cancer cells, as TCRs are known for theirimmune response against cancer disease. The TCR sequences are converted intoimages using the DANCE method. We employ deep-learning vision models to performthe classification to obtain insights into the relationship between the visualpatterns observed in the generated kaleidoscopic images and the underlyingprotein properties. By combining CGR-based image generation with deep learningclassification, this study opens novel possibilities in the protein analysisdomain.

 

Quick Read (beta)

loading the full paper ...