Loading...

Prediction of DNA/RNA Sequence Binding Site to Protein with the Ability to Implement on GPU

Fatemeh Tabatabaei | 2023

64 Viewed
  1. Type of Document: M.Sc. Thesis
  2. Language: Farsi
  3. Document No: 56117 (19)
  4. University: Sharif University of Technology
  5. Department: Computer Engineering
  6. Advisor(s): Koohi, Sommaye
  7. Abstract:
  8. Based on the importance of DNA/RNA binding proteins in different cellular processes, finding binding sites of them play crucial role in many applications, like designing drug/vaccine, designing protein, and cancer control. Many studies target this issue and try to improve the prediction accuracy with three strategies: complex neural-network structures, various types of inputs, and ML methods to extract input features. But due to the growing volume of sequences, these methods face serious processing challenges. So, this paper presents KDeep, based on CNN-LSTM and the primary form of DNA/RNA sequences as input. As the key feature improving the prediction accuracy, we propose a new encoding method, 2Lk, which includes two levels of k-mer encoding. 2Lk not only increases the prediction accuracy of RNA/DNA binding sites, but also, reduces the run time ,power consumption, GPU and CPU process consumption, temp of GPU and improves the number of trainable parameters, and increases the interpretability of KDeep by about 79%, compared to the state-of-the-art methods
  9. Keywords:
  10. Neural Network ; Interpretability ; Graphics Procssing Unit (GPU) ; DNA/RNA Sequence Binding Site to Protein Prediction ; Gene Expression Data ; Convolutional Neural Network

 Digital Object List

 Bookmark

No TOC