Replacing Deep Neural Networks with New Methods to Increase their Interpretability

Rajaei, Pedram; Tefagh, Mojtaba Motahari, Abolfazl

Please enable javascript in your browser.

Replacing Deep Neural Networks with New Methods to Increase their Interpretability

Rajaei, Pedram | 2024

0 Viewed

Type of Document: M.Sc. Thesis
Language: Farsi
Document No: 57255 (19)
University: Sharif University of Technology
Department: Mathematical Sciences
Advisor(s): Tefagh, Mojtaba; Motahari, Abolfazl
Abstract:
Recently, many bioinformatics prediction models have been produced using deep neural networks. However, due to the nature of deep neural network models, the impact of features on the final answer is not fully apparent. Moreover, due to the large number of features in bioinformatics problems, both significant computational resources and a considerable amount of time are required for model learning. Nevertheless, the selection and use of optimal features in bioinformatics is often overlooked. This is because microarray methods often generate a vast number of features, and selecting the optimal features from this large number can be very challenging and time-consuming. Additionally, the features selected by feature selection methods may not be interpretable in deep neural networks. In this thesis, we aim to introduce a novel method for solving the problem of classifying normal and cancer cells using a combination of feature selection methods and neural additive models, which achieves a relatively high accuracy (over 70%) and is highly interpretable. This model has been applied to 10,000 tabular thyroid cancer and normal datasets. Using this model, the features that are most correlated with the target variable are easily selected. Then, classification is performed using these features. Not only has the accuracy of the proposed model on the dataset been more than 70%, but due to the high interpretability of neural additive models, the impact of each feature on the problem's solution is evident. Also, due to the reduction in the dimensionality of the problem, both the required computational resources are reduced, and the model's speed is increased.
Keywords:
Genome Analysis ; Feature Selection ; Interpretability ; Bioinformatics ; Deep Additive Networks ; Deep Neural Networks

Digital Object List

محتواي کتاب
view

Bookmark

f2c704a2ace8e7ad58497b74e331bed4239315117795cc848fd511576ff95b7a.pdf
- مقدمه
  - پیش‌گفتار
  - انگیزه مسئله
  - تعریف مسئله
  - نوآوری در حل مسئله
  - ساختار پایان‌نامه
  - جمع‌بندی
- مفاهیم پایه
  - مقدمه
  - DNA و ژنوم
  - mi-RNA
  - طبقه‌بندی
    - انواع طبقه‌بندی
    - مراحل طبقه‌بندی
    - انتخاب مدل طبقه‌بندی
    - ارزیابی عملکرد یک مدل طبقه‌بندی
    - کاربردهای طبقه‌بندی
  - یادگیری ماشین و شبکه عصبی
  - انتخاب ویژگی
    - دلایل اهمیت انتخاب ویژگی
    - انواع مختلف تکنیک‌های انتخاب ویژگی
    - عوامل موثر در انتخاب ویژگی
    - کاربردهای انتخاب ویژگی
  - روش‌های پرطرفدار در انتخاب ویژگی
    - P-value
    - T-value
    - T-test
    - کسب اطلاعات
    - خی-دو
  - ارزیابی روش‌های ارایه شده
  - جمع‌بندی
- کار‌های مرتبط و پژوهش‌های پیشین
  - مقدمه
  - انتخاب ویژگی در روش‌های سنتی‌تر یادگیری ماشین
  - انتخاب ویژگی در شبکه‌های عصبی عمیق
  - یادگیری عمیق و محدودیت‌های شبکه‌های عصبی
  - مدل افزونی تعمیم‌یافته
    - محدودیت مدل‌های افزودنی تعمیم‌یافته
  - مدل افزودنی عصبی
    - تابع فعال سازی EXU
    - یادگیری چند کار به طور هم زمان در مدل‌های افزودنی عصبی
    - جمع‌بندی مدل‌های افزودنی عصبی
  - مروری بر Deseq2 و PyDeseq2
    - Deseq2
    - PyDeseq2
    - تفاوت‌های اصلی بین PyDeseq2 و DESeq2
  - جمع‌بندی
- روش پیشنهادی
  - مقدمه
  - انتخاب داده
  - انتخاب ویژگی
  - انتخاب شبکه عصبی عمیق جهت دسته‌بندی
  - نحوه ارزیابی
  - استفاده از نمودارهای مدل افزودنی برای بهبود عملکرد
  - جمع‌بندی
- نتایج اجرا
  - مقدمه
  - نتایج استفاده از خی-دو و مدل افزودنی عمیق
    - یادگیری مدل
    - حذف ویژگی‌های بدون تاثیر
    - اضافه کردن چند ویژگی تصادفی
  - نتایج استفاده از کسب اطلاعات و مدل افزودنی عمیق
    - یادگیری مدل
    - حذف ویژگی‌های بدون تاثیر
    - اضافه کردن چند ویژگی تصادفی
  - نتایج استفاده از T-test و مدل افزودنی عمیق
    - یادگیری مدل
    - حذف ویژگی‌های بدون تاثیر
    - اضافه کردن چند ویژگی تصادفی
  - تاثیر ژن‌های شناسایی شده در سرطان تیروئید در پژوهش‌های پیشین
  - معیارهای تفسیرپذیری
  - جمع‌بندی
- جمع بندی، نتیجه‌گیری و پیشنهادات
- مراجع
- واژه‌نامه
cae1bc7ed9d6bcb02002b1f3968f5460462c4e49083ab186b6e8bbeeff109ba2.pdf
f2c704a2ace8e7ad58497b74e331bed4239315117795cc848fd511576ff95b7a.pdf

Friend's email
Your name
Your email
enter code