Loading...

Efficient Implementation of Compressed Deep Convolutional Neural Networks

Afshar, Mohammad | 2018

1602 Viewed
  1. Type of Document: M.Sc. Thesis
  2. Language: Farsi
  3. Document No: 50727 (05)
  4. University: Sharif University of Technology
  5. Department: Electrical Engineering
  6. Advisor(s): Hashemi, Matin
  7. Abstract:
  8. Many mobile applications running on smartphones, wearable devices, tiny autonomous robots and IoT devices would potentially benefit from the accuracy and scalability of deep CNN-based machine learning algorithms. However,performance and energy consumption limitations make the execution of such computationally intensive algorithms on embedded mobile devices prohibitive.We present a GPU-accelerated engine, dubbed mCNN, for execution of trained deep CNNs on mobile platforms. The proposed solution takes the trained model as input and automatically optimizes its parallel implementation on the target mobile platform for efficient use of hardware resources such as mobile GPU threads and SIMD units. Empirical evaluations show that our solution achieves upto 500X speedup
  9. Keywords:
  10. Neural Network ; Convolutional Neural Network ; Increasing Efficiency ; Graphics Procssing Unit (GPU) ; Graphics Procssing Unit (GPU) ; Deep Convolutional Neural Networks

 Digital Object List

 Bookmark

...see more