Loading...
- Type of Document: M.Sc. Thesis
- Language: Farsi
- Document No: 54148 (02)
- University: Sharif University of Technology
- Department: Mathematical Sciences
- Advisor(s): Razvan, Mohammad Reza; Kamali Tabrizi, Mostafa
- Abstract:
- Detection text in natural images is a challenging task due to the complex backgrounds in an image. complex backgrounds, changes in ambient light, changing viewing angles, and other factors can make systems difficult to detection text. Hence text detection is always an problem. Since detection and recognizing a text in an image has many uses such as translating texts for tourists, helping the blind, etc., recognizing a text in different languages is important. In this thesis, we first examine the three methods of Reading Text in the Wild with Convolutional Neural Networks and FOTS and CRAFT. Then we prepared two Persian data sets. The first data set contains images to which Persian texts have been artificially added. The second set of natural images includes Persian text taken from streets and passages. We taught the CRAFT model with these two Persian datasets. The experimental results of this model on the Persian evaluation set now have an precision of 74.7 and recall of 64.9.We taught the FOTS model with these two Persian datasets. The experimental results of this model on the Persian evaluation set now have an precision of 78.07 and recall of 60.67
- Keywords:
- Image Processing ; Machine Learning ; Deep Learning ; Text Detection ; Convolutional Neural Network
-
محتواي کتاب
- view