Loading...
Named Entity Recognition in Persian Language Using Deep Learning
Aghajani, Mohammad Mahdi | 2021
861
Viewed
- Type of Document: M.Sc. Thesis
- Language: Farsi
- Document No: 54174 (19)
- University: Sharif University of Technology
- Department: Computer Engineering
- Advisor(s): Beigy, Hamid
- Abstract:
- The use of named entity recognition systems as preprocessing is used in many natural language analysis issues. With the advent of deep learning, the methods of this area were also affected. Today, there is considerable progress in this area due to the development of data resources for English, Chinese, German, and Spanish. They are also good trained models in formal Persian. However, for informal Persian, which contains a large portion of the web content under the Web, the current models do not produce a suitable solution. In this study, we use the same approach to train our models due to achieving state-of-the-art results in pre-trained models. On the other hand, there is a lack of standard datasets for informal Persian in this area. In this study, first, datasets were prepared and produced from Persian Twitter data according to standard and official procedures. Then, Persian models have been tested on the dataset, and it has been determined that they have no acceptable quality. Then, using transfer learning and parallel learning, improve the f-score from 65 to 82. In this study, using a tool that was developed for representation visualization of different layers of the network, it was found that current models for the problem of named entity recognition are more than paying attention to content themselves rather than context, which can be a clue to improve current models in the future
- Keywords:
- Pretrained Models ; Transfer Learning ; Natural Language Processing ; Deep Learning ; Multi-Task Learning ; Named Entity Recognition
- محتواي کتاب
- view
- چکیده
- فهرست شکلها
- فهرست جدولها
- فصل1 مقدمه
- فصل2 پیشزمینه
- فصل3 مرور پژوهشهای پیشین
- فصل4 روش پیشنهادی
- فصل5 آزمایشها و نتایج
- 5–1 محیط آزمایش و ابزارها
- 5–2 مجموعه دادگانهای مورد استفاده
- 5–3 جمعآوری دادگان از شبکه توییتر
- 5–4 معیارهای سنجش کیفیت سامانههای بازشناسی نهادههای نامدار
- 5–5 بررسی کیفیت مدلهای فعلی
- 5–6 ارزیابی مدل آموزش داده شده با استفاده از یادگیری انتقالی ترتیبی
- 5–7 ارزیابی مدل آموزش داده شده با استفاده از یادگیری انتقالی موازی
- 5–8 ارزیابی مدل آموزش دیده با استفاده از تابع هزینه معرفی شده
- 5–9 تحلیل و بررسی نتایج
- 5–10 نتایج ارزیابی مدل با استفاده از دادگان پوشیده
- 5–11 جمعبندی
- فصل6 جمعبندی و پیشنهادها
- مراجع
- واژهنامه فارسی به انگلیسی
- واژهنامه انگلیسی به فارسی