Loading...
Design and Preparation of a Persian Semantic Corpus Using Abstract Meaning Representation
Takhshid, Reza | 2018
748
Viewed
- Type of Document: M.Sc. Thesis
- Language: Farsi
- Document No: 55514 (31)
- University: Sharif University of Technology
- Department: Languages and Linguistics Center
- Advisor(s): Bahrani, Mohammad; Shojaie, Razieh
- Abstract:
- To keep in line with the day to day advancements in the fields of computational linguistics and natural language processing, and the growing attention of researchers to semantic processing, this thesis presents the design and preparation of a Persian semantic corpus using Abstract Meaning Representation (AMR). This semantic representation pairs each sentence with a single rooted, acyclic, directed graph, which is human and computer readable. Moreover, this representation paves the way for the creation of large semantic corpora. In order to bring such benefits to Persian, in this thesis we present solutions for representing Persian sentences in the framework of AMR. Moreover, a corpus of 150 Persian sentences paired with their Persian AMR graphs with the inter-annotator agreement of 81\% is represented, which shows the effectiveness of the prepared Persian AMR specification. With the goal of expanding the size of the corpus, a rule-based convertor has been designed and implemented. Using the converter, over 5000 Persian AMR graphs have been converted from a Persian proposition bank. The accuracy of these sentences compared with the gold graphs is 61\%
- Keywords:
- Abstract Meaning Representation ; Persian Abstract Meaning Representation ; Semantic Graph ; Semantic Representation ; Semantic Proccesing
- محتواي کتاب
- view
- مقدمه
- مفاهیم پایه
- پیشینه پژوهش
- نمایش انتزاعی معنای فارسی
- مقدمه
- یک مثال
- انتزاع: دوری از دستور فارسی
- گرایش به منطق
- کانون
- وجهیت
- نفی
- جملات پرسشی
- حالت امری و احساسی و گفتار رسمی
- حروف تعریف، شخص، شمار، ...
- نقشهای تلویحی
- افعال اسنادی
- افعال مرکب
- افعال تناوبی
- اسمها و صفتهای محمولی
- قابگری
- حروف اضافه
- بندموصولی
- چند رابطهٔ معنایی همنام
- حروف ربط
- زیرمجموعه
- درجه
- متغیرها و هممرجع بودن
- مالکیت
- موجودیتهای نامدار و ویکیسازی
- قابهای خاص برای نقشها
- اعداد
- اعداد ترتیبی
- دادگان
- جمعبندی
- تولید خودکار دادگان نامفا از دیگر دادگان موجود
- بررسی نتایج
- نتیجهگیری و پیشنهاد ادامه کار
- لغتنامه