Loading...
- Type of Document: M.Sc. Thesis
- Language: Farsi
- Document No: 47626 (19)
- University: Sharif University of Technology
- Department: Computer Engineering
- Advisor(s): Beigi, Hamid
- Abstract:
- With the growth of social networks, link prediction has attracted great attention. Completing partially observed networks, recognizing errors in observed links, predicting the network’s future structure to aid decision making, and presenting users with favorable links are some the motivations that have made link prediction important and effective for complex networks. In this work, we analyze link prediction in DBLP’s author network and attempt to increase the accuracy of state-of-the-art link prediction techniques by extracting discriminative information from the available metadata. Abstracts are an important resource that indicate an author’s field of study. Extracting the concepts an author has worked on and combining that with structural features derived from the network can improve link prediction accuracy significantly. To this end, we use a decision tree to classify similarity-based features. We then post-process the labels assigned by the decision tree using features derived from the available metadata. Experiments on the DBLP author network show that we are able to reduce false positive errors by half with only a small increase in false negatives. To the best of our knowledge, this work is the first to use the above features and post-process classifier decisions successfully
- Keywords:
- Topic Modeling ; Classification ; Link Prediction ; Similarity-based Algorithm ; DBLP Dataset
- محتواي کتاب
- view
- 1 مقدمه
- 2 کارهای پیشین
- 3 روش پیشنهادی
- 4 آزمایشها
- 1-4 مقدمه
- 2-4 مجموعه دادههای DBLP
- 3-4 فرآیند انجام آزمایشها
- 3-4.1 تحلیل اطلاعات DBLP
- 3-4.2 تعریف ساختار دادهها
- 3-4.3 استخراج دادههای آموزش و آزمایش
- 3-4.4 استخراج ویژگیهای مبتنی بر شباهت
- 3-4.5 استفاده از الگوریتم Online-LDA
- 3-4.6 محاسبه بردار نویسندگان
- 3-4.7 محاسبه بردار ویژگی نهایی برای جفت نویسندگان
- 3-4.8 پیادهسازی دستهبندها و ساخت مدل
- 3-4.9 اعتبارسنجی و ارزیابی کارایی روش
- 4-4 چالشها در انتخاب داده
- 5-4 معیارهای ارزیابی
- 6-4 ارزیابی
- 5 جمعبندی و کارهای آتی
- مراجع
- واژهنامه فارسی به انگلیسی
- واژهنامه انگلیسی به فارسی