Loading...

Author Identification Using Statistical Methods

Ameri, Reyhaneh | 2016

1666 Viewed
  1. Type of Document: M.Sc. Thesis
  2. Language: Farsi
  3. Document No: 49050 (19)
  4. University: Sharif University of Technology
  5. Department: Computer Engineering
  6. Advisor(s): Beigy, Hamid
  7. Abstract:
  8. With the increasing use of the Internet, we are witnessing the exchange of gigabytes of text in cyberspace. Cyberspace makes it possible for individuals to hide their true identity and enter this space with an spurious one. Abuses that occur in online communities due to the use of unknown identities, reduce confidence in this type of communication and create many challenges in this area. Hence the importance of maintaining the security of the space, controling the user-generated content and identifying the authors of texts increases day by day. In this Research we have presented an approach to author identification. This approach is based on modeling the style of the authors on the basis of the texts available to them. In this way, by providing a ranking algorithm for every text with an unkown author, the authors are ranked on the basis of the probability that the text belongs to them. performance is also improved by applying pre-processings to the text and reducing the feature space by the selection of features with higher separation power. The proposed approach has been evaluated in terms of performance measures in data recovery by designing and conducting experiments on a set of standard texts in Persian and English languages . The results of these experiments have shown that the proposed method has greater efficiency in comparison with the previous methods of author identification
  9. Keywords:
  10. Feature Selection ; Author Identification ; Ranking ; Ranking Author ; Style Writing

 Digital Object List

 Bookmark

...see more