
Implementation of a Statistical Persian-English Translator Prototype

Alizadeh, Yousef | 2013

785 Viewed
  1. Type of Document: M.Sc. Thesis
  2. Language: Farsi
  3. Document No: 44438 (19)
  4. University: Sharif University of Technology
  5. Department: Computer Engineering
  6. Advisor(s): Ghasem Sani, Gholamreza
  7. Abstract:
  8. Machine translation has been an important subject in the field of natural language processing (NLP). In recent years, because of providing essential linguistic data resources, statistical approached have been deployed in machine translation. Although there have been several attempt to create English to Persian automatic translator, there has not been sufficient effort in the reverse direction. In this project, we reviewed previous works in machine translator for Persian and implemented a statistical machine translator from Persian to English. We needed a bilingual corpus for building the translator. For this purpose, we used a corpus of Phd and MSc abstracts in Persian and their translation in English. The built corpus includes more than 140000 parallel sentences. After training the translator using these sentences, we evaluated the effect of free word orderedness in Persian on the quality of the translator. Eventually, to improve the quality of translattion, we used a morphological extraction method and analyzed the effect of this method on Blue metric
  9. Keywords:
  10. Statistical Machine Translation ; Bilingual Parallel Corpus ; Sentence Segmenter ; Sentence Alignment ; Morphological Words Segmenter

 Digital Object List
