Loading...
Search for: farsi
0.005 seconds

    A study to find influential parameters on a Farsi-English statistical machine translation system

    , Article 2010 5th International Symposium on Telecommunications, IST 2010, 4 December 2010 through 6 December 2010 ; 2010 , Pages 985-991 ; 9781424481835 (ISBN) Bakhshaei, S ; Khadivi, S ; Riahi, N ; Sameti, H ; Sharif University of Technology
    Abstract
    The aim of this paper is to analyze the Farsi-English statistical machine translation systems as a useful communication tool. Improvement of the nation's communication increases the need of easier way of translating between different languages in front of expensive human translators. In this work, a statistical phrase-based system is run on Farsi - English pair languages and the effect of its parameters on the translation quality has been deeply studied. Using BLEU as a metric of translation accuracy, the system achieves an improvement of 1.84%, relative to the baseline accuracy, which is increment from 16.97% to 18.81% in the best case  

    Categorization of various essential datasets and methods for textual spelling detection and normalization

    , Article Iranian Journal of Information Processing Management ; Volume 32, Issue 4 , 2017 , Pages 1143-1170 ; 22518223 (ISSN) Hosseini Beheshti, M. S ; Abdi Ghavidel, H ; Sharif University of Technology
    Iranian Research Institute for Scientific Information and Documentation  2017
    Abstract
    One of the most primary phases of automatic text processing is spelling error detection and grapheme normalization. Storing textual documents faces several problems without passing this phase, which causes a disturbance in retrieving the documents automatically. Therefore, specialists in the fields of natural language processing and computational linguistics usually make an attempt to sample various data through presenting ideal methods and algorithms in order to reach the normalized data. Several researches have been conducted on English and some other languages, which have been followed by a certain amount of researches on Farsi too. Sometimes, these several researches have remained to be... 

    A Pool-based active learning method for improving farsi-english machine translation system

    , Article 2012 6th International Symposium on Telecommunications, IST 2012 ; 978-146732073-3 Bakhshaei, Somayeh ; Sharif University of Technology
    Abstract
    In this paper we try to alleviate the problem of scares resources for developing Farsi-English Statistical Machine Translation system (SMT). It is done by applying Active Learning (AL) idea to choose more informative sentences to be translated by a human and then be added to the base-line corpus. While using the human translations is worthless in compare to the other approaches of corpus gathering (like automatic approaches), it is more costly too. So, in this way we can improve the translation system with less cost. This is done in intricate to human translator. Applying Active learning idea to a SMT system, changes it to a system which can improve its based-line corpus by asking for the... 

    High accuracy farsi language character segmentation and recognition

    , Article 27th Iranian Conference on Electrical Engineering, ICEE 2019, 30 April 2019 through 2 May 2019 ; 2019 , Pages 1692-1698 ; 9781728115085 (ISBN) Kiaei, P ; Javaheripi, M ; Mohammadzade, H ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    Despite many advances in optical character recognition in general, there are still serious challenges remaining in recognizing Farsi text. The main reason is the cursive nature of the letters in written Farsi, i.e., depending on the position of a letter within a word, it might join to its neighboring letters, which consequently changes the shape of the character. As a result, each letter can have up to four different character shapes. In addition to the problem of segmenting the characters, the increased number of characters makes the recognition task even more challenging. This paper introduces a complete framework for character recognition, including a method for segmenting the characters... 

    SFAVD: Sharif farsi audio visual database

    , Article IKT 2013 - 2013 5th Conference on Information and Knowledge Technology, Shiraz, Iran ; 2013 , Pages 417-421 ; 9781467364904 (ISBN) Naraghi, Z ; Jamzad, M ; Sharif University of Technology
    2013
    Abstract
    With increasing use of computers in everyday life, improved communication between machines and human is needed. To make a right communication and understand a humankind face which is made in a graphical environment, implementing the audio and visual projects like lip reading, audio and visual speech recognition and lip making are needed. Lack of a complete audio and visual database for this application in Farsi language made us provide a new complete Farsi database for this project that is called SFAVD. It is a unique audio and visual database which in addition to considering Farsi conceptual and speech structure, it considers influence of speech on lip changes. This database is created for... 

    Evaluation of test collection construction methods: A case study

    , Article 2008 International Conference on Information and Knowledge Engineering, IKE 2008, Las Vegas, NV, 14 July 2008 through 17 July 2008 ; January , 2008 , Pages 16-22 ; 1601320752 (ISBN); 9781601320759 (ISBN) Sheykh Esmaili, K ; Hosseini, M ; Rostami, A ; Abolhassani, H ; Sharif University of Technology
    2008
    Abstract
    Currently there is no standard test collection for evaluation of Farsi information retrieval systems. In this paper we introduce Mahak, the first complete test collection generally available for evaluating Farsi information retrieval systems. In addition, we have used different methods for constructing Mahak qrels and we have compared performance of these methods  

    Design and Hardware Implementation of Optical Character Recognition

    , M.Sc. Thesis Sharif University of Technology Dezfuli, Sina (Author) ; Hashemi, Matin (Supervisor)
    Abstract
    The objective of OCR systems is to retrieve machine-encoded text from a raster image. Despite the abundance of powerful OCR algorithms for English, there are not many for Farsi. Our proposed algorithm is comprised of pre-processing, line detection, sub-word detection and segmentation, feature extraction and classification. Furthermore, hardware implementation and acceleration of this system on a GPGPU is presented. This algorithm was tested on 5 fonts including Titr, Lotus,Yekan, Koodak and Nazanin and an average accuracy above 90% was achieved  

    Speech driven lips animation for the Farsi language

    , Article Proceedings of the International Symposium on Artificial Intelligence and Signal Processing, AISP 2015, 3 March 2015 through 5 March 2015 ; 2015 , Pages 201-205 ; 9781479988174 (ISBN) Naraghi, Z ; Jamzad, M ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2015
    Abstract
    With the growing presence of computers in everyday life, communication improvement between human and machines is inevitable. Talking faces are the faces whose movements are synchronized to speech. They have an effective role in many applications. Lip is the most important part of a talking face. The main goal of this project is implementing a natural and human-like lip movement synthesis system for the Farsi language. For this purpose, a comprehensive audio visual database called SFAVD1 was designed and used. After extracting the sufficient features and designing a parallel Hidden Markov Model, the speech driven lip movement sequence generator system for Farsi input speech was implemented.... 

    Off-line Arabic/Farsi handwritten word recognition using RBF neural network and genetic algorithm

    , Article Proceedings - 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems, ICIS 2010, 29 October 2010 through 31 October 2010, Xiamen ; Volume 3 , 2010 , Pages 352-357 ; 9781424465835 (ISBN) Bahmani, Z ; Alamdar, F ; Azmi, R ; Haratizadeh, S ; Sharif University of Technology
    2010
    Abstract
    In this paper an off-line ArabiclFarsi handwritten recognition Algorithm on a subset of Farsi name is proposed. In this system, There is no sub-word segmentation phase. Script database includes 3300 images of 30 Farsi common names. The features are wavelet coefficients extracted from smoothed word image profiles in four standard directions. The Centers of competitive layer of RBF neural network have been determined by combining GA and K-Means clustering algorithm. Weights of supervised layer has been trained by using LMS rule and the distances of feature vector of each sample to the centre of RBF network have been computed based on warping function. Experimental results show advantages of...