Loading...

Persian large vocabulary name recognition system (FarsName)

Hajitabar, A ; Sharif University of Technology

405 Viewed
  1. Type of Document: Article
  2. DOI: 10.1109/IranianCEE.2017.7985296
  3. Abstract:
  4. There has been no isolated word recognition database for the Persian language so far. In this paper we introduce FarsName dataset which contains 20 thousands isolated-word Persian utterances spoken by 226 speakers from all regions of the country each saying an average of 88 Persian names. There is a total of 5235 unique names in this dataset. Various cell phone brands have been used to record this dataset. This indicates the high diversity of the utterances in this dataset. We have been able to achieve 10.34% WER on this set using Kaldi. This is a very good performance considering the recording environment have been normal and potentially noisy. © 2017 IEEE
  5. Keywords:
  6. Smartphone ; Mobile phones ; Vocabulary control ; Dataset ; Isolated word recognition ; Isolated words ; Isolatedword ; Large vocabulary ; Name recognition ; Persian languages ; Recording environment ; Speech recognition
  7. Source: 2017 25th Iranian Conference on Electrical Engineering, ICEE 2017, 2 May 2017 through 4 May 2017 ; 2017 , Pages 1580-1583 ; 9781509059638 (ISBN)
  8. URL: https://ieeexplore.ieee.org/document/7985296