Loading...

Design and Implementation of a Face Model in Video-realistic Speech Animation for Farsi Language

Ghasemi Naraghi, Zeinab | 2013

880 Viewed
  1. Type of Document: M.Sc. Thesis
  2. Language: Farsi
  3. Document No: 44362 (19)
  4. University: Sharif University of Technology
  5. Department: Computer Engineering
  6. Advisor(s): Jamzad, Mansour
  7. Abstract:
  8. With increasing use of computers in everyday life, improved communication between machines and human is needed. To make a right communication and understand a humankind face which is made in a graphical environment, implementing the audio and visual projects like lip reading, audio and visual speech recognition and lip modelling needed. The main goal in this project is natural representation of strings of lip movements for Farsi language. Lack of a complete audio and visual database for this application in Farsi language made us provide a new complete Farsi database for this project that is called SFAVD. It is a unique audio and visual database which covers the most applicable words, all phones, diphones and common syllables in sentences. After extracting the audio and visual features, a parallel HMM and coupled HMM are trained. Finally we developed a system which converts a Farsi audio to a sequence of lip images
  9. Keywords:
  10. Hidden Markov Model ; Persian Language ; Face Animation ; Audio Visual Database ; Lip Movement Animation

 Digital Object List

 Bookmark

No TOC