Automatic Speech Recognition (ASR) System for Isolated Marathi Words: Using HTK
Sunil B. Patil1, Nita V. Patil2, Ajay S. Patil3
1Sunil B. Patil, School of Computer Sciences(SOCS), Kavayitri Bahinabai Chaudhari North Maharashtra University(KBCNMU), Jalgaon, India
2Nita V. Patil, School of Computer Sciences(SOCS), Kavayitri Bahinabai Chaudhari North Maharashtra University(KBCNMU), Jalgaon, India.
3Ajay S. Patil*, School of Computer Sciences(SOCS), Kavayitri Bahinabai Chaudhari North Maharashtra University(KBCNMU), Jalgaon, India.
Manuscript received on September 16, 2019. | Revised Manuscript received on 24 September, 2019. | Manuscript published on October 10, 2019. | PP: 3702-3705 | Volume-8 Issue-12, October 2019. | Retrieval Number: L26511081219/2019©BEIESP | DOI: 10.35940/ijitee.L2651.1081219
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: The present manuscript focuses on building automatic speech recognition (ASR) system for Marathi language (M-ASR) using Hidden Markov Model Toolkit (HTK). The M-ASR system gives the detail about experimentation and implementation using the HTK Toolkit. In this work total 106 speaker independent Marathi isolated words were recognized. These unique Marathi words are used to train and evaluate M-ASR system. The speech corpus (database) is created by us using isolated Marathi words uttered with mixed gender people. The system uses Mel Frequency Cepstral Coefficient (MFCC) for the purpose of extracting features using Gaussian mixture model (GMM). Viterbi algorithm based on token passing is used for decoding to recognize unknown utterances. The proposed M-ASR system is speaker independent. The proposed system has reported 96.23% word level recognition accuracy.
Keywords: Automatic Speech Recognition, Marathi, Hidden Markov Model, HTK, Isolated Words, Mel-Frequency Cepstral Coefficient.
Scope of the Article: Frequency Selective Surface