Image To Speech Synthesizer with Reference to Assamese Numerals
M. Rahman1, P. Sarma2, M.P. Bhuyan3, A. Das4, D. Dutta5
1M. Rahman*, Department of Information Technology, Gauhati University Guwahati, India.
2P. Sarma, Department of Information Technology, Gauhati University Guwahati, India.
3M.P. Bhuyan, Department of Information Technology, Gauhati University Guwahati, India.
4A. Das, Department of Computer Science NERIM Group of Institutions, Guwahati, India.
5D. Dutta, Department of Computer Science NERIM Group of Institutions, Guwahati, India.
Manuscript received on October 18, 2019. | Revised Manuscript received on 25 October, 2019. | Manuscript published on November 10, 2019. | PP: 900-905 | Volume-9 Issue-1, November 2019. | Retrieval Number: A4435119119/2019©BEIESP | DOI: 10.35940/ijitee.A4435.119119
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: In this research work we have shown the methodology for converting printed Assamese numerals to its corresponding utterance. We have implemented as an initial effort which will read only four digit numerals. We are using Image processing techniques to convert an image of Assamese numerals into textual/digital form. In the second phase the numerals will be pronounced as a number by Google speaker. In this system, images are stored in a dataset and then inputted data is compared with the dataset image using template matching technique. After recognition of the text output will be displayed as a speech waveform. This work has many applications in today’s digital world.
Keywords: OCR, Image Processing, Template Matching, Synthesizer, Segmentation, Recognition.
Scope of the Article: Image Processing and Pattern Recognition