GRT: Gurmukhi to Roman Transliteration System using Character Mapping and Handcrafted Rules
Shailendra Kumar Singh1, Manoj Kumar Sachan2
1Shailendra Kumar Singh, Department of Computer science & Engineering, Sant Longowal Institute of Engineering and Technology, Longowal, Sangrur, Punjab, India.
2Manoj Kumar Sachan, Department of Computer science & Engineering, Sant Longowal Institute of Engineering and Technology, Longowal, Sangrur, Punjab, India. Corresponding author
Manuscript received on 30 June 2019 | Revised Manuscript received on 10 July 2019 | Manuscript published on 30 July 2019 | PP: 2758-2763 | Volume-8 Issue-9, July 2019 | Retrieval Number: I8636078919/19©BEIESP | DOI: 10.35940/ijitee.I8636.078919
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: In the last two decades, the transliteration system has got significant research attention. It is observed that Punjabi to English transliteration for all type of part-of-speech words is comparably less studied. Currently, some research work in this area is carried out but only for proper nouns and some technical terms. So, there is need to focus on all type of words. The Gurmukhi to Roman transliteration (GRT) system is the first proposed system for transliteration of all kind of part-of-speech words. This system uses the handcrafted-rules and character mapping (CM) approach for transliteration between languages involved. The CM is done for Gurmukhi script with its equivalent to Roman script. It transliterates text written in Punjabi language into English language. It is tested on 65,130 Punjabi words and achieved accuracy of 99.27%, which is better than other state-of-art system results. The developed system can be used in social media text normalization, translation, sentiment analysis of multilingual text, text summarization of multilingual text, etc.
Keywords: Transliteration, Gurmukhi script, Roman script, Character mapping, Handcrafted rule.
Scope of the Article: System Integration