Development and Integration of an Odia Stemmer in Dspace for Odia Search Engine
Gouranga Charan Jena1, Siddharth Swarup Rautaray2
1Gouranga Charan Jena, Anand Pani School of Computer Engineering, KIIT DU, Odisha.
2Siddharth Swarup Rautaray, School of Computer Engineering, KIIT DU, Odisha.
Manuscript received on January 12, 2020. | Revised Manuscript received on January 22, 2020. | Manuscript published on February 10, 2020. | PP: 836-840 | Volume-9 Issue-4, February 2020. | Retrieval Number: D1108029420/2020©BEIESP | DOI: 10.35940/ijitee.D1108.029420
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Stemmer is used for reducing inflectional or derived word to its stem. This technique involves removing the suffix or prefix affixed in a word. It can be used for information retrieval system to refine the overall execution of the retrieval process. This process is not equivalent to morphological analysis. This process only finds the stem of a word. This technique decreases the number of terms in information retrieval system. There are various techniques exists for stemming. Here a new hybrid stemmer has developed named as “Mula” for Odia Language. It is a combination of brute force and enhanced suffix stripping approach for Odia language. The new born stemmer is both computationally inexpensive and domain independent. We have integrated this stemmer in existing Dspace for Odia text retrieval System. The results are commendable and suggest that the new stemmer can be used effectively in Odia Search Engine. The proposed stemmer also handles over-stemming and under stemming effectively.
Keywords: Derivational Suffixes, Inflectional Words, Odia Stemmer, Information Retrieval, Brute force, D Space etc.
Scope of the Article: Information Retrieval