Similarity Based Prediction System using Machine Learning Algorithms in Big Data Analytics
M.Vanitha1, P. Geetha, E. Ramaraj2

1M.Vanitha, Research Scholar, Alagappa University Karaikudi.
2Dr. P. Geetha, Associate Professor, Dr.Umayal Ramanatha College for Womens, Karaikudi.
3Dr. E. Ramaraj, Professor and Head, Dept. of Computer Science Alagappa University, Karaikudi.

Manuscript received on September 16, 2019. | Revised Manuscript received on 24 September, 2019. | Manuscript published on October 10, 2019. | PP: 4489-4494 | Volume-8 Issue-12, October 2019. | Retrieval Number: L35241081219/2019©BEIESP | DOI: 10.35940/ijitee.L3524.1081219
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Big Data is a noteworthy environment to maintain the diversity of the huge amount of data. The big data utilizes machine learning algorithms to process large datasets which comes from various places such as histories, weblogs, and data repositories, large datasets and data warehousing, etc. In an existing method, most of the data mining approaches might not be able to maintain the large dataset. Using datamining, the big data are having lack of compatibility with database systems and analysis tools; large dataset clustering and analyzing is a big issue in big data. For this reason, the research work uses machine learning algorithms which are implemented in the Hadoop tool to collect and process the large amount of data which is structured, semi-structured or unstructured in a reasonable amount of time. Also, it gives more accurate prediction system and accurate information. Using Machine Learning Algorithm computational cost and complexities is minimized. The overall research work is implemented in the Hadoop tool with the help of the python programming language and it is compared with some existing algorithms. The proposed work tested with suitable parameters such as accuracy, Kappa T and Kappa M.
Keywords: Big Data, Machine Learning, SBPS, PCA-O, PERA.
Scope of the Article: Machine Learning