Comparison between the Naïve Bayes and Hierarchical Clustering to Classify The Global Landslide Catalog for the Prediction of the Landslide.
Poonam Verma1, Charu Negi2, Nisha Chandran3, N. S. Bohra4

1Poonam Verma, Graphic Era Hill University, Clement Town, Dehradun (Uttarakhand), India.

2Charu Negi, Graphic Era Hill University, Clement Town, Dehradun (Uttarakhand), India.

3Nisha Chandran, Graphic Era Hill University, Clement Town, Dehradun (Uttarakhand), India.

4N. S. Bohra, Department of Management Studies, Graphic Era Deemed to be University, Dehradun (Uttarakhand), India.

Manuscript received on 15 June 2020 | Revised Manuscript received on 26 June 2020 | Manuscript Published on 04 July 2020 | PP: 34-37 | Volume-8 Issue-12S3 October 2019 | Retrieval Number: L100810812S319/2020©BEIESP | DOI: 10.35940/ijitee.L1008.10812S319

Open Access | Editorial and Publishing Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open-access article under the CC-BY-NC-ND license (

Abstract: Machine Learning has been used since long to identify the features of a given datasets that are important for the prediction. Landslides are complex events taking place in the various regions of the world. It is the movement of the debris, soil or rocks from an upper plane in downward direction. Identification of the features that are used for the Landslide involves consideration of various categories of parameters. Present paper studies about the performance comparison between a supervised algorithm Naïve Bayes and unsupervised algorithm Hierarchical Clustering. Naïve Bayes is a non parametric supervised algorithm that can be used for the forecasting purposes in the field of Agriculture, Economics, Aviation etc, whereas Hierarchical Clustering is used to partition the available instances of a dataset into optimal homogeneous groups on the basis of the similarities between the datapoints. The present paper draws a comparison between the accuracy of the Naïve Bayes and Hierarchical Clustering for the prediction of the Landslide dataset. The dataset used is the Global Landslide Catalog that has important parameters like date, location coordinates, country, trigger of the event, continent etc. Before the implementation of both the algorithms, reduction of the parameters is carried out using subset evaluation of the parameters and considering only the most important.

Keywords: Landslide Prediction, GLC, Hierarchical Clustering, Naïve Bayes, Multinomial Text, Machine Learning.
Scope of the Article: Clustering