Prediction of Black Sigatoka Disease in Banana Plants By Data Mining Classification Techniques using Scikit for Python
Srivalli Devi S1, A Geetha2

1Srivalli Devi S*, PG & Research Department of Computer Science, Chikkanna Government Arts College, Tirupur, India. Email:
2Dr.A.Geetha, PG & Research Department of Computer Science, Chikkanna Government Arts college, Tirupur, India.
Manuscript received on December 13, 2019. | Revised Manuscript received on December 25, 2019. | Manuscript published on January 10, 2020. | PP: 1273-1278 | Volume-9 Issue-3, January 2020. | Retrieval Number: C8714019320/2020©BEIESP | DOI: 10.35940/ijitee.C8714.019320
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (

Abstract: Agriculture has been evolving since humans started cultivating plants for food consumption. As the agriculture field evolves, the disease control measures too have evolved. Now in this modern era, disease in plants can be easily identified using computers. Data mining is the process of obtaining the useful information from the data. Before the electronic era, diseases in plants are identified just by seeing the symptoms of the plants. Similarly, we can identify the diseases in plants using data mining by supplying the disease symptoms data and classify them accordingly. The purpose of this paper is focusing on the prediction of the diseases from images of black sigatoka disease and uses the following methods: Multilayer Perceptrons, SVM, KN eighbors Classifier, K-Neighbors Regressor, Gaussian Process Regressor, Gaussian Process Classifier, GaussianNB, Decision Tree Classifier, Decision Tree Regressor, linear models such as Linear Regression, Ridge CV, Lasso, Elastic Net, Logistic Regression CV, SGD Classifier, Perceptron and Passive Aggressive Classifier and ensemble models of the above classifiers. The results are compared, and multilayer perceptron model is seen to give better results for individual classifiers and ensemble of week classifiers gives better results when ensembled. In future, a new hybrid algorithm would be used from the above algorithms for attaining better accuracy. The scikit is a library used for classification, clustering, regression, dimensionality reduction, model selection and preprocessing. Our paper discusses various classifiers used in scikit-learn library for Python and their ensembling is done. This can be applied to all the classification tasks. Classification is done for classifying the black sigatoka disease in banana from healthy leaves. This disease is the most vulnerable one among banana plants.
Keywords: Agriculture, Black Sigatoka, Classification, Data Mining, Scikit.
Scope of the Article: Classification