Implications of Meta Classifiers for Onset Diabetes Prediction
Md. Ashaf Uddaula1, Md. Al – Amin Hossain2, Md. Khalid Hossen3, Ahmed Al Marouf4

1Md. Ashaf Uddaula, Department of CSE, Daffodil International University, Dhaka, Bangladesh.
2Md. Al-Amin Hossain, Department of CSE, Daffodil International University, Dhaka, Bangladesh.
3Md. Khalid Hossen, Department of CSE, Daffodil International University, Dhaka, Bangladesh.
4Ahmed Al Marouf, Department of CSE, Daffodil International University, Dhaka, Bangladesh.
Manuscript received on February 10, 2020. | Revised Manuscript received on February 24, 2020. | Manuscript published on March 10, 2020. | PP: 266-274 | Volume-9 Issue-5, March 2020. | Retrieval Number: D2070029420/2020©BEIESP | DOI: 10.35940/ijitee.D2070.039520
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: In the data mining area, the prophecy of human diseases initiates a research zone for researchers by applying various machine learning algorithms with various patterns. As a modern community disease, diabetes is becoming one of the fastest-progressive human diseases in the world because of eating heavily sugared foods and lack of proper diet knowledge. In this era, most of the middle age people have confusion about the presence of diabetes in their bodies. That’s why we choose to do research on diabetes. In this paper, we scrutinized the classification performance of six Meta Classifiers named as Multiclass Classifier Updatable, Attribute Selected Classifier, Ada Boost M1, Logit Boost, Bagging, and Filtered Classifier for forecasting diabetes through cross-validation and percentage split techniques using in WEKA whereas as a diabetes dataset we used Pima Indians Database. And finally, according to win-rate from the Win-Draw-Loss table, the highest performance comes from Multiclass Classifier Updatable which has an 80% win-rate. On the other hand, in the measurement of highest individual accuracy, 81.9923% comes from both Attribute Selected Classifier and Filtered Classifier. According to the measurement of the highest average performance, 66% Split as a percentage split technique and Attribute Selected Classifier show the highest performance. 
Keywords: Data Mining, Meta Classifier, WEKA, Percentage Split Technique, Diabetes, Win Rate, Multiclass Classifier Updatable, Attribute Selected Classifier.
Scope of the Article: Data Mining