Data Pre-Processing Algorithm for Neural Network Binary Classification Model in Bank Tele-Marketing
Khairul Nizam Abd Halim1, Abdul Syukor Mohamad Jaya2, Ahmad Firdaus Ahmad Fadzil3

1Khairul Nizam Abd Halim*, Faculty of Information and Communication Technology, University Teknikal Malaysia Melaka, Hang Tuah Jaya, Durian Tunggal, Melaka, Malaysia.
2Abdul Syukor Mohd Jaya, Faculty of Information and Communication Technology, University Teknikal Malaysia Melaka, Hang Tuah Jaya,  Durian Tunggal, Melaka, Malaysia. Ahmad Firdaus
3Ahmad Fadzil, Faculty of Computer and Mathematical Sciences, University Technology MARA Cawangan Melaka, Merlimau, Melaka, Malaysia.
Manuscript received on December 17, 2019. | Revised Manuscript received on December 22, 2019. | Manuscript published on January 10, 2020. | PP: 272-277 | Volume-9 Issue-3, January 2020. | Retrieval Number: C8472019320/2020©BEIESP | DOI: 10.35940/ijitee.C8472.019320
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Tele-marketing presents a huge challenge in identifying potential customers with lack of effective marketing strategy may led a company to succumbs to problems such as prolonged marketing campaign. Various attempts to improve the performance of binary classification model for bank tele-marketing data. Previous researches indicate that the neural network is the most common algorithms being employed and able to produce commendable results with higher accuracy percentages compared to other algorithms. Despite several attempts to improve the model through treatment of imbalance dataset and features selection, this research argues that they are incomplete. Therefore, this research proposes a data pre-processing algorithm for bank tele-marketing binary classification neural network. Three datasets have been employed (19, 16, and 20 features) to evaluate the performance of the algorithm towards the classification model. The data pre-processing algorithm is divided into three phases; data cleaning, data imbalance treatment and finally data normalization. In this paper, the result indicated that binary classification model complemented with data cleaning techniques such as Missing common (MC) and Tomek Links (TL) shows a better result compared to Ignore Missing (IM). In terms of data normalization, techniques such as Max Abs Scaler (MAS) and Min Max Scaler (MMS) consistently indicated better performance from other normalization techniques. The classification model employed in this paper utilize data pre-processing algorithm combination of MC-TL-MMS. The algorithm using this approach able to record an area of the receiver operating characteristic curve (AUC) of 0.9129 and 0.9464 by using 16 features and 20 features respectively. This result presents the highest figure in terms of performance accuracy compared to other previous researches. 
Keywords: Classification, Neural Network, Data pre-processing, Tele-Marketing.
Scope of the Article: Classification