Clustering mixed data using an Artificial Bee Colony
José F. Cabrera-Venegas1, Yusbel Chávez-Castilla2

1José F. Cabrera-Venegas*, Computer Science Department, University of Ciego de Ávila, Cuba.
2Yusbel Chávez-Castilla, Computer Science Department, University of Ciego de Ávila, Cuba.

Manuscript received on November 13, 2019. | Revised Manuscript received on 22 November, 2019. | Manuscript published on December 10, 2019. | PP: 65-70 | Volume-9 Issue-2, December 2019. | Retrieval Number: A4861119119/2019©BEIESP | DOI: 10.35940/ijitee.A4861.129219
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: In this paper, we have proposed a clustering technique which optimizes the total compactness and separation (measured using the Silhouette index) of the clusters. The proposed algorithm uses an Artificial Bee Colony (ABC) based optimization method as the underlying optimization criterion. We used similarity based prototypes as cluster centers. The proposed clustering technique is able to suitably handle mixed and incomplete data types in such a way that the original characteristics of the data are preserved. Assignment of points to different clusters is done based on a dissimilarity function rather than the Euclidean distance. Results on real-life data sets show that the proposed technique is well-suited to detect true partitioning from data sets. Results are compared with those obtained by four existing clustering techniques, one genetic algorithm based clustering technique (AGKA), the k-Prototypes (KP) algorithm, well-known based K-means clustering technique for similarity functions (KMSF) and a newly developed algorithm with dissimilarity based clustering technique (AD2011). 
Keywords: Swarm Intelligence, Artificial bee Colony, Clustering, Mixed Data.
Scope of the Article: Clustering