Elastic differential evolution for automatic data clustering

Chen, Jun-Xian ORCID: 0000-0002-1928-5603, Gong, Yue-Jiao ORCID: 0000-0002-5648-1160, Chen, Wei-Neng ORCID: 0000-0003-0843-5802, Li, Mengting and Zhang, Jun ORCID: 0000-0001-7835-9871 (2019) Elastic differential evolution for automatic data clustering. IEEE Transactions on Cybernetics, 51 (8). pp. 4134-4147. ISSN 2168-2267

Abstract

In many practical applications, it is crucial to perform automatic data clustering without knowing the number of clusters in advance. The evolutionary computation paradigm is good at dealing with this task, but the existing algorithms encounter several deficiencies, such as the encoding redundancy and the cross-dimension learning error. In this article, we propose a novel elastic differential evolution algorithm to solve automatic data clustering. Unlike traditional methods, the proposed algorithm considers each clustering layout as a whole and adapts the cluster number and cluster centroids inherently through the variable-length encoding and the evolution operators. The encoding scheme contains no redundancy. To enable the individuals of different lengths to exchange information properly, we develop a subspace crossover and a two-phase mutation operator. The operators employ the basic method of differential evolution and, in addition, they consider the spatial information of cluster layouts to generate offspring solutions. Particularly, each dimension of the parameter vector interacts with its correlated dimensions, which not only adapts the cluster number but also avoids the cross-dimension learning error. The experimental results show that our algorithm outperforms the state-of-the-art algorithms that it is able to identify the correct number of clusters and obtain a good cluster validation value.

Dimensions Badge

Altmetric Badge

Item type Article
URI https://vuir.vu.edu.au/id/eprint/45263
DOI 10.1109/TCYB.2019.2941707
Official URL https://ieeexplore.ieee.org/document/8864092
Subjects Current > FOR (2020) Classification > 4602 Artificial intelligence
Current > Division/Research > Institute for Sustainable Industries and Liveable Cities
Keywords computing, data cluster, data clustering, algorithm, encoding scheme
Citations in Scopus 7 - View on Scopus
Download/View statistics View download statistics for this item

Search Google Scholar

Repository staff login