Fractal feature selection model for enhancing high-dimensional biological problems

Alsaeedi, Ali Hakem, Al-Mahmood, Haider Hameed R, Alnaseri, Zainab Fahad, Aziz, Mohammad R, Al-Shammary, Dhiah, Ibaida, Ayman ORCID: 0000-0003-1581-7219 and Ahmed, Khandakar ORCID: 0000-0003-1043-2029 (2024) Fractal feature selection model for enhancing high-dimensional biological problems. BMC Bioinformatics, 25. ISSN 1471-2105

Abstract

The integration of biology, computer science, and statistics has given rise to the interdisciplinary field of bioinformatics, which aims to decode biological intricacies. It produces extensive and diverse features, presenting an enormous challenge in classifying bioinformatic problems. Therefore, an intelligent bioinformatics classification system must select the most relevant features to enhance machine learning performance. This paper proposes a feature selection model based on the fractal concept to improve the performance of intelligent systems in classifying high-dimensional biological problems. The proposed fractal feature selection (FFS) model divides features into blocks, measures the similarity between blocks using root mean square error (RMSE), and determines the importance of features based on low RMSE. The proposed FFS is tested and evaluated over ten high-dimensional bioinformatics datasets. The experiment results showed that the model significantly improved machine learning accuracy. The average accuracy rate was 79% with full features in machine learning algorithms, while FFS delivered promising results with an accuracy rate of 94%.

Dimensions Badge

Altmetric Badge

Item type Article
URI https://vuir.vu.edu.au/id/eprint/47695
DOI 10.1186/s12859-023-05619-z
Official URL https://bmcbioinformatics.biomedcentral.com/articl...
Subjects Current > FOR (2020) Classification > 3102 Bioinformatics and computational biology
Current > Division/Research > Institute for Sustainable Industries and Liveable Cities
Keywords bioinformatics; feature selection; high-dimensional datasets; fractal; machine learning
Download/View statistics View download statistics for this item

Search Google Scholar

Repository staff login