Feature Selection in Breast Cancer Gene Expression Data Using KAO and AOA with SVM Classification.
Journal:
Journal of medical systems
Published Date:
Mar 26, 2025
Abstract
Breast cancer classification using gene expression data presents significant challenges due to high dimensionality and complexity. This study introduces a novel hybrid framework integrating the Kashmiri Apple Optimization Algorithm (KAO) and the Armadillo Optimization Algorithm (AOA) for effective feature selection, coupled with Support Vector Machines (SVM) for precise classification. The dual-stage approach leverages KAO for global exploration of informative genes and AOA for refining the selection through local optimization, addressing issues of redundancy and premature convergence. Applied to breast cancer datasets, the proposed method achieved a classification accuracy of 98.97%, precision of 98.46%, recall of 100%, and an F1-score of 99.22% using a subset of 15 genes. The robustness of the framework was validated across varying subset sizes, demonstrating consistent high performance. By optimizing feature relevance and redundancy, the KAO-AOA framework provides a promising tool for gene-based cancer prediction with potential applications to other cancer datasets and real-world clinical use.