Evaluating the Performance of Supervised Machine Learning Algorithms in Breast Cancer Datasets
K. Y. Obiwusi, Y. O. Olatunde, G. K. Afolabi, A. Oke, A. M. Oyelakin, A. Salami
Abstract
Breast cancer is the leading cause of mortality globally. Several attempts have been made to use data mining methodology together with machine learning techniques to develop systems that can detect or prevent breast cancer. In line with the reviewed paper; large datasets for illness analysis have been developed. In this study, the results of selected Machine Learning algorithms are compared: Decision Table, J48, SGD, bagging, and Naïve Bayes Updateable on Wisconsin Breast Cancer Original dataset was conducted using weka tools. Exploratory data analysis, pre-processed with supervised attribute selection and class order, was used to identify potential features to aid the performance of the chosen algorithms in classification. The empirical result showed that Decision Table explores greater likelihood (74% correctly classified instances, True Positive Rate of 0.752, False Positive Rate of 0.478, Precision of 0.77, receiver operating characteristic Area of 0.682) in terms of accuracy and efficiency compared with others. This study's comparison technique is thought to aid breast cancer detection.
Keywords
Breast cancer classification; Breast-cancer datasets; Data mining; ML algorithms; Supervised machine learning
Asri, H., Mousannif, H., Al Moatassime, H., and Noel, T. (2016). Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Computer Science, 83, 1064-1069.
Rajput, A., Aharwal, R. P., Dubey, M., Saxena, S., and Raghuvanshi, M. (2011). J48 and JRIP rules for e-governance data. International Journal of Computer Science and Security (IJCSS), 5(2), 201.
Shah, P. J., and Shah, T. (2021). Identification of breast tumor using hybrid approach of independent component analysis and deep neural network. International Journal of Intelligent Systems and Applications in Engineering, 9(4), 209-219.
Wu, J., and Hicks, C. (2021). Breast cancer type classification using machine learning. Journal of Personalized Medicine, 11(2), 61.