AYOOLA, Ifeoluwa Valentina and ARIWAYO, Afolabi Gabriel and OKE, Abayomi Samuel (2024) Analysis of Breast Cancer Data Using KNN Algorithm. Advances in Research, 25 (6). pp. 535-544. ISSN 2348-0394
Ayoola2562024AIR128709.pdf - Published Version
Download (562kB)
Abstract
Comparing the performance of machine learning algorithms over the same dataset is the current research trend. The fact that each algorithm has its underlying assumptions indicate that not all algorithms should be compared with each other on the same dataset. Hence, every algorithm should be compared only when the dataset satisfies the underlying assumptions. Algorithms that are suitable for a certain dataset should only be compared with each other at the optimal level. To pave a way for this, this study investigates the performance of several variations of the -Nearest Neighbors (NN) algorithm on a dataset comprising 569 breast cancer cases from the United States. The research evaluates the impact of three distance metrics, namely; Chebyshev, Manhattan, and Euclidean, across various values of . The analysis reveals that the optimal metric is Euclidean distance metric and the optimal values are and The optimal results obtained is 97.37% accuracy, 97.26% (Benign) and 97.56% (Malignant) precision, 98.61% (Benign) and 95.24% (Malignant) recall, and 97.93% (Benign) and 96.39% (Malignant) F1-scores. Additionally, the two optimal models (for and ) exhibit strong agreement on the feature’s importance except compactness feature. Further analysis is recommended to better understand the role of compactness in breast cancer diagnosis.
Item Type: | Article |
---|---|
Subjects: | STM Digital Press > Multidisciplinary |
Depositing User: | Unnamed user with email support@stmdigipress.com |
Date Deposited: | 10 Jan 2025 06:59 |
Last Modified: | 10 Jan 2025 06:59 |
URI: | http://digitallibrary.eprintscholarlibrary.in/id/eprint/1605 |