Performance-enhanced KNN algorithm-based heart disease prediction with the help of optimum parameters


Creative Commons License

TAKCI H.

JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, cilt.38, sa.1, ss.451-460, 2022 (SCI-Expanded) identifier identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 38 Sayı: 1
  • Basım Tarihi: 2022
  • Doi Numarası: 10.17341/gazimmfd.977127
  • Dergi Adı: JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Art Source, Compendex, TR DİZİN (ULAKBİM)
  • Sayfa Sayıları: ss.451-460
  • Anahtar Kelimeler: Heart disease diagnosis, machine learning, K nearest neighbor classifier, DIAGNOSIS, OPTIMIZATION
  • Sivas Cumhuriyet Üniversitesi Adresli: Evet

Özet

Heart diseases are diseases with a high mortality rate. Clinical methods and machine learning methods have been used frequently in the diagnosis of the disease. In this study, the KNN algorithm was used for diagnosis. In order to increase the performance of the algorithm, parameter adjustment has been made and in this context; Manhattan, Euclidean and Chebyshev distance measurements, Uniform and Distance weighting methods and neighbor numbers between 1...15 have been tested on UCI Statlog and Cleveland datasets. The highest classification accuracy for the Statlog dataset is 67.90%, which is obtained with the number of neighbors = 5, the distance method = Euclidian, and the weighting = Distance. Genetic algorithms were also run on the same data set and 88.88% accuracy was obtained for the number of neighbors = 5, distance method = Euclidean and weighting = Distance. While the maximum classification accuracy obtained for the Cleveland dataset was 71.42% before optimization, it was measured as 90.11% after optimization. The parameters that give the highest classification accuracy for the Cleveland dataset are; number of neighbors = 3, distance method = Manhattan and weighting = Uniform.