DSpace Repository

Prediction of heart disease by classifying with feature selection and machine learning methods

Show simple item record

dc.creator GAZELOĞLU, Cengiz
dc.date 2020-05-31T21:00:00Z
dc.date.accessioned 2021-01-21T08:24:26Z
dc.date.available 2021-01-21T08:24:26Z
dc.identifier aa49478e-73a3-493e-9de9-49ae2f019885
dc.identifier 10.23751/pn.v22i2.9830
dc.identifier https://avesis.sdu.edu.tr/publication/details/aa49478e-73a3-493e-9de9-49ae2f019885/oai
dc.identifier.uri http://acikerisim.sdu.edu.tr/xmlui/handle/123456789/85730
dc.description Study Objectives: Cardiovascular diseases are among the most common diseases experienced by human beings. In addition, these diseases require spending too much money to be treated. According to the World Health Organization report, 56 million death cases occurred in the World in 2012. Methods: The aim to determine the method (s) with the most accurate classification rate of cardiovascular diseases by using machine learning and feature selection methods. To fulfill this aim, 18 machine learning methods divided into 6 different categories, and 3 different feature selection was used in this study. These methods were analyzed via WEKA, Python and MATLAB computer program. Results: According to the results of the analysis, SVM (PolyKernel) with an 85.148% ratio was found to be the most successful machine learning algorithm without feature selection. After the Correlation-based Feature Selection (CFS) feature selection, the most successful algorithm was Naive Bayes and Fuzzy RoughSet with a ratio of 84.818%. However, after using Chi-Square feature selection, the most successful algorithm was found to be the RBF Network algorithm with 81.188% ratio. Conclusion: Consequently, it is recommended that specialist doctors who want to classify heart disease should use the SVM (PolyKernel) algorithm if they are not going to use feature selection whereas they should use should the Naive Bayes algorithm if they are going to use CFS as a feature selection. Additionally, if they are to use Fuzzy Rough Set and Chi-Square as the feature selection, it is recommended that they use the RBFNetwork algorithm.
dc.language eng
dc.rights info:eu-repo/semantics/closedAccess
dc.title Prediction of heart disease by classifying with feature selection and machine learning methods
dc.type info:eu-repo/semantics/article


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account