Professional Documents
Culture Documents
Keywords: Logistic Regression, Support The most important organ in the human
Vector Classifier, K-Nearest Neighbours, body, the heart controls blood flow
Decision Tree, Random Forest and throughout the body. Other body parts may
Gradient Boosting suffer if there is any kind of heart function
2
impairment. Heart disease is currently the compared. The complex task of decision
biggest cause of death among people. making using discrete data is easily handled
According to estimates from the World by machine learning.By identifying hidden
Health Organization, almost 12 million patterns, machine learning (ML) analyzes
people die from heart disease each year the provided data. As a result, a tool is
(WHO). The WHO estimates almost the created that enables medical professionals to
death rate would increase to 23.6 million by diagnose patients quickly, treat them
2030 [6]. effectively, and prevent negative outcomes.
[1] [4] [14].
Dizziness, ankle swelling, shortness of
breath, slow heartbeats, fainting, The field of machine learning is growing at
lightheadedness, pain in the neck, jaw, a fast pace in different industries such as
throat, dullness, weakness, or coldness in healthcare, transportation, finance,
your body parts, and irregular heartbeats are agriculture, cybersecurity, marketing, etc.
all signs of this illness. We can reduce manual error with computer
analysis and also, increase the accuracy and
Heart disease can be prevented if detected efficiency of a system.
earlier. More accurate diagnoses in less time.
Providing the best standard services and The need for near 100% accuracy and
early, correct diagnosis is the industry's key reduction of human error is most in the
problem. The extensive application of healthcare industry.
machine learning, which also produces
favorable results with the highest accuracy
for medical diagnostics, can have a positive II. LITERATURE SURVEY
impact on the healthcare sector. A project is created to predict the
possibility of getting heart disease in
Finding the best algorithm for heart disease patients. It is predicted in terms of
prediction is the study's goal for data percentage using decision Tree as well as
mining. K-nearest Neighbor algorithm, taking into
consideration vital factors which include
Algorithms like Random Forest, Logistic pressure level, gender, age, cholesterol,
Regression, K-Nearest Neighbors Classifier, chest pain, rest blood pressure, fasting
Support Vector Classifiers, Decision Tree blood sugar, chest pain type,
Classifier, and Gradient Boosting Classifier electrocardiographic result [1].
are used in the classification and
construction of a model to diagnose heart A study on the prediction of
disease in patients. [11]. cardiovascular disease compared the
accuracies of two algorithms and their
A dataset is used to apply algorithms, and hybrids. The study came to the conclusion
the accuracy levels of the results are that Decision tree algorithm had a 79%
3
accuracy rate, Random forest algorithm heart disease patients and the Decision
had an 81% accuracy rate, and their Tree Model had an accuracy level of
Hybrid model had an 88% accuracy rate. 91%.[5].
[2]
Mohd Faisal Ansari studied how attributes
Ekta Maini's work on developing a affected the outcomes of a logistic
machine learning model for effective and regression technique model. He used a
early cardiovascular disease prediction. variety of models, including logistic (all
With the aid of different algorithms, India attributes), logistic (most significant
took into account eleven related factors of attributes), logistic (removing the least
a subject and came to the conclusion that significant attribute), SVM, and logistic
the accuracy of logistic regression is (removing the least significant attribute)
90.8%, the specificity of KNN is 87.1%, (with PCA) gave 86% accuracy, recall
and the specificity of the AB model is 68% and specificity 69% with precision
93.1% [3]. 77%, and a f1 score of 72%, the study's
findings demonstrate that Logistic (with
Rati Goel et al gave a brief comparison PCA) performed best. [6].
between the efficiency of six different
algorithms which include Support Vector T Marikani went through various studies
Machine, Random Forest, Naïve Bayes, to find the best suited algorithm of heart
Decision Tree, Logistic regression, disease prediction, the algorithms under
K-nearest Neighbor, for the purpose of scanner here are supervised learning
finding the best suited algorithm to detect algorithms like Decision tree, Naïve
heart disease. The study came to a Bayes, Random Forest Tree, KNN and
conclusion that the accuracies of each of When finished, use a support vector
the algorithms were as follows -Logistic machine. According to the study, the
Regression 77%, KNN 82%, SVM 86%, accuracy of various algorithms varied
Naïve Bayes 68%, Decision Tree 83%, depending on the implementation tools
Random Forest 83%. According to the and attributes used. [7]
analysis, Support Vector Machine is the
best algorithm for heart disease early V.V.Ramalingam carried a comprehensive
prediction [4]. study on the comparison of various
methodologies for heart disease prediction
Santhana Krishnan used Decision Tree which included algorithms and techniques
Classification and Naive Bayes such as Decision Tree, Support Vector
Classification models for classification. Machine, Naïve Bayes, Random Forest, K
After applying these two supervised-data – Nearest Neighbour, Ensemble Model,
mining algorithms to the dataset, it was the study concluded that Each of the
discovered that the Naive Bayes classifier above-mentioned algorithms have
had an accuracy level of 87% in predicting performed well in some cases but poorly
4
Algorithm Accuracy
(%)
V. CONCLUSION
Our research focuses on using various
machine learning techniques to predict heart
disease, and we assess the efficacy of these
algorithms by presenting a variety of signs
that can be used to determine whether a
patient has heart disease or not. The
research demonstrates how several machine
learning algorithms function in the
foretelling of a cardiovascular disease.
Using Python programming, the
classification procedures employed in the
study were carried out. According to the
results above, the Random Forest Classifier
is the best-performing machine learning
After the user has entered the values for the technique out of all the strategies examined.
fields, if the model returns 1 for the values It has an accuracy rate of 83.60 percent.
user provided, then it will show “Possibility The average accuracy predicted is 78.94%.
of Heart Disease”, else, it’ll show “No Heart K-Nearest Neighbors is the least accurate
Disease” algorithm with accuracy 73.77%. In order to
predict cardiac illness earlier and lower the
death rate, machine learning can be utilized
efficiently in this way.
[15]https://www.kaggle.com/johnsmith88/he
art-disease-dataset.