Professional Documents
Culture Documents
ARTICLEINFO ABSTRACT
Article History: Heart disease is a significant global health concern, accounting for a
substantial number of deaths worldwide. Early prediction and detection of
Accepted: 05 May 2023
heart disease play a pivotal role in improving patient outcomes and
Published: 24 May 2023
reducing mortality rates. Machine learning techniques have demonstrated
considerable potential in accurately predicting heart disease based on
patient data. In this research paper, we propose a novel approach to heart
Publication Issue disease prediction using machine learning algorithms, with a particular
Volume 10, Issue 3 focus on creating a user-friendly graphical user interface (GUI) for
May-June-2023 enhanced accessibility and ease of use. The proposed approach leverages a
diverse dataset encompassing demographic information, medical history,
Page Number laboratory results, and diagnostic tests, providing a comprehensive view of a
patient's health status. Multiple state-of-the-art machine learning
398-404
algorithms, including logistic regression, support vector machines, random
forests, and artificial neural networks, are employed to build robust
prediction models. These models are trained, validated, and evaluated using
appropriate performance metrics to ensure accuracy and reliability. To
facilitate practical implementation, a user-friendly GUI is designed to
provide an intuitive interface for healthcare professionals and individuals
without extensive programming or machine learning expertise.
Keywords: Heart disease, Graphical User Interface, Machine Learning,
Logistic Regression, KNN, accuracy
Copyright: © 2023, the author(s), publisher and licensee Technoscience Academy. This is an open-access article distributed 398
under the terms of the Creative Commons Attribution Non-Commercial License, which permits unrestricted non-commercial
use, distribution, and reproduction in any medium, provided the original work is properly cited
Prof. Sachin Sambhaji Patil et al Int J Sci Res Sci & Technol. May-June-2023, 10 (3) : 398-404
new opportunities for accurate prediction and risk prediction models. These models will be trained,
assessment. validated, and evaluated using appropriate
Machine learning algorithms have demonstrated great performance metrics to ensure accuracy and
potential in the healthcare domain, enabling reliability.
predictive models to extract valuable insights from The developed GUI will provide an interactive
complex and diverse datasets. These algorithms can platform where users can easily input their health
analyze large volumes of patient data, including information, initiate the prediction process, and
demographic information, medical history, laboratory obtain personalized risk assessments for heart disease.
results, and diagnostic tests, to identify patterns and The system will utilize the trained machine learning
relationships that may contribute to heart disease. By models to analyze the input data and present the
leveraging these patterns, machine learning models results in a clear and understandable format, allowing
can effectively predict the likelihood of heart disease users to make informed decisions and take necessary
and help healthcare professionals make informed preventive measures.
decisions.
While machine learning models have shown II. LITERATURE SURVEY
promising results in heart disease prediction, their
adoption and practical implementation can be In recent years, numerous experiments and research
challenging for healthcare professionals and have been conducted in the field of medical science
individuals without specialized knowledge in and machine learning, leading to the publication of
programming or data analysis. Therefore, the significant papers.
development of user-friendly interfaces, such as [1]A paper entitled "Prediction of Heart Disease using
graphical user interfaces (GUIs), can bridge this gap Machine Learning Algorithms" was proposed by
and facilitate the utilization of machine learning Aadar Pandita and Sarita Yadav. They used Several
algorithms in real-world scenarios. Machine Learning algorithms such as Naïve Bayes, K-
In this research paper, we propose a GUI-based Nearest Neighbor, Decision Tree and Random Forest
approach for heart disease prediction using machine are correlated to find the most precise model. In this
learning techniques. Our objective is to create an project the algorithm is given with input variables
intuitive and accessible interface that allows and actual output obtained then algorithm compares
healthcare professionals and individuals to input their between the actual and predicted output to identify
health data easily and obtain reliable predictions errors and modifies the model precisely. The heart
regarding their heart disease risk. The GUI will serve disease database is from the UCI repository. The
as a user-friendly tool that leverages the power of accuracy of their model is 0.9907. And they conclude
machine learning algorithms to provide accurate risk it with the statement that with the help of KNN
assessments and support proactive management algorithm we can know the patients who are suffering
strategies. from heart diseases.
To achieve this, we will employ a diverse dataset [2] M.Snehith Raja, M. Anurag, Ch. Prachetan Reddy
encompassing various clinical features, including age, proposed a paper “Machine learning based heart
gender, blood pressure, cholesterol levels, and other disease prediction system”. The study relies on the
relevant parameters. Several state-of-the-art machine analysis of data collected in SAQ. They used Several
learning algorithms, such as logistic regression, Machine Learning algorithms such as KNN, Random
support vector machines, random forests, and Forest, K-mean clustering, Decision Trees. The
artificial neural networks, will be employed to build algorithm constructs N of Decision trees and outputs
International Journal of Scientific Research in Science and Technology (www.ijsrst.com) | Volume 10 | Issue3 399
Prof. Sachin Sambhaji Patil et al Int J Sci Res Sci & Technol. May-June-2023, 10 (3) : 398-404
the class that is the average of all decision trees combination of predictors to a probability value
output.So, accuracy of prediction at early stages was between 0 and 1.
achieved effectively. kNN- K-nearest neighbors is a non-parametric
[3] Aditi Gavhane, Gouthami Kokkula, Isha Pandya, classification algorithm that determines the class of a
Prof. Kailas Devadkar proposed a paper titled new instance based on its nearest neighbors in the
"Prediction of Heart Disease Using Machine Learning" training dataset.The algorithm assigns a class label to a
with the aim of developing an application capable of test instance by majority voting among its K nearest
predicting heart disease vulnerability based on basic neighbor.The value of K determines the number of
symptoms. They used ANN, multi-layer perceptron neighbors considered for the classification decision.
(MLP) to train and test the dataset. The system was Decision tree- Decision trees are hierarchical
developed using python code using PyCharm IDE. structures that make decisions based on a set of
With the help of python library sci-kit learn, the conditions or rules. The tree is constructed by
system was implemented successfully. recursively partitioning the data based on the most
[4] Proposed by Baban U. Rindhe, Nikita Ahire, informative features. At each node, a feature and a
Rupali Patil, Shweta Gagare, and Manisha Darade, the threshold value are selected to split the data, creating
paper titled "Heart Disease Prediction Using Machine branches.
Learning" was put forward.They used some machine Random forest- Random Forest is an ensemble
learning algorithms such as Artificial Neural Network learning method that combines multiple decision
(ANN), Random Forest, and Support Vector Machine trees. It creates a collection of decision trees by
(SVM). This project aims to know whether the patient randomly sampling the training dataset with
has heart disease or not. The records in the dataset are replacement (bootstrapping) and selecting random
divided into the training set and test sets. After pre- subsets of features. For prediction, the final outcome
processing the data. The data classification technique is determined by aggregating the predictions of all
namely supports vector machine, artificial neural individual decision trees (e.g., voting or averaging).
network, random forest was applied. In the project,
proper data processing was employed for the analysis IV.SYSTEM ARCHITECTURE
of the heart disease patient dataset. The three models
were trained and tested, achieving the following
maximum scores: Support Vector Classifier with
84.0%, Neural Network with 83.5%, and Random
Forest Classifier with 80.0%.
III.METHODOLOGY
International Journal of Scientific Research in Science and Technology (www.ijsrst.com) | Volume 10 | Issue3 400
Prof. Sachin Sambhaji Patil et al Int J Sci Res Sci & Technol. May-June-2023, 10 (3) : 398-404
4. Implementing Algorithms
The next step is to implement machine learning
algorithms to predict whether a patient has heart
disease or not. In this case, decision tree, random
Fig 2.Systemarchitecture of the model forest, logistic regression, and K-nearest neighbor
(KNN) are the algorithms of interest. Each algorithm
1. Disease Dataset Collection is trained using the train dataset and evaluated using
The first step in designing a system for heart disease the test dataset. The accuracy of each algorithm is
diagnosis is to collect a dataset of patients with the compared, and the algorithm with the highest
disease of interest, in this case, heart disease. The accuracy is selected for further evaluation.
dataset contains a variety of patient attributes, such as
age, gender, blood pressure, cholesterol levels, 5. Predicted Output
medical history, and other relevant factors. The The final step is to predict whether a patient has heart
dataset can be collected from medical records, disease or not based on their attributes using the
hospitals, or generated through clinical trials or selected algorithm. The algorithm takes the patient
surveys. The larger the dataset, the better the attributes as input and outputs a probability value or a
accuracy of the algorithms, as the algorithms can binary value indicating the presence or absence of
learn more from a larger dataset. heart disease. The predicted output is further
The dataset used in this system was downloaded from evaluated using metrics such as accuracy, precision,
Kaggle: recall, and F1 score to determine the performance of
https://www.kaggle.com/datasets/johnsmith88/heart- the algorithm.
disease-dataset A graphical user interface (GUI) is designed to detect
heart disease by incorporating various machine
2. Data Preprocessing learning models that analyze patient data. In this
After the dataset has been collected, the next step is to system, GUI was designed using a python module
pre-process the data. Pre-processing involves cleaning “tkinter”. After entering the required medical risk
the data, dealing with missing values, and normalizing factors, it provides the output by showing “Possibility
or standardizing the data to make it easier for the of heart disease” or “No Heart Disease”.
algorithms to process.
V. RESULT
3. Train Dataset and Test Dataset Split
The next step is to split the dataset into a train dataset The accuracies obtained by the algorithms after
and a test dataset. The train dataset is used to train the implementing the model are as follows:
International Journal of Scientific Research in Science and Technology (www.ijsrst.com) | Volume 10 | Issue3 401
Prof. Sachin Sambhaji Patil et al Int J Sci Res Sci & Technol. May-June-2023, 10 (3) : 398-404
VI.DISCUSSION
Fig 2.Accuracies obtained bythealgorithms
The use of machine learning techniques for heart
disease prediction, coupled with a user-friendly
graphical user interface (GUI), has significant
implications for improving healthcare outcomes and
empowering individuals in managing their heart
health. This study proposed a comprehensive
approach to heart disease prediction, involving data
collection, data preparation, model selection, model
training, and model evaluation.
Overall, this study demonstrated the potential of
machine learning techniques in accurate heart disease
prediction. The selection of appropriate models, such
as the Random Forest, Logistic Regression, Decision
Tree, and KNN algorithms, allowed for robust and
reliable predictions. The incorporation of a user-
friendly GUI enabled seamless interaction and
accessibility, promoting individual awareness and
proactive heart health management.These findings
Fig 3. GUI result when no heart disease hold significant implications for healthcare
professionals, individuals, and healthcare systems at
International Journal of Scientific Research in Science and Technology (www.ijsrst.com) | Volume 10 | Issue3 402
Prof. Sachin Sambhaji Patil et al Int J Sci Res Sci & Technol. May-June-2023, 10 (3) : 398-404
International Journal of Scientific Research in Science and Technology (www.ijsrst.com) | Volume 10 | Issue3 403
Prof. Sachin Sambhaji Patil et al Int J Sci Res Sci & Technol. May-June-2023, 10 (3) : 398-404
International Journal of Scientific Research in Science and Technology (www.ijsrst.com) | Volume 10 | Issue3 404