You are on page 1of 12

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/342065406

Prediction of Heart Disease Using Feature Selection and Random Forest


Ensemble Method

Article in International Journal for Pharmaceutical Research Scholars · June 2020


DOI: 10.31838/ijpr/2020.12.04.013

CITATIONS READS

22 2,133

2 authors:

Dhyan Chandra Yadav Saurabh Pal


Veer Bahadur Singh Purvanchal University Veer Bahadur Singh Purvanchal University
21 PUBLICATIONS 118 CITATIONS 98 PUBLICATIONS 2,831 CITATIONS

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Application of Data Mining Techniques with special reference to academic Performance Monitoring and evaluation in higher education View project

Prediction of Presence of Breast Cancer Disease in the Patient using Machine Learning Algorithms and SFS View project

All content following this page was uploaded by Saurabh Pal on 10 June 2020.

The user has requested enhancement of the downloaded file.


ISSN 0975-2366
DOI:https://doi.org/10.31838/ijpr/2020.12.04.013
Research Article

Prediction of Heart Disease Using Feature Selection and


Random Forest Ensemble Method
DHYAN CHANDRA YADAV, SAURABH PAL
VBS Purvanchal University, Jaunpur, India
Email ID: dc9532105114@gmail.com, drsaurabhpal@yahoo.co.in
Received: 12.03.19, Revised: 28.05.20, Accepted: 02.06.20

ABSTRACT
The heart is very soft and sensitive part of body by which brain handles blood related system in body. The
heart disease that greatly affects in body as like: pulmonary artery, atalata, enzaina and birth defects included.
Heart disease is mainly related to contraction or blocked blood vessels in the heart. The symptoms of heart
disease depend on the type of disease. Heart disease occurs not only in adults but also in children. The
infection affecting the tissues is known as percarditis. In this, the tissues closest to the heart are affected.
Infections affecting the lining of the heart muscle are known as myocardium .The study of medical datasets is
made very intuitive by machine learning algorithms. The machine learning algorithms provide techniques to
identify dataset attributes and the relationship between them.
In this research work, we used heart disease related information from UCI repository. The dataset contained
1025 Instances with 14 attributes, sick and nonstick patients in target variable. In this paper, we proposed and
analyzed classification accuracy, precision and sensitivity by four tree based classification algorithms: M5P,
random Tree and Reduced Error Pruning with Random forest ensemble method. All the prediction based
algorithms have applied after the features selection of heart patient’s dataset. In this paper, we used three
features based algorithms: Pearson Correlation, Recursive Features Elimination and Lasso Regularization. The
data table analyzed by different feature selection methods for better prediction. All the analysis is done by
three experimental setup; First experiment applied Pearson Correlation on M5P, random Tree, Reduced Error
Pruning and Random forest ensemble method. In the second experiment we used Recursive Features
Elimination and applied on above four tree based algorithms. In the third experiment we used Lasso
Regularization and applied on as above tree based algorithms. After all the performance we analyzed and
calculated classification accuracy, precision and sensitivity.
With the results, we finally concluded that feature selection methods Pearson correlation and Lasso
Regularization with random forest ensemble method provide better results 99% accuracy. We analyzed and
find the random forest ensemble method predicted better result compare to other algorithms in the previous
year’s works.
Keywords: Data mining Tree based Algorithms, Random Forest Ensemble Method, Features Relevant
Method, Features Elimination Method Lasso Regularization Method and Heart Disease.

INTRODUCTION
Research is going on, in large research Cardiovascular data are available in very large
institutions to ensure factors related to heart quantities in healthcare. Due to the large amount
disease. In some institutions, smoking, age, of data, it becomes very difficult to study it in
high/low blood pressure, obesity, diabetes and general. But with the help of data mining, large
lack of exercise have been included as main collections are easily converted into information.
factors for diseases. According to the instructions Which shows how the condition of heart disease
of the researchers, it is considered helpful to has been in children and adults in the past years
identify the disease related to heart disease. Heart and its study also helps in estimating how to
disease is also revealed due to blockage in the reduce the mortality caused by cardiovascular
blood vessels, which later expresses the possibility diseases in the future. Machine learning
of heart attack, chest pain or stroke. Valve and algorithms can improve the treatment of a person
heart muscles are mainly affected in heart suffering from the disease by comparing its
disease. The level of mortality among the world factors.
population by heart disease is quite large.

56| International Journal of Pharmaceutical Research | Oct - Dec 2020 | Vol 12 | Issue 4
Dhyan Chandra Yadav et al / Prediction of Heart Disease Using Feature Selection and Random Forest
Ensemble Method

Fig.1: Representation of blockage in heart .https://images.app.goo.gl/sSdy8qxDpni7fFTj6

Some of the symptoms of heart disease are as In this paper, we predict various heart diseases by
follows: variety of feature selection algorithms, applied on
 Heart tightness, pressure and pain. tree based machine learning algorithms. Machine
 Chest arms or neck jaw and back pain. learning algorithms provide correlations between
Heart attacks are as follows: various related attributes.
 Having a dizzy head.
 Face turning brown. RELATED WORKS
 Restlessness. Cai et al., [2020], discussed about heart
 Trouble breathing, etc. arrhythmia and 12 lead electro cardiogram. They
Heart diseases that are not easily understood used one dimensional deep densely connected
like: neural network to detect artial fibrillation. Authors
 Arrhythmias: Heart beat to be irregular. found accuracy, sensitivity and specificity (99.35) ,
 Cardiogenic: Shock in person to properly do (99.19) and (99.44) respectively the results on test
not get the blood that person's blood dataset [2].
pressure suddenly collapsed. Buettner et al., [2020], considered
 Hypoxemia: There is much difficulty in electroencephalography recording of heart
breathing due to lack of oxygen in the blood. patients. Authors explained five granular divisions
 Pulmonary Edema: Pulmonary edema of EEG spectra by machine learning classifiers.
involves the accumulation of fluid in or They used Random Forest algorithm to make a
around the lungs of a heart patient. balance between paranoid schizophrenic and
non- schizophrenic persons with (96.77) percent
 DVT or deep win thrombosis: Due to an
classification accuracy [3].
excess of blood clots in the veins obstructing
Magesh and Swarnalatha [2020] analyzed about
the blood flow.
cardiovascular ailment centers in ruler side. They
 Mycordial rupture: In this, damage the wall
found some risk factors or illness in coronary
of heart, of heart patients, which indicates a
disease by smoking. Authors examined target
major danger.
level distribution from samples and identify
 Ventricular aneurysm: A bulge in the heart
features through entropy. They used Random
chamber of the afflicted person, causing
Forest in the prediction of heart disease and
difficulty in breathing with blood flow [1].
found accuracy (89.30) percent with cluster based

57| International Journal of Pharmaceutical Research | Oct - Dec 2020 | Vol 12 | Issue 4
Dhyan Chandra Yadav et al / Prediction of Heart Disease Using Feature Selection and Random Forest
Ensemble Method

DT learning and (76.70) percent without cluster algorithms in three EMR dataset and find (77.43)
based DT learning [4]. percent accuracy [8].
Shen et al., [2020], discussed about a trial
fibrillation arrhythmia. They used neural network METHODOLOGY
and manual extraction features on the prediction In this phase, we have described the heart
of a trial fibrillation. Authors used decision tree, patient’s attributes and applied algorithms. We
Random Forest, GBDT, XG Boost, LightGBM and visualized all the attributes measured their
find (99.91) percent accuracy by stacking model distribution and considered applied algorithms
[5] . with experimental setup.
Kar et al., [2020], observed the condition of heart Data Description:
of a patient by electrocardiogram signal. They In this paper, we organized dataset from
analyzed ECG, signal by continues and discrete recorded UCI website. The dataset is related with
wavelet transforms. Authors used time interval, heart patients and measure the distribution of
statistical features and classify irregular heart disease patient attributes. The class
heartbeats. They calculated K-NN, DT-CWT distribution, box whisker plotting and visualizing
features and find (98.92) percent classification of dataset have discussed by Python language. In
accuracy [6]. this dataset, we used 1025 instances and 14
Harimoorthy and Thangavelu [2020], discussed attributes.
about hidden pattern in chronic kidney disease. Class Distribution
They reduced some features from chronic kidney target
disease and improved in SVM Redial biaskernal. 0 499
Authors compared SVMRBK with (SVM-Linear, 1 526
SVM-Polynomial, Random forest and Decision dtype: int64
Tree) and find improvement in accuracy of SVM- The class level distribution of dataset represents
RBK (98.3) percent, (98.7) percent AND (89.9) how much TRUE /FALSE positive in the target
percent [7]. variable.
Miled et al., [2020], analyzed electronic medical Box and Whisker Plots
record of diagnosis, prescriptions and medical Each attributes and their numeric values provide
notes. They used machine learning algorithms to help in disease prediction. By the help of box and
identify dementia and non dementia cases and whisker, we have implemented the heart disease
predict the fact. They developed Random forest attributes in brief and measured each attributes
distribution [9].

Fig.2: Representation of Box and Whisker plotting of heart disease attributes

58| International Journal of Pharmaceutical Research | Oct - Dec 2020 | Vol 12 | Issue 4
Dhyan Chandra Yadav et al / Prediction of Heart Disease Using Feature Selection and Random Forest
Ensemble Method

Histograms Representation
The histograms are graphical representation of used heart disease 14 attributes. Each attributes
each attributes separately in the graph and represents their valuable representation in whole
measure their visualization [10]. In this paper, we dataset.

Fig.3: Representation of Histogram plotting of heart disease attributes


Algorithms:
M5P algorithm

Fig.4: Representation of M5P algorithms for 40 instances and 14 attributes of heart disease

In this paper, we used this model for numeric node due to prediction [11]. For example we
prediction with the results; at the leaf find the considered some instances for heart disease and
class values of instances. The work of this their performance.
algorithm as an expert to search on node, each

59| International Journal of Pharmaceutical Research | Oct - Dec 2020 | Vol 12 | Issue 4
Dhyan Chandra Yadav et al / Prediction of Heart Disease Using Feature Selection and Random Forest
Ensemble Method

Random Tree Algorithm

Fig.5: Random Tree algorithms for 40 instances and 14 attributes of heart disease

Random tree algorithm used for randomly performance of class predictions with their
selection of attributes in decision node [12]. The probability and try to improve prediction
main work of this algorithm to measure the performance at each node.

Reduced Error Pruning

Fig.6: Representation of REP algorithms for 40 instances and 14 attributes of heart disease
The performance of reduced error pruning is folds=3 and seed=1 for fast learner on each
based on C4.5 algorithms [13]. In this experiment node. The main objective of algorithm to reduce
we used, batch size=100, max Depth=-1, error pruning on each node of the tree.
minimum variance probability=0.001 num

Formula Representation:
Table 1: Computational Formula for Prediction [14]
S.No. Measure Formula
1. Accuracy (TP+TN)/(TP+TN+FP+FN)
2. Sensitivity (TP)/(TP+TN)
3. Specificity (TN)/(TN+FP)

60| International Journal of Pharmaceutical Research | Oct - Dec 2020 | Vol 12 | Issue 4
Dhyan Chandra Yadav et al / Prediction of Heart Disease Using Feature Selection and Random Forest
Ensemble Method

Proposed Ensemble Method: Ensemble method. After the features selection


In this research paper, we used random forest as trained on (75%) dataset and the test on (25%)
an ensemble method. The Random Forest is a with tree algorithms with ensemble method. The
powerful decision making tree ensemble method final prediction has measured by average voting
[15]. The main property of this algorithm is to algorithms. In this experiment, we used bag size=
select decision randomly from other tree. In this 100%, batch size==100 and seed = 1 for better
paper, we used M5P tree, Random tree and Error prediction.
Reduced Pruning tree with Random Forest

Fig.7: Proposed Model of Random Forest algorithm as a ensemble model

RESULTS  The Recursive Features selection method


In this paper, we used various features selection applied with Random Forest and calculated
method and applied on various machine learning (94.12%) accuracy.
algorithms for better prediction.  Lasso Regularization by lassoCV() calculated:
 Pearson correlation with output variables, find  Best alpha= 0.0048, Best score =.51
score of some features: cp, exang, oldpeak  In the performance Lasso Model avoid some
and target (.43), (.43), (.43) and 1.00 features: fbs, chol and age
respectively, these are highly correlated  Lasso Regularization with Random Forest
features. ensemble method finds (99.9%) accuracy.
 The Pearson correlation features selection
method with Random Forest Ensemble DISCUSSION
method calculated (99.9%) accuracy. In this section, we discussed about all feature
 The Recursive Features selection method selection performance with machine learning
provide optimal number of features:12 and algorithms:
the score with 12 features: 0.54

61| International Journal of Pharmaceutical Research | Oct - Dec 2020 | Vol 12 | Issue 4
Dhyan Chandra Yadav et al / Prediction of Heart Disease Using Feature Selection and Random Forest
Ensemble Method

Fig.8: Representation of Pearson Correlation for heart disease attributes


The matrix has an absolute value (0.3) with the Correlation matrix with output variable and select
output variable and gives the results for highly the highly correlated features as:
correlate attributes [16]. We used Pearson

Table 2: Valuable Score with Features of Pearson Correlation


Features Correlation
cp 0.434854
thalach 0.422895
exang 0.438029
oldpeak 0.438441
slope 0.345512
ca 0.382085
thal 0.337838
target 1.000000
Name: target, dtype: float64
Table 3: Measure Prediction Performance for With/ Without PC by Tree Classifiers
Algorithms FSM (Without PC) FSM (With PC)
Specificity Sensitivity Accuracy Specificity Sensitivity Accuracy
M5PT 41.2 94.5 89.3 83.1 91.2 93.4
RT 42.7 95.7 91.2 92.6 95.3 95.2
REPT 50.6 95.8 91.5 89.7 96.6 96.6
RFT 62.3 95.1 93.8 90.3 99.6 99.9

For the table.3, it is clear that PC= Pearson [False True True False False False False False
Correlation feature selection on RFT = Random True True True True True]
Forest calculated highest accuracy and sensitivity. [3 1 1 6 7 4 2 5 1 1 1 1 1]
We Initialized Recursive Features Elimination Recursively remaining heart attributes and
model for fitting the data to model and find the building a model on those heart attributes remain
result as: in table. All the True are most relevant features in
dataset and False are irrelevant features [17] .

62| International Journal of Pharmaceutical Research | Oct - Dec 2020 | Vol 12 | Issue 4
Dhyan Chandra Yadav et al / Prediction of Heart Disease Using Feature Selection and Random Forest
Ensemble Method

So calculate no of features with variable to store By the experiment find the transforming data
the optimum features as: using RFE and fitting the data to model as :
Optimum number of features: 12; Score with 12 Index(['age', 'sex', 'cp', 'trestbps', 'fbs', 'restecg',
features: 0.541462 'thalach', 'exang', 'oldpeak', 'slope', 'ca', 'thal'],
dtype='object')
Table 4: Measure Prediction Performance for With/ Without RFE by Tree Classifiers
Algorithms FSM (Without RFE) FSM (With RFE)
Specificity Sensitivity Accuracy Specificity Sensitivity Accuracy
M5PT 52.4 83.7 74.8 92.3 82.5 80.6
RT 53.3 84.2 80.3 81.7 84.7 86.3
REPT 41.2 84.3 85.8 78.6 84.8 85.6
RFT 51.1 84.8 87.9 91.6 98.8 98.2

For the table.4, it is clear that RFE= Recursive Best alpha using built-in LassoCV: 0.004860
Features Elimination on RFT = Random Forest Best score using built-in LassoCV: 0.513496
calculated highest accuracy and sensitivity.
Text(0.5, 1.0, 'Feature importance using Lasso
In the Lasso regularization model, we used CV Model') and reduced some less important as: fbs,
based function for better feature importance[18] . chol and age
reg = LassoCV()

Fig.9: Lasso regularization model for features selection


If the features are irrelevant then Lasso penalizes, age are penalized. The last top and bottom
with the results, we find features: fbs, chol and features are highly related with each other.
Table 5: Measure Prediction Performance for with / Without Lasso regularization by Tree
Classifiers
Algorithms Without LRM With LRM
Specificity Sensitivity Accuracy Specificity Sensitivity Accuracy
M5PT 63.5 72.2 83.8 75.2 73.2 89.8
RT 31.7 78.3 83.7 79.5 79.8 91.9
REPT 61.5 79.4 78.7 91.5 63.2 79.4
RFT 73.5 76.8 88.9 91.3 97.1 99.9

63| International Journal of Pharmaceutical Research | Oct - Dec 2020 | Vol 12 | Issue 4
Dhyan Chandra Yadav et al / Prediction of Heart Disease Using Feature Selection and Random Forest
Ensemble Method

For the table.5, it is clear that LRM= Lasso regularization model on RFT = Random Forest calculated
highest accuracy and sensitivity.

Fig.10: Representation of Accuracy by Tree based Algorithms in heat disease


With the results, table.3, 4 &5 represents the obtained experiments and we find that the highest
comparison of other M5P, RT and REPT accuracy and Sensitivity of Random Forest
algorithms. Fig.10., represents the above all the ensemble method.
Table 6: Representation of Previous Year Paper Accuracy Score
Authors Instances Algorithms Accuracy
Hui et al.,[2012] [19] 9800 SRBC, ICA & RR 98.35
Martis et ai., [2013][20] 110,094 ICA, DWT & PNN 99.28
Ince et al.,[2015][21] 100,389 CNN & BP 98.90
Naomin et al.,[2016][22] 110,094 NN, SVM & PCA 98.90
Hua et al., [2017][23] 90808 SVM & Weighted RR 98.46
Oh et al.,[2017][24] 109949 CNN 94.47
Yildirim et al.,[2018][25] 7376 DBLSTM 99.39
Yildirim et al., [2019][26] 100,022 CAE-LSTM 99.23
Haotien et al.,[2020][27] 100630 CNN 99.06

We have studies near 2012 -2020 and find the the value of classification accuracy, precision,
highest accuracy near about (99%). In the work, sensitivity and ROC. We have used UCI
we have compared different algorithms Repository dataset for 1025 instances and 14
individually but did not cover (100%) accuracy. In attributes. In this research, we Identify whether a
this research work, we have tried to test with person is suffering from heart problem are not in
different features selection method applied on heart disease machine learning algorithms
different machine learning tree classifiers provide various way to implement the medical
algorithms and finally find Random Forest data set. In this research work, the important
ensemble method provide better result (99.9%) features were identified by Pearson correlation,
accuracy. Recursive Features Elimination and Lasso
Regularization with the selected important
CONCLUSION features we examine with improved, Random
In this research paper, we used Pearson Forest, Random Tree, Reduced Error Pruning and
Correlation, Recursive Features Elimination and M5P classifiers algorithms in heart disease. With
Lasso Regularization, features selection methods the results, we find that improved Random Forest
and applied on Machine learning tree based ensemble method with batch size (100) and seed
classifiers algorithms: M5P, Random Tree and (1) provide batter accuracy compare to other.
Reduced Error Pruning with Random Forest Since this work is based on recorded data from
ensemble method. In this analysis, we evaluated UCI repository, for future work planning , we will

64| International Journal of Pharmaceutical Research | Oct - Dec 2020 | Vol 12 | Issue 4
Dhyan Chandra Yadav et al / Prediction of Heart Disease Using Feature Selection and Random Forest
Ensemble Method

try train and test on huge medical data set with In Machine Learning with the Raspberry Pi (pp. 1-
more than one ensemble method and try to 47). Apress, Berkeley, CA.
improvement in their performance. 10. del Rio, A. A. H., Cuevas, E., & Zaldivar, D.
(2020). Multi-level Image Thresholding
Conflict of Interest Segmentation Using 2D Histogram Non-local
Authors have no conflict of Interest. Means and Metaheuristics Algorithms.
Funding In Applications of Hybrid Metaheuristic Algorithms
This study was not funded. for Image Processing (pp. 121-149). Springer,
Cham.
Acknowledgements 11. Mudali, P., Roopa, J., Raju, M. G., & Yadav, A.
The author is grateful to Veer Bahadur Singh (2020). Analysis of Parallel M5P and Random
Purvanchal University Jaunpur, Uttar Pradesh, for Forest Regression for Visualization of Traffic
Providing financial support to work as Post Behavior. In Computational Intelligence in Pattern
Doctoral Research Fellowship. Recognition (pp. 231-241). Springer, Singapore.
12. Nachmias, A. (2020). Uniform Spanning Trees of
REFERENCES Planar Graphs. In Planar Maps, Random Walks and
1. Lui, C. K., Kerr, W. C., Li, L., Mulia, N., Ye, Y., Circle Packing (pp. 89-103). Springer, Cham.
Williams, E., ... & Lown, E. A. (2020). Lifecourse 13. Thomas, T., Vijayaraghavan, A. P., & Emmanuel,
Drinking Patterns, Hypertension, and Heart S. (2020). Applications of Decision Trees.
Problems Among US Adults. American Journal of In Machine Learning Approaches in Cyber Security
Preventive Medicine. Analytics (pp. 157-184). Springer, Singapore.
2. Cai, W., Chen, Y., Guo, J., Han, B., Shi, Y., Ji, L., 14. Shehab, M. (2019). Artificial Intelligence in Diffusion
... & Luo, J. (2020). Accurate detection of atrial MRI: Enhanced Cuckoo Search Algorithm with
fibrillation from 12-lead ECG using deep neural Metaheuristic Components for Extracting the
network. Computers in biology and medicine, 116, Maxima of the Orientation Distribution
103378. Function (Vol. 877). Springer Nature.
3. Buettner, R., Beil, D., Scholtz, S., & Djemai, A. 15. Sniatala, P., Amini, M. H., & Boroojeni, K. G.
(2020, January). Development of a machine (2020). A Novel Fault Tolerant Random Forest
learning based algorithm to accurately detect Model Using Brooks–Iyengar Fusion.
schizophrenia based on one-minute EEG In Fundamentals of Brooks–Iyengar Distributed
recordings. In Proceedings of the 53rd Hawaii Sensing Algorithm (pp. 159-165). Springer, Cham.
International Conference on System Sciences. 16. Jain, G., Mahara, T., & Tripathi, K. N. (2020). A
4. Magesh, G., & Swarnalatha, P. (2020). Optimal Survey of Similarity Measures for Collaborative
feature selection through a cluster-based DT Filtering-Based Recommender System. In Soft
learning (CDTL) in heart disease Computing: Theories and Applications (pp. 343-
prediction. Evolutionary Intelligence, 1-11. 352). Springer, Singapore.
5. Shen, M., Zhang, L., Luo, X., & Xu, J. (2020, 17. Kumari, P., & Haider, M. T. U. (2020). Sentiment
January). Atrial Fibrillation Detection Algorithm Analysis on Aadhaar for Twitter Data—A Hybrid
Based on Manual Extraction Features and Classification Approach. In Proceeding of
Automatic Extraction Features. In IOP Conference International Conference on Computational Science
Series: Earth and Environmental Science (Vol. 428, and Applications (pp. 309-318). Springer,
No. 1, p. 012050). IOP Publishing. Singapore.
6. Kar, N., Sahu, B., Sabut, S., & Sahoo, S. (2020). 18. Chen, Q., & Huang, L. (2020). Research on
Effective ECG Beat Classification and Decision Prediction Model of Gas Emission Based on
Support System Using Dual-Tree Complex Lasso Penalty Regression Algorithm. In Artificial
Wavelet Transform. In Advances in Intelligent Intelligence in China (pp. 165-172). Springer,
Computing and Communication (pp. 366-374). Singapore.
Springer, Singapore. 19. Huang, H. F., Hu, G. S., & Zhu, L. (2012). Sparse
7. Harimoorthy, K., & Thangavelu, M. (2020). Multi- representation-based heartbeat classification
disease prediction model using improved SVM- using independent component analysis. Journal of
radial bias technique in healthcare monitoring medical systems, 36(3), 1235-1247.
system. Journal of Ambient Intelligence and 20. Martis, R. J., Acharya, U. R., & Min, L. C. (2013).
Humanized Computing, 1-9. ECG beat classification using PCA, LDA, ICA and
8. Miled, Z. B., Haas, K., Black, C. M., Khandker, R. discrete wavelet transform. Biomedical Signal
K., Chandrasekaran, V., Lipton, R., & Boustani, M. Processing and Control, 8(5), 437-448.
A. (2020). Predicting dementia with routine care 21. Kiranyaz, S., Ince, T., & Gabbouj, M. (2015). Real-
EMR data. Artificial Intelligence in Medicine, 102, time patient-specific ECG classification by 1-D
101771. convolutional neural networks. IEEE Transactions
9. Norris, D. J. (2020). Introduction to machine on Biomedical Engineering, 63(3), 664-675.
learning (ML) with the Raspberry Pi (RasPi).

65| International Journal of Pharmaceutical Research | Oct - Dec 2020 | Vol 12 | Issue 4
Dhyan Chandra Yadav et al / Prediction of Heart Disease Using Feature Selection and Random Forest
Ensemble Method

22. Elhaj, F. A., Salim, N., Harris, A. R., Swee, T. T., &
Ahmed, T. (2016). Arrhythmia recognition and
classification using combined linear and nonlinear
features of ECG signals. Computer methods and
programs in biomedicine, 127, 52-63.
23. Chen, S., Hua, W., Li, Z., Li, J., & Gao, X. (2017).
Heartbeat classification using projected and
dynamic features of ECG signal. Biomedical Signal
Processing and Control, 31, 165-173.
24. Acharya, U. R., Oh, S. L., Hagiwara, Y., Tan, J. H.,
Adam, M., Gertych, A., & San Tan, R. (2017). A
deep convolutional neural network model to
classify heartbeats. Computers in biology and
medicine, 89, 389-396.
25. Yildirim, Ö. (2018). A novel wavelet sequence
based on deep bidirectional LSTM network
model for ECG signal classification. Computers in
biology and medicine, 96, 189-202.
26. Yildirim, O., Baloglu, U. B., Tan, R. S., Ciaccio, E.
J., & Acharya, U. R. (2019). A new approach for
arrhythmia classification using deep coded
features and LSTM networks. Computer methods
and programs in biomedicine, 176, 121-133.
27. Wang, H., Shi, H., Chen, X., Zhao, L., Huang, Y.,
& Liu, C. (2020). An Improved Convolutional
Neural Network Based Approach for Automated
Heartbeat Classification. Journal of Medical
Systems, 44(2), 35.

66| International Journal of Pharmaceutical Research | Oct - Dec 2020 | Vol 12 | Issue 4

View publication stats

You might also like