Professional Documents
Culture Documents
4th Pooja Jha 5th Laxmi Kumari Pathak 6th Mohan Kumar Dehury
Amity Inst. of Information Technology Amity Inst. of Information Technology Amity Inst. of Information Technology
Amity University Jharkhand Amity University Jharkhand Amity University Jharkhand
Ranchi, INDIA Ranchi, INDIA Ranchi, INDIA
pjha@rnc.amity.edu lkpathak@rnc.amity.edu mkdehury@rnc.amity.edu
mohankdehury@gmail.com
Abstract—Parkinson’s disease, which affects 2-3% of people microphonic, singular-chromatic, dysarthria, and dysphonia,
under 65 years of age, is the second most prevalent neurological and that the first symptom discovered in persons with this
ailment. The neuropathological indicators of Parkinson’s disease condition is a lack of voice. There is now no documented
are intracellular inclusions containing clumps of synuclein and
neuronal death in the substantia nigra, which results in striatal therapy for the condition, but a range of pharmaceutical
dopamine insufficiency. The central and peripheral autonomic medications that can greatly lessen symptoms, especially
nerve systems contain numerous more cell types that are also in the initial stages, are available. The analysis of voice
involved, most likely beginning with the first stages of the disease. frequency is brief and not invasive. As an outcome, voice
Although bradykinesia and other essential motor characteristics frequency can be utilized to monitor the course of this
are required for clinical diagnosis, Parkinson’s disease is also ac-
companied by a wide range of non-motor symptoms that increase subjective sickness [1].
overall disability. Numerous routes and mechanisms, including
synuclein proteostasis, mitochondrial function, oxidative stress, Parkinson’s disease causes additional problems, some of
calcium homeostasis, axonal transport, and neuroinflammation, which are manageable with medication or other forms of
are involved in the underlying molecular etiology. This paper therapy. Parkinson’s disease is a major global health concern
aims to implement the SVM algorithm to predict Parkinson’s
disease in a patient. that might be greatly reduced with the aid of machine learning
Index Terms—Parkinson’s Disease, SVM, KNN, ANN, Machine and innovative computer algorithms in the field of medical
Learning. science.
brain neuron degeneration. Only Alzheimer’s disease is more expectancy increases. For decades, researchers have worked to
common than Parkinson’s disease among neurodegenerative understand much about this disease, and hence to find some
diseases. A majority of the neurons that produce dopamine techniques for successfully minimizing its symptoms, which
are affected in the substantia nigra, a specific portion of are often recurrent muscular tremors and/or rigidity. In the
the brain. Parkinson’s disease causes the loss of dopamine- later stages of Parkinson’s syndrome, further symptoms such
producing neurons, resulting in the inability to control any as akinesia, bradykinesia, and speech impairment may arise.
voluntary activities. This disorder can cause both motor and Symptoms usually begin modestly and worsen over time. As
non-motor symptoms, such as tremors, sluggish movement, the disease progresses, people may have difficulty speaking
sleep problems, posture problems, sadness, and others [2]. and walking. They may also encounter behavioral and mental
problems, sleep troubles, depression, memory loss, and fatigue
A number of machine learning models, including SVM, [5].
KNN, ANN, Random Forest technique, etc., are being
presented and have assisted in the early detection of D. Solution
Parkinson’s disease. Given that symptomatic therapy may Parkinson’s disease is a major global health concern that
be ineffective and time-consuming, a neuro-image-based might be greatly reduced with the aid of machine learning and
diagnosis for Parkinson’s disease may also be acceptable [3]. innovative computer algorithms in the field of medical science.
In addition to other medical scans such as Positron Emission We can develop algorithms like a support vector classifier or
Tomography (PET), Magnetic Resonance Imaging (MRI), a machine to detect it at an early stage.
Functional Magnetic Resonance Imaging (fMRI), and others,
Single-photon Emission Computed Tomography (SPECT) III. OBJECTIVE AND GOAL
is the functional imaging technique most commonly used The goal of this work is to compare different existing
in European clinics for the early diagnosis of Parkinson’s machine learning models for detecting Parkinson’s disease and
disease [4]. to implement our model to identify the same.
A. Goal
Clinical decisions are often based on a physician’s intuition
and knowledge, which is supported by hospital-wide data. To detect Parkinson’s disease using vocal features.
The enormous dimensions of information in data sets require
B. Specific Objectives
the discovery of approaches that can be used in therapeutic
choices. This data should be assessed for medical research The objectives are:
and use in health centers. • To compare the different machine-learning models
• To analyze their accuracy score
B. Background of the Study • To implement our SVM model
The brain is the main functioning unit of the body, and any
small incident in any part of the human body will have an IV. LITERATURE REVIEW
immediate impact on the other organs. Parkinson’s disease is Many authors have studied Parkinson’s disease prediction
an inoperable neurological condition that worsens over time. using Machine learning methods. Paper [2] provides a
By 2020, an estimated 9.4 million people worldwide were complete survey of the most recent research papers published
still living with this condition. Only 4% of the cases occur in up to the year 2017. Based on vocal datasets obtained from the
those under the age of 50 years of age. This condition mainly UCI repository, it compares the accuracy of existing classifiers
affects people over the age of 60 years. The symptoms of this and validates the performance of implemented classifiers. In
disease are classified as motor or non-motor. The slowness of [6], authors have reviewed and addressed numerous issues
movement, tremors, rapid eye movement disorder, shivering, while also providing some future recommendations and
gait difficulty, and unstable posture are the most common opportunities. In particular, this review offers significant
motor symptoms. insights and guidance for future advancements regarding
neural networks and associated learning systems.
Currently, there is no documented therapy for the condition,
but a variety of pharmaceutical medications are available that Authors in [7] have characterized this disorder by tremors,
can greatly reduce symptoms, especially in the initial stages. muscle rigidity, and incorrect walking motions. They have
The analysis of voice frequency is brief and not invasive. As studied previous attempts to distinguish Parkinson’s disease
an outcome, voice frequency can be utilized to monitor the from healthy subjects, but this work focused on differentiating
course of this subjective sickness [3]. Parkinson’s disease from other neurological-specific diseases
like Huntington’s Disease and Amyotrophic lateral sclerosis
C. The Problem (ALS) based on gait characteristics. The authors in [4]
The number of patients with Parkinson’s disease is calcu- suggested an ML model for a given DaTSCAN that is
lated to be 120-180 per 100,000 persons, although the propor- precisely classified as Parkinson’s disease or not as well as
tion (and thus the number of affected people) increases as life offers a reasonable basis for the prediction. Visual indicators
created by Local Interpretable Model-Agnostic Explainer system’s data dimensionality influences both the training and
(LIME) are used to get such reasoning. Transfer learning was run-time phases. Meanwhile, excessive dimensionality may
used to train DaTSCANs on a CNN (VGG16) using data result in dimensionality issues.
from the Parkinson’s Progression Markers Initiative.
Fig. 2. Top rows of the dataset. Fig. 4. Information about missing values.
F. Using drop() command into a specific range. Next, we used scaler.fit() which will fit
In this phase, the data pre-processing has started. We will all the data. The scaler.transform() function will convert the
separate features and targets. Here ‘features’ are the other data in the same range. Now the output range of X train is
columns in the dataset and ‘target’ is the status column. Now in between (-1 to +1) whereas, previously our values of the
two variables X and Y are created. X will take all the features dataset used to lie in hundreds. The obtained output is shown
column and Y will take the status column. In X we will drop in Fig. 9.
the ‘name’ column and the ‘status’ column of our dataset and
we will store it in X as shown in Fig. 7.
TABLE I R EFERENCES
C OMPARISION OF ML MODELS
[1] M. Wodzinski, A. Skalski, D. Hemmerling, J. R. Orozco-Arroyave, and
ML model Accuracy Score E. Nöth, “Deep learning approach to parkinson’s disease detection using
KNN 79% voice recordings and convolutional neural network dedicated to image
ANN 83.02% classification,” in 2019 41st Annual International Conference of the
SVM 87.17% IEEE Engineering in Medicine and Biology Society (EMBC). IEEE,
2019, pp. 717–720.
[2] G. Pahuja and T. Nagabhushan, “A comparative study of existing
machine learning approaches for parkinson’s disease detection,” IETE
Journal of Research, vol. 67, no. 1, pp. 4–14, 2021.
B. ANN [3] M. Ricci, G. Di Lazzaro, A. Pisani, N. B. Mercuri, F. Giannini,
and G. Saggio, “Assessment of motor impairments in early untreated
In recent years, Artificial Neural Network or the (ANN)- parkinson’s disease patients: the wearable electronics impact,” IEEE
based diagnosing of clinical disorders has received loads of at- Journal of Biomedical and Health Informatics, vol. 24, no. 1, pp. 120–
tention. during this paper, 2 forms of ANNs are utilized to clas- 130, 2019.
[4] P. R. Magesh, R. D. Myloth, and R. J. Tom, “An explainable machine
sify effective degenerative disorder diagnosis. The Multilayer learning model for early detection of parkinson’s disease using lime
Perceptron with Precedence Propagation Teaching Method on datscan imagery,” Computers in Biology and Medicine, vol. 126, p.
and Radial Basis Operation (RBF). ANNs were trained to 104041, 2020.
[5] Y. Matsumoto, M. Seki, T. Ando, Y. Kobayashi, H. Iijima, M. Nagaoka,
differentiate between medical parameters in samples (total of and M. G. Fujie, “Analysis of emg signals of patients with essential
195) with and without Parkinson’s disease [17]. An ANN is an tremor focusing on the change of tremor frequency,” in 2012 Annual
attributed chaotic system with a connected graph topology that International Conference of the IEEE Engineering in Medicine and
Biology Society. IEEE, 2012, pp. 2244–2250.
accepts data output through a state reaction to input activities. [6] A. Rana, A. Dumka, R. Singh, M. K. Panda, N. Priyadarshi, and
Nodes of the ANN are processor parts and directed channels. B. Twala, “Imperative role of machine learning algorithm for detec-
The accuracy score obtained using ANN is 83.02% [18]. tion of parkinson’s disease: Review, challenges and recommendations,”
Diagnostics, vol. 12, no. 8, p. 2003, 2022.
[7] S. Shetty and Y. Rao, “Svm based machine learning approach to
identify parkinson’s disease using gait analysis,” in 2016 International
C. SVM Conference on Inventive Computation Technologies (ICICT), vol. 2.
IEEE, 2016, pp. 1–5.
Using the SVM model on the dataset as discussed in the [8] O. Asmae, R. Abdelhadi, C. Bouchaib, S. Sara, and K. Tajeddine,
previous section the accuracy score is obtained to be 87.17% “Parkinson’s disease identification using knn and ann algorithms based
on voice disorder,” in 2020 1st International Conference on Innovative
as shown in Fig. 12. The accuracy score of the employed Research in Applied Science, Engineering and Technology (IRASET).
machine learning models in this paper is presented in table I. IEEE, 2020, pp. 1–6.
[9] F. S. Gharehchopogh and P. Mohammadi, “A case study of parkinson’s
disease diagnosis using artificial neural networks,” International Journal
VIII. CONCLUSION of Computer Applications, vol. 73, no. 19, 2013.
[10] H.-L. Chen, C.-C. Huang, X.-G. Yu, X. Xu, X. Sun, G. Wang, and
S.-J. Wang, “An efficient diagnosis system for detection of parkinson’s
In this paper, we have recognized and implemented three disease using fuzzy k-nearest neighbor approach,” Expert systems with
supervised learning machine-learning algorithms in our applications, vol. 40, no. 1, pp. 263–271, 2013.
[11] B. Kumar, S. Roy, A. Sinha, C. Iwendi, and L. Strážovská, “E-commerce
evaluation. Following that, the performance of the three website usability analysis using the association rule mining and machine
classifiers used in the prediction of Parkinson’s disease was learning algorithm,” Mathematics, vol. 11, no. 1, p. 25, 2022.
evaluated using various statistical approaches. In the tests, [12] R. Vunnava, L. Bodla, M. K. Dehury, and B. K. Mohanta, “Perfor-
mance analysis of ml techniques in identification of fake news,” in
we prepared and tested each categorization algorithm on a 2022 International Conference on Sustainable Computing and Data
training set that included both positive and negative datasets. Communication Systems (ICSCDS). IEEE, 2022, pp. 276–281.
The preliminary performance results show that the SVM [13] S. Rissanen, M. Kankaanpää, M. P. Tarvainen, J. Nuutinen, I. M. Tarkka,
O. Airaksinen, and P. A. Karjalainen, “Analysis of surface emg signal
outperformed the other two classifiers on the Parkinson morphology in parkinson’s disease,” Physiological measurement, vol. 28,
datasets. no. 12, p. 1507, 2007.
[14] D. Surangsrirat, C. Thanawattano, R. Pongthornseri, S. Dumnin,
C. Anan, and R. Bhidayasiri, “Support vector machine classification
In the future, this study will help in the development of parkinson’s disease and essential tremor subjects based on temporal
of a project that aims to design an automated program that fluctuation,” in 2016 38th Annual International Conference of the IEEE
can react more precisely to normal occurrences of the said Engineering in Medicine and Biology Society (EMBC). IEEE, 2016,
pp. 6389–6392.
disease and provide more informed choices in complex [15] [Online]. Available: https://www.kaggle.com/datasets/vikasukani/parkinsons-
situations. The program will be able to diagnose Parkinson’s disease-data-set
disease in a matter of minutes and warn users of the [16] P. Hall, B. U. Park, and R. J. Samworth, “Choice of neighbor order in
nearest-neighbor classification,” the Annals of Statistics, vol. 36, no. 5,
possibility of contracting the disease. This can be incredibly pp. 2135–2152, 2008.
beneficial in areas where there is a scarcity of healthcare [17] M. Tsuda, S. Asano, Y. Kato, K. Murai, and M. Miyazaki, “Differential
institutions and physicians. The model can be further scaled diagnosis of multiple system atrophy with predominant parkinsonism
and parkinson’s disease using neural networks,” Journal of the Neuro-
by gathering data from many clinical and medical institutes logical Sciences, vol. 401, pp. 19–26, 2019.
locally. Ensemble methods may be preferred for establishing [18] A. I. Galushkin, Neural networks theory. Springer Science & Business
an accurate model of these Parkinson’s disease predictions Media, 2007.
and performance could be enhanced.