You are on page 1of 6

PDetect-Parkinson’s Disease Detection Using

Speech Features
Puja C Prof.Chinchu.A
M Tech Signal Processing and Embedded Systems Assistant Professor
Government College of Engineering Kannur Department of Electronics and Communication
Government College of Engineering Kannur

Abstract—Parkinson’s syndrome is a degenerative,progressive In senior people,this results in a general loss of motor


neurological condition. The substantia nigra(a part of brain) skills. As a result of the shortage of neurotransmitter receptors
nerve cells in tiny bundles are mostly affected.When everything throughout time, in the nigrostriatal area. Nearly 90% of
is working properly, these cells produce dopamine, a chemical
(neurotransmitter) that carries information between brain areas patients,resting tremor, gait rigidity, postural instability, and
to coordinate smooth and balanced muscular movement. Because vocal dysfunction are all indicators of Parkinson’s disease.
these nerve cells die as a result of Parkinson’s disease,body There are two sorts of symptoms for Parkinson’s disease: range
motions are altered. Early Parkinson’s disease intervention is of motion and movement-unrelated.Depression, loss of sense
crucial for slowing the disease’s course. By allowing patients to of smell, and psychological characteristic impairments are
have access to information Condition modifying therapy is a type
of medication that is used to treat a disease. To ameliorate the examples of nonmotor symptoms.Parkinson’s disease patients
symptoms of Parkinson’s disease, computational algorithms that commonly encounter auditory processing disorder.majority of
utilise a set of data containing medical information about the the individuals have speech abnormalities. Such as extremely
disease. This will aid the number of people who seek to identify quiet and unvoiced speech. Speech signal analysis is regarded
a risk early on. as a major non-invasive tool for diagnosing Parkinson’s dis-
Research conducted on the Parkinson’s disease (PD) detec-
tion voice impediment has been proven to exist, is associated
ease. Health professionals and neuroscientists are interested
with symptoms. It is seen in 90% of people with early-stage in non-invasive PD detection and forecasting techniques. In
Parkinson’s syndrome. Therefore, we are interested in applying addition, recognising vocal changes in Parkinson’s patients
the vowel function to computer-aided Clinical recognition and could lead to an earlier diagnosis. This project aims to show
remote patient monitoring of patients with parkinson. The that a patient’s vocalisation data can be used to determine
prognosis for Parkinson’s disease is poor. However, both genetic
and environmental factors may be involved. For this project,
whether or not they have a heart condition. whether or not
I’ve opted to focus on medicine and use vocalisation data to they have Parkinson’s disease. As a result, it is at first It’s
classify whether or not a person has Parkinson’s disease. For assumed that there’s a link between the two.
background, it is a degenerative brain disorder that causes both
movement and cognitive impairment. As a result, it’s realistic to
infer that a patient’s capacity to, as these abilities deteriorate,
II. L ITERATURE S URVEY
they develop Parkinson’s disease. Compared to older Machine
Learning models, the proposed methodology greatly improves
Until recently, there was no technique to diagnose Parkin-
the accuracy of PD identification. son’s disease (PD) [1]. There exist multiple symptoms and
Index Terms—Parkinson disease, Speech processing, Machine diagnostic tools for the detection. A variety of indicators have
learning, Acoustic features been investigated by scientists in order to assist them diagnose
Parkinson’s disease early and slow it down. All Parkinson’s
I. I NTRODUCTION disease treatments at the time improve symptoms but do not
reduce or stop the disease from advancing.
Parkinson’s disease is a disorder which affects the nervous A articulation transition feature comparison revealed that
system. The symptoms occur gradually, with a tremor in only the number of articulation transitions and the fundamental
one hand frequently being the first. Tremors are common, but frequency are both growing in a study conducted by Quan
they’re also linked to stiffness and decreased mobility. Your Changqin et al [1]. The curves of HC speakers and PD
face will be having little or no expression at all in the previous patients were drastically different. As a result of this finding,
stages of Parkinson’s disease. When you walk, your arms may we propose utilising a bidirectional long-short term memory
not swing.There will be a great chance that your speech is (LSTM) model to capture time-series dynamism. Voice signal
slurred or hushed. Symptoms of Parkinson’s disease worsen as features that can be utilised to detect Parkinson’s disease.
the disease progresses. Although there is no immediate cure Designers built a deep neural network to forecast the sever-
for Parkinson’s disease, drugs may help you feel better for just ity of Parkinson’s illness in Grover et al, [10]. The developed
a short period of time. Your doctor may recommend surgery DNN model outperformed previous strategies, in terms of
to help you feel better by addressing specific areas of your accuracy. It was also revealed that the classification based
brain. on motor UPDRS score is superior to the classification based

Electronic copy available at: https://ssrn.com/abstract=4298373


on total UPDRS score. Implying that it is a better tool for • MDVP:Jitter(%),MDVP:Jitter(Abs),MDVP:RAP, MDVP
predicting severity. :PPQ,Jitter:DDP -implies the Several measures of vari-
Wodzinsk et alresearch et.al,[7] demonstrates how to iden- ation in fundamental frequency
tify vowels with persistent phonation and a ResNet architecture • MDVP:RAP - MDVP relative amplitude perturbation
built for image classification . The audio recordings’ spectrum • MDVP:PPQ -MDVP five-point period perturbation quo-
was calculated and input into the ResNet architecture, which tient
had already been pre-trained with the ImageNet. And the • Jitter:DDP -Average absolute difference of differences
SVD databases. The dataset was extensively extended in the between jitter cycles
time domain to avoid overfitting. In the Parkinson’s disease • MDVP:Shimmer,MDVP:Shimmer(dB):Several measures
sample (from the PC-GITA database), there are 100 patients: of variation in amplitude
50 healthy and 50 with Parkinson’s disease. • MDVP:Shimmer -MDVP local shimmer
The accuracy of diagnosis was as high as 90%. The publica- • Shimmer:APQ3 - amplitude perturbation quotient(Three-
tion [5] discusses many classifiers for identifying PD. Includ- point)
ing Neural Networks, Decision Trees, Regression and the DM • Shimmer:APQ5 - amplitude perturbation quotient(Five-
neural method. Neural Network has the highest classification point)
rate of 92.9 percent. [8] describes how correlation-based fea- • Shimmer:DDA -Average absolute differences between the
ture selection was used to lower feature dimensionality (from amplitudes of consecutive periods
23 to 11) and how it affected the Rotation Forest classifier. The • NHR :Noise-to-harmonics ratio,HNR :Harmonics-to-
authors report a 2.7 percent improvement as a result of using noise ratio- measures of ratio of noise to tonal compo-
11 characteristics. The developers of [2] devised a method nents
for classifying and grading Parkinson’s disease severity. For • RPDE (Recurrence period density entropy measure)
further speech disorders that are much easier to diagnose from • D2 (Correlation dimension)- nonlinear dynamical com-
a healthy person’s voice, automatic feature extraction analysis plexity measures
[1] was performed. • DFA - Signal fractal scaling exponent
• spread1,spread2,PPE (Pitch period entropy)- nonlinear
III. DATASET measures of fundamental frequency variation [1]
The database is turned out by Max Little of the University
of Oxford. It in the collaboration with the National Center
for Voice and Voice in Denver, Colorado. They recorded the
speech signals. In the original study, the feature extraction
approaches for broad voice difficulties were provided. The
biological voice measurements from 31 persons and 23 per-
sons whom having parkinson disease have been collected and
it is added to the dataset. Thus the dataset is applied for
the modelling and processing. The column consisting of the
specific voice measure. Similarly the row consisting of one
of the 195 voice recordings made by the given persons. The
main aim of the data is the distinguishning of healthy persons
and the ones with parkinsons. In the dataset taken the status Fig. 1. Data sample of both a healthy patient and one with Parkinsons
values corresponds to the persons with or without parkinsons.
The status value corresponds to ’1’ indicates the person with B. Data visualization
parkinson and ’0’ indicates the one without parkinson. The
information is stored in ASCII CSV format. A single voice
recording occurrence is represented in each row in the format
of CSV file . Every patient has roughly six recordings, the first
of which is labelled with the patient’s name. This is shown in
fig.1.

A. Features
• MDVP:Fo(Hz) - implies the Average vocal fundamental
frequency
• MDVP:Fhi(Hz) - implies the Maximum vocal fundamen-
tal frequency
• MDVP:Flo(Hz) - implies the Minimum vocal fundamen- Fig. 2. Line chart for the dataset
tal frequency

Electronic copy available at: https://ssrn.com/abstract=4298373


IV. PD ETECT B. Support Vector Machines
A. Features A classifier that uses an ideal hyper plane between data to
PDetect’s major goal is to produce reliable findings, which categorise the data set. This classifier was chosen because it
gives rise to its specialisation.PDetect uses two steps to has a large variety of various kernelling functions that can be
diagnose Parkinson’s disease: The ’Vocal Frequency Test’ and used. This model, I hope, can produce a high prediction rate
the ’Spiral and Wave Test.’ when modified.
• Vocal Frequency Test (Stage 1) C. Stochastic Gradient Descent
– Parkinson’s disease (PD) has numerous effects on A classifier that uses the gradient descent idea to achieve
speech. Many patients with PD speak calmly and the best prediction rate. This classifier was chosen since it
in one tone, with little emotion conveyed. Speech contains a number of parameters that might be tweaked if it
can seem breathy or hoarse at times. Patients with provides good results with my data set. My biggest concern is
Parkinson’s disease may slur words, grumble, or that it performs well with sparse data but suffers from larger
stop at the end of a sentence. Because patients training sets.
with Parkinson’s disease have these distinct vocal
characteristics, the disease can be recognised using D. K Nearest Neighbor
a person’s vocal frequencies. The K-NN algorithm thinks that the new case/data and
• Spiral and Wave Test (Stage 2) precedents are comparable. And it places the new case in the
– A test that entails drawing a spiral and a wave on classification, that is closest to the previous categories. This
a piece of paper could be used to detect Parkin- method keeps track of all available data . Categorises sets of
son’s syndrome early on. One of the symptoms of data, based on their similarity to existing data. This implies
Parkinson’s syndrome is tremors and muscle rigidity, that the method, new data can be quickly categorised into
which makes drawing smooth spirals and waves the relevant place. Although the technique can be useful for
challenging. Draw a spiral and a wave on different both regression and classification purpose. The classification
sheets of paper, then upload an image of each to the application is the most prevalent. It does not make any
website. assumptions about the data.
VI. N EURAL N ETWORK
Artificial neural networks (ANNs) and simulated neural net-
works (SNNs), both types of neural network models, are used
in deep learning techniques. Their naming and construction
were inspired by the human brain, although they function
similarly to genuine neurons.

Fig. 3. PD detection procedure

V. A LGORITHMS AND TECHNIQUES


To analyse the data collection, I used many supervised
learning techniques. My ambition is to discover the best
classifier for this problem. After that, I’ll be able to fine-tune
it to maximise its efficiency. The ultimate goal is to develop
a model based on one of these classifiers to perform as good
as or better than the study’s prediction rate. As follows, I’ve
picked three distinct supervised classifiers:
A. Naive Bayes
Fig. 4. Deep Neural Network
A classifier that attempts to classify data using Baye’s
theorem. Baye’s equation is used to calculate the likelihood In the Artificial neural network it mainly consisting of the
of a certain classification based on probabilities received from nodal layer which consists of an input layer and multiple
data features. This classifier was chosen since it is known to hidden layers and also an output layer. There will be having
require a minimal training set to estimate parameters and is there own weight for every artificial neural network and also
unaffected by irrelevant features. the threshold value. And is linked to the others also. When

Electronic copy available at: https://ssrn.com/abstract=4298373


a specific level is reached by the output of a node, it gets The NB classifier calculates class membership probability
activated and the data is forwarded to the next layer of the using a probabilistic technique. It is based on the conditional
network. If this isn’t the case, no data is transferred to the independence principle. During the learning phase, the NB
networks next level . is straightly to use because it just requires one iteration to
generate probabilities. In the theory, the classes are assigned
to the training data using a probability density function.
Following that, the objects are allocated to the most likely
class. It attributes a new set of features to the most probable
target class (c) according to:

c =arg max Prob c | f 1,f2 ,...,fn ))


 
1 ,f2 ,...,f |c),Prob(c)
=arg max Prob(f
Prob(f1 ,f2 ,...,f )

Fig. 5. KNN classifier


using the chain rule, the most probable target class (c) can be
expressed as follows:
VII. I MPLEMENTATION n
!
Y
By using the concept of similarity the KNN algorithm c = arg max Prob(c) Prob (fi c)
determines the closest neighbours . It aggregates a group i=1
of data points into categories . And categorize new data as With the given data set, the supervised learning algorithms
shown in fig.5. A metric for similarity (such as the Euclidean performed admirably. I had no problems with the measure-
distance). The key benefit is that it does not take the shape of a ments and strategies I used. Some effort was put into tweaking
model that will fit. It is based entirely on data-driven learning. the scatter matrix I created so that the labels could be seen
It locates the k nearest neighbours of I, given a value k and properly. I should mention that I’ve written a number of
a feature, in order to categorise I. the sample set and uses functions to aid in the training and prediction of my classifiers.
neighbour categories to define I’s class. The structure imposes This makes it quite simple for numerous classifiers like myself.
a lower standard computation burden. As a result, the k-NN
algorithm is extremely quick. It produces paistaking findings A. Streamlit app
in general. Streamlit is an open source app. Its a framework built on
The following is a common k-NN algorithm: Python. It allows us to swiftly build data science. And als
• Calculate the Euclidean distance between a new object o the machine learning web apps. It supports scikit-learn, Keras
and the rest of the learning sets other objects. etc. among other Python libraries. The obtained streamlit app
• Select the k items fron the learning set that are more has shown in fig.6 and fig.7. Widgets are treated as variables,
similar to o. therefore no callbacks are necessary. Data caching speeds up .
• Assign o to the group that contains the most k items. It simplifies the use of computation pipelines. The application
Following the discovery of the k closest neighbours, the is automatically deployed using the link while shared while
new object’s class is determined. Using a voting technique doing it. Following after Streamlit monitors changes to the
such as majority voting or weighted-sum voting. The formal associated Git repository for updates.
k-NN classifier algorithm is as follows in summary: On this app we will provide the values of the voice features
of an individual by examining that the app wil give the result
on detecting whether the individual has parkinson disease or
argmin(de (t, o, k)) → identif yP not.
where t represents the training data, o represents the object to While giving the values as shown in Fig.5 while pressing
be classified, P represents the new object’s given class, and k the detect button as it will show the results.Streamlit is more
is the number of nearest neighbours to be considered and de organised and emphasises simplicity. It only supports Python-
is the Euclidean distance given by: based data analysis and has a restricted range of widgets to
pick from (for example, sliders).
v
u L
uX B. Performance Metrics
2
de = t (ti,k − oi,k ) If a PD patient is accurately recognised during testing of
i=1
the method for PD detection, this case is a TRUE POS-
where L is length of each of data vector. In this work, the ITIVE (TP); otherwise, this case is a FALSE NEGATIVE
parameter k is set to one. (FN). TRUE NEGATIVES (TN) relate to accurately identified

Electronic copy available at: https://ssrn.com/abstract=4298373


evaluate the suggested method’s performance.
TP + TN
Accuracy = (1)
TP + FP + TN + FN
2 × Specif ity × Sensitivity
F − score = (2)
Specif ity + Sensitivity
TN
Specif ity = (3)
TN + FP
TP
Sensitivity = (4)
TP + FN

VIII. R ESULT
The final results are not as impressive as the study’s goal of
Fig. 6. PDetect app [Local host]
91.4 %. The final rate for my model was 88.46 %. However,
my model is within a 5% margin of their research, which
verifies the findings. I believe that by paying even more
attention to the model’s tweaking, I will be able to obtain the
same result as the study. It may be possible to achieve a higher
rate, but this would need considerable parameter adjustments
and data cleaning to exclude small outliers.

IX. C ONCLUSION
My experiment examined a variety of classifiers in the hopes
of identifying one that would confirm the study’s classification
using vocalisation data. I found the data distribution for
individuals with Parkinson’s disease versus those who were
healthy to be quite fascinating. This backs up the theory
that vocalisation data is linked to whether or not someone
has Parkinson’s disease. Coding and analysing the scatter
matrix to find this association was the most difficult task I
faced.However, it was quite beneficial in helping me compre-
hend the data set. This model met my expectations because
it supported the findings of the investigation. Unfortunately,
Fig. 7. PDetect app
because it does not meet the 95 percent confidence interval,
I do not feel it can be used for diagnosis in the general
healthcare environment.
To improve the prediction rate, the model can be fine-tuned
further. I was able to use all of the techniques I created, but
I am aware that there are other supervised learning classifiers
that may be used to classify this data. In terms of my final
solution, I believe that a better answer exists if more work is
spent tweaking classifier parameters and studying additional
supervised classifiers. A classifier designed expressly for this
type of situation might be the most effective.

R EFERENCES
[1] Quan Changqin,Kang Ren and Zhiwei luo.”A Deep Learning Based
Method for Parkinson Disease Detection using dynamic speech features”
IEEE Access 9(2021) pp.10239-10252.
Fig. 8. PDetect app [2] Wang Wu,Junho Lee,Fouzi Harrou and Ying sun.”Early Detection of
parkinsons disease using deep learning and machine learning”.IEEE
Access 8 (2020) pp. 147635-147646.
[3] Palancios alanso,Daniel, Guillermo melendez morales,Agustin lopez ar-
healthy subjects, whereas FALSE POSITIVES (FP) belong ribas ”Monparloc:a speech based system for parkinson disease analaysis
and monitoring”.IEEE Access 8(2020) pp. 188243-188255
to misclassified healthy patients (FP). Accuracy, sensitivity, [4] Gunduz hakan”Deep learning based parkinson disease classification
specificity, and precision are utilised as figures of merit to using vocal feature set”.IEEE Access 7 (2019) pp.115540-115551

Electronic copy available at: https://ssrn.com/abstract=4298373


[5] Johri Anubhav,Ashish tripati ”Parkinson disease detection using deep
neural network ”. 2019 Twelth International conference on Contempo-
rary Computing (IC3)IEEE 2019 pp.1-4
[6] Nonavinakere prabhakera , N Schuller b alku ”The detection of parkin-
sons disease from speech using voice source information” IEEE/ACM
Transactions on audio,speech and language processing vol.29, pp.1925-
1936
[7] Wodzinski Marek, Andrzej Skalski, Daria Hemmerling ”Deep learning
approach to parkinsons disease dtection using voice recordings and
convolutional neural network dedicated to image classification ” In
2019 41st annual international on IEEE Engineering and Medicine and
Biology Soceity (EMBC) IEEE 2019 pp. 717-720.
[8] Saloni and A. K. Gupta. ”Detection of Parkinson disease using clinical
voice data mining.” International Journal of Circuits, Systems and Signal
Processing 9 (2015).
[9] Solana-Lavalle, Gabriel, Juan-Carlos Galan-hernandez, and Roberto
rosas-romero. ”Automatic Parkinson disease detection at early stages
as a pre-diagnosis tool by using classifiers and a small set of vocal
features.” Biocybernetics and Biomedical Engineering 40, no. 1 (2020):
505-516.
[10] Grover, Saloni Bhartia, Abhilasha Yadav, and K. R. Seeja. ”Predicting
severity of Parkinson’s disease using deep learning.” Procedia computer
science 132 (2018): 1788-1794.

Electronic copy available at: https://ssrn.com/abstract=4298373

You might also like