Professional Documents
Culture Documents
Nhat Tan Le* 1 , Khuong Cong Duy Nguyen* 2 , Nhat Duy Vo 3 , Tan Thi
Pham 4
1234
Ho Chi Minh City University of Technology (HCMUT), 268 Ly Thuong
Kiet, Ward 14, District 10, Ho Chi Minh City, Vietnam,
1234
Vietnam National University Ho Chi Minh City (VNUHCM), Quarter 6,
Linh Trung Ward, Thu Duc City, Ho Chi Minh City, Vietnam
∗
Authors equally contribute to this work
Abstract
Wearing-off forecasting plays an important role in Parkinson’s symptom moni-
toring, which could reduce the risk due to the expiration of drug effectiveness.
Taking advantage of the development of technology, this detrimental phe-
nomenon could be accurately and remotely forecasted by analyzing data from
wearable sensors with machine learning techniques. In the ABC Challenge
2023, the dataset of 10 Parkinson’s patients, which was collected by a wear-
able fitness device, is utilized for the wearing-off forecasting application. In
this work, the KYSAI team has proposed a processing pipeline for the heart
rate, stress, and step dataset for the forecasting purpose. A total of 31 features
are extracted, which include several time windows before the forecasting pe-
riod. XGBoost model is utilized to perform the intra-person forecasting with
the focal loss to handle the imbalance class dataset issue. The remarkable
performance presented, the overall average weighted F1-score, Precision, and
1 lenhattan@hcmut.edu.vn
2 duy.nguyenkhuongcong@gmail.com
3 duy.vonhatduy@hcmut.edu.vn
4 ptthi@hcmut.edu.vn
1
2
recall are 0.94, 0.94, and 0.96 respectively. However, further analysis is needed
to address this highly imbalanced data problem.
1 Introduction
Parkinson’s disease (PD) is a neurodegenerative disorder characterized by the
loss of dopamine-producing brain cells [7], which significantly impacts the
quality of a patient’s life. Specifically, PD affects the patient’s motor abilities,
leading to symptoms such as tremors, muscle stiffness, and difficulties with
movement and balance [18]. Additionally, PD can cause non-motor symptoms
such as disturbances in sleep, constipation, and urinary [18]. Therefore, early
diagnosis and appropriate treatment plans for Parkinson’s disease are cru-
cial for effective management and improved patient outcomes [10]. However,
conventional methods, such as clinical assessments and neuro-imaging, often
rely on subjective observations and are not accessible to all individuals. This
poses a challenge in timely diagnosis and treatment initiation for PD patients,
delaying the potential benefits of early intervention [7].
Wearing-off phenomenon management plays an important role in PD
symptom controlling and improving the patient’s overall well-being. This phe-
nomenon significantly impacts their daily activities, where the effectiveness of
the medication gradually diminishes over time, and the patient’s symptoms
reappear before the next dose of medication [5]. The wearing-off phenomenon
is associated with the fluctuation of motor and non-motor symptoms, includ-
ing tremors, rigidity, bradykinesia, and fluctuations in mood and cognition [5].
Understanding the characteristics of PD medication, including the wearing-off
phenomenon, is essential for optimizing treatment strategies and improving
patient outcomes.
By leveraging advanced technologies, such as Artificial Intelligence (AI)
and the Internet of Things, wearing-off periods would possibly be effectively
monitored and accurately forecasted. By analyzing wearable sensor data vari-
ables [20, 1, 4, 15, 12] (such as heart activity, mobility, physical exercise, sleep
patterns, gait, and other relevant factors), questionnaires [16] or clinical mea-
surements [14] (such as medical imaging, drug intake), AI-powered were devel-
oped to anticipate the occurrence of wearing off in Parkinson’s disease patients
and had achieved many considerable results [20, 1, 3]. This approach offers
the potential to enhance the understanding of Parkinson’s disease treatment
and support clinicians in optimizing patient care strategies such as tailoring
medication regimens and providing timely interventions.
In the ABC Challenge 2023, the dataset [19] from a wrist-worn device is
utilized to forecast the wearing-off period in the next 15 minutes. In this paper,
team KYSAI aims to explain the proposed pipeline including pre-processing
method (Section 3.2), the feature extraction and analysis process based on
3
tracker data (Section 3.3 and 3.6) and model training (Section 3.4). The model
presents positive results in forecasting performance (Section 4.1). Moreover,
features insight has been shown in Section 3.6)
3 Methodology
The processing pipeline includes 5 main stages (Figure 1). Firstly, raw data is
cleaned during pre-processing 1 stage (Section 3.2). After cleaning, the data
are re-sampled to a specific time window, and a total of 31 features are ex-
tracted through this process (Section 3.3). After matching data from several
sources, a secondary pre-processing stage is proposed to handle additional
missing values (Section 3.2). Afterward, an ensemble learning model is uti-
lized with an additional technique to solve the imbalance-class data challenge
4
(Section 3.4). Finally, several evaluation metrics are applied to check the val-
idation performance (Section 3.5). Moreover, an feature analysis has been
conducted and presented in Section 3.6.
3.1 Dataset
The dataset utilized in this challenge was collected from the Garmin viosmart4
fitness tracker [20], including heart rate, steps, stress score, and sleep pattern.
In this work, 3 sources of data (heart rate, stress, steps) were analyzed to
perform the wearing-off prediction.
The wearing-off period was recorded by using Fonlog [9], a smartphone ap-
plication for data collection. The wearing-off period is annotated by Wearing-
Off Questionnaire (WoQ-9) using the Japanese translation [6]. In which, the
wearing-off phenomenon occurs when at least 1 symptom appears (pain,
tremors, anxiety, rigidity, slow down, slow thoughts, impaired hands, mood
change, muscle spasm)
3.2 Pre-processing
The main task in the 2 pre-processing stages is missing data handling.
In the raw dataset, missing data occurs as a short blank sequence. For the
heart rate and stress data, the nearest values are utilized to fill in the missing
series. For the step data, zero values are filled.
In the second pre-processing stage, after re-sampling and merging data
from several sources. Missing data occurs as a longer blank sequence due to
the device dropping out. The median value is filled into heart rate and stress
data, while the zero value is filled into step data.
tree (GBDT) machine learning module that provides parallel tree boosting
and is the leading machine learning library for regression, classification, and
ranking problems. Along with the implementations of Weighted Loss and Fo-
cal Loss to address the problem of label-imbalanced data.
Additionally, to achieve the optimal result, a hyper-parameter tuning tech-
nique, Grid-search Cross Validation, has been applied in this work. Specifi-
cally, the tuning hyper-parameters are focal gamma and imbalance alpha.
3.5 Evaluation
After the model training stage, the evaluation process is conducted on the
validation split of 10 participants. Due to the huge imbalance in the quantity
of data among classes, 3 weighted evaluation metrics (Precision, Recall, and
F1-score) were performed in this work.
TABLE 1.1: Table of extracted features. In which, the time windows t1, t2,
and t3 are from -45 minutes to -30 minutes, from -30 minutes to -15
minutes, and from -15 minutes to present respectively.
5 Discussion
The model performance through evaluation metrics presents optimistic re-
sults. Therefore, the effectiveness of the processing pipeline has been proved.
However, due to a high imbalance of wearing-off labels, cause the amount of
the sequence labeled ”wearing-off” is extremely few in the validation split.
8
Therefore, despite the model producing high performance with the average
weighted F1-score of 0.94, several factors are needed to be noted:
1. Insufficient data for intra-person forecasting: The current training
dataset is not nearly enough for most models to learn since it usu-
ally takes thousands or more samples. Specifically in our case after
resampling, there are only 865 segments of training data for par-
ticipants 8 and 9, and only 288 segments of training data for par-
ticipants 7. Which in both case are not enough for data to train or
generalized efficiently.
2. Imbalance class dataset: After assigning wearing-off status labels to
processed data, it is clear that the training set is heavily skewered
towards predicting control status due to wearing-off only happen-
ing on several occasions. This caused the model to tend to predict
”normal” in most cases. Hence despite even not wearing off exist,
the model can still have a high level of accuracy in the extreme case
of Participants 2 and 7 due to having no ”wearing off” labels, which
has an accuracy of 100%.
The inter-person forecasting model could be considered in further work.
However, there are several challenges due to the difference in bio-signal and be-
havior among patients. More specifically, some parameters such as heart rate,
and mental health status depend a lot on individual factors such as physique,
demographics, work, and living conditions. However, the performance of the
inter-person forecasting model could achieve considerable results if the dataset
is suitable and large amounts.
9
6 Conclusion
In this paper, a processing pipeline has been demonstrated for Parkinson’s
disease wearing-off forecasting purposes by analyzing a dataset from a fitness
tracker. 31 features are extracted from 3 different time windows: from -45
minutes to -30 minutes, from -30 minutes to -15 minutes, and from -15 minutes
to present. The XGBoost model with focal loss technique has been utilized
and presented a considerable performance on intra-patient forecasting tasks
on the imbalance-class dataset. The feature analysis results show that high
importance value in the extracted features, especially in timestamp-based,
heart-rate-based, and stress-score-based features. Although there are several
challenges including highly imbalanced classes are still considered.
10
Bibliography
[1] Aich, S., Youn, J., Chakraborty, S., Pradhan, P.M., Park, J.B., Park,
S.H.: A supervised machine learning approach to detect the on/off
state in parkinson’s disease using wearable based gait signals. Diag-
nostics 10(6), 421 (2020). DOI 10.3390/diagnostics10060421. URL
https://doi.org/10.3390/diagnostics10060421
[2] Antonini, A., Martinez-Martin, P., Chaudhuri, R.K., Merello, M., Hauser,
R.A., Katzenschlager, R., Odin, P., Stacy, M., Stocchi, F., Poewe,
W., Rascol, O., Sampaio, C., Schrag, A., Stebbins, G.T., Goetz, C.G.:
Wearing-off scales in parkinson’s disease: Critique and recommendations.
Movement Disorders 26(12), 2169–2175 (2011). DOI 10.1002/mds.23875.
URL https://doi.org/10.1002/mds.23875
[3] Camps, J., Sam, A., Martn, M., Rodrguez-Martn, D., Prez-Lpez, C.,
Arostegui, J.M.M., Cabestany, J., Catal, A., Alcaine, S., Mestre, B.,
Prats, A., Crespo-Maraver, M.C., Counihan, T.J., Browne, P.J., Quin-
lan, L.R., Laighin, G., Sweeney, D., Lewy, H., Vainstein, G., Costa, A.C.,
Annicchiarico, R., Bays, N., Rodrguez-Molinero, A.: Deep learning for
freezing of gait detection in parkinson’s disease patients in their homes
using a waist-worn inertial measurement unit. Knowledge Based Sys-
tems 139, 119–131 (2018). DOI 10.1016/j.knosys.2017.10.017. URL
https://doi.org/10.1016/j.knosys.2017.10.017
[4] De Lima, A.R., Evers, L.J., Hahn, T., Bataille, L., Hamilton, J.L., Little,
M.A., Okuma, Y., Bloem, B.R., Faber, M.J.: Freezing of gait and fall de-
tection in parkinson’s disease using wearable sensors: a systematic review.
Journal of Neurology 264(8), 1642–1654 (2017). DOI 10.1007/s00415-
017-8424-0. URL https://doi.org/10.1007/s00415-017-8424-0
[5] DeMaagd, G., Philip, A.: Parkinson’s disease and its management: Part
4: Treatment of motor complications. PubMed 40(11), 747–73 (2015).
URL https://pubmed.ncbi.nlm.nih.gov/26609209
[6] Fukae, J., Higuchi, M.A., Yanamoto, S., Fukuhara, K., Tsugawa,
J., Ouma, S., Hatano, T., Yoritaka, A., Okuma, Y., Kashihara, K.,
Hattori, N., Tsuboi, Y.: Utility of the japanese version of the 9-
item wearing-off questionnaire. Clinical Neurology and Neurosurgery
134, 110–115 (2015). DOI 10.1016/j.clineuro.2015.04.021. URL
https://doi.org/10.1016/j.clineuro.2015.04.021
[7] Kalia, L.V., Lang, A.E.: Parkinson’s disease. The Lancet 386(9996),
896–912 (2015). DOI 10.1016/s0140-6736(14)61393-3. URL
https://doi.org/10.1016/s0140-6736(14)61393-3
11
[8] Lu, R., Xu, Y., Zhang, H., Fan, Y., Zeng, W., Tan, Y., Ren, K.F.,
Chen, W., Cao, X.: Evaluation of wearable sensor devices in parkin-
son’s disease: A review of current status and future prospects. Parkin-
son’s Disease 2020, 1–8 (2020). DOI 10.1155/2020/4693019. URL
https://doi.org/10.1155/2020/4693019
[9] Mairittha, N., Mairittha, T., Inoue, S.: A mobile app for nursing
activity recognition (2018). DOI 10.1145/3267305.3267633. URL
https://doi.org/10.1145/3267305.3267633
[10] Murman, D.L.: Early treatment of parkinson’s disease: opportunities
for managed care. PubMed 18(7 Suppl), S183–8 (2012). URL
https://pubmed.ncbi.nlm.nih.gov/23039867
[11] Pursiainen, V., Korpelainen, J.T., Haapaniemi, T.H., Sotaniemi, K.A.,
Myllylä, V.V.: Blood pressure and heart rate in parkinsonian pa-
tients with and without wearing-off. European Journal of Neurology
14(4), 373–378 (2007). DOI 10.1111/j.1468-1331.2007.01672.x. URL
https://doi.org/10.1111/j.1468-1331.2007.01672.x
[12] Rigas, G., Tzallas, A.T., Tsipouras, M.G., Bougia, P., Tripoliti,
E.E., Baga, D., Fotiadis, D.I., Tsouli, S., Konitsiotis, S.: Assess-
ment of tremor activity in the parkinson’s disease using a set of
wearable sensors. IEEE Transactions on Information Technology in
Biomedicine 16(3), 478–487 (2012). DOI 10.1109/titb.2011.2182616.
URL https://doi.org/10.1109/titb.2011.2182616
[13] Ruonala, V., Tarvainen, M.P., Karjalainen, P.P., Pekkonen, E., Rissanen,
S.M.: Autonomic nervous system response to L-dopa in patients with
advanced Parkinson’s disease (2015). DOI 10.1109/embc.2015.7319799.
URL https://doi.org/10.1109/embc.2015.7319799
[14] Salmanpour, M.R., Shamsaei, M., Saberi, A., Setayeshi, S.,
Klyuzhin, I.S., Sossi, V., Rahmim, A.: Optimized machine learn-
ing methods for prediction of cognitive outcome in parkin-
son’s disease. Computers in Biology and Medicine 111,
103347 (2019). DOI 10.1016/j.compbiomed.2019.103347. URL
https://doi.org/10.1016/j.compbiomed.2019.103347
[15] Shima, K., Tsuji, T., Kan, E., Kandori, A., Yokoe, M., Sakoda,
S.: Measurement and evaluation of finger tapping movements using
magnetic sensors (2008). DOI 10.1109/iembs.2008.4650490. URL
https://doi.org/10.1109/iembs.2008.4650490
[16] Stacy, M.: The wearing-off phenomenon and the use of questionnaires
to facilitate its recognition in parkinson’s disease. Journal of Neural
Transmission 117(7), 837–846 (2010). DOI 10.1007/s00702-010-0424-5.
URL https://doi.org/10.1007/s00702-010-0424-5
12
[17] Stacy, M., Hauser, R.A., Oertel, W.H., Schapira, A.H., Sethi,
K.D., Stocchi, F., Tolosa, E.: End-of-dose wearing off in parkin-
son disease. Clinical Neuropharmacology 29(6), 312–321
(2006). DOI 10.1097/01.wnf.0000232277.68501.08. URL
https://doi.org/10.1097/01.wnf.0000232277.68501.08
[18] Sveinbjörnsdóttir, S.: The clinical symptoms of parkinson’s disease. Jour-
nal of Neurochemistry 139, 318–324 (2016). DOI 10.1111/jnc.13691.
URL https://doi.org/10.1111/jnc.13691
[19] Victorino, J.N.C., Kaneko, H., Fikry, M., Nahid, N., Hosain, T.,
Shibata, T., Inoue, S.: 5th abc challenge: Forecasting parkinson’s
disease patients’ wearing-off phenomenon datasets (2023). URL
https://ieee-dataport.org/competitions/5th-abc-challenge-forecasting-
parkinsons-disease-patients-wearing-phenomenon-datasets
[20] Victorino, J.N.C., Shibata, Y., Inoue, S., Shibata, T.: Predicting wearing-
off of parkinson’s disease patients using a wrist-worn fitness tracker and
a smartphone: a case study. Applied Sciences 11(16), 7354 (2021). DOI
10.3390/app11167354. URL https://doi.org/10.3390/app11167354
[21] Wang, C., Deng, C., Wang, S.: Imbalance-xgboost: Leveraging weighted
and focal losses for binary label-imbalanced classification with xgboost
(2019)
7 Appendix