Professional Documents
Culture Documents
Keywords: Background and purpose: Low adherence to medication in chronic disease patients leads to increased morbidity,
Medication refill adherence mortality, and healthcare costs. The widespread adoption of electronic prescription and dispensation records
Electronic health records allows a more comprehensive overview of medication utilization. In combination with electronic health records
Simulation (EHR), such data provides new opportunities for identifying patients at risk of nonadherence and provide more
Prediction
targeted and effective interventions. The purpose of this article is to study the predictability of medication
Refill patterns
adherence for a cohort of hypertensive patients, focusing on healthcare utilization factors under various pre-
dictive scenarios. Furthermore, we discover common proportion of days covered patterns (PDC-patterns) for
patients with index prescriptions and simulate medication-taking behaviours that might explain observed pat-
terns.
Procedures: We predict refill adherence focusing on factors of healthcare utilization, such as visits, prescription
information and demographics of patient and prescriber. We train models with machine learning algorithms,
using four different data splits: stratified random, patient, temporal forward prediction with and without index
patients. We extract frequent, two-year long PDC-patterns using K-means clustering and investigate five simple
models of medication-taking that can generate such PDC-patterns.
Findings: Model performance varies between data splits (AUC test set: 0.77–0.89). Including historical in-
formation increases the performance slightly in most cases (approx. 1–2% absolute AUC uplift). Models show
low predictive performance (AUC test set: 0.56–0.66) on index-prescriptions and patients with sudden drops in
PDC (Recall: 0.58–0.63). We find 21 distinct two-year PDC-patterns, ranging from good adherence to inter-
mittent gaps and early discontinuation in the first or second year. Simulations show that observed PDC-patterns
can only be explained by specific medication consumption behaviours.
Conclusions: Prediction models developed using EHR exhibit bias towards patients with high healthcare utili-
zation. Even though actual medication-taking is not observable, consumption patterns may not be as arbitrary,
provided that medication refilling and consumption is linked.
⁎
Corresponding author.
E-mail address: alexander.galozy@hh.se (A. Galozy).
URL: http://www.islab.hh.se/mediawiki/Alexander_Galozy (A. Galozy).
https://doi.org/10.1016/j.yjbinx.2020.100075
drug, provide the highest accuracy but place a high burden on patients. 2. Methods
Less accurate and rigorous measures include the assessment of the
clinician via patient interviews and self-report through patient-kept 2.1. Data source
diaries [21]. Indirect measures approximate medication adherence
through medication refill adherence by utilizing proxy information We analyzed data from the Region Halland (RH) data warehouse
such as pharmacy dispensation data or insurance claims data, providing containing detailed patient records from public primary, secondary,
a low cost, low burden solution, with the caveat that actually medica- and specialist care facilities in Halland, Sweden. Data from private care
tion consumption remains unknown. Nevertheless, new tools created by providers through a data agreement with RH are available, providing a
the wealth of information through the advancing digitization of comprehensive view of the patients’ encounters with the regions’ health
healthcare, provide fertile ground for valid and cost-effective analysis care system. Clinical data include lab results, radiology diagnostics,
of refilling behaviour as well as timing of drug exposure [3,9]. diagnoses, waveform data, doctors’ notes, and performed medical
Many different ways of computing medication refill adherence, procedures. Through data integration efforts with Swedish pharmacies,
hence forth called refill adherence, using secondary databases have pharmacy dispensation data is available. As of October 2018, records
been developed, with Proportion of Days Covered (PDC) and from 545,652 patients (male 49.2%/ female 50.8%), totaling 7,548,301
Medication Possession Ratio (MPR) being the most popular ones. Those visits, from 29 care centers (24 primary care, 3 hospitals, 2 emergency
measures expresses the acquired number of doses over a set period as a care) are included. The data is pseudonymized with each patient’s 10-
percentage, ordinarily in the range between 0% and 100%. Even though digit Swedish Personal ID being replaced with a unique randomized ID
many measures exists, PDC is being recommended by the Pharmacy that allows tracking through the healthcare system without exposing
Quality Alliance (PQA) as the defacto standard measure.1 the identity of the patient. All identifying information such as: name,
Previous studies on predicting refill adherence utilizing socio- address, date of birth and telephone numbers are removed. The plat-
demographic factors, clinical factors, or purchasing information had form has been designed to conform to Swedish and EU laws and reg-
limited success, with models exhibiting only low discriminability, often ulations on data privacy and confidentiality [4].
with AUCs < 0.7 [30,18,13,19]. Although these efforts have had only
marginal success in developing accurate predictive models, they pro- 2.2. Cohort
vided valuable insights into the importance of analyzing patients’ re-
filling behaviours and health care utilization patterns. Several studies The cohort consists of patients between the ages of 18 and 90 with
find that initial and past patterns of medication refilling can sig- an essential hypertension diagnosis (ICD10 code i10-) at one point
nificantly improve model performance, providing the majority of the between the dates 1st November 2014 and 31st July 2018. We assume
models’ predictive accuracy [12,13,22,20]. the full medical history of the patients is known. Therefore we did not
As mentioned, a limiting factor of refill adherence analysis is that consider further exclusion criteria on patient-level, such as specific
patients that refill their medication do not necessarily take them as comorbidities or excluding patients with hospital visits as has been
prescribed, but there exists some evidence to suggest a link between common in previous studies that lack the same completeness of the
refilling and medication-taking [14]. Analyzing medication refill pat- EHR. There is a possibility that patients visit hospitals or primary care
terns might provide insight into the possible consumption patterns of centres outside RH and may receive prescriptions that are subsequently
medication, especially early on in pharmacological treatment where missing in the records, but we do not expect this to alter results sig-
long-term effects of, for example, hypertension can be mitigated. nificantly. Dispensation records are available from all dispensaries in
This study was part of a project providing personalized interven- Sweden; this mitigates the possibility of underestimation of adherence.
tions and support for patients struggling with adherence to pharma- We predict refill adherence for one-year prescriptions of medication
cological treatment.2 Therefore, we first analyze refill adherence of a commonly prescribed for antihypertensive treatment. Brand names and
large and diverse cohort of patients diagnosed with and treated for generic names of the prescribed and dispensed medication are avail-
hypertension (ICD i10-) under various predictive scenarios to get a able. However, due to missing joining keys between prescription and
general idea about the predictability of refill adherence and potential dispensation systems and the possibility that patients might change
model or data bias, as we will show, using data from a comprehensive brand names within a prescription, we opted to use the Anatomical
EHR system. We focus specifically on healthcare utilization factors, Therapeutic Chemical Classification System (ATC) code to allow for
such as visit frequency/type, past prescription information, past PDC, simpler matching of dispensations and prescriptions. As part of the
and a limited number of demographic variables of both patient and project mentioned above, we primarily consider medication with ATC
prescriber. codes that are commonly prescribed for patients with high blood
Secondly, we discover common PDC-patterns via unsupervised pressure: C02- (antihypertensives), C03- (diuretics), C07- (beta-
clustering of a sub-cohort of patients that have been prescribed anti- blocking agents), C08- (calcium channel blockers) and C09- (Agents
hypertensive medication for the first time. This sub-cohort constitutes a acting on the renin-angiotensin system).
more typical group of patients that might be target by interventions in
the project mentioned above, where early intervention can have argu- 2.3. Data preprocessing
ably the highest impact on long term positive health outcomes.
Furthermore, under the assumption of a link between refilling and Before matching dispensations to prescriptions, we removed dupli-
medication-taking behaviour, we construct simple simulation models, cations and adjust prescription periods. The exact procedure is detailed
generating diverse PDC-patterns and medication consumption patterns in an earlier published article but briefly summarized here for com-
that could explain refill behaviour we observe in the patient sub-cohort. pleteness [2].
In all cases, adherence is evaluated in the prescription period, the
time frame between the prescription start and end date. Dispensation
data is available between 28th May 2012 up until 31st July 2019, while
prescription data is available from 1st January 2010. To avoid missing
1
https://www.pqaalliance.org/adherence-measures - last accessed: 2020-04- dispensations, which would lead to underestimation of adherence, we
13. exclude patients with index prescriptions before the 28th May 2012.
2
https://www.hh.se/english/research/research-environments/embedded- Average dispensation rates approx. take until 1st November 2013 to
and-intelligent-systems-eis/technology-area-aware-intelligent-systems/imeda- fully settle. We exclude patients that have index prescriptions until this
digital-support-for-high-blood-pressure.html - last accessed 2020-04-13. date to reduce the chance of missed dispensations further. In case
2
A. Galozy and S. Nowaczyk Journal of Biomedical Informatics: X 6–7 (2020) 100075
3
A. Galozy and S. Nowaczyk Journal of Biomedical Informatics: X 6–7 (2020) 100075
4
A. Galozy and S. Nowaczyk Journal of Biomedical Informatics: X 6–7 (2020) 100075
2.6.4. Simulation model for medication-taking With c yinterval defining the amplitude between motivation “highs” and
We employ a simple model to simulate medication-taking. The “lows”, e.g. for c yinterval = 0.5 motivation oscillates between 1 and 0.
primary mechanism determining medication-taking in this model is As mentioned earlier, real patients can consume medication arbi-
“forgetting” and the concept of “motivation”, i.e., the desire of the trarily and there does not need to be any link between refill patterns
patient to abide by their prescribed treatment regimen. We model and actual medication use. In the absence of any direct information
motivation by simple mathematical equations and investigate five basic about how a particular patient is consuming medication, it is reason-
models. able to assume that patients that are motivated to take their medication
The general model of medication consumption is governed by are also more likely to refill their medication regularly. Provided that
combining the outcomes of two iid Bernoulli trials: this assumption does mirror reality, we can reduce the number of all
possible medication consumption patterns given particular refilling
P (Xc = 1) = P (X1 = 1) P (X2 = 1) (4)
behaviour.
With Xc being a binary random variable indicating medication con- Therefore, for our simulated patients, we assume that there exists a
sumption. The binary random variables X1 and X2 are distributed ac- link between available medication supply and refilling patterns. For
cording to: simplicity, the probability of refilling is only dependent on the supply
X1 ~B (1, M ), X2 ~B (1, 1 cfprob) currently available to the patient, i.e., the patient delays refilling if
supply is still available. The probability of pickup Xp is computed as
With M (0, 1) being the level of motivation expressed as a probability follows:
and cfrob (0, 1) being the probability of forgetting to take medication.
e s
This model of medication-taking applies to all motivation models. P (Xp = 1|s , t ) = cdisp (t )
1+e s (10)
For the constant model, motivation does not change over time:
M = cconst (5) where cdisp (t ) is the indicator function defined as:
where 0 cconst 1. The constant model can generate a variety of PDC- 1 if t cdisp
cdisp (t )
patterns associated with patients having low conscientiousness in 0 if t < cdisp
medication-taking, i.e., patients are somewhat willing to take mediation
but forget at random times, intentionally or unintentionally. Due to limits in dispensation frequency at Swedish pharmacies, (real)
In the seasonal model, motivation does decrease over time(ex- patients are not allowed to dispense all packages of medication at one
ponentially) but increases(suddenly) when the time for a follow-up time, needing to wait before receiving a new package. We model this
prescriptions arises, i.e., patients might feel pressure to fulfill their re- for our simulated patients with the dispensation threshold cdisp that
gimen before visiting their doctor. limits the dispensation frequency. The probability of pickup is strongly
coupled with available supply, that is to say, patients pick up medica-
(t mod cseasonality)·cdecline
tion if they nearly exhausted their current supply.
M (t ) = e cseasonality (6)
Simulated patients pick up 100 days of supply per dispensation,
where t is the current simulation iteration and cseasonality defining the constituting a standard supply amount for antihypertensive medication
time (in days) between sudden changes of motivation. in Sweden. We take into account the possibility that patients might
For the sigmoidal model, motivation can change somewhat have 100 days supply available at the beginning of the prescription
abruptly, either increasing, decreasing or having a short period of low without a registered dispensation. Fig. 2 illustrates simulated medica-
or high motivation. Patients might discontinue medication on their own tion-taking, refilling and PDC-patterns.
for some time with a re-uptake of regimen later on. We define a base
model: 2.6.5. Pattern generation & matching
t + coffset + cgap We run a Monte Carlo Simulation [15] for 800 days (approx. two
m (t , coffset , cgap) = 1/(1 + e cdecline ) (7) years) of medication-taking for each of the motivation models. We run
With coffset defining the time at which patient motivation changes. cgap each simulation with a variety of different parameters to ensure the
defines the time between lowering or increasing motivation levels. We diversity of PDC-patterns. The list of parameters for the simulation
then construct several motivational patterns: models is shown in Table 2. As a result of convergence analysis, see
Fig. 3, each model is run 101 times for each parameter combination and
m (t , coffset , 0), for n = 1 the median PDC-pattern computed. We match the simulated PDC-pat-
1 m (t , coffset , 0), for n = 0 terns and PDC-patterns found by cluster analysis in a step by step
M (t , n) =
[1 m (t , coffset , 0)]+ fashion: First, instead of matching individual simulation runs to cluster
m (t , coffset , cgap), for n = 1 centroids directly, we chose one simulation pattern that is closest to the
median of the 101 runs. Given the stochastic nature of each model,
For the exponential model motivation decreases exponentially over single runs may not accurately reflect expected model behavior for a set
time. Refills influence the rate of decrease, i.e., refills reset the rate of of parameters. To mitigate the impact of output variability, we chose a
motivation loss. The exponential model is defined recursively. representative pattern that is close to the median output of the model.
t cdisp We do this by finding the simulation pattern that minimizes the sum of
M (t ) = M (t 1)· e cdecline·citer (8) squared difference between individual PDC-pattern and the median
PDC-pattern:
where cdisp measures the number of days since the last pickup and citer
defines the total number of iteration, i.e., days (here 800). citer
5
A. Galozy and S. Nowaczyk Journal of Biomedical Informatics: X 6–7 (2020) 100075
Fig. 2. Medication-taking (Drug taken), refill (Dispensation) and PDC-patterns (Day covered) generated by the proposed simulation models. The models are capable
of generating a large variety of medication consumption and corresponding PDC-patterns solely based on current motivation levels and forgetting.
Table 2
Model parameters, their description and ranges for the simulation experiments.
Parameter Description Range
6
A. Galozy and S. Nowaczyk Journal of Biomedical Informatics: X 6–7 (2020) 100075
Table 3
Descriptive statistics of predictors between training, validation and test sets of different splitting strategies. Long predictor labels are shown for easier reading. Mean
and standard deviating shown.
Splitting strategy Random Patient
Predictors mean std mean std mean std mean std mean std mean std
Patient Age at prescription 67.1 12.3 67.1 12.4 66.9 12.4 67.1 12.3 66.9 12.4 67.3 12.6
Patient Gender∗ 0.514 0.5 0.523 0.5 0.512 0.5 0.508 0.5 0.534 0.499 0.529 0.499
Prescriber Age 41.7 11.0 41.8 11.0 41.8 11.1 41.6 11.0 42.2 11.3 41.8 10.9
Prescription quantity 1.06 0.329 1.06 0.307 1.06 0.31 1.06 0.309 1.06 0.361 1.06 0.343
Num. concurrent drugs 1.81 0.911 1.83 0.929 1.81 0.913 1.82 0.917 1.82 0.91 1.82 0.902
Num. prescriptions in year 1.02 1.48 1.02 1.48 0.994 1.48 1.02 1.49 0.995 1.45 1.01 1.47
Num. different drug classes 1.75 1.5 1.76 1.53 1.75 1.51 1.76 1.52 1.74 1.49 1.74 1.47
Years since index date of drug 1.72 0.917 1.73 0.927 1.71 0.912 1.73 0.921 1.72 0.92 1.7 0.9
Oversupply 48.9 108.0 50.8 113.0 50.4 116.0 50.1 109.0 46.1 110.0 49.3 112.0
Num. primary care visits 43.1 53.7 42.4 54.2 42.6 51.7 43.1 54.3 40.6 50.3 44.3 52.7
Num. hospital Visits 1.04 1.84 1.05 1.94 1.01 1.79 1.02 1.85 1.05 1.81 1.13 1.85
Num. emergency care Visits 1.58 2.72 1.59 2.78 1.58 2.7 1.58 2.82 1.52 2.41 1.68 2.57
Primary care visits since last pres. 4.67 7.71 4.47 7.71 4.66 8.12 4.58 7.74 4.6 7.45 4.94 8.22
Hospital visits since last pres. 0.0909 0.338 0.0874 0.318 0.0844 0.328 0.0877 0.331 0.0889 0.324 0.0982 0.353
Emergency visits since last pres. 0.128 0.488 0.123 0.43 0.122 0.435 0.126 0.481 0.125 0.444 0.131 0.456
PDC 0.786 0.41 0.786 0.41 0.786 0.41 0.785 0.411 0.782 0.413 0.791 0.407
Patient age at prescription 66.9 12.4 67.3 12.3 67.4 12.4 66.9 12.3 67.5 12.1 67.6 12.6
Patient Gender∗ 0.513 0.5 0.517 0.5 0.515 0.5 0.517 0.5 0.509 0.5 0.509 0.5
Prescriber Age 41.6 11.1 42.2 10.9 42.2 10.7 41.5 11.2 42.4 10.6 42.0 10.4
Prescription quantity 1.06 0.319 1.06 0.319 1.06 0.325 1.06 0.31 1.06 0.326 1.08 0.372
Num. concurrent drugs 1.83 0.936 1.86 0.9 1.82 0.882 1.81 0.905 1.85 0.914 1.83 0.952
Num. prescriptions in year 1.0 1.5 1.07 1.45 1.01 1.43 0.997 1.46 1.0 1.5 1.1 1.55
Num. different drug classes 1.54 1.53 2.3 1.31 2.32 1.33 1.57 1.44 2.08 1.49 2.29 1.65
Years since index date of drug 1.47 0.756 2.23 0.958 2.43 1.02 1.48 0.696 2.16 1.03 2.39 1.18
Oversupply 41.0 96.5 66.5 133.0 68.0 128.0 41.9 89.0 62.3 127.0 71.1 162.0
Num. primary care visits 38.1 50.1 49.2 56.8 52.7 59.8 38.8 48.5 50.9 59.8 53.8 65.3
Num. hospital Visits 0.973 1.75 1.1 1.99 1.15 2.05 1.0 1.77 1.08 1.95 1.16 2.07
Num. emergency care visits 1.46 2.53 1.7 3.0 1.82 3.26 1.49 2.5 1.73 2.95 1.88 3.39
Primary care visits since last pres. 4.02 7.35 6.2 8.83 5.92 8.9 4.5 7.57 5.59 8.39 4.34 7.99
Hospital visits since last pres. 0.0896 0.327 0.0942 0.351 0.0856 0.351 0.0963 0.343 0.0868 0.341 0.0599 0.275
Emergency visits since last pres. 0.117 0.462 0.147 0.484 0.155 0.504 0.126 0.474 0.142 0.483 0.113 0.451
PDC 0.757 0.429 0.826 0.379 0.819 0.385 0.769 0.422 0.819 0.385 0.83 0.376
∗
Ratio of male to female patients.
3. Results performance on the validation set and test set is comparable. Predictive
performance was highest on the temporal split scenario excluding pa-
3.1. Descriptive statistics tients with single prescriptions from the test set with the best model
(RF) having AUCs of approximately 0.90 and 0.91 using the baseline
The training, validation and test sets include 53302, 11410, 11410 predictors and baseline + history, respectively. In comparison, the
unique prescriptions of 59085 prescribing events4 from 18581 patients lowest performance is observed for the temporal split having the most
(50,6% Male, 49,4% Female), for all splitting strategies. The mean and recent prescription in the test set with AUCs of the best model (GB) of
standard deviation of predictors and outcome variable between sets are approximately 0.77 and 0.80 with baseline predictors and base-
similar for random and patients splits. Both temporal patient and line + history, respectively.
temporal recent splits show expected differences in means of predictors In general, including historical information by adding features of
expected to increase with time, such as the average number of visits, past prescriptions (including PDC) increases prediction performance of
age at prescription, years since first prescription or oversupply available future adherence.
at the start of a new prescription. Table 3 shows mean and standard
deviation of all included predictors and PDC for each splitting scenario. 3.2.1. Predictive performance on index prescriptions
For the random, patient and temporal recent split, we evaluate the
models’ performances on all patients in the test sets that only received
3.2. Refill adherence predictions
one prescription, i.e., new patients and patients that discontinued
treatment after the first prescription. Given that such patients are new
Table 4 shows the predictive performances of each optimized model
to treatment, we expect higher variability in medication refill rates at
for all splitting scenarios. In all scenarios, The Random Forest(RF) and
treatment initiation and, therefore, lower model performance on pre-
Gradient Boosting Trees(GB) algorithm perform the best on both vali-
dicting adherence. Table 5 shows the predictive performance for pa-
dation and test set using the baseline predictors and including history.
tients with a single prescription only. We observe a significant decrease
The models do not seem to overfit on the validation set and
in model performance compared to previous experiments, for all
models and data splits, indicating low informativeness of used pre-
4
Patients can have multi prescriptions per prescribing event, for example dictors, mostly due to missing previous adherence and oversupply in-
three prescriptions on one day constitute one prescribing event. formation.
7
A. Galozy and S. Nowaczyk Journal of Biomedical Informatics: X 6–7 (2020) 100075
Table 4
Predictive performance under investigated prediction scenarios. Bold figures indicate the highest model performance (AUC) per scenario. RF,Random Forest, LR,
Logistic Regression, GB, Gradient Boosting Trees, KNN, k-nearest neighbor.
Data Set Validation Test
Random AUC Precision Recall AUC Precision Recall AUC Precision Recall AUC Precision Recall
RF 0.788 0.833 0.952 0.801 0.840 0.947 0.796 0.838 0.954 0.803 0.846 0.947
LR 0.751 0.931 0.528 0.783 0.919 0.637 0.755 0.934 0.534 0.786 0.925 0.630
GB 0.790 0.828 0.966 0.806 0.835 0.959 0.793 0.826 0.966 0.807 0.835 0.956
KNN 0.747 0.799 0.972 0.717 0.799 0.978 0.747 0.805 0.977 0.723 0.802 0.978
Patient
RF 0.766 0.922 0.588 0.779 0.898 0.701 0.772 0.934 0.602 0.790 0.906 0.715
LR 0.747 0.927 0.509 0.773 0.913 0.620 0.754 0.935 0.528 0.784 0.926 0.625
GB 0.766 0.807 0.975 0.788 0.815 0.969 0.779 0.812 0.980 0.803 0.823 0.971
KNN 0.730 0.799 0.966 0.699 0.795 0.972 0.736 0.802 0.971 0.702 0.800 0.974
Temporal patient
RF 0.878 0.894 0.985 0.896 0.899 0.987 0.899 0.898 0.988 0.914 0.899 0.987
LR 0.770 0.919 0.711 0.818 0.908 0.858 0.769 0.916 0.706 0.817 0.906 0.842
GB 0.857 0.899 0.971 0.883 0.903 0.979 0.876 0.901 0.973 0.897 0.907 0.979
KNN 0.775 0.845 0.972 0.741 0.834 0.991 0.786 0.844 0.971 0.754 0.829 0.987
Temporal recent
RF 0.786 0.924 0.707 0.808 0.898 0.830 0.768 0.932 0.664 0.783 0.903 0.810
LR 0.755 0.914 0.685 0.800 0.919 0.750 0.757 0.920 0.650 0.788 0.924 0.713
GB 0.787 0.835 0.982 0.817 0.854 0.975 0.773 0.847 0.979 0.801 0.862 0.966
KNN 0.746 0.830 0.979 0.704 0.828 0.984 0.731 0.840 0.966 0.694 0.835 0.974
Table 5 Table 6
Predictive performance of trained models on patients with a single prescribing Recall of trained models on patients with a sudden drop off in PDC on their
event. The history column refers to the models trained using baseline predictors recent prescription.
of the five most recent prescriptions. Bold figures indicate the highest model
Predictors Baseline History
performance (AUC).
Predictors Baseline History Temporal patient
Temporal recent
3.3. Predictor influence on model output
RF 0.562 0.880 0.238 0.560 0.882 0.234
LR 0.630 0.903 0.194 0.590 0.888 0.186
We use the SHapley Additive exPlanations (SHAP)5 framework to
GB 0.581 0.767 0.967 0.586 0.764 0.976
KNN 0.550 0.763 0.905 0.548 0.761 0.890 generate a summary of local model explanations on the test set pre-
dictions to investigate feature importance [24,25]. In essence, shapely
values are computed by introducing each feature in the model step by
3.2.2. Sudden refill adherence drop-off step, attributing the change in prediction to the feature introduced and
For temporal patient and temporal recent splits, we investigate the averaging over all feature orderings [23]. The resulting explanations for
accuracy of the models on patients that refilled their recent prescription the overall highest performing models (GB) is shown in Fig. 4.
insufficiently given past high refill adherence. Predicting a sudden drop The number of available supply at the beginning of the prescription
in PDC would allow care providers to intervene early. had the most substantial influence on model output with higher values
Table 6 shows the predictive performance of the trained models on of available supply, increasing the likelihood of positive (adherent)
the test set for patients that had a sudden drop in PDC after exhibiting output. Other important features are past adherence to medication with
high refill adherence in previous prescriptions. All models tend to higher values increasing positive model output, while lower previous
predict continuing good PDC, resulting in low model performance with PDC does not necessarily result in negative model output shown by
the LR model showing the highest Recall if history is not included.
Adding history lifts Recall of all models.
5
Github: https://github.com/slundberg/shap.
8
A. Galozy and S. Nowaczyk Journal of Biomedical Informatics: X 6–7 (2020) 100075
Fig. 4. SHAP values of the test set for all splitting scenarios using the GB models. Positive values correspond to higher portability outputs of the model.
(t 1), (t 2) , as so forth indicate predictors from previous prescriptions.
significant overlap between positive and negative labels for low pre-
vious adherence.
9
A. Galozy and S. Nowaczyk Journal of Biomedical Informatics: X 6–7 (2020) 100075
Fig. 6. Best fit simulated medication-taking patterns for the top six PDC-patterns for each simulation model. The overall best fit is achieved using the sigmoidal model
of medication-taking, with early “discontinue/nonadherence” and “adherence/adherence” patterns being easily reproduced by most models.
with the proposed models. Given that pickup probability is directly tied does contribute to PDC directly and, secondly, patients that are regular
to available supply, a “late pickup/adherence” pattern can only be refillers (adherent patients), do tend to accumulate a significant supply
explained in two ways: Either the patient has medication available at as they are progressing in their treatment, due to extra supply dispensed
the beginning of the prescription, or low motivation to fill their first (100 days supply every 90 days).
prescription. An “Adherence/early discontinue” pattern without gaps in The highest performance on the test set is achieved in the temporal
the first year is reproduced my models exhibiting consistent medica- splitting strategy, where new patients are excluded. It has to be noted
tion-taking with a drop in motivation shortly after the last refill. In that predicting adherence for a self-selected group of patients, that are
constant, seasonal and exponential models cannot reproduce such already several years into their treatment, are expected to be somewhat
patterns given the simulated medication-taking behaviour. adherent to treatment. This selection bias is also shown when pre-
The sigmoidal model most faithfully reproduces the “Adherence/ dicting adherence using the “temporal recent strategy” seeing a drop in
single gap” pattern. After consistent medication-taking behaviour, we AUC to 0.78 when new patients are introduced that do not have a prior
see a motivation drop-off close the supply exhaustion after the first refill treatment history.
in the second year with eventual re-uptake of treatment. Predictive performance is modest for patients with a single pre-
scription for all splitting scenarios with AUCs between 0.56 and 0.65,
4. Discussion illustrating again that clinical predictors are only weakly correlated
with future medication adherence. Most recent work that investigated
Overall model performance for medication adherence prediction is the predictive performance of clinical predictors come to similar con-
surprisingly high, with AUCs between 0.78–0.91 on the test set for all clusions, showing that initial medication filling behaviour predicts refill
investigated splits. The available oversupply has the highest impact on adherence more accurately for patients with index prescription [13]. A
model decisions, with higher oversupply values correlating with in- similar low performance is observed for patients with a sudden drop-off
creased PDC. This fact is not surprising, given that, firstly, oversupply in PDC, where previous PDC has been high, with model Recall ranging
10
A. Galozy and S. Nowaczyk Journal of Biomedical Informatics: X 6–7 (2020) 100075
from under 1% (KNN) to 44%, not including history and 6–63% when behaviour. Predicting refill adherence on a prescription basis remains
adding historical information. The models are unable to perform well challenging when initial filling behaviour is not considered, pointing
on patients where previous information is missing or predict adherence towards the necessity to monitor patients adherence continuously and
for prescription with a sudden drop in PDC for otherwise adherent providing interventions just in time when patterns of nonadherence
patients. This model behaviour shows a clear bias towards patients that start manifesting.
remain in treatment with regular follow-ups, i.e., patients that remain Our simulations show that particular patterns of medication con-
refill adherent also continue to utilize healthcare resources, generating sumption are incompatible with observed PDC-patterns of patients with
a data bias towards adherence. This observation might partially explain index prescription. Provided there exists a link between refill and real
why adding historical information from past prescriptions, and ad- adherence, patterns of medication-taking might not be as arbitrary, and
herence improves prediction performance on the test set for all testing knowing how patients refill their medication can provide insights into
scenarios. The fact that patients had regular checkups in the past pro- possible medication-taking habits. Such habits could then be analyzed
vides increased confidence that they remain refill adherent, as evi- in terms of medication effectiveness, their effects on clinical outcomes
denced by past adherence being among the strongest predictors of fu- and finding appropriate interventions.
ture refill adherence.
Given these results and results of previous studies, it is unlikely that 6. Limitations
EHRs, in their current state, provide information relevant for refill
adherence that allows the development of high-performance models Our study has several limitations. We analyzed data from patients
that can predict refill adherence at the time when the prescription is that are located in a specific region in Sweden with healthcare and
issued. It might be more useful to switch the focus from prediction to pharmacy policies that might not reflect systems in other countries. We
monitoring, analyzing refill patterns and provide patients with ade- expect refill patterns to change accordingly.
quate interventions. We do not investigate the frequency and timeliness of dispensations,
All of the common PDC-patterns can be reproduced by the simula- but show the number of covered days given the dispensed supply and
tion models investigated, coupling mediation refilling with medication- prescribed daily dose. As such, simulations do not reflect dispensation
taking behaviour. Early discontinuation in the first year exhibits the patterns per se, but dispensations that would lead to similar PDC-pat-
most diverse of all consumption patterns. Any consumption model that terns. Furthermore, our simulation models assume patients either forget
does not exhaust the available supply can reproduce such patterns, to take their mediation or consume their daily dose as prescribed. We
pointing towards high variability in possible consumption patterns with did not simulate over-consumption of medication, i.e., patients might
significant differences in levels of drug exposure. Good or near-perfect exhaust their supply before the next official refill.
refill adherence, given our simulation models, is only possible if con- For the refill pattern analysis, we mainly focused on patients with
sumption patterns are consistent as well, i.e., to avoid significant gaps monotherapy. Although we include patients with combination drugs,
covered days, patients are not permitted to have significant gaps in patients treated for hypertension are often prescribed multiple medi-
medication-taking. cations and may exhibit different patterns not investigated in this study.
Other patterns are more difficult to model, restricting the set of We do not validate our simulation results with actual consumption
possible medication consumption patterns. In all cases, the sigmoidal patterns of the cohort under investigation. Given that such data is not
model does fit the patterns the best, reproducing the PDC-patterns available, we focused on simple patterns, but real consumption beha-
somewhat faithfully, indicating medication consumption that exhibits viours might be more complex.
periods of nonadherence instead of constant are regular forgetting of
medication. The exponential and constant models do not reproduce the Authors’ statement
most common PDC-patterns beyond early discontinuation. These
models seem to provide the least plausible consumption patterns for the All authors certify that they have seen and approved the final ver-
majority of cases. sion of the manuscript. The article is the authors’ original work, was not
Even though the real pattern of mediation use is not uniquely re- published prior or is under consideration for publication elsewhere. All
trievable from dispensation information, the realm of possibilities is authors have made a substantial contribution to this study. Galozy:
somewhat restricted given a link between refilling and actual mediation Study concept and design, data analysis, data interpretation and
use does exist when examining the phenomenon of adherence on po- drafting of the manuscript. Nowaczyk: Study concept and design, data
pulation level. interpretation and critical revision of the manuscript for important in-
tellectual content.
5. Conclusion
11
A. Galozy and S. Nowaczyk Journal of Biomedical Informatics: X 6–7 (2020) 100075
All prediction models underwent hyper-parameter grid-search on the validation set to find the optimal combination of parameters yielding the
best AUC. The parameters considered for optimization of each algorithm are shown in Table A.1. The optimal parameter combinations for each
algorithm and splitting strategy is shown in Table A.2. For all algorithm implementations, training and evaluation we used the scikit-learn package
implemented in the programming language python [28].
Table A.1
Grid-search hyper-parameters for each algorithm.
Algorithm Hyper-parameters
Table A.2
Optimal parameters for each algorithm, predictor set and splitting strategy.
Baseline History
Splitting strategy
Random n estim.: 500 C:0.8 n estim.: 100 n neighbors: 25 n estim.: 400 C:0.1 n estim.: 100 n neighbors: 25
max depth: None penalty: l2 max depth: 15 p: 1 max depth: None penalty: l1 max depth: 15 p: 1
criterion: entropy criterion: entropy
Patient n estim.: 500 C:0.1 n estim.: 300 n neighbors: 25 n estim.: 300 C:0.1 n estim.: 150 n neighbors: 25
max depth: 10 penalty: l2 max depth: 3 p: 1 max depth: 10 penalty: l1 max depth: 3 p: 3
criterion: entropy criterion: entropy
Temporal patient n estim.: 500 C:0.1 n estim.: 500 n neighbors: 20 n estim.: 500 C:0.1 n estim.: 250 n neighbors: 25
max depth: None penalty: l1 max depth: 15 p: 1 max depth: None penalty: l1 max depth: 15 p: 1
criterion: gini criterion: entropy
Temporal recent n estim.: 400 C:0.1 n estim.: 400 n neighbors: 25 n estim.: 500 C:0.1 n estim.: 200 n neighbors: 25
max depth: 12 penalty: l1 max depth: 3 p: 1 max depth: 10 penalty: l1 max depth: 3 p: 1
criterion: entropy criterion: entropy
References S.M. Czajkowski, J.G. Daniel, I. Ekman, M. Ho, M. Johnson, S.E. Kimmel, L.Z. Liu,
J. Musaus, W.H. Shrank, E.W. Buono, K. Weiss, C.B. Granger, Medication ad-
herence: a call for action, Am. Heart J. 162 (2011) 412–424, https://doi.org/10.
[1] T.M. Abegaz, Y.G. Tefera, B.A. T., Target organ damage and the long term effect of 1016/j.ahj.2011.06.007.
nonadherence to clinical practice guidelines in patients with hypertension: a ret- [7] M. Burnier, Is there a threshold for medication adherence? Lessons learnt from
rospective cohort study, Int. J. Hypertens. 2090-0384 (2017) 749, https://doi.org/ electronic monitoring of drug adherence, Front. Pharmacol. 9 (2019), https://doi.
10.1155/2017/2637051. org/10.3389/fphar.2018.01540.
[2] A. Galozy, S. Nowaczyk, A. Sant’Anna, M. Ohlsson, M. Lingman, Pitfalls of medi- [8] M. Burnier, B.M. Egan, Adherence in hypertension, Circ. Res. 124 (2019)
cation adherence approximation through ehr and pharmacy records: Definitions, 1124–1140, https://doi.org/10.1161/CIRCRESAHA.118.313220 https://www.
data and computation, Int. J. Med. Informatics 136 (2020) 104092, https://doi.org/ ahajournals.org/doi/abs/10.1161/CIRCRESAHA.118.313220.
10.1016/j.ijmedinf.2020.104092 http://www.sciencedirect.com/science/article/ [9] S.M. Cadarette, L. Wong, An introduction to health care administrative data, Can. J.
pii/S1386505619308974. Hospital Pharmacy 68 (2015) 232–237, https://doi.org/10.4212/cjhp.v68i3.1457.
[3] S.E. Andrade, K.H. Kahler, F. Frech, K.A. Chan, Methods for evaluation of medi- [10] R.L. Cutler, F. Fernandez-Llimos, M. Frommer, C. Benrimoj, V. Garcia-Cardenas,
cation adherence and persistence using automated databases, Pharmacoepidemiol. Economic impact of medication non-adherence by disease groups: a systematic
Drug Saf. 15 (2006) 565–574, https://doi.org/10.1002/pds.1230. review, BMJ Open 8 (2018) e016982, https://doi.org/10.1136/bmjopen-2017-
[4] A. Ashfaq, S. Lönn, H. Nilsson, J.A. Eriksson, J. Kwatra, Z.M. Yasin, J.E. Slutzman, 016982.
T. Wallenfeldt, Z. Obermeyer, P.D. Anderson, M. Lingman, Data resource profile: [11] Ester, M., peter Kriegel, H., Sander, J., Xu, X., 1996. A density-based algorithm for
regional healthcare information platform in Halland, Sweden, a dedicated en- discovering clusters in large spatial databases with noise, AAAI Press. pp. 226–231.
vironment for healthcare research, Int. J. Epidemiol. (2020), https://doi.org/10. doi:10.5555/3001460.3001507. URL https://dl.acm.org/doi/10.5555/3001460.
1093/ije/dyz262. 3001507.
[5] P.C. Baumgartner, R.B. Haynes, K.E. Hersberger, I. Arnet, A systematic review of [12] J.M. Franklin, A.A. Krumme, W.H. Shrank, O.S. Matlin, T.A. Brennan,
medication adherence thresholds dependent of clinical outcomes, Front. Pharmacol. N.K. Choudhry, Predicting adherence trajectory using initial patterns of medication
9 (2018), https://doi.org/10.3389/fphar.2018.01540. filling, Am. J. Manag. Care 21 (2015) e537–544.
[6] H.B. Bosworth, B.B. Granger, P. Mendys, R. Brindis, R. Burkholder, [13] J.M. Franklin, W.H. Shrank, J. Lii, A.K. Krumme, O.S. Matlin, T.A. Brennan,
12
A. Galozy and S. Nowaczyk Journal of Biomedical Informatics: X 6–7 (2020) 100075
N.K. Choudhry, Observing versus predicting: initial patterns of filling predict long- medications in patients with long-term initial medication fills using indicators of
term adherence more accurately than high-dimensional modeling techniques, clinical events and health behaviors, J. Managed Care Specialty Pharmacy 24
Health Serv. Res. 51 (2016) 220–239. (2018) 469–477, https://doi.org/10.18553/jmcp.2018.24.5.469 pMID: 29694288.
[14] R. Grymonpre, M. Cheang, M. Fraser, C. Metge, D.S. Sitar, Validity of a prescription https://doi.org/10.18553/jmcp.2018.24.5.469.
claims database to estimate medication adherence in older persons, Med. Care 44 [23] S.M. Lundberg, G. Erion, H. Chen, A. DeGrave, J.M. Prutkin, B. Nair, R. Katz,
(2006). J. Himmelfarb, N. Bansal, S.I. Lee, From local explanations to global understanding
[15] R.L. Harrison, Introduction to Monte Carlo simulation, AIP Conf. Proc. 1204 (2010) with explainable ai for trees, Nat. Mach. Intell. 2 (2020) 2522–5839.
17–21. [24] S.M. Lundberg, S.I. Lee, A unified approach to interpreting model predictions, in:
[16] F.D.R. Hobbs, Cardiovascular disease: different strategies for primary and sec- I. Guyon, U.V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan,
ondary prevention? Heart 90 (2004) 1217–1223. doi:10.1136/hrt.2003.027680, R. Garnett (Eds.), Advances in Neural Information Processing Systems 30, Curran
arXiv:https://heart.bmj.com/content/90/10/1217.full.pdf. https://heart.bmj.com/ Associates, Inc., 2017, pp. 4765–4774.
content/90/10/1217. [25] S.M. Lundberg, B. Nair, M.S. Vavilala, M. Horibe, M.J. Eisses, T. Adams, D.E. Liston,
[17] V. Kini, P.M. Ho, Interventions to improve medication adherence: a review, JAMA D.K.W. Low, S.F. Newman, J. Kim, et al., Explainable machine-learning predictions
320 (2018) 2461–2473, https://doi.org/10.1001/jama.2018.19271. for the prevention of hypoxaemia during surgery, Nat. Biomed. Eng. 2 (2018) 749.
[18] A.A. Krumme, J.M. Franklin, D.L. Isaman, O.S. Matlin, A.Y. Tong, C.M. Spettell, [26] J. MacQueen, Some methods for classification and analysis of multivariate ob-
T.A. Brennan, W.H. Shrank, N.K. Choudhry, Predicting 1-year statin adherence servations, in: Proceedings of the Fifth Berkeley Symposium on Mathematical
among prevalent users: a retrospective cohort study, J. Manag. Care Specialty Statistics and Probability, Statistics, vol. 1, University of California Press, Berkeley,
Pharmacy 23 (2017) 494–502, https://doi.org/10.18553/jmcp.2017.23.4.494. Calif, 1967, pp. 281–297. https://projecteuclid.org/euclid.bsmsp/1200512992.
[19] A.A. Krumme, G. Sanfélix-Gimeno, J.M. Franklin, D.L. Isaman, M. Mahesri, [27] J. Paparrizos, L. Gravano, k-shape: Efficient and accurate clustering of time series,
O.S. Matlin, W.H. Shrank, T.A. Brennan, G. Brill, N.K. Choudhry, Can purchasing ACM SIGMOD Record 45 (2016) 69–76, https://doi.org/10.1145/2949741.
information be used to predict adherence to cardiovascular medications? An ana- 2949758.
lysis of linked retail pharmacy and insurance claims data, BMJ Open 6 (2016) [28] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel,
e011015, https://doi.org/10.1136/bmjopen-2015-011015 http://bmjopen.bmj. M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos,
com/content/6/11/e011015.abstract. D. Cournapeau, M. Brucher, M. Perrot, E. Duchesnay, Scikit-learn: machine learning
[20] H. Kumamaru, M.P. Lee, N.K. Choudhry, Y.H. Dong, A.A. Krumme, N. Khan, in Python, J. Mach. Learn. Res. 12 (2011) 2825–2830.
G. Brill, S. Kohsaka, H. Miyata, S. Schneeweiss, J.J. Gagne, Using previous medi- [29] D. Prakash, Target organ damage in newly detected hypertensive patients, J. Family
cation adherence to predict future adherence, J. Managed Care Specialty Pharmacy Med. Prim. Care 8 (2019) 2042–2046.
24 (2018) 1146–1155, https://doi.org/10.18553/jmcp.2018.24.11.1146. [30] J.F. Steiner, P.M. Ho, B.L. Beaty, L.M. Dickinson, R. Hanratty, C. Zeng, H.M. Tavel,
[21] W.Y. Lam, Paula Fresco, Medication adherence measures: an overview, BioMed Res. E.P. Havranek, A.J. Davidson, D.J. Magid, R.O. Estacio, Sociodemographic and
Int. 1–12 (2015), https://doi.org/10.1155/2015/217047. clinical characteristics are not clinically useful predictors of refill adherence in
[22] J.C. Lauffenburger, J.M. Franklin, A.A. Krumme, W.H. Shrank, O.S. Matlin, patients with hypertension, Circ. Cardiovasc. Qual. Outcomes 2 (2009) 451–457,
C.M. Spettell, G. Brill, N.K. Choudhry, Predicting adherence to chronic disease https://doi.org/10.1161/CIRCOUTCOMES.108.841635.
13