Professional Documents
Culture Documents
155]
ORIGINAL ARTICLE
ABSTRACT
Background and Objective: Portable and readily accessible wellness devices can aid vital sign measuring for those interested
in tracking their health. In this diagnostic accuracy study, we evaluated the performance of the VitalWellness device (VW), a
wireless, compact, noninvasive device that measures four vital signs (VS) – blood pressure (BP), heart rate (HR), respiratory
rate (RR), and body temperature (BT) – using the index finger and forehead. Methods: Adult volunteers with VS that fell both
within and outside of the normal physiological range were enrolled to provide BP, HR, RR, and BT measurements using both
the VW and Food and Drug Administration-approved reference devices. A subgroup of participants underwent an additional
test to analyze the VW’s performance on HR and RR outside of normal physiological ranges. Statistical measurements were
plotted on scatter and Bland–Altman plots. Sensitivity analyses to evaluate the VW’s performance by gender, skin color, finger
size, and auxiliary activities were performed. Results: A total of 263 participants completed the study. On an average, systolic
BP measured using the VW was 10 mmHg lower than that of the reference device (correlation coefficient r = 0.7), whereas
diastolic BP was 3 mmHg lower (r = 0.6), and RR was 2 bpm lower (r = 0.7). VW HR and BT measurements were, on average,
1 bpm and 0.3°F higher than the corresponding reference measurements (r = 0.9 and r = 0.3), respectively. Conclusion: The
VW device is well-suited for home-based, nonmedical monitoring of HR, RR, and BP. Further improvement in measurement
accuracy is required to enable applications for medical use.
Keywords: Cuff-less blood pressure monitor, monitoring, vital signs, Wellness device
DOI:
How to cite this article: Polanco N, Odametey S, Derakhshani SN,
10.4103/digm.digm_22_19 Khachaturian M, Devoe C, Jethwani K, et al. Evaluating the accuracy of the
VitalWellness device. Digit Med 2019;5:109-18.
their consent during the study; we estimated the final Study protocol
sample size at 284. The study participants were instructed not to consume
food, alcohol, or caffeine in the 30-min period before the
Recruitment study visit.
Study volunteers were recruited from the public and clinical
practices within the Partners HealthCare System (PHS). Participants were enrolled in a quiet, temperature-
The study was advertised on three online portals: The PHS controlled room. To enable the assessment of variability
clinical trials website, Craigslist.com and the NIH clinical in measurements due to differences in finger size, a
trials website. Study investigators also sought patients from Mudder Ring Gauge was used to measure participants’
the Wellesley Family Care Associates clinic, the Massachusetts left index finger size. Skin color was assessed using
General Hospital Revere Health Center, and the Brigham and the NIS Skin Color Scale,[16] which required trained
Women’s Hospital Division of Renal Medicine. In addition, study staff to make a subjective judgment of skin
study staff sent out letters and made recruitment phone calls color and assign a score based on the scale. VS were
to study volunteers from previous studies at PHS who had measured first with the VW, then with the respective
consented to being contacted for future research. reference device. Different staff members obtained
measurements using the VW and the reference devices,
Eligibility criteria respectively, and each was blinded to measurements
Volunteers age ≥18 years with enough understanding taken by the other.
of the English language to understand and consent to
study procedures were eligible to participate. The study Study staff recorded any VW device failure as a “failed
measurement” while collecting VS measurements. This
participants with physical disabilities that precluded
included connection loss between device and cellphone,
safe and adequate testing, and those with implanted or
device freezing or turning off during testing, and errors in
body-worn electronic medical devices were excluded. For
participants’ finger position.
measurements using exercise or aerobic step testing, the
following study volunteers were excluded: VS outside of Blood pressure measurement
normal physiological range during initial measurements; Participants placed their left index finger inside of
age ≥65 years; self-report of food, alcohol or caffeine the VW finger cuff, held the device in their palm, and
consumption within 3 h of testing; pregnant; self- placed it on their chest at heart level for 3 min. Two
reported history of fall or fracture in the past year; those consecutive measurements were taken. Thereafter, two
with arthritis, dizziness, neurological illness, chest pain, reference BP measurements were obtained by two staff
myocardial infarction, heart disease, stroke, transient simultaneously. The ISO 2009 guidelines were followed
ischemic attack; self-report of being under active treatment using the aforementioned sphygmomanometer and
for anemia, electrolyte imbalance or hyperthyroidism; stethoscope.[15]
and mental impairment with limited ability to cooperate.
Participants with nail polish or artificial nail on the left Per VW user guidelines, if two consecutive VW
index finger were excluded from HR measurements. The measurements for a given participant differed by
following exclusions applied for the paced breathing test: ≥12 mmHg for SBP or by ≥8 mmHg for DBP,
VS outside of normal physiological range; history of the complete set of measurements (including the
seizures or epilepsy; panic attacks; history of respiratory reference device measurements) was discarded.
or heart disease; hyperthyroidism; and/or pregnancy. Otherwise, the first measurement was recorded as
the VW value for SBP and DBP, and the second measured in accordance with the procedures outlined
one was discarded. Similarly, if the two reference above.
measurements differed by ≥4 mmHg, the set of
measurements was discarded, and a fresh set was Statistical analysis
obtained after a 10-min wait. [15] We calculated correlation coefficients, mean, standard
deviation, mean difference, standard deviation of mean
Heart rate measurement difference, and standard error of the mean difference with
Using the VW and the Welch Allyn Connex Spot Monitor 95% confidence intervals for measurements obtained by the
one after the other, HR was administered three times on VW and respective reference devices. The measurements
the participant’s index finger for 1 min each. were plotted on a scatter plot and a Bland–Altman plot.
To obtain HR values >130 beats/min, eligible participants We conducted exploratory data analysis and identified
underwent either a cycle ergometer test or an aerobic step data points that might bias the estimated correlation
test. between the VW and reference devices. For BT
• Exercise physiologists used the Borg Scale to assess measurements, implausible data points (<96°F
the participant’s perceived exertion on a Scifit 100R or >104°F) were identified and excluded. For BP
Cycle Ergometer every 3 min. The resistance level of measurements, a complete set of measurements
the ergometer was increased by 10 watts every 3 min was excluded if the reference device measurements
until the participant reached a perceived exertion level were ≥12 mmHg for SBP or ≥8 mmHg for DBP.[15]
of “somewhat hard” Thereafter, data points having the greatest influence
• The Harvard step test protocol was used for aerobic on the correlation coefficient were identified using
step testing.[17] The reference device was placed on the Cook’s distance and excluded from the final set of
participant’s finger to enable real-time monitoring of measurements, using a cutoff value of 0.05.
HR. Participants were asked to step up and down in
synchrony with a metronome set at 96 beats/min. The For sensitivity analyses, mean difference with 95%
procedure lasted for 5 min, or until the participant’s confidence intervals was calculated for subgroups based
HR reached 140 beats/min. on gender, finger size, skin color, and the presence of
auxiliary activities. We compared the mean differences
Respiratory rate measurement
between subgroups using an independent sample t-test.
RR was measured using the Capnostream™ Monitor and
the VW simultaneously. The participant breathed into a RESULTS
nasal prong capnography sensor that captured exhaled
air. The EtCO2 waveform was then printed and read by a Study population
physician to confirm the RR value. The study enrolled 265 participants, with two withdrawing
before the study completion [Figure 2]. The total sample
To obtain RR values >30 breaths/min, eligible study
participants underwent a paced breathing test. They Screened
matched their breathing to a metronome set variably (n = 375) Excluded
-Ineligible (n = 1)
between 60 and 80 beats/min (that is, 30–40 breaths/min) -Decline (n = 6)
for up to 100 s. -Did not schedule appointment (n = 21)
-Canceled appointment (n = 73)
-No show (n = 9)
Temperature measurement Eligible
(n = 265)
To measure temperature, study staff placed the VW 0.5–1”
away from a participant’s forehead and held the power
button for 1 s. Staff then placed the temperature probe
Enrolled
in the posterior pocket under the participant’s tongue to (n = 265)
obtain BT measurements.
Withdrew Consent
(n = 2)
Artifact testing
To test the variability of VS measurements, a sub- Completed
(n = 265)
sample of participants (n = 15) were asked to cough
(n = 5), drink water (n = 5) or talk (n = 5) while VS were Figure 2: Recruitment and enrollment flow chart
(n = 263) were aged 18–97, with a median of 38 years old. Table 2: Study population characteristics (n=263)
Nearly two-thirds of participants were female, most were Study population characteristics Subject population
Caucasian, and two-thirds self-reported their educational (n=263)
status as “college or post college.” More than half of the Age distribution, median (range) 38 (18-97)
participants self-reported their occupation as “employed,” Gender, n (%)
Female 164 (62)
followed by “student” (14%) and “unemployed” (14%). Race, n (%)
About three-fourths of the study participants were White 163 (63)
subjectively assigned an NIS skin color score of 1–3 by Black or African American 46 (18)
study staff, 20% of them were given skin color scores of Asian 26 (10)
Other or >1 race 23 (9)
4–6, and <4% were determined to have a skin color score
Unknown (excluded in calculations) 5
>6 [Table 2]. Max level of education, n (%)
Less than high school 8 (3)
Vital signs High school diploma or GED 29 (11)
Some college/vocational program/ 51 (19)
Statistical measurements were plotted on scatter and technology program
Bland–Altman plots to show correlation between VW and College graduate or professional degree 175 (67)
reference device measurements [Figures 3 and 4]. Table 3 Employment statusa, n (%)
and Figure 5 show which VS measurements were included Employed 155 (59)
Unemployed 38 (14)
in the final analysis. Table 4 shows the mean values and Homemaker 15 (6)
differences for VS measured by reference device and VW. Student 38 (14)
The sensitivity analysis did not suggest any differences in the Retired 28 (11)
performance of the VW by gender, skin color, or auxiliary Disabled 13 (5)
Ring sizeb, n (%)
activities [Tables 5 and 6]. However, the mean difference of
First tertile 79 (30)
the RR – but not the other VS – between the reference and Second tertile 88 (33)
VW was lower for larger finger sizes [Table 6]. Third tertile 96 (37)
NIS skin color scale, n (%)
1 54 (21)
DISCUSSION 2 105 (41)
3 40 (15)
Principal findings 4 28 (11)
We compared measurements for four VS from the VW 5 14 (5)
in comparison with the FDA-approved reference devices 6 10 (4)
and conclude that the measurements correlate well for 7 5 (2)
8 3 (1)
HR, RR, and BP. 9 0
10 0
HR obtained from the VW are similar in accuracy to those Unknown (excluded in calculations) 4
reported for commercially-available wrist-worn devices a
Study participants were allowed to self-report more than one employment.
Therefore, the totals in this section of the table are greater than the sample
and superior to those reported for smartphone-based size, and the percentages sum to greater than 100%, bSubgroups are
applications that use optical input from the phone’s adjusted for gender. The first tertile includes ring size <7.5 for women, and
<9.5 for men. The second tertile includes ring size 7.5-9 for women, and
camera to determine HR.[18] The accuracy of the VW for 9.5-11.5 for men. The third tertile includes ring size ≥9 for women, and
SBP and DBP is similar to that of smartphone-based ≥11.5 for men. NIS: New immigrant survey
SBP: Systolic BP, DBP: Diastolic BP, VS: Vital sign, HR: Heart rate,
reports on wellness devices that measure RR. RR: Respiratory rate
The estimated correlation of the VW for BT was the mean difference in temperature measurements
lower than that observed for other VS. However, is quite low compared to the reference device and
Figure 3: Correlation between vital sign measured using the reference device and that measured with VitalWellness. r = Pearson’s correlation
coefficient
Table 4: Mean values and difference for vital signs measured by reference device and VitalWellness
VS Reference, VW, mean Mean differencea SD of SEM Correlation
mean (SD) (SD) (95% CI) difference difference coefficient (95% CI)
SBP (n=96) 111 (15) 101 (13) 10 (8-12) 11 1 0.7 (0.6-0.8)
DBP (n=93) 69 (11) 66 (8) 3 (1-5) 9 0.9 0.6 (0.4-0.7)
HR (n=448) 73 (13) 74 (13) −1 (−2-−0.6) 6 0.3 0.9 (0.88-0.92)
Temperature (n=144) 98 (0.3) 99 (0.6) −0.3 (−0.4-−0.2) 0.6 0.1 0.3 (0.1-0.4)
RR (n=97) 21 (8) 19 (4) 2 (0.3-3) 6 0.6 0.7 (0.6-0.8)
a
Difference has been calculated as reference minus VW. VW: VitalWellness, VS: Vital sign, CI: Confidence interval, SBP: Systolic blood pressure, DBP: Diastolic
blood pressur, HR: Heart rate, RR: Respiratory rate, SD: Standard deviation, SEM: Standard error of the mean
Table 5: Differences in vital sign measurements between the reference device and VitalWellness in the
study participant subgroups
VS Gender Skin color (NIS Scale)
Male Female 1-2 3-4 ≥5
n Mean difference n Mean difference n Mean difference n Mean difference n Mean difference
(95% CI) (95% CI) (95% CI) (95% CI) (95% CI)
SBP 33 9 (6-13) 63 10 (7-13) 56 11 (8-14) 28 8 (3-12) 11 10 (1-19)
DBP 31 3 (−0.6-6) 62 3 (1-5) 52 3 (0.9-6) 29 3 (−0.1-6) 11 4 (−3-11)
HR 156 −0.7 (−2-0.1) 292 −1 (−2-−0.7) 267 −0.9 (−2-−0.3) 120 −1 (−3-−0.2) 55 −1 (−3-−0.1)
Temperature 49 −0.4 (−0.5-−0.2) 95 −0.3 (−0.4-−0.1) 83 −0.3 (−0.4-−0.1) 39 −0.5 (−0.7-−0.3) 20 −0.1 (−0.4-0.2)
RR 48 2 (−0.2-3) 49 2 (−0.2-3) 65 2 (0.3-3) 18 0.4 (−2-3) 10 2 (−4-9)
CI: Confidence interval, SBP: Systolic blood pressure, DBP: Diastolic blood pressure, VS: Vital sign, HR: Heart rate, RR: Respiratory rate, NIS: New
immigrant survey
suggests that the correlation coefficient may have measurement devices is higher than that of the VW.[27]
been impacted by low variability within obtained However, the mean difference in temperature for
temperature measurements.[26] The reported accuracy the VW is substantially lower than that reported for
of commercially-available wireless dermal temperature temporal artery thermometers.[28]
Figure 4: Bland-Altman plots of the difference in vital sign measurements between the reference device and VitalWellness. Solid lines indicate
mean difference and dotted lines indicate limits of agreement
Table 6: Differences in vital sign measurements between the reference device and the VitalWellness in
the study participant subgroup
VS Finger size Presence of artifactsa
First tertile Second tertile Third tertile
n Mean difference n Mean difference n Mean difference n Mean difference
(95% CI) (95% CI) (95% CI) (95% CI)
SBP 27 9 (5-14) 36 8 (4-12) 33 12 (9-16) 14 5 (−5-15)
DBP 24 4 (0.8-7) 35 1 (−2-4) 34 5 (2-8) 13 5 (−2-12)
HR 168 −1 (−2-−0.2) 131 −0.5 (−1-0.4) 149 −2 (−3-−0.8) 14 1 (−3-5)
Temperature 48 −0.3 (−0.5-−0.2) 44 −0.4 (−0.6-−0.2) 52 −0.2 (−0.4-0.01) 14 0.04 (−1-1)
RR 28 4 (2-6) 40 2 (−0.6-4) 29 −0.6** (−3-1) 13 2 (−0.2-4)
**P<0.01. aIncluding coughing, drinking water, or talking. CI: Confidence interval, SBP: Systolic blood pressure, DBP: Diastolic blood pressure, VS: Vital
sign, HR: Heart rate, RR: Respiratory rate
Previous clinic-based studies have found accuracy wrist-worn sensors can vary based on skin tone.[18]
(difference of >5 mmHg) of SBP measured using The accuracy of the VW for all VS did not seem to
home BP devices to differ systematically by age, be influenced by gender, skin tone, finger size, or
gender, and size of cuff used.[19] Similarly, HR with auxiliary activities.
guidelines recommended by the FDA for evaluation of 2. Centers for Disease Control and Prevention. Chronic Disease
Prevention and Health Promotion. Available from: http://www.
medical devices. cdc.gov/fmo/topic/budget%20information/factsheets/CHRONIC_
Factsheet.pdf. [Last accessed on 2019 Sep 23].
CONCLUSION 3. Kutz D, Shankar K, Connelly K. Making sense of mobile – And web-
based wellness information technology: Cross-generational study.
Current methods of measuring VS require multiple J Med Internet Res 2013;15:e83.
devices and clinical staff to record these measurements. In 4. Paré G, Jaana M, Sicotte C. Systematic review of home telemonitoring
for chronic diseases: The evidence base. J Am Med Inform Assoc
contrast, the VW allows for fast and accurate measurement 2007;14:269-77.
of key VS with minimal app-guided training. It can serve 5. Yilmaz T, Foster R, Hao Y. Detecting vital signs with wearable
as a convenient tool for users who monitor and track their wireless sensors. Sensors (Basel) 2010;10:10837-62.
VS measurements remotely. 6. Kellett J, Sebat F. Make vital signs great again – A call for action. Eur
J Intern Med 2017;45:13-9.
7. Appelboom G, Camacho E, Abraham ME, Bruce SS, Dumont EL,
The VW device is well-suited for home-based, nonmedical Zacharia BE, et al. Smart wearable body sensors for patient self-
monitoring of HR, RR, and BP. Further studies with assessment and monitoring. Arch Public Health 2014;72:28.
more data variability are needed to conclusively assess 8. Weenk M, van Goor H, Frietman B, Engelen LJ, van Laarhoven CJ,
the accuracy of BT readings from the device, as well as Smit J, et al. continuous monitoring of vital signs using wearable
devices on the general ward: Pilot study. JMIR Mhealth Uhealth
to evaluate usability and acceptability of the device by 2017;5:e91.
end-users. 9. Ludikhuize J, Smorenburg SM, de Rooij SE, de Jonge E. Identification
of deteriorating patients on general wards; measurement of vital
parameters and potential effectiveness of the modified early warning
Acknowledgments
score. J Crit Care 2012;27:424.e7-13.
We would like to thank the following staff for helping 10. Fanucci L, Saponara S, Bacchillone T, Donati M, Barba P, Sanchez-
recruit and enroll participants into the study: Stephen Tato I, et al. Sensing devices and sensor signal processing for remote
Agboola, M. D, M. P. H, Mursal Atif, Amanda J. Centi, monitoring of vital signs in CHF patients. EEE Trans Instrum Meas
2013,62:553-69.
P. h. D, Odeta Dyrmishi, Felicia Estrada, Harriett
11. Dias D, Paulo Silva Cunha J. Wearable health devices-vital sign
Gabbidon, Margaret Gorini, R. N, Saafia Masoom, Jessica monitoring, systems and technologies. Sensors (Basel) 2018;18:2414.
Odotei, R. N, Ramya Palacholla, M. D, M. P. H. Emily 12. Kaur A, Student ME. A review of remote patient monitoring system:
de Redon and Nina Schussler. The following staff Potentials, challenges and current issues. International Journal of
Trend in Research and Development 2016;3:2394-933.352.
provided financial and material support by tracking
13. George J, MacDonald T. Home blood pressure monitoring. Eur
and requesting sponsor payments: Colin Gorman. The Cardiol 2015;10:95-101. [Doi: 10.15420/ecr.2015.10.2.95].
following staff provided support during the editing phase 14. International Organization for Standardization. Medical Electrical
of the manuscript draft: Julie R. Brown. The following Equipment-Particular Requirements for Basic Safety and Essential
staff provided their clinics and patients for enrollment in Performance of Clinical Thermometers for body Temperature
Measurement. (ISO 80601-2-56). International Organization for
the study: Mallika L. Mendu, M. D, M. B. A, Jeffrey W. Standardization; 2017.
Phillips, M. D, Sushrut S. Waikar, M. D and Amy Wheeler, 15. International Organization for Standardization. Non-Invasive
M. D. The following department and staff provided the Sphygmomanometers – Part 2: Clinical Investigation of Automated
Measurement Type (ISO 81060-2:2013). Geneva: International
equipment and their expertise necessary to administer Organization for Standardization; 2013.
the exercise test for the study: The MGH Cardiovascular 16. Massey DS, Martin JA. 2003. The NIS Skin Color Scale. Switzerland:
Disease Prevention Center. International Organization for Standardization; 2013.
17. Harvey S. The No Sweat Exercise Plan: Lose Weight, Get Health, and
Financial support and sponsorship Live Longer (Harvard Medical School Guides). Blacklick: McGraw-
Hill; 2006.
Vital USA financially supported this research study.
18. Chandrasekhar A, Kim CS, Naji M, Natarajan K, Hahn JO,
Mukkamala R. Smartphone-based blood pressure monitoring via the
Conflicts of interest oscillometric fi nger-pressing method. Sci Transl Med 2018;10:431.
The only conflict of interest to declare within the 19. Ruzicka M, Akbari A, Bruketa E, Kayibanda JF, Baril C,
Hiremath S. How accurate are home blood pressure devices in use?
manuscript is that the author, Mark Khachaturian, Ph.D., A cross-sectional study. PLoS One 2016;11:e0155677.
is employed by Vital USA, the company who financially 20. Hodgkinson JA, Sheppard JP, Heneghan C, Martin U, Mant J,
sponsored the study. Roberts N, et al. Accuracy of ambulatory blood pressure monitors:
A systematic review of validation studies. J Hypertens 2013;31:239-50.
21. Shcherbina A, Mattsson CM, Waggott D, Salisbury H, Christle JW,
REFERENCES Hastie T, et al. Accuracy in wrist-worn, sensor-based measurements
1. Report Tracking. Washington, D.C: Pew Research Center; 2013. of heart rate and energy expenditure in a diverse cohort. J Pers Med
Available from: https://www.pewinternet.org/2013/01/28/tracking- 2017; 7:7-9.
for-health/. [Last accessed on 2019 Sep 23]. 22. Subbe CP, Kinsella S. Continuous monitoring of respiratory rate in