Professional Documents
Culture Documents
Supplement to: Jacobs DR Jr, Woo JG, Sinaiko AR, et al. Childhood cardiovascular risk factors and adult cardio-
vascular events. N Engl J Med 2022;386:1877-88. DOI: 10.1056/NEJMoa2109191
This appendix has been provided by the authors to give readers additional information about the work.
This supplement contains the following items:
Contents
Original SAP (from the grant proposal)................................................................................................... 2
Final SAP ..................................................................................................................................................... 6
Summary of Amendments to the SAP .................................................................................................. 10
References ................................................................................................................................................ 13
Original SAP (from the grant proposal)
1. Introduction/Aims
This study is an observational follow-up of 7 epidemiologic cohorts begun in children in the
1970s to 1990s in the US, Finland and Australia, with the goal of identifying the relationships
between measured childhood cardiovascular risk factors and adjudicated adult cardiovascular
disease. This SAP is the verbatim wording from the grant application (except for reordering for
clarity), written prior to a complete understanding of the data structure.
Aim (Childhood Analyses): To evaluate relationships between childhood CV risk factors and
adult CV endpoints.
Hypothesis 1: Adverse childhood/adolescent (age 3‐19) CV risk factor levels (BMI, lipids
[total cholesterol, LDL cholesterol, HDL cholesterol, and triglycerides], BP) are related to
increased incidence of CV endpoints in adulthood.
Hypothesis 2: A CV risk score weighting childhood/adolescent risk factors (BMI, lipids,
BP, age, sex and race/ethnicity) is a stronger predictor of adult CV endpoints than any
individual risk factor.
Hypothesis 3: The relationship of individual risk factors or CV risk score with adult CV
endpoints becomes stronger with increasing age from childhood (age 3‐11) to
adolescence (age 11‐19).
Aim (Adulthood and Childhood Analyses): To evaluate the association of CV risk score
trajectories on adult CV endpoints.
Hypothesis 4: Trajectories in the CV risk score from childhood (age 3–11) through
adolescence (age 11–19) (i.e., high-high, high-low, low-high, low-low) are better
predictors of adult CV endpoints than risk scores in either childhood (age 3–11) or
adolescence (age 11–19) alone.
Hypothesis 5: Extension of trajectories in the CV risk score from childhood/adolescence
(age 3–19) through adulthood (age > 25) increases prediction of adult CV endpoints
beyond the childhood/ adolescence (age 3–19) risk scores or trajectories alone.
2. Population to be Analyzed
The analytic sample will consist of all participants who were located and consented for this
questionnaire-based follow-up, plus decedents identified by National Death Index (NDI) search.
3. Baseline Characteristics
We will initially study age, sex, and cohort distributions of all predictor variables, as well as
observed adjudicated event rates by cohort.
5. Outcome Assessment
The major outcome is presence/absence of fatal or non‐fatal CV events, with all event types
considered as a composite variable. Outcomes to be adjudicated include fatal or non-fatal
coronary heart disease, myocardial infarction, peripheral artery disease, stroke, aneurysm,
angina pectoris, transient ischemic attack, and heart failure. We recognize the latter three
may be more subjective and difficult to diagnose and, therefore all events will require hospital
record verification. Heart failure due to non-CV disease (e.g., cardiomyopathy, cocaine use)
will be excluded. Hypertension and type 2 diabetes will not be considered as outcomes.
The hospital records required to adjudicate the event will depend on the endpoint reported,
following guidelines similar to the CARDIA Study’s protocol. Data requested may include
discharge summary, ICD codes and specific CV event related documents such as ECG,
cardiac enzymes, echocardiogram, catheterization and surgical reports, as appropriate.
Medical records will be abstracted at each site and entered into a de-identified web portal for
centralized adjudication. Adjudicators will remain blinded with regard to the case’s cohort of
origin during the review process. Because identification of cases from Finland and Australia
will partially or fully rely on health care registries, adjudication algorithms using registry data
will be developed and tested for consistency with the medical records-based adjudication.
6. Statistical Methods
Prediction of events will employ Cox‐proportional hazards (CoxPH) regression using time to
event or censoring (i.e., date of last CV disease-free contact or death). For hypothesis 1, in
primary analyses each of the variables in the predictor set will be analyzed as a continuous
variable both separately and jointly. The sample at risk varies according to predictor variable
(Table 4). Age, sex, and race/ethnicity will be included as covariates. Goodness of fit will be
assessed by visually examining splines and/or representing the predictor variable in categories,
such as quartiles.
For hypothesis 2, we will use information about shape of the prediction curve from hypothesis 1
to construct and test risk scores that combine BMI, blood lipid, and BP z‐scores, as well as age,
sex, and race/ethnicity. Given available sample numbers (Table 4), we will represent blood
lipids with total cholesterol and triglycerides in one risk score (RS1) and with LDL‐C, HDL‐C,
and triglycerides in a second (RS2). First, we will run a CoxPH regression with all risk-factor z-
scores entered simultaneously to calculate a risk score from the regression function. Second,
we will calculate a weighted average of the risk factor z-scores (defined so a higher score
indicates higher risk). We will construct a receiver operating characteristic (ROC) curve by
mapping observed event counts against predicted ones. The prediction results will be compared
by the area under the curve (AUC) statistics. AUC for prediction from these risk scores will be
compared to prediction obtained from demographics or demographics plus individual risk factors
alone. We will use the Net Reclassification Improvement 3 statistic to examine extent of
improvement of clinical classification compared to more basic models of demographics or single
risk factors. Depending on the sample size in a given analysis, we will alternately plan to
conduct a split‐sample cross‐validation of the risk scores. Our expectation is that the risk scores
have substantially greater predictive capability than does any single CV risk factor.
For hypothesis 3 we will evaluate whether and how risk prediction varies by age of childhood
measurement. We expect strength of prediction to be greater in adolescence than in childhood,
so we will test age interaction both with a continuous risk factor*age term in the regression
models and in age strata (age 3‐11, age 11‐19 years). Expected numbers of participants with at
least 1 observation within the two age windows are given in Table 4 (participants seen only
once would enter only 1 age window). For each outcome variable, a single, repeated measures
Poisson regression will describe the prediction in two age strata. For a given risk factor or risk
score (R) and an indicator for age 3‐11 or 11‐19 (IA), and using the youngest measurement in
each age window, this will have terms Age, R, and IA*R. The coefficient for IA*R describes and
provides a test for the difference in prediction for measurements in the two age windows, in a
model that pools all available predictor information from each participant. If there is no age
interaction, the regression coefficients from individual risk factors and from the combined risk
score will indicate which childhood/adolescent risk factor levels predict future CV disease
regardless of age at first CV risk factor measurement.
The studies included in this proposal were designed with differing sets of measurements and
ages of follow-up. Thus, missing predictor values are not always related to differential follow‐up,
but may represent noninformative missing where missing data can be imputed without
introducing bias. We will use correlations across time within an individual, across risk factors, or
among demographically similar cohorts. The imputed data are not observed so standard errors
of resulting estimates will be adjusted to incorporate the increased uncertainty. 4 Finally, we will
conduct a sensitivity analysis to confirm that no bias is introduced via imputation.
7. Sensitivity Analyses
By cohort: The event rate of the major outcome or the distribution of the predictors (childhood
risk factors) may differ across the cohorts, so cohort-specific baseline hazards will be estimated
to account for these discrepancies. For every regression analysis, we will conduct a random-
effects meta-analysis1, 2 to address heterogeneity of statistical effects across the cohorts, with a
forest plot drawn to help visualize and evaluate differences between cohort-specific estimates
and the pooled estimate of the relative hazard for each predictor.
Fatal vs. Fatal/nonfatal events: Fatal CV events will be known in almost all of the 41,006
participants with any childhood measurements, including non‐responders to our health
questionnaire. Expected numbers of fatal CV events will be inadequate for detecting expected
relations to the individual risk factors or risk score. However, these data are still useful for
studying potential bias in the sample recruited. If the responders are not a biased sample, there
will be general consistency of findings using CV death as the outcome in the whole sample with
the findings using fatal or non‐fatal CV events in the responders. We will also explore total
mortality in sensitivity analyses to examine whether there are any unintended adverse
associations.
Bias from loss to follow-up: Potential loss‐related bias will be assessed by testing for differences
in childhood CV risk factor levels between participants and non‐participants. Analytically, we will
compute the probability (propensity) of the adult follow‐up as a function of demographics and
childhood risk factors and will weight each subject case with the inverse of the computed
probability in the proposed analyses.
Selection of childhood measurements: For hypothesis 3, for participants with multiple measures
within an age range, we will conduct sensitivity analyses where the oldest measurement within
an age stratum is used, or when the average of all measurements within an age stratum is
used. These sensitivity analyses supplement and describe the goodness of fit of the model with
a continuous age interaction term.
Secondary Analyses for the Aim: Other variables available in smaller subsets will be examined
in exploratory analyses, including adolescent smoking, blood glucose and insulin, diet, physical
activity, and socioeconomic status (SES). These factors are important for understanding
potential leverage points for changing CV risk factors in childhood, but have insufficient power to
include in primary analyses.
Bias in adjudication: Adjudication of all true CV endpoints will depend on two main factors: 1)
likelihood that the event was diagnosed or treated in a medical facility, and 2) likelihood that
medical records are available and complete. While events requiring hospital admission likely will
be captured for all participants, less severe conditions not requiring hospitalization might remain
unidentified. Reduced access to medical care may result in some misclassification of true
events as non-events; however, past studies have shown that false negative reporting of
diagnosed CV events is uncommon. Discrepancies in underreporting across cohorts will be
addressed by cohort-specific baseline hazards.
Final SAP
1. Introduction/Aims
This study is an observational follow-up of 7 epidemiologic cohorts begun in children in the
1970s to 1990s in the US, Finland and Australia, with the goal of identifying the relationships
between measured childhood cardiovascular risk factors and adjudicated adult cardiovascular
disease.
Aim (Childhood Analyses): To evaluate relationships between childhood CV risk factors and
adult CV endpoints.
Hypothesis 1: Adverse childhood/adolescent (age 3‐19) CV risk factor levels (BMI, total
cholesterol, triglycerides, systolic BP and youth smoking) are related to increased
incidence of CV endpoints in adulthood.
Hypothesis 2: A CV risk score equally weighting childhood/adolescent risk factor age-
sex standardized z-scores (BMI, total cholesterol, triglycerides, systolic BP and youth
smoking) is a stronger predictor of adult CV endpoints than any individual risk factor.
Hypothesis 3: The relationship of individual risk factors or CV risk score with adult CV
endpoints becomes stronger with increasing age from childhood (age 3‐11) to
adolescence (age 12‐19).
Aim (Adulthood and Childhood Analyses): To evaluate the association of CV risk score
trajectories on adult CV endpoints.
Hypothesis 4 was deleted because only about 50% of participants had repeated
measures within age 3-19.
Hypothesis 5: Changes in the CV risk score from childhood/adolescence (age 3–19)
through adulthood (age > 20) increases prediction of adult CV endpoints beyond the
childhood/ adolescence (age 3–19) risk scores alone.
2. Sample to be Analyzed
The analytic sample will consist of all participants with sufficient information available for follow-
up or searching using the National Death Index (NDI) and with at least one risk factor (BMI, total
cholesterol, triglycerides, systolic BP and youth smoking) measured prior to age 20 (n=38,589).
Participants who had an event, died or were followed up prior to age 25 were excluded from
analysis based on expected rarity of CVD before that age, a decision at the initiation of the grant
period. Inclusion in the analytic sample consisted of three potential groups: those who were
located and consented for this questionnaire-based follow-up; decedents identified by NDI
search; and those not located in the NDI searches and thus presumed alive.
The analytic sample for the analysis of Hypothesis 5, incorporating adult measured risk factors,
will include the subsample of childhood participants who had any measured adult risk factor
(BMI, total cholesterol, ln(triglycerides), or systolic BP) at or after age 20 but prior to any known
cardiovascular event (n=13,401).
In each of these analysis streams, bias will be removed and the sample size will be maintained
by multiple imputation, under the plausible assumption of Missing At Random (MAR).
3. Baseline Characteristics
We will initially study age, sex, and cohort distributions of all predictor variables, as well as
observed adjudicated event rates by cohort. We will also compare distributions of these factors
as well as race, parental education and own education across outcome classifications.
Section 7: Sensitivity
Analyses
Modification 1 Exclusion of meta-analysis of cohorts
Rationale We did not do the formal meta-analysis, because two features of the
data became obvious after the data were collected. First, across all 7
cohorts there were almost no CVD events before age 40. Second,
only 4 cohorts for the fatal/nonfatal analysis and 2 cohorts for the
fatal analysis had sufficient events to contribute to the differential
risk observed at ages above 40. Therefore, the added variance due to
between cohort heterogeneity could not be estimated reliably, and
we opted instead to present findings stratified by cohort.
Modification 2 Fatal CV events included as primary outcome
Rationale As indicated above, this analysis was feasible as a primary analysis
due to higher-than-expected mortality due to cardiovascular causes
and strength of the risk factor associations.
Modification 3 Exclusion of consideration of some childhood risk factors
Rationale We did not conduct analyses including glucose, insulin, diet or
physical activity in childhood. Blood glucose and insulin were studied
in relation to diabetes6, but sample sizes were too small for the main
Outcomes analysis. We continue to work on harmonizing diet and
physical activity variables, but these are not currently available.
Modification 4 Addition of sensitivity analyses: interactions by sex and race
Rationale To complement analyses of age group interactions, we were
concerned about differences in relationships by sex and race.
References
1. DerSimonian, R and Laird, N, Meta-analysis in clinical trials. Control Clin Trials, 1986. 7(3): p.
177-88.
2. Thompson, S, Kaptoge, S, White, I, Wood, A, Perry, P, Danesh, J, and Emerging Risk
Factors, C, Statistical methods for the time-to-event analysis of individual participant data from
multiple epidemiological studies. Int J Epidemiol, 2010. 39(5): p. 1345-59.
3. Pencina, MJ, D'Agostino, RB, Sr., and Steyerberg, EW, Extensions of net reclassification
improvement calculations to measure usefulness of new biomarkers. Stat Med, 2011. 30(1): p.
11-21.
4. Little, RJA and Rubin, DB, Statistical analysis with missing data. 2002, New York: John Wiley
and Sons.
5. Hu T, Gall SL, Widome R, Bazzano LA, Burns TL, Daniels SR, Dwyer T, Ikonen J, Juonala M,
Kähönen M, Prineas RJ, Raitakari O, Sinaiko AR, Steinberger J, Urbina EM, Venn A, Viikari J,
Woo JG, Jacobs DR Jr. Childhood/Adolescent Smoking and Adult Smoking and Cessation: The
International Childhood Cardiovascular Cohort (i3C) Consortium. J Am Heart Assoc. 2020 Apr
7;9(7):e014381.
6. Hu T, Jacobs DR Jr, Sinaiko AR, Bazzano LA, Burns TL, Daniels SR, Dwyer T, Hutri-
Kähönen N, Juonala M, Murdy KA, Prineas RJ, Raitakari OT, Urbina EM, Venn A, Woo JG,
Steinberger J. Childhood BMI and Fasting Glucose and Insulin Predict Adult Type 2 Diabetes:
The International Childhood Cardiovascular Cohort (i3C) Consortium. Diabetes Care. 2020
Nov;43(11):2821-2829.
7. De Stavola BL, Nitsch D, dos Santos Silva I, et al. Statistical issues in life course
epidemiology. Am J Epidemiol 2006;163(1):84-96. DOI: 10.1093/aje/kwj003.