Professional Documents
Culture Documents
12
Copyright © 2001 by the Johns Hopkins University Bloomberg School of Public Health Printed in U.S.A.
All rights reserved
Edmond K. Kabagambe,1 Ana Baylin,1 Damon A. Allan,1 Xinia Siles,2 Donna Spiegelman,3,4 and Hannia
Campos1,2
biological markers; diet; epidemiologic methods; Hispanic Americans; methods; questionnaires; recall;
reproducibility of results
High intake of certain nutrients and total energy is a rec- (11–13), and biomarkers may not be better than traditional
ognized risk factor for conditions such as diabetes, stroke, methods because they can be affected by factors other than
and cardiovascular disease (1, 2), whereas intake of others diet and are not available for all nutrients (10, 14).
(e.g., tocopherols, carotenoids, folic acid, vitamin C, and Regardless of the technique used, actual food intake and its
dietary fiber) is thought to be protective (3–5). Assessment reporting are affected by factors such as culture, education,
of long-term dietary intake, the essential exposure factor for type of food consumed, age, gender, and body weight (11,
disease, is complicated by a lack of accurate methods (6, 7). 12, 15–17). Furthermore, menus and recipes tend to vary
Traditionally, intake of micro- and macronutrients has been across sociocultural groups.
estimated by the use of semiquantitative food frequency Although a questionnaire is an excellent tool for assessing
questionnaires (FFQs), dietary records, and dietary recalls long-term dietary intake, an FFQ valid for one population
(7, 8). Use of biomarkers of dietary intake is a recent addi- may be invalid when applied to another. For this reason, a
tion to these techniques (3, 9, 10). Assessment of true long- validation study is necessary whenever an FFQ is used. The
term dietary intake is fraught with a number of problems performance of FFQs in describing food consumption pat-
terns (18) and intake of individual nutrients has been evalu-
ated extensively in European and US Caucasian populations
(8, 14, 19). Little is known about the performance of the
Received for publication February 13, 2001, and accepted for
publication September 13, 2001. FFQ in other ethnic groups, particularly in developing coun-
Abbreviations: FFQ, food frequency questionnaire; USDA, US tries and in US minority populations. In addition, FFQs have
Department of Agriculture; VC, validity coefficient. been validated mainly against one dietary assessment
1
Department of Nutrition, Harvard School of Public Health, method such as dietary recalls or dietary records. The prob-
Harvard University, Boston, MA.
2 lem attending this approach is that the same factors that
Salud Coronaria, Instituto de Investigaciones en Salud,
Universidad de Costa Rica, San Pedro, Costa Rica. affect the reference method may also affect the FFQ. This
3
Department of Epidemiology, Harvard School of Public Health, problem would make it impossible to assume independent
Harvard University, Boston, MA.
4
random errors in the two methods, which could lead to over-
Department of Biostatistics, Harvard School of Public Health, estimation of the correlation between the reference method
Harvard University, Boston, MA.
Correspondence to Dr. Hannia Campos, Department of Nutrition, and the FFQ (20).
Room 353A, Harvard School of Public Health, Harvard University, Even if this error did not exist, a bias leading to underes-
Boston, MA 02115 (e-mail: hcampos@hsph.harvard.edu). timation of this correlation could occur as a result of corre-
1126
Validation of FFQs by Using the Method of Triads 1127
lated random errors in repeated measurements when the ref- infarction and gene-diet interaction in Costa Rica. We also
erence method is used (14, 20). Furthermore, in most FFQ report the reproducibility of the FFQ in assessing dietary
evaluation studies, additional information on biomarkers of intake of selected nutrients in Costa Rica.
intake is usually not available to augment data from the ref-
erence method. When such information is available, Kaaks
MATERIALS AND METHODS
(6) has recommended that the correlation of the estimate by
using the dietary assessment method and a person’s “true” Study population and design
long-term intake (referred to as the validity coefficient
(VC)) be estimated from three pairwise correlations The subjects in this 1995–1998 validation study were
between the FFQ, the reference method, and the biomarker recruited from the control group of an ongoing case-control
(figure 1). The technique for this estimation, named the study of myocardial infarction in Costa Rica. The 78 men and
“method of triads,” is an application of a factor analysis 42 women in this study are of Mestizo background and cul-
model and does correct for bias due to correlated errors in turally are Hispanic Americans. All subjects had no history of
the repeated measurements from the reference method (6, myocardial infarction or physical or mental disabilities prior
FIGURE 1. Diagrammatic representation of the method of triads used to estimate the correlation (?) between true long-term nutrient intake
and intake estimated using dietary assessment methods. Modified from Ocké and Kaaks (20).
biopsy (for assessment of tocopherol, carotenoid, and fatty L-4200 detector was set at a wavelength of 445 nm. Lutein
acid intake). and zeaxanthin were grouped because they coelute on the
Intake of energy and nutrients was computed by multi- chromatogram. The between-run coefficients of variation
plying the consumption frequency of each food by the nutri- for α-carotene, β-carotene, β-cryptoxanthin, lycopene, and
ent content of the specific portion; food composition values zeaxanthin + lutein were 7.2, 7.5, 12.6, 8.1, and 8.6 percent,
from the US Department of Agriculture (USDA) database respectively, in plasma samples and 18.6, 21.8, 26.4, 16.7,
(19, 21) and data from manufacturers and published reports and 19.6 percent in adipose tissue samples.
were used. Carotenoid intake (including α-carotene, β- The proportion of total fat in adipose tissue contributed
carotene, β-cryptoxanthin, lycopene, and zeaxanthin by each fatty acid was determined by gas-liquid chromatog-
lutein) was calculated by using the new carotenoid database raphy (27). In brief, about 2 mg of adipose tissue was added
that includes more than 2,400 fruits, vegetables, and to a hexane:isopropanol (3:2) mixture. To 200 µl of this
selected multicomponent foods (22, 23). The carotenoid mixture was added methanol and acetyl chloride, which
content of tomato-based products was updated with values formed methyl esters. After esterification, the adipose tissue
from the USDA, which were derived recently by reverse- was evaporated and the fatty acid methyl esters were redis-
Because there were high correlations between total energy der, and county of residence in Costa Rica) as numeric
intake and individual nutrient intakes estimated by dietary regressors and used them as partial variables in the CORR
recalls and the FFQs, variables were adjusted for total energy procedure of SAS (32). Because of the established negative
intake by regressing the nutrient against total energy intake, relation between smoking and amounts of carotenoids and
as described previously (7, 30). Briefly, a nutrient was loge or tocopherols in the plasma and diets of smokers (30, 33), and
square-root transformed to improve normality. The trans- the observed associations between these nutrients, smoking
formed nutrient was then regressed on total energy intake status, and body mass index in this study, we used regression
obtained from the same dietary assessment method. The methods to compute Pearson’s partial correlation coeffi-
regression coefficient, the intercept, and residuals were cients for these nutrients (32).
obtained. Although by definition the residuals are indepen- Random within-subject error in estimating the intake of a
dent of the explanatory variable, some are negative and dif- nutrient tends to attenuate correlations toward zero (7).
ficult to interpret. Therefore, we multiplied the regression Therefore, we performed analysis of variance on the data
coefficient by the mean of energy intake for the whole study from dietary recalls to estimate within- and between-subject
population; to this product we added the intercept and the variation and used the ratio of the two variances to correct
TABLE 1. Characteristics of the 120 controls interviewed to validate the semiquantitative food
frequency questionnaire used in a study of myocardial infarction, Costa Rica, 1995–1998
% or mean (standard deviation)
Variable (unit) Validation study population Total control population
(n = 120) (n = 503)
TABLE 2. Comparison of mean daily energy and absolute nutrient intakes estimated by using a
semiquantitative food frequency questionnaire and seven 24-hour dietary recalls for 120 controls
in a study of myocardial infarction, Costa Rica, 1995–1998
Mean (standard deviation)
Variable (unit) Mean of seven
FFQ1* FFQ2 Average FFQ
24-hour recalls
Total energy (kcal) 1,872 (585) 2,370 (728) 2,328 (724) 2,349 (611)
Protein (g) 65 (22) 76 (24) 75 (25) 76 (21)
Total fat (g) 59 (24) 86 (30) 92 (40) 89 (29)
Saturated fat (g) 20 (9) 30 (10) 32 (13) 31 (10)
Monounsaturated fatty acids (g) 22 (10) 32 (14) 34 (17) 34 (15)
Polyunsaturated fatty acids (g) 10 (4) 15 (6) 15 (6) 15 (5)
Total carbohydrate (g) 275 (88) 332 (118) 312 (101) 322 (92)
Sucrose (g) 101 (43) 69 (34) 61 (31) 65 (28)
Total fiber (g) 20 (8) 26 (11) 24 (9) 25 (8)
Cholesterol (mg) 231 (111) 272 (164) 279 (194) 276 (152)
Vitamin B6 (mg) 1.5 (0.6) 2.4 (1.0) 2.3 (0.9) 2.3 (0.8)
Vitamin B12 (mg) 3.6 (3.6) 5.3 (3.9) 4.6 (3.2) 4.9 (3.1)
Vitamin C (mg) 143 (80) 260 (144) 213 (116) 237 (114)
Vitamin K (mg) 119 (115) 104 (77) 93 (69) 99 (66)†
Folic acid (mg) 265 (100) 377 (139) 345 (131) 361 (113)
Calcium (mg) 636 (258) 891 (384) 820 (368) 856 (319)
Iron (mg) 15 (5) 15 (4) 15 (5) 15 (4)†
Sodium (mg) 2,267 (911) 2,054 (619) 1,892 (673) 1,973 (544)
Zinc (mg) 8.5 (3.0) 9.5 (3.2) 9.6 (3.5) 9.6 (2.9)
Caffeine (mg) 303 (173) 342 (182) 295 (167) 319 (162)†
Myristic (14:0) acid (g) 1.4 (0.8) 1.9 (1.0) 1.9 (1.2) 1.9 (1.0)
Palmitic (16:0) acid (g) 11 (6) 20 (6) 21 (7) 20 (6)
Stearic (18:0) acid (g) 3.9 (1.9) 6.0 (2.3) 6.3 (2.9) 6.1 (2.3)
Palmitoleic (16:1) acid (g) 0.95 (0.52) 1.3 (0.6) 1.3 (0.7) 1.3 (0.6)
Oleic (18:1) acid (g) 17 (8) 30 (13) 32 (16) 31 (13)
Linoleic (18:2) acid (g) 7.5 (3.0) 13 (5) 13 (6) 13 (5)
Linolenic (18:3) acid (g) 0.88 (0.40) 1.2 (0.5) 1.3 (0.5) 1.2 (0.4)
a-Tocopherol (mg) 3.0 (1.2) 9.0 (4.1) 9.5 (5.6) 9.3 (4.2)
g-Tocopherol (mg) 3.0 (1.1) 15 (8) 15 (8) 15 (7)
a-Carotene (mg) 118 (70) 640 (792) 459 (442) 550 (547)
b-Carotene (mg) 596 (351) 4,239 (3,344) 3,604 (2,283) 3,921 (2,442)
b-Cryptoxanthin (mg) 262 (172) 610 (603) 457 (498) 533 (482)
Lycopene (mg) 916 (798) 5,180 (3,709) 5,249 (3,637) 5,215 (3,108)
Zeaxanthin + lutein (mg) 599 (292) 2,922 (2,892) 2,423 (2,265) 2,672 (2,308)
TABLE 3. Pearson’s partial correlation coefficients* for nutrients estimated from two food frequency
questionnaires and seven 24-hour dietary recalls for 120 subjects from Costa Rica, 1995–1998
TABLE 4. Pearson’s partial correlation coefficients* for carotenoids, tocopherols, and fatty acids
estimated from plasma, adipose tissue, seven 24-hour dietary recalls, and two semiquantitative food
frequency questionnaires for 120 subjects from Costa Rica, 1995–1998
Plasma samples
a-Tocopherol‡ 0.18§ 0.21 0.03§ 0.04§ 0.06§ 0.41
g-Tocopherol‡ 0.24 0.30 0.28 0.30 0.32 0.45
a-Carotene‡ 0.25 0.33 0.32 0.28 0.36 0.01§
b-Carotene‡ 0.26 0.35 0.33 0.36 0.38 0.50
b-Cryptoxanthin‡ 0.25 0.43 0.53 0.55 0.58 0.49
Lycopene‡ 0.30 0.45 0.24 0.18§ 0.26 0.43
Zeaxanthin + lutein‡ 0.32 0.43 0.28 0.26 0.27 0.36
Adipose tissue samples
* All correlations, except those between plasma and adipose tissue, are adjusted for total energy intake,
gender, age, and county of residence.
† FFQ, food frequency questionnaire; r, Pearson’s partial correlation coefficient; Cr, r corrected for day-to-day
variation and used to compute validity coefficients.
‡ Correlation also adjusted for smoking and body mass index.
§ Not significant at p = 0.05; the rest of the noncorrected correlations are significant.
Validity coefficients linoleic acid, were similar to those from dietary recalls,
again suggesting that the three methods are comparable.
The VCs for each dietary assessment method are pre-
sented in tables 5 and 6. Although a few Heywood cases
DISCUSSION
were evident, especially when adipose tissue was used as a
biomarker, most VCs were within the range of 0–1. For Unlike most validation studies, in which data from only
γ-tocopherol and carotenoids in plasma and for adipose tis- one FFQ interview and one reference method are available,
sue polyunsaturated fatty acids, at least two of the three our study estimated nutrient intake by using three methods
VCs were similar, suggesting that at least two of the meth- (FFQ, dietary recalls, and biomarkers). The sample size of
ods yielded similar measurements of the nutrient. When 120 subjects, coupled with seven dietary recall replicates per
the average FFQ was used together with the mean of subject, allowed for reasonable precision in estimating the
dietary recalls, plasma (VC, 0.16–0.72; median, 0.52) was correlations (7). In addition, potential seasonal variation in
found to be a better biomarker for carotenoids and γ- food intake, especially of fruits and vegetables, was consid-
tocopherol than adipose tissue (VC, 0.05–0.51; median, ered in the study design. The analysis also produced a better
0.30). VCs for the average FFQ were similar to or higher estimation of long-term intake because the two FFQs were
than those for plasma and adipose tissue biomarkers and averaged.
were in some cases (e.g., β-cryptoxanthin) higher than In this study, the correlations between estimates obtained
those for dietary recall. by the FFQ and the dietary recall are similar to those
VCs for adipose tissue palmitic acid could not be esti- observed between FFQs and dietary records in other studies
mated because of a negative correlation (tables 4 and 6). (8, 19). The median of the correlations between the dietary
However, adipose tissue was a good biomarker for polyun- recalls and average FFQ measurements was 0.55, again
saturated fatty acids; VCs ranged from 0.45 to 1.01. For indicating good validity of the FFQ. Relative to the dietary
polyunsaturated fatty acids, the VCs from the FFQ were recalls, the FFQ also performed well in estimating intake of
always higher than those from adipose tissue and, for fatty acids, irrespective of whether they were saturated or
TABLE 5. Validity coefficients (95% bootstrap confidence intervals) for carotenoids and tocopherols estimated from plasma,
adipose tissue, seven 24-hour dietary recalls, and two semiquantitative food frequency questionnaires for 120 subjects from
Costa Rica, 1995–1998
* Validity coefficients and confidence limits >1 (Heywood cases) were set to 1.00.
† Q1, food frequency questionnaire (FFQ) 1; DR, 24-hour dietary recalls; T, “true” dietary intake; PL, plasma; Q2, FFQ2; Q3, average FFQ;
AD, adipose tissue; NA, not applicable because of a negative correlation.
unsaturated. The highest correlations were found for linoleic biomarker for β-cryptoxanthin (r 0.43) and zeaxanthin
(r 0.73) and myristic (r 0.70) acids. The FFQ was lutein (r 0.43), and adipose tissue was a good biomarker
highly reproducible, as shown by Pearson’s partial correla- for β-carotene (r 0.43). The correlation between plasma
tion coefficients ranging from 0.33 to 0.77. Intraclass corre- and adipose tissue α-tocopherol measurements (r 0.41)
lations were also high (data not shown). was similar to the one (r 0.39) reported in the European
In comparison with dietary recall estimates, lycopene Community Multicenter Study on Antioxidants, Myocardial
seemed to be equally well estimated from both plasma (r Infarction and Cancer (EURAMIC) of the Breast (34).
0.45) and adipose tissue (r 0.42). Plasma was also a good However, our correlations between plasma and adipose tis-
TABLE 6. Validity coefficients (95% bootstrap confidence intervals) for selected fatty acids estimated from adipose tissue, seven
24-hour dietary recalls, and two semiquantitative food frequency questionnaires for 120 subjects from Costa Rica, 1995–1998
* DR, 24-hour dietary recalls; Q1, food frequency questionnaire (FFQ) 1; Q2, FFQ2; Q3, average FFQ; T, “true” dietary intake; NA, not applic-
able because of a negative correlation; AD, adipose tissue.
† Validity coefficients and confidence limits >1 (Heywood cases) were set to 1.00.
‡ 38.8% of the bootstrap samples produced negative correlations leading to inaccurate confidence intervals, which therefore were omitted.
sue β-carotene (r 0.50) and lycopene (r 0.43) were Hispanics in Costa Rica. Furthermore, the VCs suggest that
higher than those reported in the EURAMIC study (β- plasma could be used as a biomarker for γ-tocopherol, β-
carotene, r 0.39; lycopene, r 0.24). carotene, β-cryptoxanthin, lycopene, and zeaxanthin lutein,
Measurements of saturated fatty acids and monounsatu- while adipose tissue could be used as a biomarker for
rated fatty acids from adipose tissue correlated poorly with lycopene and polyunsaturated fatty acids. This study also
those estimated by the FFQ. This poor correlation may shows that biomarkers did not perform better than the FFQ
reflect the fact that other than dietary sources, saturated fatty and that they should be used to complement the FFQ rather
acids can be synthesized endogenously; furthermore, other than substitute for it.
physiologic processes may influence adipose tissue concen-
trations. Compared with plasma, higher coefficients of vari-
ation were observed for nutrients measured from adipose
tissue. We determined that this high variation is due to ACKNOWLEDGMENTS
nonuniform distribution of the nutrient in adipose tissue and
could in part explain the poor correlations between adipose This study was supported by National Institutes of Health