Professional Documents
Culture Documents
CITATION
Levant, R. F., McDermott, R., Parent, M. C., Alshabani, N., Mahalik, J. R., & Hammer, J. H. (2020,
February 3). Development and Evaluation of a New Short Form of the Conformity to Masculine
Norms Inventory (CMNI-30). Journal of Counseling Psychology. Advance online publication.
http://dx.doi.org/10.1037/cou0000414
Journal of Counseling Psychology
© 2020 American Psychological Association 2020, Vol. 1, No. 999, 000
ISSN: 0022-0167 http://dx.doi.org/10.1037/cou0000414
The Conformity to Masculine Norms Inventory (CMNI) has been an important tool in researching
masculinity. With the original measure at 94 items (Mahalik et al., 2003), there have been several
abbreviated forms developed from 11 to 55 items. However, in confirmatory factor analyses (CFA’s)
testing 13 common factors, bifactor, hierarchical, and unidimensional models, only 4 models demon-
strated adequate fit to the data, and most of these were for the still quite long 46-item version. As a result,
there was no psychometrically strong truly short form of the CMNI. In the present study, data from 1561
community and university men were used to develop a short form. First an exploratory factor analysis
using a portion of the data was conducted, which resulted in a 10-subscale dimensionality, followed by
CFA estimating a common factors model. The results of the CFA were used to create two candidate
models for a 30-item short form of the CMNI, based on Classical test theory (CTT) and optimized CTT.
The best-fitting candidate model for the CMNI-30 was CTT. Next, the fit of the 29, 46, and 94 item
models were compared to the 30-item version, which had the superior fit. Then, measurement invariance
between White men and men of color was assessed, choosing this comparison because hegemonic
masculinity is theorized to marginalize men of color. Evidence was found for full configural and metric,
and partial scalar and residuals invariance. Finally, significant relationships between CMNI-30 scores
and indicators of depression and anxiety provides preliminary concurrent evidence for its validity.
Over the last several decades, counseling psychologists have ditional masculinity ideology and gender role conflict (e.g., Le-
made great strides in the development of measures assessing vant, Hall, & Rankin, 2013; Mahalik et al., 2003; O’Neill, 2008).
conformity to masculine norms and the related constructs of tra- Studies using these measures have brought important insights to
1
2 LEVANT ET AL.
the field regarding the development of client presenting issues, heterosexual men because “gender role norms from the most
barriers to help-seeking, and factors affecting psychological treat- dominant or powerful group in a society affect the experiences of
ment. The present article focuses on the Conformity to Masculine persons in that group, as well as persons in all other groups” (p. 5).
Norms Inventory (CMNI; Mahalik et al., 2003) and undertakes a The original 94-item measure assessed an individual’s level of
major refinement to develop a new short form of the CMNI that conformity to 11 masculine norms: winning (focus on success and
captures most of the dimensions of the original 94-item version winning competitive contests), emotional control (endorsing con-
with clarified wording and strong psychometric properties. trol of emotional expression); primacy of work (endorsing work as
Conformity to masculine norms, or the degree to which indi- a primary focus of life), risk-taking (voluntary exposure to risky
viduals endorse personally enacting the requirements of masculine situations), violence (endorse violence as acceptable response to
norms, is an important conceptual framework for investigating some situations), heterosexual self-presentation (the importance of
masculinity and its correlates. Previous research has found that not appearing to be gay to others), playboy (endorsing casual
conformity to masculine norms can be either adaptive or maladap- sexual activity), self-reliance (reluctance to seek help but rather
tive depending on the subscale and the focal outcome (Gerdes & rely on oneself), power over women (general control of women),
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Levant, 2018; Hammer & Good, 2010; Levant, Wimer, & Wil- dominance (desire to be in charge and have control over situa-
This document is copyrighted by the American Psychological Association or one of its allied publishers.
liams, 2011; Owen, 2011; Parent & Moradi, 2011; Wong, Ho, tions), and pursuit of status (wanting to be seen as an important
Wang, & Miller, 2017). For example, in a meta-analysis of 78 person; Mahalik et al., 2003). These original 11 norms were
samples (Wong et al., 2017), conformity to masculine norms was derived from a literature review and focus groups regarding mas-
correlated with maladaptive mental health outcomes and psycho- culine norms for men in the United States (Mahalik et al., 2003).
logical help-seeking attitudes and behaviors. The authors also Data from a pool of 144 items were subjected to an exploratory
identified three subscales—playboy, power over women, and self- factor analysis (EFA) using principle axis factoring and oblimin
reliance—that were consistently and robustly related to maladap- rotation to develop the measure, but a confirmatory factor analysis
tive mental health outcomes. In another recent study, Gerdes and (CFA) was not conducted at the time. Although the CMNI has
Levant (2018) demonstrated that focusing on the CMNI-94 total been highly used in research, efforts have been made to shorten the
score obscured the complex relationships between conforming to original 94-item scale, resulting in the development of five abbre-
specific masculine norms and their adaptive or maladaptive out- viations of the CMNI: CMNI-55 (Owen, 2011), CMNI-46 (Parent
comes, which were broader than the mental health outcomes & Moradi, 2009), CMNI-29 (Hsu & Iwamoto, 2014), CMNI-22
investigated by Wong et al. (2017). Whereas the total score was (Owen, 2011), and CMNI-11 (Mahalik, Burns, & Syzdek, 2007).
overwhelmingly associated with maladaptive outcomes, 30% of The CMNI-55 was created to reduce the length of the original
the findings of the subscales reflected healthy outcomes. Further- measure but keep the full dimensional structure of the original
more, subscales differed in their associated outcomes. The sub- 94-item measure. This abbreviation used classical test theory
scale primacy of work was predominantly correlated with healthy (CTT) by selecting the five highest loading items for the original
outcomes, such as motivation and health promotion, and four 11 factors (Owen, 2011). Three models were tested in the CFAs of
others had a nearly equal balance of healthy and harmful outcomes the CMNI-55: a correlated subscales (common factors) model, an
(pursuit of status, risk-taking, winning, and heterosexual self- uncorrelated subscales model, and a hierarchical model. None fit
presentation). However, six subscales were mostly associated with the data at acceptable levels (Owen, 2011).
harmful outcomes, three of which overlapped with Wong et al.’s The CMNI-46 was derived by conducting a CFA on the original
(2017) problematic scales (Emotional Control, Violence, Power 94-item CMNI, which resulted in a nine-factor measure. Items that
over Women, Dominance, Playboy, and Self-Reliance; Gerdes & had a factor loading less than .60 were deleted. This nine-factor
Levant, 2018). Maladaptive correlates of masculinity scales have model dropped the dominance and pursuit of status subscales and
been theorized to be due to those men who score in the very high kept the rest of the original factors (Parent & Moradi, 2009). The
end of the scales (Levant & Pryor, in press), but at present there is abbreviated scale development used a majority White sample
very little in the way of theory that would explain why some (59%) but included 23% Asian American participants.
subscales are more often associated with maladaptive outcomes The CMNI-29 was developed by Hsu and Iwamoto (2014) with
and others with adaptive ones, which is a task for future research. a sample of White and Asian men. They first conducted two CFAs
But based on the empirical evidence, masculinity norm conformity on the CMNI-46, both of which resulted in poor fit. They then
is a multifaceted construct that is useful in addressing social, conducted two EFAs and a CFA using the same data set, a flawed
intrapersonal, and health outcomes for men (Mahalik et al., 2003; practice which Worthington and Whittaker (2006) recommended
Parent & Moradi, 2011). Counseling psychologists may benefit against (and which tends to inflate fit statistics), yielding two
from utilizing information regarding clients’ conformity to mas- 8-factor scales and dropping the primacy of work factor. The first
culine norms in terms of help-seeking behavior, social functioning, scale had 35 items and demonstrated poor fit to the data, and the
strengths, and areas of distress. second scale had 29 items version and demonstrated acceptable fit.
The CMNI-22 was developed using CTT, selecting the two
highest loading of the original 11 factors from the 94-item CMNI
Assessing Masculine Norm Conformity
(Owen, 2011). The 22-item measure reduces the CMNI to a
Mahalik et al. (2003) developed the CMNI “to be able to unidimensional scale, with poor fit statistics. Although this version
examine the great variability in how men enact masculinity, as of the scale is notably shorter, its unidimensionality deviates from
well as understand the causes of that variability and the resultant the original purpose of the CMNI, which was to assess the multiple
benefits and costs to the individual and others” (p. 4). Their focus dimensions of conformity to masculine norm (Mahalik et al.,
was on the masculine norms reflected in White, cisgender, and 2003). Finally, an 11-item version was created using CTT by
CONFORMITY TO MASCULINE NORMS INVENTORY-30 3
selecting the highest loading item on each of the 11 subscales factors model, is the preferred way to measure conformity to
(Mahalik et al., 2007). No CFA results are reported for this masculine norms. It is not clear theoretically why conformity to
version. masculine norms might be best modeled as completely multidi-
mensional without some unifying or overarching factor, especially
Assessment of the Variance Composition of the CMNI since a related measure, the Male Role Norms Inventory–Short
Form (MRNI-SF) has been almost always been modeled as a
The variance composition of most versions of the CMNI has bifactor structure, with a general traditional masculinity ideology
been investigated. Correlated (or common) factors, bifactor, hier- factor and seven group factors corresponding to the subscales
archical (or second-order), and unidimensional models have been (McDermott et al., 2017). It may have something to do with the
estimated and compared (Hammer, Heath, & Vogel, 2018; Heath, way that beliefs are measured as compared with self-concept, but
Brenner, Vogel, Lannin, & Strass, 2017; Hsu & Iwamoto, 2014; this is a task for future research. On the other hand, the multidi-
Levant, Hall, Weigold, & McCurdy, 2015; Mahalik et al., 2007; mensional measure has considerable clinical utility for practitio-
Owen, 2011; Parent & Moradi, 2009). In the unidimensional ners, largely because certain subscales have been associated with
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
model, items load only onto a general conformity to masculine maladaptive outcomes, and others with adaptive ones, as discussed
This document is copyrighted by the American Psychological Association or one of its allied publishers.
norms (CMN) factor. In the correlated factors model, items load above (Owen, 2011).
only onto their group factors (i.e., subscales). In the bifactor
model, items load both onto their group factors and a general CMN
Rationale for the Present Study
factor; the general CMN factor is uncorrelated with (orthogonal to)
the specific factors, as they are to each other. In the second order As noted in the preceding text, the present study developed a new
model, items load onto their respective first order factor (i.e., the short form of the CMNI that captures most of the dimensions of the
subscales), which in turn load onto the second-order factor. The fit original 94-item version with clarified wording and strong psycho-
statistics for all CMNI versions (except the 11-item, which has not metric properties. The chief advantage of a short form is that it
been provided) are presented in Table 1. Only the common factors reduces participant fatigue. There are some potential drawbacks to a
model of the 46 and 29, and the bifactor model of the 46 have at short form, such as a possible loss of bandwidth in measuring each
times demonstrated acceptable fit. subscale and reduced reliability. On the other hand, developing a new
Levant et al. (2015) found that the bifactor model provided form with strong psychometric properties is clearly indicated. First,
better fit than both the hierarchical and correlated factors models. societal masculine norms have continued to change in the nearly 20
However, looking at explained common variance, only 22% of years since the development of the inventory. Second, four models
variance was explained by the general CMN factor whereas 78% (with 94, 55, 22, and 11 items) have never demonstrated good fit to
of variance was explained by the group factors. Furthermore, the data on CFA, and thus do not have adequate psychometric
Heath et al. (2017) found that bifactor model had inadequate fit to properties. Only the models with 29 and 46 items have at times
the data. Moreover, Hammer et al. (2018) used CFA and ancillary demonstrated adequate fit. Second, the 46-item model is still quite
bifactor indices in two samples of community and college men to long, and it is an empirical question as to whether it has better fit than
assess how well the CMNI-46 measured a general CMN factor and our 30-item model. Although it is true that the 29-item CMNI ad-
found that the general factor explained only 18% to 27% of item vanced the field by providing an abbreviated form, there are ways that
variance, indicating that the CMNI-46 can measure conformity to it falls short, making room for the 30-item model. First, it was
specific masculine norms but not overall CMN. These results developed using the nonrecommended practice of conducting the
indicate that a multidimensional measure, based on the correlated EFA and CFA with the same sample (cf. DeVellis, 2017; Worthing-
Table 1
Model Fit Statistics and Comparisons of CMNI Models
RMSEA estimate,
CMNI Model, Source 2(df), p CFI, TLI [90% CI] SRMR
94-item CF, Parent & Moradi, 2009 7089.3 (4,222), p ⬍ .001 .71 .055 [.052, .057] .077
46-item CF, Parent & Moradi, 2009 1414.3 (953), p ⬍ .001 .90 .046 [.041, .051] .059
46-item CF, Levant, Hall, Weigold, & McCurdy, 2015 1728.20 (953), p ⬍ .001 .900, 891 .043 [.040, .046] .060
46-item H, Levant et al., 2015 1901.20 (980), p ⬍ .001 .881, .875 .046 [.043, .049] .081
46-item BF, Levant et al., 2015 1596.08 (942), p ⬍ .001 .916, .918 .039 [.036, .043] .074
46-item CF, Hammer, Heath, & Vogel, 2018 2140.08 (953), p ⬍ .001 .935, .929 .039 [.037, .041] .049
46-item BF, Hammer et al., 2018 1657.20 (943), p ⬍ .001 .931, .924 .035 [.032, .037] .062
46-item BF, Heath, Brenner, Vogel, Lannin, & Strass, 2017 1652.27 (943), p ⬍ .001 .86 .05 [.05, .06] .11
46-item CF, Hsu & Iwamoto, 2014 3768.81 (909), p ⬍ .001 .82 .05 [.05, .06] .07
55-item CF, Owen, 2011 2805.70 (1,485), p ⬍ .001 .88 .045 .06
55-item H, Owen, 2011 3192.50 (1,419), p ⬍ .001 .85 .049 .08
29-item CF, Hsu & Iwamoto, 2014 965.26 (893), p ⬍ .001 .93 .040 [.041, .048] .04
22-item Uni, Owen, 2011 .26 .13 [.12, .14] .13
Note. CMNI ⫽ Conformity to Masculine Norms Inventory; CFI ⫽ comparative fit index; TLI ⫽ Tucker–Lewis index; RMSEA ⫽ root mean square error
of approximation; SRMR ⫽ standardized root mean square residual; CF ⫽ common factors model; H ⫽ hierarchical model; BF ⫽ bifactor model; Uni ⫽
unidimensional model.
4 LEVANT ET AL.
ton & Whittaker, 2006), which tends to inflate fit statistics. In addi- residual (SRMR). Furthermore, McDermott et al. (2017) assessed
tion, although the present study started from the 94-item version, Hsu ME/I of a related instrument, the MRNI-SF, comparing men with
and Iwamoto (2014) started from the CMNI-46, which had already women, White men with Black and Asian men, and gay men with
dropped two factors, accounting for why only eight of the original 11 heterosexual men. They found that the MRNI-SF demonstrated
factors were retained. configural invariance and at least partial metric invariance (i.e.,
measured similar constructs), but that scalar and residuals invari-
ance were only supported for Asian men compared with White
Present Study
men. Based on these studies, it appears that ME/I is more likely to
To create a new short form that captures most of the dimensions be demonstrated in comparisons of White and Asian men, but there
of the original 94-item version with improved wording and strong is no literature directly bearing on our research question, and thus
psychometric properties, we first revised the wording of most we consider the ME/I of White men versus men of color to be an
items to improve their clarity and consistency with the construct of exploratory question.
norm conformity. We also expanded the scale from four to six Finally, we conducted a preliminary assessment of criterion
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
points to allow for greater variability and precision. Then an EFA evidence for validity. Given the myriad evidence associating
This document is copyrighted by the American Psychological Association or one of its allied publishers.
was conducted on the 94 items using a portion of the data. Because scores on various versions of the CMNI with measures of psycho-
the literature supports a common factors model, we next estimated logical well-being, we assessed correlations between CMNI-30
one using CFA with the balance of the data and used the resultant scores and high-bandwidth, well-established measures of mood
factor loadings to create two candidate model short forms. We disturbance. We predicted that the three subscales (playboy, power
planned a priori to generate a three items per subscale version of over women, and self-reliance) identified by Wong et al. (2017) as
the CMNI-30 because construction of latent variables in structural consistently related to maladaptive mental health outcomes would
equation modeling (SEM) requires use of at least three manifest be moderately positively associated with diagnostic measures of
variables to indicate a latent factor without causing local identifi- depression and anxiety.
cation problems (Kline, 2016; Little, Cunningham, Shahar, &
Widaman, 2002). The first model was developed by applying CTT, Method
selecting the three highest loading items per factor. The second
model optimized CTT (CTT-Opt) by selecting three items for each
factor with higher factor loadings but also ensuring that the content Participants
reflected all aspects of the subscale, while avoiding having too Data from 1,561 college (n ⫽ 113) and community (n ⫽ 1,448)
similar items, and preserving content validity (Goetz et al., 2013). men were used in the study.1 A recent study has found that
As there is no literature directly bearing on this question, we supplementing traditional college convenience samples with com-
consider it to be exploratory. munity samples recruited from the Internet may help researchers
Next, the best-fitting candidate model of the CMNI-30 was assemble samples that are more representative of the U.S. popu-
compared with the 29-, 46-, and 94-item models using CFA to lation (Alto, McCullough, & Levant, 2018).
assess fit with the data. Admittedly this was an imperfect compar- Ages ranged from 18 to 76 years, with a mean of 36.27 (SD ⫽
ison because, as noted above, we had changed the wording of 12.18). The median age was 33.0, and the mode was 30.0. Most
many items to improve their clarity and congruence with the participants self-identified as White/European American (73%), 6.2%
construct of conformity. Then the CMNI-30 was utilized to test identified as Black/African American, 8.9% as Asian/Asian Ameri-
measurement equivalence/invariance (ME/I) by racial group, com- can, 6.9% as Latino, and 2.7% as multiracial. The remaining 2.3% of
paring White participants to men of color. We were limited by low participants identified as American Indian/Alaska Native, Pacific Is-
sample sizes in the various groups of people of color, and thus lander/Native Hawaiian, other, or did not respond. In terms of highest
could not test specific identities. We recognize the limitations both educational degree, 0.4% did not complete high school, 9.4% had a
of aggregating these identities and of using a proxy variable like high school diploma or graduate equivalency diploma, 10% had an
racial/ethnic identification (Priem, Lyon, & Dess, 1999), and en- associate’s degree, 23.5 completed some college, 40.8% had a bach-
courage investigators to examine specific identities using broader elor’s degree, 11.5% had a master’s or specialist degree, and 3.5% had
measures of racial/ethnic identity in future studies. However, we a doctoral degree. The remainder (0.9%) preferred not to answer or
were ultimately guided by Connell and Messerschmidt’s (2005) did not respond to this question. Reported family/household in-
theory of hegemonic masculinity that posits that the masculine come levels were as follows: under $20,000, 11.5%; $20,001–
norms function not only to make men dominant over women but $40,000, 21.3%; $40,001–$60,000, 23.3%; $60,001–$80,000,
also to marginalize men of color relative to White men. This has 17.7%; $80,001–$100,000, 9.6%; and over $100,000, 15.3%. The
to do with the relative privilege and power of White men and the remainder (1.3%) did not respond to this question.
systems of oppression that marginalize men of color in the United In terms of self-identified socioeconomic status, of those who
States (Alexander, 2012; Connell & Messerschmidt, 2005). Hsu responded to this question, the largest percentage (47%) identified
and Iwamoto (2014) reported ME/I on the CMNI-29, comparing
White with Asian American participants, finding evidence for
1
configural, metric, and scalar invariance. However, their findings The two samples were compared on their scores on the 11 CMNI-94
were based only on the change in comparative fit index (CFI) and subscales, finding that the community sample was significantly different
(lower) from the college sample on only the Risk-Taking subscale, which
root mean square error of approximation (RMSEA) in going from makes sense intuitively. The two samples differed on both the validity
a less restrictive to a more restrictive model; they did not report the variables and their demographic characteristics as would be expected.
change in scaled chi square or in standardized root mean square These comparisons are in Table 1 in the online supplemental materials.
CONFORMITY TO MASCULINE NORMS INVENTORY-30 5
as middle class, although 29.3% identified as lower middle class, ity, indicating that CMNI subscales measure what they are in-
9.9% as lower class, 12.1% as upper middle class, and 0.6% as tended to measure (Mahalik et al., 2003).
upper class. In terms of sexual orientation, most participants who Generalized Anxiety Disorder–7-Item Scale (GAD-7). Part-
responded to this question (89%) identified as heterosexual/ icipants completed the GAD-7 (Löwe et al., 2008) to assess
straight, 3.9% identified as gay/lesbian, 5.8% as bisexual, and anxiety. The instructions stated, “over the last 2 weeks how often
0.7% as other. Most participants (87.3%) indicated that their you have been bothered by the following.” A sample item is “not
preferred sexual partner was always female, 4.7% indicated always being able to stop or control worrying.” Items are rated on a
male, and 7.4% indicated either female or male. The remainder four-point scale from 0 (not at all) to (3) nearly every day. The
(0.6%) preferred not to answer or did not respond to this question. GAD-7 has been used as an assessment of symptoms and anxiety
In terms of relationship status, 30.4% participants indicated they across myriad samples. The measure has demonstrated strong
were single; 42.2% indicated they were married, partnered, or psychometric properties including good model-data fit, strong
engaged; 13.7% indicated they were dating one person exclu- convergent validity with other measures of anxiety, measurement
sively; 7.2% were currently partnered without legal recognition; invariance across gender, good test–retest reliability with .83 in-
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
and 6.1% were dating casually. The remaining participants (0.3%) traclass correlation, and good internal reliability (e.g., Rutter &
This document is copyrighted by the American Psychological Association or one of its allied publishers.
preferred not to answer this question. Regarding religious affilia- Brown, 2017; Spitzer, Kroenke, Williams, & Löwe, 2006).
tion, the largest percentage (46.4%) identified as Christian; 32.7% Patient Health Questionnaire–2-Item Scale (PHQ-2). Parti-
identified as either agnostic or atheist; 9.5 identified as none; and cipants completed the PHQ-2 (Kroenke, Spitzer, & Williams,
11.4% identified as either Jewish, Buddhist, Hindu, Muslim, Pa- 2003). Items asked participants to indicate “over the last 2 weeks
gan, other, or did not respond to the question. how often they have been bothered by the following.” A sample
item is “little interest or pleasure in doing things.” Items are rated
on a four-point scale, which ranged from 0 (not at all) to 3 (nearly
Measures every day). The PHQ-2 is a well-established screener for depres-
sion. The measure has demonstrated strong psychometric proper-
Demographic questionnaire. The demographic question-
ties including strong convergent evidence for validity with other
naire asked participants to report their gender, race/ethnicity, age,
measures of depression, good reliability, and measurement invari-
relationship status, preferred gender identity of sexual partner,
ance across gender (Manea et al., 2016; Richardson et al., 2010).
sexual orientation, level of education, family household income,
socioeconomic status, and religious identity.
CMNI. Participants completed a revision of the original (94- Recruitment and Survey Procedures
item) CMNI (Mahalik et al., 2003), designed to measure partici- The study was approved by the University of Akron Institutional
pants’ conformity to 11 masculine norms. Of the 94 items, 54 had Review Board. All participants were provided with a link to a Qual-
wording changes. Some items were changed to improve clarity trics website, which hosted the study. After completing the informed
(e.g., changed from “In general I will do anything to win” to “I will consent page, participants filled out the questionnaires and then were
do anything to win”), and some items were changed that did not provided with an educational debriefing. Community-dwelling par-
clearly reflect conformity (“I do . . .”) but rather reflected ideology ticipants were recruited using Amazon Mechanical Turk (MTurk)
(“men should . . .”; e.g., changed from “It is best to keep your service. Data obtained from MTurk has been demonstrated to be valid
emotions hidden” to “I try to keep my emotions hidden”). The and reliable when appropriate selection criteria and attention checks
number of items and a sample item for each subscale are as are used (Casler, Bickel, & Hackett, 2013; Peer, Vosgerau, & Ac-
follows: Winning (10),“Winning is not my first priority”; Primacy quisti, 2014). The survey contained three validity check items (e.g.,
of Work (8), “I feel good when work is my first priority”; Emo- “Thank you for taking our survey. Please click strongly agree”).
tional Control (11), “I try to keep my emotions hidden”; Pursuit of Failing a validity check resulted in being exited from the study.
Status (6), “It feels good to me to be important”; Heterosexual Following completion of the study, payment was granted through an
Self-Presentation (10), “I would be furious if someone thought I automated link between the Qualtrics survey and MTurk. Undergrad-
was gay”; Playboy (12), “I would frequently change sexual part- uate students were recruited from the voluntary research participant
ners if I could”; Violence (8), “I like getting into fist fights”; pools at the University of Akron and University of Texas at Austin
Self-Reliance (6), “I hate asking for help”; Risk-Taking (10), and received course credit for completing the study. At the conclusion
“Taking risks help me prove myself”; Power Over Women (9), “I of the survey, they were directed to a different Qualtrics site where
control the women in my life”; Dominance (4), “In general, I must they could enter their contact information confidentially for their
get my way.” Mahalik et al. (2003) reported Cronbach’s alphas for course credit.
subscales ranging from .72 and .91, and high test–retest reliability
over a 2- to 3-week period, using data from mostly White college
Sample Size Considerations
men and women. Items were rated on a six-point Likert scale
(which is a change from the original four-point scale, designed to For the EFA, 470 cases were randomly selected from the full
capture greater variability and allow for more precision) and data set of 1,561. This sample size allowed for five participants per
ranged from 0 (strongly disagree) to 5 (strongly agree). Several item, which is within the range of current practice (Costello &
CMNI subscales related positively to their respective scales in the Osborne, 2005). For the single-group CFAs, multigroup CFAs,
Gender Role Conflict Scale, and to psychological distress, social and validity analyses, we used the remaining 1,091 cases. This
dominance, aggression, and seeking muscularity, while others sample size is adequate using the MacCallum, Browne, & Sug-
related negatively psychological help-seeking and social desirabil- awara (1996, Table 4) criteria. The largest number of degrees of
6 LEVANT ET AL.
freedom of the CFAs was 286, indicating that the minimum sample represent the magnitude of metric and scalar measurement noninvari-
size was ⬍178. ance. Based on simulation data, in which researchers manipulated the
level of noninvariance in a model from small (i.e., not practically
meaningful) to large (Nye et al., 2018), DMACS were interpreted using
Data Analytic Procedures
the following effect size cutoffs: .40 (small), .60 (medium), and .80
Overview. EFA of the 94 CMNI items was conducted, using (large).
principle axis factoring with oblimin rotation to reassess the di-
mensionality of the scale. Next, a common factors model was
Results
confirmed using CFAs. The common factors model was then used
as a basis for specifying the two candidate models of the CMNI-30
(CTT, CTT-Opt), for assessing measurement invariance, and for Preliminary Results
examining validity.
Data cleaning, missing values, normality. The complete
Statistical analyses. The EFA and descriptive statistics were
data set, consisting of 2,253 participants, was screened before
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Mplus (Version 8; Muthén & Muthén, 1998 –2017) was used. The
and 606 from the first and second author’s universities; both data
overall fit of all CFA models was assessed with the scaled chi-
sets included all measures. One participant was removed due to
square goodness-of-fit test. However, because this statistic is de-
age less than 18, 285 were removed for stopping immediately after
pendent on sample size, it is overly sensitive to trivial sources of
the consent question or for missing more than 30% of the data, and
model misfit when sample sizes are large, as in the current study
282 were removed for failing one or more of the three attention
(Cheung & Rensvold, 2002). Thus, a set of alternative fit indices
checks. Additionally, 124 participants were removed for identify-
was consulted to determine whether a model demonstrates ade-
ing as women. The final dataset was N ⫽ 1,561. A low level of
quate fit (Kahn, 2006). These indices and the criteria used to assess
missing data was observed, ranging from 3 to 18 missing responses
their values (see Kline, 2016) were (1) the CFI; (2) the Tucker–
per item. Most items were missing less than 1% of all responses
Lewis Index (TLI), for which both indices values of ⬎.90 indicate
with two items missing 1.1% of responses. The total percentage of
reasonable fit, and values of ⱖ.95 indicate good fit; (3) the
missing responses for 94 items was ⬍1%. Furthermore, no other
RMSEA, where good model fit is indicated by values of ⱕ.05 and
major complicating concerns such as small sample size or poor
values between .05 and .08 suggest reasonable fit; and (4) SRMR,
internal reliability of scales were present. Thus, data analysis
for which values of ⱕ.05 suggest good model fit, and values
proceeded following the simplest path; no missing values were
of ⬍.10 are considered acceptable.
imputed and all available responses for each item were used in the
The relative fits of nested single-group or multigroup CFA
analysis (Parent, 2013). The data was mildly non-normally distrib-
models were compared using a scaled chi-square difference tests
uted, with values for scales of skew ranging from ⫺1.28 to 1.36
(adjusted to take into consideration the use of the maximum
and of kurtosis ranging from ⫺1.28 to 1.55. To accommodate this
likelihood estimation with robust standard errors (MLR) estimator
nonnormality, the Mplus MLR option was used, which calculates
(see Satorra & Bentler, 1999 and instructions at the Mplus web site
the scaled chi-square test statistic.
by accessing http://www.statmodel.com/chidiff.shtml)). The chi-
Outliers. The sample yielded a large number of multivariate
square difference test allows investigators to determine if a dec-
outliers (183; 10.86%), as evidenced by Mahalanobis distance
rement in fit associated with a nested or more constrained model is
procedures. The multivariate outlier participants were compared
statistically significant. However, the chi-square difference test,
with the nonoutlier participants. Groups differed significantly on
like the chi-square goodness-of-fit test, is dependent on sample
seven of the subscales: winning, risk taking, dominance, power
size (Cheung & Lau, 2012; Cheung & Rensvold, 2002). Because
over women, heterosexual self-presentation, primacy of work, and
the ⌬2 is expected to be statistically significant in samples larger
pursuit of status with the multivariate outlier group scoring higher
than 300 (Kline, 2016), we also consulted the ⌬CFI. Althogh Chen
than the other group on all seven subscales. Findings indicate that
(2007) suggested ⌬CFI of ⬍.01, Meade, Johnson, and Braddy
the multivariate outlier groups represent participants with higher
(2008) suggested that a ⌬CFI of ⬍.002 would provide evidence for
but still valid masculine norm conformity, and thus multivariate
a more parsimonious equality-constrained model. If the fit of a
outliers were not deleted.
model is not degraded by imposing equality constraints on the
factor loadings or a similar parameter such as the intercepts, then the
more constrained model is preferred because it is more parsimonious. EFA, CFA, and Creation and Comparison of the
A more parsimonious model would be one in which, for example, a Candidate Models
single set of factor loadings suffices for both groups, rather than a
separate set of loadings for each. Last, because measurement invari- The EFA of the 94 items extracting 11 factors resulted in the
ance testing is limited by yielding only a go (invariance) or no-go elimination of nine items, seven for low loadings (⬍.35), and two
(noninvariance) decision, in cases where the evidence for a particular for cross loadings (⬎.32; cf. Tabachnick & Fidell, 2007). The
level of invariance was equivocal, we first performed a series of post Dominance subscale did not survive but the remaining 10 sub-
hoc bootstrap confidence interval tests to examine the differences scales did. This was not surprising because the Dominance sub-
between the groups on the parameter (e.g., item intercepts), and then scale has not demonstrated acceptable psychometric properties in
calculated the effect size for measurement noninvariance, the DMACS
(Nye & Drasgow, 2011; Nye, Bradburn, Olenick, Bialko, & Drasgow, 2
DMACS software is available at https://psychology.psy.msu.edu/
2018).2 DMACS effect sizes are provided for each item and generally pers_nye/.
CONFORMITY TO MASCULINE NORMS INVENTORY-30 7
prior research. In Mahalik et al. (2003), most of the initial 12 For the CMNI-46, whereas the chi-square was significant,
Dominance items did not load onto any CMNI subscale or loaded 2(953) ⫽ 3643.49, p ⬍ .001, RMSEA and SRMR were reason-
onto a different subscale, and as a result their final Dominance able; but CFI and TLI were inadequate (CFI ⫽ .859, TLI ⫽ .847,
subscale had only four items, the lowest number for any CMNI-94 RMSEA ⫽ .051; 90% CI [.049, .053]; SRMR ⫽ .063). Finally, for
subscale. Furthermore, in Parent and Moradi (2009), the Domi- the CMNI-94, whereas the chi-square was significant, 2(3,034) ⫽
nance subscale did not survive the factor analysis. In sum, the 10,694.66, p ⬍ .001, the RMSEA was good (.048, 90% CI [047,
Dominance items appear to be too vague or broad to support .049]), the SRMR was reasonable (.074), but the CFI (.786) and
continued use of the Dominance subscale. However, to ensure that TLI (.778) were poor. Hence, using the present data, the CMNI-30
these findings were not merely artifacts of prior study character- fit better than the 29-, 46-, and 94-item versions.
istics, we included the Dominance items in our EFA analysis. The
EFA results are shown in Table 2. Assessment of Measurement Invariance
Next, the measurement model was confirmed by using CFA to
estimate a common factors model. We then used the factor load- We next used multigroup CFAs of CMNI-30 responses to assess
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
ings from the common factors model to develop the two candidate configural, metric, scalar and residuals invariance. A series of
This document is copyrighted by the American Psychological Association or one of its allied publishers.
short-form models. The CTT model was developed by simply nested models was estimated, treating men of color and White men
selecting the three highest loading items for each factor, whereas as two separate subsamples in simultaneous estimations. All mod-
CTT-opt model was developed by selecting three items for each els used the common factors model as their dimensional structure.
factor with higher factor loadings but also ensuring that the content Although the chi-square was statistically significant for all models,
reflected all aspects of the subscale, while avoiding having too CFI, TLI, RMSEA, and SRMR, were at acceptable levels. Fol-
similar items, and preserving content validity (Goetz et al., 2013). lowing the recommended practices for testing invariance (cf.
This resulted in replacing 6 items of the CTT model. Then we Kline, 2016), models with increasingly stringent cross-group
compared them using CFA. The CTT model fit well: 2(360) ⫽ equality constraints were examined for differences in the scaled
786.46, p ⬍ .001, CFI ⫽ .961, TLI ⫽ .953, RMSEA ⫽ .033 (90% chi-square and CFI. A nonsignificant increase in chi-square or a
CI [.030, .036]), SRMR ⫽ .037. The CTT-Opt model fit reason- change in CFI (⌬CFI) less than established cut-offs between
ably well: 2(360) ⫽ 987.16, p ⬍ .001, CFI ⫽ .935, TLI ⫽ .921, models with and without cross-group equality constraints indicated
RMSEA ⫽ .040 (90% CI [.037, .043]), SRMR ⫽ .046. Comparing that the model with cross-group equality constraints did not sig-
the CTT versus the CTT OPT, ⌬CFI ⫽ ⫺.026 favoring CTT, nificantly degrade fit to the data, and the invariance/equivalence
above even the more liberal cutoff of .01. Hence, the best fitting between the two groups was supported.
candidate model for the CMNI-30 was CTT. The factors, items The first model imposed no equality constraints across
and standardized factor loadings of the CMNI-30 are shown in groups on any parameters, and thus provided a test of configural
Table 3. invariance—meaning that the same pattern of factor loadings
As a final analysis we assessed the variance composition of the held across groups. This least parsimonious model showed
CMNI-30, estimating in addition to the common factors model good fit to the data, 2(720) ⫽ 1,411.41, p ⬍ .001; CFI ⫽ .958,
discussed above, bifactor, hierarchical and unidimensional models. TLI ⫽ .949, RMSEA ⫽ .035, 90% CI [.033, .038], SRMR ⫽
Only the unidimensional model converged, and it showed poor fit .040. All standardized factor loadings were significant and
statistics: 2(405) ⫽ 10, 136, p ⬍ .001, CFI ⫽ .107, TLI ⫽ .041, ranged from .37 to .89 for men of color ranged and .53 to .92
for White men. Therefore, configural invariance was demon-
RMSEA ⫽ .148 (90% CI [.146, .151]), SRMR ⫽ .198. Hence, as
strated in this data set. Following recommended procedures, we
was found for the CMNI-94, the CMNI-30 is best modeled as a
next turned to tests of stronger factorial invariance (Vandenberg
common factors structure.
& Lance, 2000).
The second model constrained factor loadings to be equal for
CFA Comparisons With the CMNI-29, CMNI-46 and both groups of men, and thus provided a test of full metric
CMNI-94 invariance. Except for the statistically significant chi-square sta-
tistic, overall the fit of this model was good, 2(740) ⫽ 1,437.76,
We ran CFAs (common factors model) for the 29-, 46-, and p ⬍ .001; CFI ⫽ .957, TLI ⫽ .950, RMSEA ⫽ .035, 90% CI [.032,
94-item versions using our data. Of course, this was not an exact .038], SRMR ⫽ .041. All standardized factor loadings were sig-
comparison because we had changed the wording of many of the nificant and ranged from .42 to .90 for men of color and .52 to .92
items to improve item clarity and consistency as noted above, for White men. When the more parsimonious metric invariance
which would likely improve fit statistics. For the CMNI-29: model was compared with the configural invariance model, we
2(349) ⫽ 1,102.26; p ⬍ .001; CFI ⫽ .930; TLI ⫽ .919; found that the chi-square difference test was not statistically sig-
RMSEA ⫽ .044 (90% CI [.041, .047]); SRMR ⫽ .049. The nificant, ⌬2(20) ⫽ 25.55, ns, indicating that the more constrained
CMNI-30 had better fit statistics than the CMNI-29, notably with metric model did not degrade fit. Furthermore, the ⌬CFI was only
the CFI at .961 and TLI at .953 meeting the criterion for good fit .001 smaller, meeting the most restrictive criterion. Hence, the data
(⬎.95), whereas the CFI and TLI for the CMNI-29 was .930 and support full metric invariance.
.919, which is only considered reasonable fit. Also, the RMSEA Given that metric invariance was established with the data, a
and SRMR were lower for the CMNI-30. Comparing this analysis model specifying full scalar invariance (i.e., invariance of the item
to Hsu and Iwamoto’s (2014) results, we found a comparable CFI intercepts for the latent factors) was estimated. This model fit well
but the RMSEA and SRMR were a little worse in the present study in an absolute sense, 2(760) ⫽ 1,487.93, p ⬍ .001; CFI ⫽ .955,
but still less than .05. TLI ⫽ .949, RMSEA ⫽ .035, 90% CI [.033, .038], SRMR ⫽ .041.
8 LEVANT ET AL.
Table 2
Standardized Factor Pattern Loadings From Exploratory Factor Analysis of the CMNI-94 Items
F1: Winning
48. It is important for me to win .77
67. Winning is not important to me (R) .69
22. Winning is not my first priority (R) .65
75. The best feeling in the world comes from winning .59 .21
54. More often than not, losing does not bother me (R) .59
89. I work hard to win .58 .27
39. I don’t mind losing (R) .54 .22 .22
2. I will do anything to win .53
69. I like to always get my way (Dom) .49
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
F4: Playboy
3. I would change sexual partners often if I could .83
72. I would find it enjoyable to date more than one person at a time .79
47. I would feel good if I had many sexual partners .78
28. If I could, I would date a lot of different people .72
58. For me, committed relationships are better than casual sex (R) .72
38. I date only one person at a time (R) .67
83. I don’t want to get tied down to dating just one person .67
33. I don’t have sex unless I am dating the person (R) .50
78. I like emotional involvement in a romantic relationship (R) .45 .21
13. For me the best part about sex is feeling close to the person (R) .24 .38
F6: Risk-taking
40. I take risks .83
24. I enjoy taking risks .82
92. I hate any kind of risk (R) .71
55. It would be foolish for me to take risks (R) .21 .70
17. I do not like risky situations (R) .68
(table continues)
CONFORMITY TO MASCULINE NORMS INVENTORY-30 9
Table 2 (continued)
20. I feel bad when work takes up all my attention (R) .29 .39
F9: Violence
68. It’s never ok for me to be violent (R) ⫺.76
44. I think that violence is sometimes necessary ⫺.73
62. I am willing to get into a physical fight if it is necessary ⫺.70
25. I dislike any kind of violence (R) ⫺.69
79. Even if a person made me very angry, I would not use violence (R) ⫺.55
4. If there is going to be violence, I find a way to avoid it (R) ⫺.42
F10: Self-reliance
53. I never ask for help .75
85. It bothers me when I have to ask for help .68
74. I am not ashamed to ask for help .64
10. I hate asking for help .63
45. If I asked for help it would be a sign of failure. .61
Note. Item numbers refer to the number of the item in the original Conformity to Masculine Norms Inventory (CMNI). Factor (F) loadings ⬍ .2 were
suppressed. Dom refers to the original scale (Dominance) for these items. R ⫽ reverse scored.
All standardized factor loadings were significant and ranged from and potentially inconsequential (Nye et al., 2018), indicating
.42 to .90 for men of color and .52 to .90 for White men. However, partial scalar invariance.
the results were equivocal when comparing the scalar with the Given these results, which support a conclusion of partial
metric model, ⌬2(20) ⫽ 52.51, p ⬍ .001, ⌬CFI ⫽ .002. strong invariance (Kline, 2016), we proceeded to assess strict
Given the strength of the fit statistics and factor loadings we invariance, namely that of residuals, which (if found) would
evaluated scalar invariance further, following the recommenda- mean that the indicators measure the factors in each group
tions of Cheung and Lau (2012) and performed a series of post with the same degree of precision. The model had reasonable
hoc bootstrap confidence interval tests to examine the differ- fit: 2(810) ⫽ 1,836.64, p ⬍ .001; CFI ⫽ .937, TLI ⫽ .932,
ences between the groups on item intercepts. These tests deter- RMSEA ⫽ .041, 90% CI [.038, .043], SRMR ⫽ .054. All
mined that out of 30 comparisons, 11 were invariant; however, standardized factor loadings were significant and ranged from
in four comparisons, White men had larger intercepts, and in 15 .49 to .88 for men of color and .49 to .89 for White men.
comparisons, men of color had larger intercepts. Although this However, this model degraded fit when comparing the residuals
test revealed measurement noninvariance between White men with the scalar model, ⌬2(50) ⫽ 304.49, p ⬍ .001, ⌬CFI ⫽
and men of color, DMACS effect sizes ranged from .03 to .27. .018, exceeding even the more liberal criterion. However, given
Thus, although most intercepts were noninvariant, the magni- the strength of the fit statistics and factor loadings and docu-
tude of any measurement noninvariance was considered small mented Type I error problems associated with chi-square dif-
10 LEVANT ET AL.
Table 3
Standardized Factor Loading From Single-Group Confirmatory Factor Analysis of the CMNI-30
F2: Winning
75. For me, the best feeling in the world comes from winning .77
2. I will do anything to win .67
6. In general I must get my way .66
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
F3: Playboy
This document is copyrighted by the American Psychological Association or one of its allied publishers.
F4: Violence
68. It’s never ok for me to be violent (R) .77
44. I think that violence is sometimes necessary .71
25. I dislike any kind of violence (R) .67
F10: Risk-taking
24. I enjoy taking risks .86
40. I take risks .77
60. I put myself in risky situations .72
Note. Item numbers refer to the number of the item in the original Conformity to Masculine Norms Inventory (CMNI). F ⫽ factor; R ⫽ reverse scored.
ference tests and ⌬CFI (Cheung & Lau, 2012; Kline, 2016), we in the two racial/ethnic groups for the 10 latent factors of the
further evaluated residual invariance and performed a series of CMNI-30.
post hoc bootstrap confidence interval tests to examine the
differences between the groups on item residuals. These tests Descriptive Statistics
determined that out of 30 comparisons, 21 were invariant, Given the support for the common factors structure of the
suggesting partial residuals invariance (i.e., a minority of items CMNI-30 in the total sample, and for partial scalar and resid-
were noninvariant; Kline, 2016). In sum, the results support full uals invariance across groups, we provide raw score-based
configural and metric and partial scalar and residuals invariance subscale scores for the CMNI-30 separately for White men and
CONFORMITY TO MASCULINE NORMS INVENTORY-30 11
men of color, for purposes of comparison with future studies materials. The model with GAD-7 scores as the dependent variable
that use this instrument. Means, standard deviations and alpha was still significant, F(10, 1550) ⫽ 15.256, p ⬍ .001, R2 ⫽ 0.090.
coefficients for these subscales (as well as the other study The results for playboy, power over women and self-reliance were
variables) are displayed in Table 4, and correlation coefficients very similar to those with the CMNI-30, as were those for emo-
are shown in Table 5. It should be pointed out that the alpha tional control and primacy of work, which is no surprise since they
coefficients for two subscales (violence, pursuit of status) were the same items. The results for Winning was now associated
were ⬍.70 for men of color. with GAD-7 scores, negatively. As a preoccupation with success is
conceptually related to anxiety, this change is in the opposite
direction from what would be conceptually anticipated. Risk-
Validity Analyses
taking also became significantly associated with GAD-7 scores,
We conducted a preliminary assessment of the validity of the likely due to alterations in the covariance matrix among the sub-
CMNI-30 using diagnostic scores on two often-used assessments scales. For PHQ-2 scores the model was still significant, F(10,
of depression (GAD-7) and anxiety (PHQ-2). There were three sets 1548) ⫽ 15.992. p ⬍ .001, R2 ⫽ 0.090. Again, the results for
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
of analyses. First, scores on the 10 CMNI-30 subscales were playboy, power over women and self-reliance were similar to those
This document is copyrighted by the American Psychological Association or one of its allied publishers.
regressed on GAD-7 and PHQ-2 scores. The association between with the CMNI-30, as were those for emotional control and pri-
CMNI-30 subscales and GAD-7 scores was significant, F(10, macy of work, but not for pursuit of status. Winning was also
1550) ⫽ 14.88, p ⬍ .001, R2 ⫽ 0.09, as was their association with associated with PHQ-2 scores, again in the opposite direction from
PHQ-2, F(10, 1548) ⫽ 14.83, p ⬍ .001, R2 ⫽ 0.09. Results are what would be conceptually anticipated.
presented in Table 6. We had predicted that playboy, power over Third, we assessed for differences by majority or minority racial
women, and self-reliance scales would be moderately and posi- status in the relationship between CMNI-30 subscales and mea-
tively associated with diagnostic measures of depression and anx- sures of depression and anxiety by conducting moderation analy-
iety, which was partially supported. For anxiety the associations ses. Mean-centered transformations of the CMNI-30 subscales and
were significant and positive for playboy ( ⫽ .064), and self- race (dichotomously coded into White men and men of color) were
reliance ( ⫽ .278); however, they were significant and negative entered in the first block of a regression. The interaction terms
for power over women ( ⫽ ⫺.066). In addition, they were between race and each CMNI-30 subscale were entered in the
significant and negative for emotional control ( ⫽ ⫺.070), and second block. The interaction terms did not improve the regression
primacy of work ( ⫽ –.087). With regard to anxiety, the associ- in predicting PHQ-2 scores. Block 1, F(11, 1529) ⫽ 13.45, p ⬍
ations were significant and positive for playboy ( ⫽ .087), and .001, R2 ⫽ 0.09; Block 1 and 2, F(10, 1519) ⫽ 7.57, p ⬍ .001,
self-reliance ( ⫽ .269), but they were not significant for power R2 ⫽ 0.10; ⌬F ⫽ 1.09, p ⫽ .364. The interaction terms did add to
over women ( ⫽ .030); they were also significant and negative the prediction of GAD-7 scores, Block 1, F(11, 1531) ⫽ 13.39,
for emotional control ( ⫽ ⫺.064), pursuit of status ( ⫽ ⫺.053) p ⬍ .001, R2 ⫽ 0.08; Block 1 and 2, F(10, 1521) ⫽ 8.13, p ⬍ .001,
and primacy of work ( ⫽ ⫺.076). Ferguson (2009) recommends R2 ⫽ 0.10; ⌬F ⫽ 2.22, p ⫽ .015. Significant interactions were
a minimum effect size of .2, and for both depression and anxiety observed for race by the violence, playboy, and primacy of work
only self-reliance rises to this level. subscales. The simple slopes for the interactions are presented in
Second, we conducted the same analysis using the CMNI-29, in Figures 1 through 3 in the online supplemental materials. For the
which only the items loading on the winning, violence, and het- interaction between violence and race, though the interaction was
erosexual self-presentation differed from those in the CMNI-30. significant the slopes were not significant for either men of color
The results are shown in Table 2 in the online supplemental (t ⫽ ⫺1.00, p ⫽ .316) or White men (t ⫽ 0.23, p ⫽ .815). For the
Table 4
Means, Standard Deviations, and Alpha Coefficients for Study Variables Separately for Men of
Color and White Men
Table 5
Correlation Coefficients for Study Variables Separately for Men of Color and White Men
12. PHQ-2 .05 .03 .13ⴱⴱ .00 .00 ⫺.07ⴱ ⫺.03 .04 .25ⴱⴱ .01 .75ⴱⴱ —
This document is copyrighted by the American Psychological Association or one of its allied publishers.
Note. Values for men of color appear above the diagonal, and those for White men appear below the diagonal. F1 through F10 are the 10 factors of the
Conformity to Masculine Norms Inventory (CMNI)-30. GAD-7 ⫽ Generalized Anxiety Disorder–7-Item Scale; PHQ-2 ⫽ Patient Health Questionnaire–
2-Item Scale.
ⴱ
p ⬍ .05. ⴱⴱ p ⬍ .01.
interaction between playboy and race, though the interaction was crease the risk for Type I error. The observed effect indicates that the
significant the slopes were not significant for either men of color (t ⫽ association between primacy of work and GAD-7 scores is driven by
1.33, p ⫽ .184) or White men (t ⫽ 1.79, p ⫽ .858). Finally, for the the presence of a relationship between these variables for men of color
interaction between primacy of work and race, the slope was signif- only.
icant for people of color (t ⫽ ⫺3.25, p ⬍ .001) but was not significant
and positive for White men (t ⫽ ⫺.91, p ⫽ .365). Thus, though three
Discussion
significant interactions emerged, in only one was there a significant
effect among the two groups. This single association should be The purpose of this study was to refine the CMNI (Mahalik et
interpreted with caution as the multiple comparisons conducted in- al., 2003) for several reasons. First, societal masculine norms have
Table 6
Results Comparing Mood Disorder Diagnostic Assessments With CMNI-30 Scores
90% CI
Item B SE  p Upper bounds Lower bounds
GAD-7
Emotional control ⫺.345 .134 ⫺.070 .010 ⫺.608 ⫺.081
Winning .222 .182 .040 .223 ⫺.135 .579
Playboy .277 .113 .064 .014 .056 .498
Violence ⫺.011 .137 ⫺.002 .937 ⫺.279 .257
Heterosexual self-presentation .059 .126 .014 .639 ⫺.188 .305
Pursuit of status ⫺.225 .152 ⫺.040 .140 ⫺.522 .073
Primacy of work ⫺.881 .261 ⫺.087 .001 ⫺1.393 ⫺.370
Power over women ⫺.339 .161 ⫺.066 .036 ⫺.655 ⫺.023
Self-reliance 1.513 .150 .278 ⬍.001 1.219 1.806
Risk-taking .237 .149 .043 .114 ⫺.057 .530
PHQ-2
Emotional control ⫺.088 .038 ⫺.064 .020 ⫺.162 ⫺.014
Winning .018 .051 .012 .718 ⫺.082 .119
Playboy .105 .032 .087 .001 .043 .167
Violence ⫺.038 .038 ⫺.026 .326 ⫺.113 .038
Heterosexual self-presentation ⫺.044 .035 ⫺.036 .217 ⫺.113 .026
Pursuit of status ⫺.084 .043 ⫺.053 .049 ⫺.168 .000
Primacy of work ⫺.216 .073 ⫺.076 .003 ⫺.359 ⫺.072
Power over women .042 .045 .030 .348 ⫺.046 .131
Self-reliance .411 .042 .269 ⬍.001 .329 .494
Risk-taking .040 .042 .026 .336 ⫺.042 .123
Note. For Generalized Anxiety Disorder–7-Item Scale (GAD-7), 276 participants scored ⱖ10, and 1,284
participants scored ⬍10. For Patient Health Questionnaire–2-Item Scale (PHQ-2), 291 participants scored ⱖ3,
and 1,270 scored ⬍2. CMNI ⫽ Conformity to Masculine Norms Inventory.
CONFORMITY TO MASCULINE NORMS INVENTORY-30 13
continued to change in the nearly 20 years since the development were small per the DMACS analyses. Indeed, the size of noninvari-
of the inventory. Using slightly revised items to improve clarity ance fell in a range consistent with full invariance in simulated
and consistency with the construct of conformity and expanding data (Nye et al., 2018). Thus, while there likely are unidentified
the scale from four to six points to allow for greater variability and cultural components influencing the zero-point, the actual effect of
precision, we examined whether the revised items and factor these components may be inconsequential, though additional re-
structures are psychometrically sound, an important question to search will be needed to support this assertion. In this light, the
address given the changing constructions of masculinity. We also results indicated that the 10 factors of the CMNI-30 have the same
note that counseling psychology scale development practices have meaning and very close to the same zero points for White men and
markedly improved during that time. As such, examining the short men of color (Vandenberg & Lance, 2000; Xu & Tracey, 2017).
form of the inventory with these tools can provide greater insight Finally, we found evidence for the partial invariance of the resid-
into the validity of the measure as related to variance composition uals, with 70% being invariant, suggesting that for the most part
and measurement invariance. the factors in each group were measured with the same degree of
The refinement of the 94-item, 11-factor CMNI resulted in a precision. Such findings provide initial evidence that the CMNI-30
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
30-item, 10-factor CMNI-30. The fact that a 10-factor structure can be used across White men and men of color with relative
This document is copyrighted by the American Psychological Association or one of its allied publishers.
original measure, confirms that the measure is best used when Hsu, K., & Iwamoto, D. K. (2014). Testing for measurement invariance in
examining specific masculinity norms, provides evidence of the the Conformity to Masculine Norms-46 across White and Asian Amer-
measure’s full configural and metric invariance and partial scalar ican college men: Development and validity of the CMNI-29. Psychol-
and residual invariance between two racial/ethnic groups for the 10 ogy of Men & Masculinity, 15, 397– 406. http://dx.doi.org/10.1037/
latent factors, as well as preliminary validity evidence. As such, a0034548
IBM. (1989). IBM SPSS statistics (Version 25) [Computer software].
the current refinement demonstrates significant psychometric
Retrieved from www.ibm.com/SPSS-Statistics
strengths and we encourage its adoption for investigating confor- IBM. (2017). IBM SPSS statistics (Version 25) [Computer software].
mity to masculinity norms and their correlates. Retrieved from www.ibm.com/SPSS-Statistics
Kahn, J. H. (2006). Factor analysis in counseling psychology research,
References training, and practice: Principles, advances, and applications. The Coun-
seling Psychologist, 34, 684 –718. http://dx.doi.org/10.1177/
Alexander, M. (2012). The new Jim Crow: Mass incarceration in the age 0011000006286347
of colorblindness. New York, NY: The New Press. Kline, R. B. (2016). Principles and practice of structural equation mod-
Alto, K., McCullough, K., & Levant, R. F. (2018). Who is on Craigslist?
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Kroenke, K., Spitzer, R. L., & Williams, J. B. (2003). The Patient Health
ship. Psychology of Men & Masculinity, 19, 319 –324. http://dx.doi.org/ Questionnaire-2: Validity of a two-item depression screener. Medical
10.1037/men0000092 Care, 41, 1284 –1292. http://dx.doi.org/10.1097/01.MLR.0000093487
Casler, K., Bickel, L., & Hackett, E. (2013). Separate but equal? A .78664.3C
comparison of participants and data gathered via Amazon’s MTurk,
Levant, R. F., Hall, R. J., & Rankin, T. J. (2013). Male Role Norms
social media, and face-to-face behavioral testing. Computers in Human
Inventory-Short Form (MRNI-SF): Development, confirmatory factor
Behavior, 29, 2156 –2160. http://dx.doi.org/10.1016/j.chb.2013.05.009
analytic investigation of structure, and measurement invariance across
Chen, F. F. (2007). Sensitivity of goodness-of-fit indexes to lack of
gender. Journal of Counseling Psychology, 60, 228 –238. http://dx.doi
measurement invariance. Structural Equation Modeling, 14, 464 –504.
.org/10.1037/a0031545
http://dx.doi.org/10.1080/10705510701301834
Levant, R. F., Hall, R. J., Weigold, I. K., & McCurdy, E. R. (2015).
Cheung, G. W., & Lau, R. S. (2012). A direct comparison approach for
Construct distinctiveness and variance composition of multi-
testing measurement invariance. Organizational Research Methods, 15,
dimensional instruments: Three short-form masculinity measures. Jour-
167–198. http://dx.doi.org/10.1177/1094428111421987
nal of Counseling Psychology, 62, 488 –502. http://dx.doi.org/10.1037/
Cheung, G. W., & Rensvold, R. B. (2002). Evaluating goodness-of-fit
cou0000092
indexes for testing measurement invariance. Structural Equation Mod-
Levant, R. F., & Pryor, S. (in press). The tough standard: The hard truths
eling, 9, 233–255. http://dx.doi.org/10.1207/S15328007SEM0902_5
about masculinity. New York, NY: Oxford University Press.
Connell, R. W., & Messerschmidt, J. W. (2005). Hegemonic masculinity:
Levant, R. F., Wimer, D. J., & Williams, C. M. (2011). An evaluation of
Rethinking the concept. Gender & Society, 19, 829 – 859. http://dx.doi
the Health Behavior Inventory-20 (HBI-20) and its relationships to
.org/10.1177/0891243205278639
masculinity and attitudes toward seeking psychological help among
Costello, A. B., & Osborne, J. W. (2005). Best practices in exploratory
factor analysis: Four recommendations for getting the most from your college men. Psychology of Men & Masculinity, 12, 26 – 41. http://dx
analysis. Practical Assessment, Research & Evaluation, 10, 86 –99. .doi.org/10.1037/a0021014
http://dx.doi.org/10.4135/9781412995627.d8 Little, T. D., Cunningham, W. A., Shahar, G., & Widaman, K. F. (2002).
DeVellis, R. F. (2017). Scale development: Theory and applications (4th To parcel or not to parcel: Exploring the question, weighing the merits.
ed.). Thousand Oaks, CA: SAGE. Structural Equation Modeling, 9, 151–173. http://dx.doi.org/10.1207/
Ferguson, C. J. (2009). An effect size primer: A guide for clinicians and S15328007SEM0902_1
researchers. Professional Psychology, Research and Practice, 40, 532– Löwe, B., Decker, O., Müller, S., Brähler, E., Schellberg, D., Herzog, W.,
538. http://dx.doi.org/10.1037/a0015808 & Herzberg, P. Y. (2008). Validation and standardization of the Gen-
Gerdes, Z. T., & Levant, R. F. (2018). Complex relationships among eralized Anxiety Disorder Screener (GAD-7) in the general population.
masculine norms and health/well-being outcomes: Correlation patterns Medical Care, 46, 266 –274. http://dx.doi.org/10.1097/MLR
of the Conformity to Masculine Norms Inventory subscales. American .0b013e318160d093
Journal of Men’s Health, 12, 229 –240. http://dx.doi.org/10.1177/ MacCallum, R. C., Browne, M. W., & Sugawara, H. M. (1996). Power
1557988317745910 analysis and determination of sample size for covariance structure mod-
Goetz, C., Coste, J., Lemetayer, F., Rat, A. C., Montel, S., Recchia, S., . . . eling. Psychological Methods, 1, 130 –149. http://dx.doi.org/10.1037/
Guillemin, F. (2013). Item reduction based on rigorous methodological 1082-989X.1.2.130
guidelines is necessary to maintain validity when shortening composite Mahalik, J. R., Burns, S. M., & Syzdek, M. (2007). Masculinity and
measurement scales. Journal of Clinical Epidemiology, 66, 710 –718. perceived normative health behaviors as predictors of men’s health
http://dx.doi.org/10.1016/j.jclinepi.2012.12.015 behaviors. Social Science & Medicine, 64, 2201–2209. http://dx.doi.org/
Hammer, J. H., & Good, G. E. (2010). Positive psychology: An empirical 10.1016/j.socscimed.2007.02.035
examination of beneficial aspects of endorsement of masculine norms. Mahalik, J. R., Locke, B. D., Ludlow, L. H., Diemer, M. A., Scott, R. P.,
Psychology of Men & Masculinity, 11, 303–318. http://dx.doi.org/10 Gottfried, M., & Freitas, G. (2003). Development of the Conformity to
.1037/a0019056 Masculine Norms Inventory. Psychology of Men & Masculinity, 4, 3–25.
Hammer, J. H., Heath, P. J., & Vogel, D. L. (2018). Fate of the total score: http://dx.doi.org/10.1037/1524-9220.4.1.3
Dimensionality of the Conformity to Masculine Norms Inventory-46 Manea, L., Gilbody, S., Hewitt, C., North, A., Plummer, F., Richardson, R.,
(CMNI-46). Psychology of Men & Masculinity, 19, 645– 651. http://dx . . . McMillan, D. (2016). Identifying depression with the PHQ-2: A
.doi.org/10.1037/men0000147 diagnostic meta-analysis. Journal of Affective Disorders, 203, 382–395.
Heath, P. J., Brenner, R. E., Vogel, D. L., Lannin, D. G., & Strass, H. A. http://dx.doi.org/10.1016/j.jad.2016.06.003
(2017). Masculinity and barriers to seeking counseling: The buffering McDermott, R. C., Levant, R. F., Hammer, J. H., Hall, R. J., McKelvey,
role of self-compassion. Journal of Counseling Psychology, 64, 94 –103. D. K., & Jones, Z. (2017). Further examination of the factor structure of
http://dx.doi.org/10.1037/cou0000185 the Male Role Norms Inventory-Short Form (MRNI-SF): Measurement
CONFORMITY TO MASCULINE NORMS INVENTORY-30 15
considerations for women, men of color, and gay men. Journal of Journal of Management, 25, 935–953. http://dx.doi.org/10.1177/
Counseling Psychology, 64, 724 –738. http://dx.doi.org/10.1037/ 014920639902500607
cou0000225 Richardson, L. P., Rockhill, C., Russo, J. E., Grossman, D. C., Richards, J.,
Meade, A. W., Johnson, E. C., & Braddy, P. W. (2008). Power and McCarty, C., . . . Katon, W. (2010). Evaluation of the PHQ-2 as a brief
sensitivity of alternative fit indices in tests of measurement invariance. screen for detecting major depression among adolescents. Pediatrics,
Journal of Applied Psychology, 93, 568 –592. http://dx.doi.org/10.1037/ 125, e1097– e1103. http://dx.doi.org/10.1542/peds.2009-2712
0021-9010.93.3.568 Rutter, L. A., & Brown, T. A. (2017). Psychometric properties of the
Muthén, L. K., & Muthén, B. O. (1998 –2017). Mplus user’s guide (7th Generalized Anxiety Disorder Scale-7 (GAD-7) in outpatients with
ed.). Los Angeles, CA: Author. anxiety and mood disorders. Journal of Psychopathology and Behav-
Nye, C. D., Bradburn, J., Olenick, J., Bialko, C., & Drasgow, F. (2018). ioral Assessment, 39, 140 –146. http://dx.doi.org/10.1007/s10862-016-
How Big Are My Effects? Examining the Magnitude of Effect Sizes in 9571-9
Studies of Measurement Equivalence. Organizational Research Meth- Satorra, A., & Bentler, P. M. (1999). A scaled difference chi-square test
ods. Advance online publication. http://dx.doi.org/10.1177/1094 statistic for moment structure analysis. Psychometrika, 66, 507–514.
428118761122 http://dx.doi.org/10.1007/BF02296192
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Nye, C. D., & Drasgow, F. (2011). Effect size indices for analyses of Spitzer, R. L., Kroenke, K., Williams, J. B., & Löwe, B. (2006). A brief
measure for assessing generalized anxiety disorder: The GAD-7. Ar-
This document is copyrighted by the American Psychological Association or one of its allied publishers.