Professional Documents
Culture Documents
Downloaded 03 Jun 2013 to 216.47.136.20. Redistribution subject to AAPT license or copyright; see http://ajp.aapt.org/authors/copyright_permission
PHYSICS EDUCATION RESEARCH SECTION
The Physics Education Research Section 共PERS兲 publishes articles describing important results from
the field of physics education research. Manuscripts should be submitted using the web-based system that
can be accessed via the American Journal of Physics home page, http://www.kzoo.edu/ajp/, and will be
forwarded to the PERS editor for consideration.
133 Am. J. Phys. 79 共1兲, January 2011 http://aapt.org/ajp © 2011 American Association of Physics Teachers 133
Downloaded 03 Jun 2013 to 216.47.136.20. Redistribution subject to AAPT license or copyright; see http://ajp.aapt.org/authors/copyright_permission
Write an equation using the variables S and P to
represent the following statement: “There are six
times as many students as professors at this univer-
sity.” Use S for the number of students and P for
the number of professors.
134 Am. J. Phys., Vol. 79, No. 1, January 2011 E. T. Torigoe and G. E. Gladding 134
Downloaded 03 Jun 2013 to 216.47.136.20. Redistribution subject to AAPT license or copyright; see http://ajp.aapt.org/authors/copyright_permission
Table I. Average and standard error for numeric and symbolic versions of each question in the study. The p-values were calculated using a two-tailed t-test.
The p-value represents the likelihood that such a difference could be observed under the assumption that the null hypothesis is true.
Question 1 Question 2 Question 3 Question 4 Question 5 Question 6 Question 7 Question 8 Question 9 Question 10
Numeric 91.5⫾ 1.4 93.3⫾ 1.3 79.6⫾ 2.1 90.5⫾ 1.5 44.9⫾ 2.5 61.2⫾ 2.4 76.0⫾ 2.1 33.2⫾ 2.3 78.6⫾ 2.0 48.8⫾ 2.3
Symbolic 70.4⫾ 2.3 56.8⫾ 2.5 63.4⫾ 2.4 82.3⫾ 1.9 31.9⫾ 2.3 52.9⫾ 2.5 75.6⫾ 2.1 29.8⫾ 2.2 54.5⫾ 2.5 52.8⫾ 2.4
Difference 21.1⫾ 2.7 36.5⫾ 2.8 16.2⫾ 3.1 8.1⫾ 2.4 13.0⫾ 3.4 8.3⫾ 3.4 0.4⫾ 2.9 3.4⫾ 3.1 24.1⫾ 3.2 −3.9⫾ 3.3
p-value ⬍0.001 ⬍0.001 ⬍0.001 ⬍0.001 ⬍0.001 0.01⬍ p ⬍ 0.05 0.5⬍ p 0.2⬍ p ⬍ 0.4 ⬍0.001 0.2⬍ p ⬍ 0.4
score and standard error for final 1 was 78.9⫾ 0.5 and for 1. Physics difficulty
final 2 was 78.6⫾ 0.5, and we conclude that the two groups
The score for the top 1/4 on the numeric version was used
were equivalent.
as a measure of each question’s physics difficulty. Questions
A sample of some question pairs used in the study is found
5 and 8 were significantly more difficult for the top 1/4 stu-
in Appendix A.12 All but one of the ten paired questions
dents than the other questions, and the difficulty of the phys-
contained analogous choices for each the numeric and sym-
ics content overwhelmingly dominated the mathematical dif-
bolic versions of the question.13 To discourage cheating,
ficulty of the questions. Consequently, we removed them
many of the choices were in a different order in the two
from further analysis in this study. This removal is supported
versions.
by the fact that the average scores by the bottom 1/4 for the
numeric versions of these questions was 23%, consistent
with the random guessing rate of 20%.
A. Results
Table I shows the score, the standard error, the difference, 2. Multiple equations
and the p-value of the difference for the numeric and sym- This property distinguishes whether the problem is com-
bolic versions of each question.14 Although there exist large monly solved with one equation or with multiple equations.
differences between numeric and symbolic scores for some The symbolic questions tend to be more difficult because
questions, which is consistent with our earlier findings, other symbol confusion often occurs when equations are com-
questions show very little difference in score between ver- bined. The presence of multiple equations allows for the pos-
sions. In the following, we describe properties of the ques- sibility of confusion of the same type of quantity 共for ex-
tions that influenced the difference in score between ver- ample, confusing two different velocities兲. Questions 1–3, 6,
sions. and 9 require multiple equations for their solution. Questions
An analysis of each of the ten questions, including the 4 and 7 are the only questions in which the solution can be
popularity of the correct choices and incorrect choices, and found with a single equation.
an analysis of students’ written work were conducted to un-
derstand why some questions showed differences between 3. Simultaneous equations
the versions, while others did not. Table II shows the differ-
ence in scores between versions and the associated question The solution is simultaneous if, for example, there are two
properties. Many of the question properties we identified re- equations and two unknowns and both unknowns are present
late to the meaningful use of symbolic equations. The fol- in both equations. In contrast, a sequential set of equations
lowing is a list of question properties with explanations. has an equation containing only a single unknown. Although
numeric values cannot be obtained until the equations are
combined in the first case, a numeric value can be obtained
immediately from one of the sequential equations. Question
Table II. Questions ranked by the difference in the score between numeric 10 is the only question in our sample that requires simulta-
and symbolic versions of each question with the associated question prop- neous equations for the solution. The numeric and symbolic
erties. The abbreviations are MultEq 共multiple equations in the solution兲, versions were equally difficult because students are forced to
MoGE 共manipulation of general equations兲, CompExp 共use of a compound
use the same procedure to solve both the numeric and the
expression兲, SingEq 共single equation in solution兲, SimulEq 共simultaneous
equations兲, ManipErr 共manipulation error兲, Diff 共difficult question兲.
symbolic versions.
135 Am. J. Phys., Vol. 79, No. 1, January 2011 E. T. Torigoe and G. E. Gladding 135
Downloaded 03 Jun 2013 to 216.47.136.20. Redistribution subject to AAPT license or copyright; see http://ajp.aapt.org/authors/copyright_permission
any distinction between the numeric and symbolic versions.
For the top 1/4 only, one question had a ratio lower than
0.93.
A two-tailed t-test was performed to compare the score for
the bottom 1/4 on the numeric and symbolic versions of each
question. Questions 4, 7, 8, and 10 were the only questions
for which p ⬎ 0.05, and thus the hypothesis that the perfor-
mance by the bottom 1/4 on the numeric and symbolic ver-
sions of these questions were equivalent remains tenable.15
The high ratio for questions 8 and 10 is due to an equally
poor performance on both versions of those questions. Ques-
Fig. 2. Ratio of the symbolic version score to the numeric version score for tion 4 and 7 were the only questions that could be solved
different subgroups of the class for all ten questions. with single equations, and the error for question 4 was re-
lated to a manipulation error. The lack of a correlation indi-
cates that those types of question properties do not correlate
5. Use of a compound expression well with the overall success in the course. The low ratios for
the bottom 1/4 for the remaining questions, which contain
This property signifies that to reach the correct symbolic properties that stress meaningful representation, suggests that
solution students must replace a variable in a general equa- the bottom 1/4’s algebraic difficulties may be related to their
tion with a compound expression 共for example, replacing the poor performance in physics.
variable v by a more specific compound expression v / 2兲.
Questions 2, 6, and 9 require that students specify a vari-
III. CODING QUESTIONS BY MATHEMATICAL
able as a compound expression. For question 2, the second
STRUCTURE
most popular incorrect choice corresponded to using v in-
stead of v / 2 or t instead of t / 2. For question 6, the most Many of the question properties described in Sec. II A
popular incorrect choice corresponded to a failure to specify were discovered after the research was performed and were
the variable m as 2M. Question 9 required that students not independently validated. In this study, we attempt to vali-
specify both the variable m as 3m and the variable v as v / 3. date those question properties by analyzing an entire semes-
ter of exam questions. Specifically, we are interested in de-
termining if question properties that stress meaningful
6. Manipulation error algebraic representation can be used to identify discriminat-
This property signifies that a common error on this ques- ing exam questions.
tion is related to an incorrect manipulation of a symbolic We examined the set of exam questions administered in
equation. The mean score for the symbolic version of ques- Spring 2006 in Physics 211 and categorized them by the
tion 4 was 8.2% higher than the numeric version because of mathematical properties of their solutions. The exam ques-
this effect. An analysis of student work found that the differ- tions were made by faculty members with no knowledge of
ence in score was in large part due to students who setup the the question properties we intended to study. There were 169
energy conservation equation correctly, but who choose an unique16 multiple-choice questions administered 共three mid-
option corresponding to an incorrect manipulation of that terms and two versions of the final exam兲 and 870 students
equation. who completed the course without any missing exam grades.
We analyzed the mathematical structure of the solutions to
all of the questions in our sample and coded for the follow-
B. Connection to failure in physics ing types of questions: 共1兲 Multiple-equation symbolic ques-
tions; 共2兲 simultaneous-equation numeric questions; and 共3兲
To study the relation between the specific questions and single-equation numeric questions where the target unknown
the overall course performance, we divided the class into appears on opposite sides of the equal sign.
three subgroups based on the total course points. The groups This last property is equivalent to the form Ax + B = Cx
are the bottom 1/4, the middle half, and the top 1/4. For each + D, which is a structure that requires students to use alge-
group and for each question in the study, we calculated the braic methods. An example of a question with a solution of
ratio of the average score on the symbolic version to the this form can be found in Appendix B. We referred to ques-
average score on the numeric version 共see Fig. 2兲. This ratio tions with any of these three properties as equation priority
represents the fraction of the students who solved the nu- questions. As the name suggests, the solutions to these ques-
meric version correctly and who would also be able to solve tions emphasize the meaningful representation of symbolic
the symbolic version. Even though no student was given equations. The three question properties require students to
both versions of a single question, this interpretation is jus- formally represent the equations and prevent simple one
tified because of the equivalence of the midterm exam aver- equation at a time numeric sequential solutions. Specifically
ages of the students given each version of the final exam 共see omitted were single-equation and sequential-equation nu-
Sec. II兲. meric questions, both of which were shown in Sec. II B to be
In Sec. I we showed that an analysis of data from our poorly correlated with the overall performance in the course.
previous study1 revealed that the lowest ratio of symbolic to Two additional properties identified in Sec. II A were not
the numeric scores was observed for the bottom 1/4 of the included in this coding scheme: General equation manipula-
class. Our present results are consistent with that result. For tion and the use of compound expressions. Even though we
almost all questions, the smallest ratio was observed for the have some evidence that these properties tend to make sym-
bottom 1/4 of the class. In contrast, the top 1/4 rarely showed bolic problems more difficult, such a distinction is not im-
136 Am. J. Phys., Vol. 79, No. 1, January 2011 E. T. Torigoe and G. E. Gladding 136
Downloaded 03 Jun 2013 to 216.47.136.20. Redistribution subject to AAPT license or copyright; see http://ajp.aapt.org/authors/copyright_permission
Table III. Comparison of equation priority and other questions. The error on
the mean difference was calculated using the distribution of question score
differences between groups 共Ref. 18兲.
Number 40 129
Bot. 1/4 score 33.9% ⫾ 2.5% 56.9% ⫾ 1.8%
Top 3/4 score 62.0% ⫾ 2.9% 77.5% ⫾ 1.5%
Mean difference 28.1% ⫾ 1.7% 20.6% ⫾ 0.9%
Pearson r 0.38 0.29
137 Am. J. Phys., Vol. 79, No. 1, January 2011 E. T. Torigoe and G. E. Gladding 137
Downloaded 03 Jun 2013 to 216.47.136.20. Redistribution subject to AAPT license or copyright; see http://ajp.aapt.org/authors/copyright_permission
Table IV. Comparison between the bottom 1/4 and the rest of the class on Some research suggests that the structure of equations can
groupings of questions based on fixing the score for the top 3/4 of the class. assist the acquisition of physical concepts. Schwartz et al.19
Other difficult non-equation priority questions were found by omitting easy
found that equations help children develop an understanding
questions until the other question score was the same as the equation priority
question score for the top 3/4 of the class. Other matched questions were
of balance problems. They hypothesize that the structure of
created by matching each equation priority question to other non-equation the equations supports the precision of conceptual ideas, al-
priority questions by difficulty for the top 3/4 of the class. leviates working memory load, and allows for the organiza-
tion of multiple parameters. Sloutsky et al.20 found that
Equation priority Other difficult Other matched learning and transfer can be facilitated when knowledge is
expressed in an abstract generic form. In introductory phys-
Number 40 57 40 ics the ability to meaningfully construct and interpret sym-
Bot. 1/4 score 33.9% ⫾ 2.5% 40.7% ⫾ 1.9% 41.7% ⫾ 2.9% bolic equations may be beneficial to learning in all aspects of
Top 3/4 score 62.0% ⫾ 2.9% 61.9% ⫾ 1.8% 62.3% ⫾ 2.8% the course. Conversely, students who are unable to under-
Mean difference 28.1% ⫾ 1.7% 21.2% ⫾ 1.3% 20.6% ⫾ 1.6% stand symbolic equations may learn less than those that do.
Pearson r 0.38 0.29 0.29 The question properties we have identified penalize stu-
dents who attempt to solve problems without understanding
the meaning of the symbolic equations they use. Tuminaro21
observed an activity that he called “recursive plug-and-chug”
effect of question difficulty, we used two methods of omit- while studying students working on homework questions in
ting the other questions so that the difficulty for the top 3/4 groups. In this activity students try to match variables to a
of the class on other questions was the same. This procedure list of equations. If they find an equation where the target
allowed us to compare the performance of the bottom 1/4 on quantity is the only unknown, they solve for the target quan-
two sets of questions that were equally difficult for the top tity; if it is not, they replay the game until an appropriate
3/4. The first method equalized the difficulty of both ques- equation is found. Although this procedure might result in
tions for the top 3/4 of the class by omitting the easiest other the correct answer for some questions, it does not require an
non-equation priority questions. The second method understanding of the underlying physics, and it is not very
achieved the same result by matching the difficulty of each effective for simultaneous-equation and symbolic questions.
equation priority question to another non-equation priority Symbolic questions inhibit strategies like recursive plug-
question for the top 3/4 of the class. Table IV shows that a and-chug by allowing for a greater amount of confusion of
gap in the mean difference remains almost unchanged after meaning. Questions that contain incorrect options corre-
these two corrections for question difficulty. sponding to the manipulation of general equations and the
The discriminatory ability between equation priority and failure to specify variables as compound expressions penal-
other questions is unchanged even when different methods ize students who use strategies that do not understand the
are employed to correct for question difficulty. Hence, the meaning of the symbols and symbolic expressions.
discriminatory ability of equation priority questions is not an These confusions of meaning might be alleviated by the
effect of the difficulty of the questions. use of subscripts on symbols, but few students use them.
Although seemingly trivial, subscripts allow experts to dis-
IV. DISCUSSION tinguish variables from specific quantities belonging to spe-
cific objects or individuals. A common error on symbolic
Our studies show the importance of mathematical struc- questions was the confusion of two quantities of the same
tures in the difficulty and discrimination of questions in in- type. For example, combining two equations by inappropri-
troductory physics. Questions whose structure force alge- ately equating object A’s velocity with object B’s velocity. If
braic representation tend to be more difficult and more students were in the practice of immediately writing sub-
discriminating than questions that can be solved by a series scripts after choosing a general equation, they would be
of calculations. Even though these properties are not neces- forced to consider the meaning of the symbols that they were
sarily related to the physics content of the questions, perfor- using.
mance on questions with these properties is correlated with Conceptual exams like the force concept inventory 共FCI兲
the overall performance in the course. have demonstrated that the ability to solve quantitative prob-
This relation may exist for a variety of reasons. One rea- lems does not always indicate conceptual understanding.
son might be the teaching methods employed in introductory Mazur22 demonstrated that high average scores for complex
physics. Because symbolic algebra is vital to the understand- quantitative questions can often be associated with low av-
ing of the expert physicists who teach introductory physics, erage scores on analogous conceptual questions with the
it is often a prominent component of instruction. At the Uni- same population of students. We suggest that the question
versity of Illinois symbolic derivations are often used to in- properties we have identified can bridge the gap between
troduce new concepts, and symbolic example questions are quantitative and conceptual questions. The question proper-
often shown to demonstrate general problem solving proce- ties we have described can be used by instructors to produce
dures. However, students with algebraic difficulties may be quantitative questions that emphasize meaningful symbolic
easily confused by such methods. representation.
The correlations might also reflect that the ability of stu- The use of these question properties may have the effect of
dents to solve questions symbolically may increase the effec- reorganizing the reward structure in our physics courses. Stu-
tiveness of practice problems by allowing for a more general dents who were rewarded on exams and homework assign-
understanding of the solution. The ability to see the symbolic ments for using strategies that did not require an understand-
structure of a problem might improve the probability that ing of the equations they used would not be rewarded with
students with this ability will be able to solve similarly struc- the types of questions we have described.
tured questions on later exams. Improving students’ ability to symbolically represent rela-
138 Am. J. Phys., Vol. 79, No. 1, January 2011 E. T. Torigoe and G. E. Gladding 138
Downloaded 03 Jun 2013 to 216.47.136.20. Redistribution subject to AAPT license or copyright; see http://ajp.aapt.org/authors/copyright_permission
Fig. 5. Diagram for question 2.
ACKNOWLEDGMENTS
The authors would like to thank the reviewers for their
helpful comments. The authors would also like to thank the
members of the physics education research group at the Uni-
versity of Illinois at Urbana-Champaign. This material is Fig. 7. Diagram for question 10.
based upon work supported by NSF DUE 0088734 and NSF
DUE 0341261.
139 Am. J. Phys., Vol. 79, No. 1, January 2011 E. T. Torigoe and G. E. Gladding 139
Downloaded 03 Jun 2013 to 216.47.136.20. Redistribution subject to AAPT license or copyright; see http://ajp.aapt.org/authors/copyright_permission
7
J. H. Larkin, J. McDermott, D. P. Simon, and H. A. Simon, “Models of
competence in solving physics problems,” Cogn. Sci. 4 共4兲, 317–345
共1980兲.
8
J. Clement, “Algebra word problem solutions: Thought processes under-
lying a common misconception,” J. Res. Math. Educ. 13 共1兲, 16–30
共1982兲.
9
E. Cohen and S. E. Kanim, “Factors influencing the algebra ‘reversal
error’,” Am. J. Phys. 73 共11兲, 1072–1078 共2005兲.
10
E. Soloway, J. Lochhead, and J. Clement, “Does computer programming
enhance problem solving ability? Some positive evidence on algebra
word problems,” in Computer Literacy, edited by R. J. Seidel, R. E.
Anderson, and B. Hunter 共Academic, Burlington, 1982兲, pp. 171–201.
11
M. Scott, T. Stelzer, and G. Gladding, “Evaluating multiple-choice exams
in large introductory physics courses,” Phys. Rev. ST Phys. Educ. Res. 2
共2兲, 020102 共2006兲.
12
Fig. 8. An example of a question requiring a single numeric equation where See supplementary material at http://dx.doi.org/10.1119/1.3487941 for all
the target unknown appears on opposite sides of the equal sign. ten numeric and symbolic pairs of questions used in this study.
13
Question 4 was created by modifying an existing symbolic question.
When numbers were introduced to create the numeric version, one of the
APPENDIX B: SAMPLE EQUATION PRIORITY symbolic options corresponded to an imaginary quantity. To ensure the
similarity of all of the options, only the magnitude of this quantity was
QUESTION displayed in the numeric version. Two of the other five options for this
We give an example of a question that requires a single question do not agree between the versions, but each of these options was
chosen by 2% or less of the students.
numeric equation, where the unknown target appears on op- 14
The p-value represents the likelihood that such a difference can be ob-
posite sides of the equal sign. 共Physics 211 Spring 2006 served under the assumption that the null hypothesis is true 共see Ref. 15兲.
exam 2, question 10.兲 15
G. V. Glass and K. D. Hopkins, Statistical Methods in Education and
Three boxes are arranged as shown. The middle box has a Psychology, 2nd ed. 共Prentice-Hall, Englewood Cliffs, NJ, 1984兲, pp.
mass of 2 kg and accelerates to the right at 5 m / s2 on a 229–235.
16
horizontal frictionless table. The boxes to the left and right Some questions were common between the two versions of the final
hang freely, suspended by strings over massless, frictionless exam.
17
The discrimination of multiple-choice questions is most commonly mea-
pulleys. The tension in the left string is T1 = 10 N. What is sured using the point biserial coefficient of correlation because the result
the mass of the box M 1 on the left? 共See Fig. 8.兲 of a multiple-choice question is most commonly dichotomous. The
multiple-choice questions in this study were analyzed using the Pearson
共a兲 M 1 = 0.675 kg
correlation coefficient r, because students were given partial credit for
共b兲 M 1 = 1.91 kg multiple selections. As a result a student could receive a score of 0, 0.33,
共c兲 M 1 = 2.13 kg 0.5, or 1 on each question.
共d兲 M 1 = 3.75 kg 18
The error of the mean difference shown in Table III is less than what one
共e兲 M 1 = 4.16 kg would calculate if the errors for the top and bottom groups were com-
bined in quadrature. To calculate the error shown, we took advantage of
the fact that the difference in score between the top and bottom groups
a兲
Electronic mail: etorigoe@gmail.com could be determined for each question. The error in the mean difference
1
E. Torigoe and G. Gladding, “Same to us, different to them: Numeric for the equation priority questions was determined by calculating the
computation versus symbolic representation,” in 2006 Physics Education variance of the distribution of differences for the 40 equation priority
Research Conference, edited by L. McCullough et al. 共AIP, New York, questions. This process of pairing data is analogous to how one would
2007兲, pp. 153–156. calculate gains on the FCI by pairing the each precourse and postcourse
2
E. Torigoe, “What kind of math matters? A study of the relationship score by student, rather than finding the mean difference between the
between mathematical ability and success in physics,” Ph.D. dissertation, average precourse score and the postcourse score for the class as a whole.
19
University of Illinois at Urbana-Champaign, 2008. D. L. Schwartz, T. Martin, and J. Pfaffman, “How mathematics propels
3
C. Kieran, “Cognitive processes involved in learning school algebra,” in the development of physical knowledge,” Cognit. Dev. 6 共1兲, 65–88
Mathematics and Cognition: A Research Synthesis by the International 共2005兲.
20
Group for the Psychology of Mathematics Education, edited by P. Nesher V. M. Sloutsky, J. A. Kaminski, and A. F. Heckler, “The advantage of
and K. Kilpatrick 共Cambridge U. P., Cambridge, 1990兲, pp. 96–112. simple symbols for learning and transfer,” Psychon. Bull. Rev. 12 共3兲,
4
C. Kieran, “The learning and teaching of school algebra,” in Handbook of 508–513 共2005兲.
21
Research on Mathematics Learning and Teaching, edited by D. Grouws J. Tuminaro and E. F. Redish, “Elements of a cognitive model of physics
共Macmillan, New York, 1992兲, pp. 390–419. problem solving: Epistemic games,” Phys. Rev. ST Phys. Educ. Res. 3
5
E. Filloy and T. Rojano, “Solving equations: The transition from arith- 共2兲, 020101 共2007兲.
metic to algebra,” For the Learning of Mathematics 9 共2兲, 19–25 共1989兲. 22
E. Mazur, Peer Instruction: A User’s Manual 共Prentice-Hall, Upper
6
The examples shown are from Ref. 4. Saddle River, NJ, 1997兲, pp. 5–7.
140 Am. J. Phys., Vol. 79, No. 1, January 2011 E. T. Torigoe and G. E. Gladding 140
Downloaded 03 Jun 2013 to 216.47.136.20. Redistribution subject to AAPT license or copyright; see http://ajp.aapt.org/authors/copyright_permission