Professional Documents
Culture Documents
To cite this article: Ying Xie, Yan Chen & Lan Hui Ryder (2019): Effects of using mobile-based
virtual reality on Chinese L2 students’ oral proficiency, Computer Assisted Language Learning,
DOI: 10.1080/09588221.2019.1604551
ABSTRACT KEYWORDS
This article reports a mixed-methods study about using vir- VR tools; second language
tual reality (VR) tools (Google Cardboard and Expeditions) learning; oral proficiency;
for developing students’ oral proficiency in learning role play in VR
Chinese as a second language. Twelve students role-played
as tour guides for six locations throughout a semester: four
of them with VR tools and two without. Data collection
included oral presentations recorded at four different times
of the semester (for Presentations 1, 3, 4, and 6), class
observations, participants’ reflection papers, and individual
interviews at the end of the semester. Results revealed that
the content and vocabulary of participants’ oral presenta-
tions when using VR tools scored statistically significantly
higher than when not using VR tools. Qualitative data
showed that participants felt that VR tools facilitated prep-
aration and encouraged active learning.
Introduction
The Communicative Language Teaching (CLT) approach emphasizes
that language learning occurs through intensive interactions in and max-
imum use of the target language (Larsen-Freeman, 2000; Turnbull, 2001).
Specifically to achieve fluent and accurate oral proficiency, a sufficient
amount of practice is needed. Some of the recent research attempts to
promote L2 oral proficiency examined the effects of synchronous com-
puter-based chats (Payne & Ross, 2005), web-based voice virtual environ-
ments (Ockey, Gu, & Keehner, 2017), flipped classroom (Wang, An, &
Wright, 2018), digital storytelling (Kim, 2014), and CALL system with
speech technology (Strik, Cornillie, Colpaert, van Doremalen, &
Cucchiarini, 2009). Other researchers (Cheng, Zhan, & Tsai, 2010;
Peterson, 2001) advocate the use of authentic contexts for creating social
Theoretical framework
In the attempt to explain the learning processes, Dalgarno and Lee
(2010) created a model of learning in three-dimensional (3D) virtual
environments (VE), recognizing two unique features as representational
fidelity and high degrees of learner interactions.
Most recently, based on the Information Processing theory (Atkinson
& Shiffrin, 1968) and the Embodied Cognition theory (Varela,
Thompson, & Rosch, 2017), Ladendorf, Schneider, and Xie (2019) estab-
lished the Hypothetical Model of Immersive Cognition (HMIC). The key
principles of the hypothetical model denote that the VR tools create an
immersive sense of presence by activating visual and motor channels and
tricking the brain into believing physical stimuli is present. Since objects
in the space immediately surrounding a person’s body are processed in a
more thorough and complex manner (Holmes & Spence, 2004), the
COMPUTER ASSISTED LANGUAGE LEARNING 3
VR tools in research
The field has seen a rise of research studies about VR tools recently. Due
to the versatile nature of the technology, VR has been used as training
tools (Desai, Desai, Ajmera, & Mehta, 2014; Le, Pedro, & Park, 2015), a
treatment tool for learning in affective domains, such as empathy, pub-
lic-speaking anxiety, and posttraumatic stress disorder (Ahn et al., 2016;
Anderson, Zimand, Hodges, & Rothbaum, 2005; Parsons & Rizzo, 2008;
Passig, Eden, & Heled, 2007; Rizzo et al., 2010), and on a number of sci-
ence and engineering subjects (Bassanino et al., 2010; Lisichenko, 2015;
Merchant et al., 2012).
Among these newly available tools, Google Cardboard turns any
Android or Apple mobile phone into a head-mounted virtual reality
viewer. The Cardboard device is made of curved lenses that split the
phone’s screen and give viewers the feeling of physically being in a spe-
cific location. Google Expeditions, the newly released app that works
with Google Cardboard, takes viewers to multiple locations around the
world. Viewers use the Cardboard and mobile device to transport them-
selves to different environments. The viewer controls movements in a
360-degree pattern. The viewer provides an up-close and personal view
of the specific landmark, city or environment.
The latest studies conducted by Lee, Sergueeva, Catangui, and
Kandaurova (2017) and Rupp et al. (2016) only found that students’
interest in VR did not lead to improved learning. Both of the studies
used the VR tools for a relatively short period of time. Studies using VR
tools for an extended duration or for oral language learning are rare.
Research questions
The present study aimed to explore the potential of VR tools for a
semester-long L2 course by answering the following questions:
Participants
Twelve students (four females and eight males) enrolled in the advanced
Chinese course at a state university in the United States participated in
the study (10 English-native speakers, 2 Chinese heritage learners). They
all majored in non-language disciplines, such as Computer Science,
International Relations, Psychology, Nursing, French, and Business. Eight
participants had briefly studied in China before. The majority of the par-
ticipants had taken 3-4 Chinese courses at the university level.
Participants reported that on average, they spent 1.29 hours every day on
Chinese learning, mainly for pleasure, reading, writing, Internet brows-
ing, TV watching, and music.
Procedure
During the first week of the class, the instructor and the research assist-
ant demonstrated the use of Google Cardboard and Google Expedition.
The instructor provided a list of 12 topics (famous attractions in China)
for students to vote on. Students’ selected places included a university
campus, the Great Wall, Xi’an, Guilin, Beijing, Jiuzhaigou. Two topics
were randomly chosen to be presented by using traditional meth-
ods (PowerPoint).
Starting from Week 2, participants were randomly paired for each of
the six presentation topics so that no two participants worked together
on more than one topic. For every two weeks, all dyads were asked to
thoroughly study the same location and present to the rest of the class in
Chinese as ‘tour guides’. Although the overall location or topic was the
same for all participants in any given week, they were free to choose any
points of interests of the same location for their presentations. The infor-
mation about the attractions on Google Expedition was in English.
COMPUTER ASSISTED LANGUAGE LEARNING 5
Participants were encouraged to look for and use any additional informa-
tion in either English or Chinese for their presentations.
The VR tools were used for Presentations 2, 3, 4, and 5 while
PowerPoint was used for Presentations 1 and 6 for comparison purposes.
When the VR tools were employed, the presenters had complete control
and directed the other students to look toward specific areas they were
presenting on. As a requirement of serving as ‘tour guides’, participants
were asked to only point to or use pictures without textual information
when presenting. To make sure that the variety of the topics was not a
confounding variable, participants were asked to rate the difficulty of dif-
ferent topics on a scale of 1 (easiest) to 5 (most difficult). Comparison of
repeated measures using a nonparametric Friedman’s test showed that
there was no statistical difference among participants’ average ratings of
the difficulty levels for the four topics (v2 (3)¼3.56, p ¼ 0.35 > 0.05).
Data collection included audio-recorded oral presentations, class obser-
vations of all six presentations, two reflection papers (after Presentations
1 and 5), and individual interviews at the end of the semester.
Data analysis
An Oral Proficiency Rubric adapted from two separate scales was used
to code participants’ presentations (Padilla & Sung, 1999; Wu, Marek, &
Chen, 2013; see Appendix A). This adapted rubric, independent of the
language under examination, evaluates oral proficiency into dimensions
of content, fluency, vocabulary, pronunciation, and grammar. Two
experts in Chinese language, also the co-authors of this paper, first dis-
cussed the coding scheme and reached a common understanding of the
rubric. Then they graded one set of recorded presentations separately (12
oral presentations). Even though the presentation was conducted in pairs,
each participant was graded individually and received a unique score
based on his or her own part of the presentations. The inter-rater reli-
ability (Cohen’s Kappa) were 0.79 (content), 0.88 (fluency), 0.83 (vocabu-
lary), 0.88 (pronunciation), 1.00 (grammar), and 0.91 (ICC for total
score). Then both experts graded the rest of the presentations separately
and the average ratings were used for data analysis.
To answer the first research question about the effect of time on par-
ticipants’ oral proficiency, a within-subjects experimental design and cor-
responding analysis was employed to increase statistical power and
reduce the impact of between subject variability. Five nonparametric
Friedman’s ANOVAs on the five dimensions were performed with
Wilcoxon signed-rank tests as post-hoc analyses when necessary. A
6 Y. XIE ET AL.
repeated-measure ANOVA was performed for the total scores for their
oral presentations.
For the second research question about the effect of VR tools on partici-
pants’ oral proficiency, a dependent-sample t-test on six pairs (for five
dimensions and the total score) was performed between (1) the averages of
Presentations 1 and 6 and (2) the averages of Presentations 3 and 4. These
presentations were selected to eliminate the possible threats to internal val-
idity introduced by variables such as time, order and participants’ maturity.
Participants’ reflection papers, and individual interviews with all 12
participants were transcribed and analyzed by using an interpretive quali-
tative data analysis approach to provide further and possible explanations
to the findings of the quantitative analysis (Creswell, 2007). According to
Elliot and Timulak (2005), an interpretive analysis method is appropriate
when the research focus is to explain why the phenomenon could pos-
sibly come about and how it evolves over time.
Results
Change of participants’ oral presentations over time
Table 1 shows the descriptive statistics and results from the repeated
measure analyses of participants’ oral presentations over time. Figure 1
shows the changes of the five dimensions over time.
Friedman’s tests showed two statistically significant differences in par-
ticipants’ content and vocabulary of their oral presentations over time.
Post-hoc analyses with Wilcoxon signed-rank tests with Bonferroni cor-
rections were conducted for the content and vocabulary dimensions to
identify statistically significant differences over time (see Table 2).
The mean score for participants’ content of their presentations (±SD)
was 3.25 (0.62) for the first PPT-based presentation as the baseline, 4.08
(0.67) for Presentation 3, 3.8 (0.57) for Presentation 4, and 3.33 (0.49)
for Presentation 6. Four statistically significant differences were found:
(a) between Presentation 3 and the baseline (Z¼ 2.43, p ¼ 0.02); (b)
between Presentation 4 and the baseline (Z¼ 2.65, p ¼ 0.01); (c)
between Presentations 6 and 3 (Z¼ 2.46, p ¼ 0.01); and (d) between
Presentations 6 and 4 (Z¼ 2.36, p ¼ 0.01).
The mean for participants’ vocabulary score of their presentations
(±SD) was 2.83 (0.83) for the first PPT-based presentation as the baseline,
3.50 (0.52) for Presentation 3, 3.41 (0.67) for Presentation 4, and 3.17
(0.72) for Presentation 6. Three statistically significant differences were
found: (a) between Presentation 3 and the baseline (Z¼ 2.31, p ¼ 0.02);
(b) between Presentation 4 and the baseline (Z¼ 2.65, p ¼ 0.01); and
(c) between Presentation 6 and the baseline (Z¼ 2.00, p ¼ 0.05).
Table 1. Descriptive statistics, Friedman’s ANOVAs, and repeated measure analyses of participants’ oral presentations over time.
Presentation 1 M Presentation 3 M Presentation 4 M Presentation 6 M
Outcome (SD) (No VR) (SD) (With VR) (SD) (With VR) (SD) (No VR) N F or v2 df p
Content 3.25 (0.62) 4.08 (0.67) 3.83 (0.57) 3.33 (0.49) 12 16.97 (v2) 3 0.001
Fluency 3.17 (0.71) 3.25 (0.75) 3.41 (1.16) 3.41 (1.16) 12 3.86 (v2) 3 0.28
Vocabulary 2.83 (0.83) 3.50 (0.52) 3.41 (0.67) 3.17 (.72) 12 13.03 (v2) 3 0.01
Pronunciation 3.25 (1.05) 3.25 (0.87) 3.25 (1.05) 3.17 (1.11) 12 1.00 (v2) 3 0.80
Grammar 3.33 (0.77) 3.17 (0.71) 3.17 (0.83) 3.17 (0.83) 12 2.00 (v2) 3 0.57
Total 15.83 (3.61) 17.25 (3.07) 17.08 (3.91) 16.25 (3.91) 12 3.36 (F) 1.57 0.07
p < 0.05.
Mauchly’s test of sphericity was statistically significant, indicating that the assumption of sphericity had been violated, v2(5)¼17.580, p ¼ 0.004. As a result, the Greenhouse–Geisser
correction is reported here.
COMPUTER ASSISTED LANGUAGE LEARNING
7
8 Y. XIE ET AL.
Table 2. Pair-wise post-hoc analysis of the content and vocabulary dimensions of partici-
pants’ oral presentation.
Outcome dimension (I) Presentation (J) Presentation Z (J I) p
Content 1 3 2.43a 0.02
4 2.65a 0.01
6 0.58a 0.56
3 4 1.34b 0.18
6 2.46b 0.01
4 6 2.34b 0.01
Vocabulary 1 3 2.31a 0.02
4 2.65a 0.01
6 2.00a 0.04
3 4 0.58b 0.56
6 1.41b 0.16
4 6 1.73b 0.08
a
Based on negative ranks.
b
Based on positive ranks.
p < 0.05.
Effects of VR tools
A dependent-sample t test was conducted on six pairs (for five dimen-
sions and the total score) between (1) the averages of Presentations 1
and 6 and (2) the averages of Presentations 3 and 4. Three statistically
significant differences were found, as shown in Table 3.
In general, participants’ content, vocabulary and total scores when
using VR tools scored statistically significantly higher than when not using
VR tools whereas the other dimensions including fluency, pronunciation
and grammar scored similarly with or without these tools. In addition, the
effect sizes (Cohen’s d) for all three statistically significant differences were
relatively large (for content: 1.24; vocabulary: 0.85; and total: 0.79). As a
result, the null hypothesis for the second research question was rejected.
COMPUTER ASSISTED LANGUAGE LEARNING 9
Table 3. Descriptive statistics, t-test, and effect size for participants’ oral presentation.
Without VR With VR
(Presentations 1 (Presentations 3
and 6) and 4)
Outcome M SD M SD N t Cohen’s d p
Content 3.29 0.49 3.96 0.54 12 4.30 1.24 0.001
Fluency 3.29 0.91 3.33 0.93 12 0.56 0.16 0.59
Vocabulary 3.00 0.74 3.46 0.54 12 2.93 0.85 0.014
Pronunciation 3.21 1.07 3.24 0.94 12 0.56 0.16 0.59
Grammar 3.25 0.75 3.17 0.75 12 0.80 0.23 0.44
Total 16.04 3.72 17.17 3.36 12 2.72 0.79 0.02
p < 0.05.
online and translated their findings into Chinese. Comparing the need to
translate the materials they found on search engines, they expressed
more preference to the VR tool. Gladys explained,
(when preparing for PPT), if you are not familiar with the place, you Google it,
and pick up the pics popped up, it makes me more nervous, because I never
know if it is accurate. I had to spend lots of time just to make sure …
Preparing the oral presentations with the VR apps offered more reli-
able information-enriched learning opportunities for these participants.
As a result, it seemed that preparing for PPT presentations could be
more challenging and time-demanding than for VR-supported
presentations.
The chosen topics were real-life places in China. They were not only
learning the language itself but also learning about the culture, and the
latter inevitably brought out newer content and vocabulary. As
David mentioned,
From these projects, I learned about places I did not know existed, such as Guilin
or Jiuzhaigou. I learned about their rich histories. In cases like Beijing, I learned
about the modernity in those cities, such as the CCTV headquarters
Second, VR tools brought real-life objects into close proximity and encour-
aged active learning
One advantage that most participants agreed was that the VR tools cre-
ated an immersive sense of being present in the scene. For example,
Larry stated,
Google Expedition interested me in these projects because I get to see a 360 view
of the area that I am presenting, whereas if you Google image the place. I might
just be seeing a far-away screenshot of it instead of looking directly. Almost as if I
am there.
Gladys stated, ‘It was easier because the tools make it more fun to learn
it. Easier to because we had to actively look for, find, and point to a spot’.
According to Holmes and Spence (2004), objects within the space
immediately surrounding the body are processed in more modalities of
sensory information whereas objects in extrapersonal spaces (far away
for the body to directly interact with) can only be perceived through a
limited number of senses. Further, the HMIC proposed by Ladendorf
et al. (2019) posits that the life-like visual stimuli offered by the VR tools
could possibly create an immersive sense of presence, so as to trigger the
brain to activate schema to process information in multisensory repre-
sentations. As a result, participants paid more attention to the details to
the objects virtually brought to their close proximity. David corrobo-
rated, ‘Using these tools did spark an interest in learning about these pla-
ces, seeing that, one could explain in detail about a location while seeing
it right before you’.
Her reflection not only showed that these VR tools encouraged active
learning but, to a great extent, promoted in-depth and extensive scrutiny
about the topic of interest.
Similarly, Amy commented, ‘The topics themselves were interesting. The
areas such as Guilin did get difficult only because I was not familiar with
nature-like words. It was interesting learning key facts in each section and
discovering a bit of history’. Generally, the immersive capability of the
VR tools might have allowed the participants to relatively easily identify
their knowledge gaps. The attractive, vivid, and real-life presentations of
the scenes through these tools inspired further explorations. These
12 Y. XIE ET AL.
In addition, using these tools provided them extra time and space
to relax during the presentations as Jason noticed, ‘It [using these
tools] gives you time to think when you are looking for the next spot
you are talking about’. Similarly, Henry commented, ‘I was not ner-
vous. The tour gave a slight buffer or pause so presenters could find
their words’. Since participants’ anxiety level decreased, their oral flu-
ency was not negatively affected by expanding the lexicon of their
presentations.
A third factor that could have explained why participants managed to
maintain their oral fluency when using newer vocabulary was the avail-
ability of both visual and verbal information on the VR tool during their
talk. For example, Jason mentioned,
The tool gave us a visual assistance to what we were talking about. When I was
talking about the archery hole on the great wall. I simply pointed out them on the
picture and said, ‘look, those are the archery holes’ instead of using a different
PowerPoint for your presentation.
The visual and verbal information not only contextualized their talking
points (Hadley & Reiken, 1993) but also served mental cues for their
14 Y. XIE ET AL.
Participants’ grammar
Surprisingly, the level of participants’ grammar use slightly dropped after
the first presentation and remained equable for the other three presenta-
tions. The average score of participants’ grammar of their presentations
was roughly three out of five. According to the rubric, most presenta-
tions revealed ‘a limited ability to utilize a few complex constructions,
though not always successfully. Other incorrect uses parts of speech and
noticeable grammatical errors persist which may make meaning ambigu-
ous’ (see Appendix A). It showed that although the participants were in
this relatively advanced Chinese class, their grammar skills were still rela-
tively elementary. The grammar rules still need to be taught to them.
In the book, Logical Relations in Chinese and the Theory of Grammar,
Huang (1998) discussed the generative theory of Universal Grammar and
linguistic typology and analyzed the grammatical constructions of a var-
iety of Chinese sentences. Although the sub-theories of grammar is
beyond the discussions of this paper, the underlying principle is that
grammar consists of descriptive or interpretive rule components that
interact with one another to form the complicated meaning in language.
Likewise, Canale and Swain (1980) view grammatical use as one compo-
nent of communicative competence, which was defined as an amalgam-
ation of students’ underlying knowledge structure and communication
skills. Celce-Murcia (1991) argued that grammar is not a stand-alone
skill but interact with other aspects of a language, including meaning,
social function, or discourse. Since grammar cannot be taught de-contex-
tualized, feedback for correction and discourse were among the most
effective ways of improving grammar (Celce-Murcia, 1991). In this study,
no discourse writing was involved since participants’ presentations were
more colloquial than formal. In addition, they were not required to turn
in essays after their presentations, so the instructor could only provide
limited feedback during or immediately after the oral presentations.
Future research should make essay writing a requirement of the class to
fully examine the effect of VR tools on their grammar skills.
Conclusions
The present study integrated Google Cardboard and Google Expeditions
in an advanced Chinese Language class, where students role-played in
their target situations. According to Situated Cognition Theory, a nurtur-
ing learning experience should help construct and link activity, context
and culture, leading to rich implicit understanding, learning and acting.
The VR tools in this study improved participants’ oral presentations by
enabling a process consisting of contextualized authentic learning,
instructor coaching, peer collaboration, articulation of the gathered infor-
mation, and reflection.
16 Y. XIE ET AL.
The study was significant to the field in several ways: First, it provided
empirical evidence that these VR tools did improve participants’ oral
proficiency in terms of the content and vocabulary. Second, the follow-
up qualitative analyses offered insights about why learning improved.
Since mobile-based VR tools are increasingly adopted in various class-
rooms, the in-depth analyses about how learners interact with these tools
are instrumental for future VR design and research. As a new and poten-
tially powerful technology, understanding the parameters and consequen-
ces of these tools’ usage within higher education has significant meaning
for teachers, practitioners, and instructional designers for the classroom.
The study also has the potential of informing the best practice about
how to invigorate language classrooms and outreach with the increas-
ingly ubiquitous mobile-based, head-mounted VR technology for mean-
ingful language learning and growth.
Disclosure statement
There was no potential conflict of interest with the work we reported.
Data
All research related data, including raw data coded with pseudonyms and SPSS output
files can be requested by emailing the principle researcher of the study. After receiving
the request, access will be granted to a Dropbox folder where the files are stored.
ORCID
Ying Xie http://orcid.org/0000-0002-9189-6046
References
Ahn, S. J., Bostick, J., Ogle, E., Nowak, K. L., McGillicuddy, K. T., & Bailenson, J. N.
(2016). Experiencing nature: Embodying animals in immersive virtual environments
increases inclusion of nature in self and involvement with nature. Journal of
Computer-Mediated Communication, 21(6), 399–419. doi:10.1111/jcc4.12173
Anderson, P. L., Zimand, E., Hodges, L. F., & Rothbaum, B. O. (2005). Cognitive behav-
ioral therapy for public speaking anxiety using virtual reality for exposure. Depression
and Anxiety, 22(3), 156–158. doi:10.1002/da.20090
Atkinson, R. C., & Shiffrin, R. M. (1968). Human memory: A proposed system and its
control processes1. In Psychology of learning and motivation (Vol. 2, pp. 89–195).
New York: Academic Press.
Bassanino, M., Wu, K. C., Yao, J., Khosrowshahi, F., Fernando, T., & Skjaerbaek, J.
(2010). The impact of immersive virtual reality on visualisation for a design review in
construction. In Information Visualisation (IV), 2010 14th International Conference
(pp. 585–589). IEEE.
COMPUTER ASSISTED LANGUAGE LEARNING 17
Bellemare, C., Bissonnette, L., & Kr€ oger, S. (2014). Statistical power of within and
between-subjects designs in economic experiments. Cahier de Recherche/Working
Paper, 14, 25.
Bongaerts, T., Van Summeren, C., Planken, B., & Schils, E. (1997). Age and ultimate
attainment in the pronunciation of a foreign language. Studies in Second Language
Acquisition, 19(4), 447–465.
Canale, M., & Swain, M. (1980). Theoretical bases of communicative approaches to
second language teaching and testing. Applied Linguistics, 1(1), 1–47. doi:10.1093/
applin/1.1.1
Celce-Murcia, M. (1991). Grammar pedagogy in second and foreign language teaching.
TESOL Quarterly, 25(3), 459–480. doi:10.2307/3586980
Cheng, H. J., Zhan, H., & Tsai, A. (2010). Integrating second life into a Chinese lan-
guage teacher training program: A pilot study. Journal of Technology and Chinese
Language Teaching, 1(1), 31–58.
Creswell, J. W. (2007). Qualitative inquiry and research design: Choosing among five
approaches. Thousand Oaks, CA: SAGE Publications.
Dalgarno, B., & Lee, M. (2010). What are the learning affordances of 3-D virtual envi-
ronments? British Journal of Educational Technology, 41(1), 10–32. doi:10.1111/j.1467-
8535.2009.01038.x
Desai, P. R., Desai, P. N., Ajmera, K. D., & Mehta, K. (2014). A review paper on oculus
rift-a virtual reality headset. International Journal of Engineering Trends and
Technology, 13, 175–179. doi:10.14445/22315381/IJETT-V13P237
Dickey, M. D. (2005). Brave new (interactive) worlds: A review of the design affordances
and constraints of two 3D virtual worlds as interactive learning environments.
Interactive Learning Environments, 13(1–2), 121–137. doi:10.1080/10494820500173714
Elliott, R., & Timulak, L. (2005). Descriptive and interpretive approaches to qualitative
research. A Handbook of Research Methods for Clinical and Health Psychology, 1(7),
147–159.
Engwall, O., & B€alter, O. (2007). Pronunciation feedback from real and virtual language
teachers. Computer Assisted Language Learning, 20(3), 235–262. doi:10.1080/
09588220701489507
Hadley, A. O., & Reiken, E. (1993). Teaching language in context, and teaching language
in context–workbook. Florence, KY: Heinle & Heinle Publishers, International
Thomson Publishing Book Distribution Center.
Ho, C. M. L., Rappa, N. A., & Chee, Y. S. (2009). Designing and implementing virtual
enactive role-play and structured argumentation: Promises and pitfalls. Computer
Assisted Language Learning, 22(5), 381–408. doi:10.1080/09588220903184732
Holmes, N. P., & Spence, C. (2004). The body schema and multisensory representation (s)
of peripersonal space. Cognitive Processing, 5(2), 94–105. doi:10.1007/s10339-004-0013-3
Huang, C. T. J. (1998). Logical relations in Chinese and the theory of grammar. New
York: Taylor & Francis.
Jauregi, K., Canto, S., De Graaff, R., Koenraad, T., & Moonen, M. (2011). Verbal inter-
action in second life: Towards a pedagogic framework for task design. Computer
Assisted Language Learning, 24(1), 77–101. doi:10.1080/09588221.2010.538699
Kim, S. (2014). Developing autonomous learning for oral proficiency using digital story-
telling. Language Learning & Technology, 18(2), 20–35.
Kormos, J., & Prefontaine, Y. (2017). Affective factors influencing fluent performance:
French learners’ appraisals of second language speech tasks. Language Teaching
Research, 21(6), 699–716. doi:10.1177/1362168816683562
18 Y. XIE ET AL.
Ladendorf, K., Schneider, D., & Xie, Y. (2019). Mobile-based virtual reality: Why and
how does it support learning? In A. Zhang and D. Cristol (Eds), Handbook of mobile
teaching and learning (2nd ed.), New York: Springer.
Larsen-Freeman, D. (2000). Techniques and principles in language teaching. Oxford:
Oxford University Press.
Le, Q. T., Pedro, A., & Park, C. S. (2015). A social virtual reality based construction
safety education system for experiential learning. Journal of Intelligent & Robotic
Systems, 79, 487–506. doi:10.1007/s10846-014-0112-z
Lee, S. H., Sergueeva, K., Catangui, M., & Kandaurova, M. (2017). Assessing Google
Cardboard virtual reality as a content delivery system in business classrooms. Journal
of Education for Business, 92(4), 153–160. doi:10.1080/08832323.2017.1308308
Liang, M. Y. (2012). Foreign ludicity in online role-playing games. Computer Assisted
Language Learning, 25(5), 455–473. doi:10.1080/09588221.2011.619988
Lin, T. J., & Lan, Y. J. (2015). Language learning in virtual reality environments: Past,
present, and future. Educational Technology & Society, 18(4), 486–497.
Liou, H. C. (2012). The roles of second life in a college computer-assisted language
learning (CALL) course in Taiwan, ROC. Computer Assisted Language Learning,
25(4), 365–382. doi:10.1080/09588221.2011.597766
Lisichenko, R. (2015). Issues surrounding the use of virtual reality in geographic educa-
tion. The Geography Teacher, 12(4), 159–166. doi:10.1080/19338341.2015.1133441
Melchor-Couto, S. (2017). Foreign language anxiety levels in second life oral interaction.
ReCALL, 29(1), 99–119. doi:10.1017/S0958344016000185
Merchant, Z., Goetz, E. T., Keeney-Kennicutt, W., Kwok, O. M., Cifuentes, L., & Davis,
T. J. (2012). The learner characteristics, features of desktop 3D virtual reality environ-
ments, and college chemistry instruction: A structural equation modeling analysis.
Computers & Education, 59, 551–568. doi:10.1016/j.compedu.2012.02.004
Ockey, G. J., Gu, L., & Keehner, M. (2017). Web-based virtual environments for facilitat-
ing assessment of L2 oral communication ability. Language Assessment Quarterly,
14(4), 346–359. doi:10.1080/15434303.2017.1400036
Padilla, A. M., & Sung, H. (1999). The Stanford foreign language oral skills evaluation
matrix (FLOSEM): A rating scale for assessing communicative proficiency.
Unpublished manuscript, Stanford University.
Parsons, T. D., & Rizzo, A. A. (2008). Affective outcomes of virtual reality exposure
therapy for anxiety and specific phobias: A meta-analysis. Journal of Behavior Therapy
and Experimental Psychiatry, 39(3), 250–261. doi:10.1016/j.jbtep.2007.07.007
Passig, D., Eden, S., & Heled, M. (2007). The impact of virtual reality on the awareness
of teenagers to social and emotional experiences of immigrant classmates. Education
and Information Technologies, 12(4), 267–280. doi:10.1007/s10639-007-9031-y
Peterson, M. (2001). MOOs and second language acquisition: Towards a rationale for
MOO-based learning. Computer Assisted Language Learning, 14(5), 443–459. doi:
10.1076/call.14.5.443.5773
Peterson, M. (2010). Massively multiplayer online role-playing games as arenas for
second language learning. Computer Assisted Language Learning, 23(5), 429–439. doi:
10.1080/09588221.2010.520673
Payne, S., & Ross, B. (2005). Synchronous CMC, working memory, and L2 oral profi-
ciency development. Language Learning & Technology, 9(3), 35–54.
Rizzo, A., Difede, J., Rothbaum, B. O., Reger, G., Spitalnick, J., Cukor, J., & McLay, R.
(2010). Development and early evaluation of the Virtual Iraq/Afghanistan exposure
COMPUTER ASSISTED LANGUAGE LEARNING 19
therapy system for combat-related PTSD. Annals of the New York Academy of
Sciences, 1208(1), 114–125. doi:10.1111/j.1749-6632.2010.05755.x
Roed, J. (2003). Language learner behaviour in a virtual environment. Computer Assisted
Language Learning, 16(2–3), 155–172. doi:10.1076/call.16.2.155.15880
Rupp, M. A., Kozachuk, J., Michaelis, J. R., Odette, K. L., Smither, J. A., & McConnell,
D. S. (2016). The effects of immersiveness and future VR expectations on subjec-tive-
experiences during an educational 360 video. In Proceedings of the Human Factors
and Ergonomics Society Annual Meeting (Vol. 60, No. 1, pp. 2108–2112). Los
Angeles, CA: SAGE Publications. doi:10.1177/1541931213601477
Strik, H., Cornillie, F., Colpaert, J., van Doremalen, J. J. H. C., & Cucchiarini, C. (2009).
Developing a CALL system for practicing oral proficiency: How to design for speech
technology, pedagogy and learners. Proceedings of SLATE.
Turnbull, M. (2001). There is a role for the L1 in second and foreign language teaching,
but … . Canadian Modern Language Review, 57(4), 531–540. doi:10.3138/cmlr.
57.4.531
Varela, F. J., Thompson, E., & Rosch, E. (2017). The embodied mind: Cognitive science
and human experience. Cambridge, MA: MIT Press.
van Doremalen, J., Boves, L., Colpaert, J., Cucchiarini, C., & Strik, H. (2016). Evaluating
automatic speech recognition-based language learning systems: A case study.
Computer Assisted Language Learning, 29(4), 833–851. doi:10.1080/
09588221.2016.1167090
Von der Emde, S., Schneider, J., & Kotter, M. (2001). Technically speaking:
Transforming language learning through virtual learning environments (MOOs). The
Modern Language Journal, 85(2), 210–225. doi:10.1111/0026-7902.00105
Wang, J., An, N., & Wright, C. (2018). Enhancing beginner learners’ oral proficiency in
a flipped Chinese foreign language classroom. Computer Assisted Language Learning,
31(5–6), 490–521.
Wehner, A. K., Gump, A. W., & Downey, S. (2011). The effects of second life on the
motivation of undergraduate students learning a foreign language. Computer Assisted
Language Learning, 24(3), 277–289. doi:10.1080/09588221.2010.551757
Wu, W. C. V., Marek, M., & Chen, N. S. (2013). Assessing cultural awareness and lin-
guistic competency of EFL learners in a CMC-based active learning context. System,
41(3), 515–528. doi:10.1016/j.system.2013.05.004
Xie, T. (2010). Tools for teaching Chinese in the virtual world. Journal of Technology
and Chinese Language Teaching, 1(1), 59–70.
Yoshimura, Y., & MacWhinney, B. (2007). The effect of oral repetition on L2 speech flu-
ency: An experimental tool and language tutor. In Workshop on Speech and
Language Technology in Education.
20 Y. XIE ET AL.
Criteria Level 1 (1 point) Level 2 (2 points) Level 3 (3 points) Level 4 (4 points) Level 5 (5 points)
Content The speaker The speaker is The speaker is The speaker is The speaker is
seems to have aware of the aware of the knowledgeable knowledgeable
little or no subject and subject and about the sub- about the sub-
understanding attempts to attempts to ject and pro- ject and pro-
of the subject. provide rele- provide rele- vides adequate vides a
Statement are vant ideas vant ideas detail, given significant level
superficial or about it with- about it. the of detail with
not relevant. out support- Provides time available. supporting
ing details. some details. materials,
given the
time available.
Fluency The speaker fails The speaker has The speaker hesi- The speaker sel- The speaker
to present a many hesita- tates several dom hesitates speaks confi-
topic and a tions and great times, but gen- and generally dently and nat-
focus that are difficulty erally seems to seems to know urally with no
inappropriate remembering know the the distracting hes-
for selecting desired words, desired words. itations. Ideas
the purpose. words. even if it is flow smoothly.
necessary to
think about
them a bit.
Vocabulary The speaker’s The speaker has The speaker has The speaker has The speaker pos-
vocabulary is enough an adequate enough sesses an
limited to: (a) vocabulary working vocabulary to extensive
high frequency (including high vocabulary. understand native-like
words for com- frequency The speaker is and participate vocabulary to
mon everyday idiomatic at a beginning in conversa- participate in
items and expressions) to stage of show- tions which more extended
actions, and make simple ing knowledge include discussions on
(b) some con- statements and of synonyms abstract ideas. a large number
versational for- ask questions and alternative The speaker is of concrete
mulaic or about concrete ways of aware of word and
idiomatic things in a expressing sim- connotations abstract topics.
expressions. simplified ple ideas. and nuances
conversation. in meanings.
Pronunciation Even at the level The speaker is The speaker is The speaker’s The speaker’s pro-
of isolated beginning to beginning to speech is nunciation and
words and for- master some demonstrate always intelli- intonation
mulaic expres- sounds and control over a gible, though a approach a
sions, The sound patterns; larger number definite accent native-
speaker exhib- however, she of sounds and and/or occa- like ability.
its difficulty still has diffi- sound patterns. sional inappro-
inaccurately culty with Some repeti- priate inton-
reproducing many other tion may be ation pattern
the target lan- sounds, mak- necessary to is apparent.
guage sounds ing mean- make mean-
and ing unclear. ing clear.
sound
patterns.
Grammar The speaker’s pro- The speaker can The speaker’s The speaker’s The speaker artic-
ductive skills produce utter- speech shows speech exhibits ulates with no
are limited to ances which a limited abil- a good com- more incorrect
high frequency show an ity to utilize a mand over a grammar than
words and understanding few complex large range of a native
short formulaic of basic sen- constructions, more complex speaker would.
conversational tence and though not patterns with
expressions. question pat- always success- few incorrectly
The speaker terns, but fully. Other uses parts of
fails to use incorrectly incorrectly uses speech or
(continued)
COMPUTER ASSISTED LANGUAGE LEARNING 21
Appendix A. Continued.
Criteria Level 1 (1 point) Level 2 (2 points) Level 3 (3 points) Level 4 (4 points) Level 5 (5 points)
correctly con- constructs sen- parts of speech noticeable
structs senten- tences or uses and noticeable grammatical
ces or uses parts of speech grammatical errors.
parts of are present errors persist
speech. It is which which may
difficult or obscure make mean-
impossible to meaning. ing ambiguous.
assess her
knowledge
of grammar.
Adapted from
Wu, W-C. V., Marek, M. W. & Huang, H. W. (2012). Using Skype and Facebook
as social media to enhance the speaker communicative competency and cultural
awareness in an EFL advanced conversation class. In J. Colpaert, A. Aerts, W.C. V
Wu, & Y.C.J. Chao(Eds.), Proceedings of the 2012 CALL Conference (pp. 681-684).
Antwerp: CALL (Taichung, Taiwan).
Padilla, A. M., & Sung, H. (1999). The Stanford Foreign Language Oral Skills
Evaluation Matrix (FLOSEM): A rating scale for assessing communicative profi-
ciency. Retrieved from http://files.eric.ed.gov/fulltext/ED445538.pdf