Professional Documents
Culture Documents
Presented by:
Student of the group FL-35
Snizhana Salamakha
Supervised by:
Olesya Saban
Lviv – 2022
2
ABSTRACT
Our research is devoted to study and analysis of synonymous adjectives
CHARMING and STUNNING using corpus-based method of analysis. Corpus
linguistics is viewed by some linguists as a research tool or methodology and by
others as a discipline or theory in its own right.
The research aims to determine the special features of adjectives
CHARMING and STUNNING, to gather new insights into the semantic differences
of these adjectives by examining their most frequent collocations and their stylistic
variation.
The object of this research is the adjectives CHARMING and STUNNING in
diachrony.
The subject of this research is a corpus-based diachronic study of the
adjectives CHARMING and STUNNING.
Research material is compiled from various sources and supplemented by my
results, such as COCA (The Corpus of Contemporary American English), Oxford
Dictionary of Phrase (2010), Oxford Advanced Learner's Dictionary (2010), Collins,
Longman's Dictionary of Modern English (2014), Longman's Advanced American
Dictionary (2013), Macmillan's Dictionary of Phrases (2010).
Linguistic data from the Corpus of Contemporary American English
consists of 825 collocations with the adjective STUNNING, and 974 collocations
with the adjective CHARMING.
The research paper considers the concept of adjective phrase, corpus and
collocation. This research paper presents how to study, research and analyze the
synonymous adjectives with methods of corpus linguistics. Collocations with
adjectives were classified into collocation models, like CHARMING/STUNNING +
noun, CHARMING/STUNNING+verb, CHARMING/STUNNING +ADVERB,
CHARMING/STUNNING+adjective and other.
TABLE OF CONTENTS
INTRODUCTION…………………………………………………………………….3
CHAPTER 1. THEORETICAL AND METHODOLOGICAL BACKGROUND…...6
1.1. The concept of synonyms………………………………………………………6
1.2. Corpus linguistics and corpus-based studies…………………………………...7
1.3. Literature overview…………………………………………………………….9
1.4. Adjective as a part of speech………………………………………………….10
1.5. Data and methodology………………………………………………………..12
CHAPTER 2. PRESENTATION AND ANALYSIS OF DATA……………………14
2.1. The general information about the adjectives CHARMING and
STUNNING…………………………………………………………………………14
2.2. The comparison of the concept of the adjectives CHARMING and STUNNING
in the period from the year 1990-1994 to the year 2015-2019………………………15
CONCLUSIONS………………………………………………………………….....27
REFERENCES……………………………………………………………..………..30
4
INTRODUCTION
Corpus-based diachronic studies have been thriving over the last three decades
to such an extent that the validity of corpora in the enrichment of historical linguistic
research is now undeniable. Traditionally a corpus is a collection of language
examples: written or spoken examples of words, sentences, phrases, or texts.
Nowadays a corpus can be any collection of examples.
A corpus is collected to learn from it, i.e. to extract domain-specific
information. Examples can be analyzed and the rules and models behind them can be
discovered. Machine learning algorithms are used to extract relationships between
samples. Manual structuring (annotation) of such data allows the integration of
human preferences and knowledge into machine learning algorithms. One of the
focuses of modern applied language research is corpus linguistics.
Corpus linguistics is the study of language based on large collections of "real
life" language use stored in corpora (or corpora)—computerized databases created for
linguistic research. It is also known as corpus-based studies. Corpus-based research is
also useful for features that cannot be examined with automated computer programs.
Furthermore, this research makes it easier to examine multiple features of the
function, allowing us to gain a more complete picture of its purpose [1, p.12].
Research in the field of corpus linguistics concerns the creation of the first
electronic corpora and their gradual spread as an effective means of language
research, as well as the formation of corpus linguistics as an independent discipline.
Among the researchers who made a significant contribution to the development of
corpus linguistics, it is worth mentioning the names of H. Kucera (1979), D. Biber
(1990, 1992), J. Sinclair (1994), G. Kennedy (1998), N. Ide (2000), M. Basko (1996;
2003), T. Erjavec (2001), T. McEnery and A. Wilson (2001), and others.
Corpus linguistics is viewed by some linguists as a research tool or
methodology and by others as a discipline or theory in its own right. This research
analyzes adjectives CHARMING and STUNNING. It is a corpus-based and
diachronic study.
5
The topicality of the research is the interest in the popularity of the word in
various sources of information, in the use of a word at this or that time, how the use
of the word has changed in certain years, and what it means. Furthermore, corpus-
based research is a new and interesting trend that can provide reliable results.
The object of this research is the adjectives CHARMING and STUNNING in
diachrony.
The subject of this research is a corpus-based diachronic study of the
adjectives CHARMING and STUNNING.
The research aims to determine the special features of adjectives
CHARMING and STUNNING, to gather new insights into the semantic differences
of these adjectives by examining their most frequent collocations and their stylistic
variation.
The research tasks include the following:
1. to study the concept of a diachronic corpus-based approach;
2. to research corpus-based and corpus-linguistic studies;
3. to reveal the etymology and definition of words CHARMING and
STUNNING;
4. to study and analyze the main information about the corpora COCA (The
Corpus of Contemporary American English);
5. to compare the adjective CHARMING in the specific period, to analyze
the obtained result;
6. to compare the adjective STUNNING in the specific periods, to analyze
the obtained result.
Research methods are:
1. analysis of definitions – to clarify the interpretation of terms;
2. distributional analysis – to identify the contextual environment, connecting
potential and properties of the studied units;
3. concordance analysis – for the analysis of a set of terminological units and
connections between language units;
4. comparative analysis of adjectives as synonyms;
6
5. corpus method.
Research material is compiled from various sources and supplemented by my
results, such as COCA (The Corpus of Contemporary American English), Oxford
Dictionary of Phrase (2010), Oxford Advanced Learner's Dictionary (2010), Collins,
Longman's Dictionary of Modern English (2014), Longman's Advanced American
Dictionary (2013), Macmillan's Dictionary of Phrases (2010).
Linguistic data from the Corpus of Contemporary American English
consists of 825 collocations with the adjective STUNNING, and 974 collocations
with the adjective CHARMING.
7
D. A. Cruse [9, p. 268] establishes a scale of synonymy. On the one hand, there
is absolute synonymy, on the other, non-synonymy. Absolute synonyms can be
defined as lexical elements that can be used interchangeably in all contexts. However,
it is impossible to test all of these contextual relationships [9, 268), and therefore “if
they exist, they are extremely rare” [9, p. 270]. As mentioned above, the existence of
two lexical units with the same meaning is useless. Consequently, one of the words
becomes obsolete or takes on a different semantic function [9, p. 270] as the semantic
meaning of words is constantly being updated [9, p. 107].
Further, on the synonymy scale, D. A. Cruse [9, p. 270] reinforces cognitive
synonymy. He defines cognitive synonyms as lexical elements that have “certain
common semantic properties” [9, p. 270]. He calls these semantic properties a
semantic mode, which is divided into propositional and expressive [9, p. 271]. The
propositional way is expressed by the form of the sentence, namely, if the sentence
expresses a "statement, question, command, exclamation, etc." [9, p. 271]. While the
meaning of a propositional meaning is determined by the truth condition pronounced
in the sentence, the truth condition does not play a role in the expressive meaning of
the lexical element. As a consequence, words are defined as cognitive synonyms if
they share propositional meanings but differ in their way of expression [9, p. 273].
course, at the initial stage, the teacher needs to teach students directly the principles
of work with the corpora, and in the future to control their independent research
activities on the detection of grammatical rules or linguistic laws, and each time in
advance to check the effectiveness of one or another corpus in solving the task [3,
p.7]. Among the most well-known cases of English in free access, it should be noted:
British National Corpus (BNC);
Corpus of Contemporary American English (COCA).
Work with the corpus is carried out with the help of special search engines -
corpus managers – which represent the results of the search in the form of
concordance, that is, the list in which the search unit is presented in the context of
fragments of various texts (horizontal lines with a search word in the middle are
called KWIC – Key Word In Context) and in the form of statistical data such as the
frequency of word-forms, the frequency of lexemes, grammatical categories, the
feature of compatibility, management, etc.
The corpus analysis procedure includes three steps: the identification of
language data by categorical analysis, the correlation of speech data with statistical
methods, and the intellectual interpretation of results [2, p. 20]. The lexicological
studies of the corpus allow us to determine the frequency of the use of lexical units of
the language, the relationship between the meanings of words such as polysemy,
homonymy, synonymy, and antonymy, changes in the vocabulary of the language, as
well as reflection in the language of social, territorial, and professional
characteristics.
The main concept of corpus linguistics is the corpus, that is, the selection of
speech material, which is formed by certain requirements and topics. The corpora can
vary according to styles and genres, or be separated by a specific discursive area. The
corpus of texts is not just a collection of texts selected by a certain method and
presented in electronic form in certain areas of language use (many similar
collections are now known), but a collection that is categorized by the integral
characteristics of each text (e.g. genre) and from the specific characteristics of
different units of its organization (tokens, word forms, morphemes, etc.).
10
Adjective clauses and adjective phrases largely play the same role: to describe
a noun. But while an adjective phrase is simply a few additional words that “bulk up”
an adjective, an adjective clause reiterates the noun. Often, an adjective clause does
this with a pronoun.
genres: oral, fiction, popular magazines, newspapers, academic writing, and (updated
in March 2020): television and films. subtitles, blogs, and other web pages [33].
The definitions of the near-synonyms as employed in the qualitative analysis
rely on The Macmillan Dictionary (2014), The Collins American Dictionary (2014),
and The Merriam-Webster Dictionary (2014). All three dictionaries are online
available. Whereas The Macmillan Dictionary includes the British and American
varieties, The Collins American Dictionary and The Merriam-Webster Dictionary
specialized in the American variety.
The corpus used for the research was the Corpus of Contemporary American
English (COCA) provided by Mark Davies of the Birmingham Young University.
The corpus was chosen due to its large size since it contained approximately 450
million words when the study was conducted in 2014. Moreover, COCA compiles
contemporary data and its interface and search options are easily operated.
Furthermore, a corpus based on American text samples was selected due to the use of
American-based dictionaries for the qualitative analysis of the near-synonyms.
According to M. Stubbs, we used such steps in our corpus analysis of
adjectives CHARMING vs STUNNING:
1. Lexical level: learn a word by frequency, context, and adjacent phrase.
2. Syntactic level: quantitative analysis of various lexical combinations and
patterns of sentences.
Frequency is an indispensable theoretical concept in the study of lexical
phrases in linguistics. Word frequency statistics, both manual and computerized,
determine and analyze the occurrence of words to derive lexical rules. It is often used
to study lexical patterns, frequency of word use, learning, and so on.
15
2.1. The general information about the adjectives CHARMING and STUNNING
There were two sources used for the collection of data in this study, the
Longman dictionary of contemporary English, 6 th edition (2014) and COCA. This
study aimed to understand the differences and similarities between the two selected
synonyms, and grammatical structure. Table 1. Meanings are the initial part of words
in a language. To get meaning Longman dictionary of contemporary English 6th
edition (2014) was used (see Table 2).
Table 2
1) extremely beautiful or attracti 1)They make a stunning couple.
ve: 2) a stunning victory
STUNNING - collocation with: a beautiful person,
painting, sight, etc.
2) shocking or very impressive;
1) pleasant and attractive: 1) We had dinner with our director and
2) used to describe people who his charming wife.
use their attractiveness to influe 2)He's very charming but I
nce people or to make wouldn't trust him.
CHARMING other people like them: 3)"Shut up, will you, I'm trying to
3) used to show that you do watch TV!" "Oh, charming!"
not approve of what someone
has said or done:
16
The CHART option shows the total frequency in each section. (in COCA, the
genres and five-year blocks). It is also possible to see the frequency of words and
phrases by sub-genre and/or year.
In figures 3-4 we can see the frequency of adjectives by the year and make a
comparative analysis of them.
As we see from figure 5, the most frequent of use the adjective CHARMING in
the period of 1990-1995 is in sections of FICTION and NEWS.
Fig. 6 Adjective CHARMING in the period from the year 1990-1994 to the year
2015-2019 in COCA
Fig. 8 Adjective STUNNING in the period from the year 1990-1994 to the year
2015-2019 in COCA
You can find a wealth of information for the top 60,000 words in the corpus.
As the following examples with bread show, you can see:
an overview of all of the information below
related topics (words that co-occur anywhere on the web page)
collocates (automatically grouped by part of speech)
clusters (the most frequent 2, 3, and 4-word strings)
a reportable Keyword in Context (concordance) display
related words (synonyms and WordNet entries), and
websites use that word the most (can use these to create Virtual Corpora).
In figures 9 and 10 you can see the information about adjectives –
CHARMING and STUNNING. We can see the use of adjectives in different types of
19
texts, the list of synonyms for researched adjectives, collocations, and clusters with
them.
F. R. Palmer (1997) stated that “synonyms, in their core meaning, may be same
but different in their collation”. In this study, the results manifest that beautiful,
handsome, and pretty co-occur with the same and different noun and adverb
collocates. These collocates were taken from COCA.
over the years. For example, in 1990-1994 the most frequent word collocates are
VICTORY, DEFEAT, SUCCESS, BEAUTY, VIEWS, WOMAN, and others. But in
2010-2014, the most frequent word collocates are VIEWS, VICTORY, WOMAN,
NEWS, DRESS, BEAUTY, and SUCCESS. We can conclude that the adjective
STUNNING has the same word collocates in 1990-1994 – 2010-2014.
years. For example, in 1990-1994 the most frequent word collocates are PRINCE,
MAN, WAY, WOMAN, GUY, PEOPLE, TOWN, CHILD, PLACE, and others.
But in 2010-2014, the most frequent word collocates are PRINCE, MAN, WAY,
WOMAN, GUY, PEOPLE, TOWN, TIME, and RESTAURANT. We can conclude
that the adjective STUNNING has the same word collocates in 1990-1994 – 2010-
2014.
The concept of collocation is one of the most essential in corpus linguistics.
The British linguist J.R. Firth discussed it as early as 1951 and first coined the term in
its modern linguistic sense along with the famous explanatory slogan: ‘you shall
judge a word by the company it keeps’ (1957).
The next step of the analysis concerns collocation. J.R. Firth (1957) states ‘you
shall know a word by the company it keeps, so the working hypothesis here is that
overlap in collocation may reveal which words have meanings or senses that are
closer to each other than others. On the other hand differences in collocation may also
indicate divergence among synonyms.
F. R. Palmer (1997) stated that “synonyms, in their core meaning, may be same
but different in their collocation”. In this study, the results manifest that beautiful,
handsome, and pretty co-occur with the same and different noun and adverb
collocates. These collocates were taken from COCA.
Linguistic data from the Corpus of Contemporary American English
consists of 825 collocations with the adjective STUNNING, and 974 collocations
with the adjective CHARMING.
Collocations of the adjective CHARMING have such collocation models in
COCA:
- CHARMING+NOUN: 401 collocates, the most frequent are following
+ NOUN
956 prince
164 smile
120 town
86 lady
62 personality
51 village
47 hotel
23
41 accent
38 restaurant
34 tale
33 inn
32 fellow
31 cottage
30 self
27 manner
26 host
26 gentleman
25 shop
25 princess
- CHARMING+ADJECTIVE: 401 collocates, the most frequent are
following
+ ADJ
212 funny
181 little
161 handsome
160 young
113 beautiful
100 witty
88 sweet
81 smart
81 intelligent
48 attractive
47 lovely
46 cute
43 warm
43 charismatic
41 romantic
39 sexy
38 wonderful
38 friendly
36 quaint
35 delightful
5 staff
4 dash
4 furnish
4 swoop
3 reassure
3 hint
3 situate
3 behold
3 dine
3 amaze
3 cherish
- VERB+CHARMING:
1) I think that's a charming idea.
2) Well, I think you're pretty charming.
3) Whoa, for the first time in my life, women find me charming
-ADVERB+CHARMING:
1) He's a very charming man and very gentlemanly and extremely observant
and smart, " Bloch said. "
2) Your cousin is a very charming young lady.
3) Denzel is so effortlessly charming and handsome
4) Hey, who knew Oneonta had the most charming bed and breakfast in the
state?
5) I find autumn one the most charming season in Prague.
6) I mean there was nobody bigger or more charming and those blue eyes and
that smile.
7) And by the way, if your goal is to prove how charming and awesome you
are, as far as I'm concerned, you've already…
8) Today, stories about dabbling in drug use are now offered
up as charming personal anecdotes.
9) ... almost like an actress or a model, except she
is just naturally charming without a script…
10) …you could very easily give playwright Jenelle
Riley's quite charming romantic comedy…
Collocations of the adjective STUNNING have such collocation models in
COCA:
- STUNNING +NOUN: 401 collocates, the most frequent are following
26
+ NOUN
299 View
203 victory
129 success
125 image
116 result
112 beauty
105 photo
95 performance
94 development
87 loss
86 example
83 photograph
78 piece
77 display
77 revelation
72 dress
71 defeat
68 collection
65 color
60 video
59 reversal
2 intellectually
- VERB+STUNNING:
1) Clary, you look absolutely stunning.
2) In the tasting area, giant windows look out on a stunning backdrop of
rolling hills, stone farmhouses and wide-mouthed barns.
3) I think it was very stunning when the second dress started walking on the
catwalk because the letters float.
4) i think it's stunning citizens can hold any leader in such high esteem
without ever considering the many possibilities…
5) You know, the stunning visual of the satellite photo is something I always
think of, where North Korea
6) A dime a dozen at flea markets and antiques shops, cast-off
prisms make for stunning pendants.
7) These colorful latkes make a stunning presentation when served on a
platter with toppings such as sour cream, diced tomatoes…
8) His ability to get on top of cornerbacks and make stunning adjustments on
take-off routes is reflected in his 30 receptions for 40 or more yards…
9) In the center of this blackened rock they see a stunning sight- a lake of
boiling lava.
10) What I find stunning from a political point of view is I don't hear any
outrage from Democrats…
-ADVERB+STUNNING:
1) It is one of the most visually stunning moments in cinema for me and also
an awesome song.
2) Your photos are absolutely stunning and make gorgeous wallpaper that I
enjoy all day long.
3) This is an absolutely stunning book!
4) The violin's played by Tim Fain, who's just a stunning violinist.
29
CONCLUSIONS
This paper proposes a corpus-based inspection of eight nouns using corpus
linguistic techniques, as well as a gradience analysis of similarities and differences
between two near-synonyms. Two questions were presented in this paper.
The first question was related to discerning the similarities and differences
between adjectives STUNNING and CHARMING while the second question focused
on whether the results from a corpus-based methodology could be represented in any
form in addition to displaying the percentages from the corpora. The answer we
provided for each question was positive.
This study aimed to investigate the two synonyms STUNNING and
CHARMING in terms of their meanings and collocation with the help of the
Longman Dictionary of Contemporary English 6th edition (2014) and COCA. This
study was also aimed at investigating the similarities and differences between the
three synonyms. The findings of this research declare that these synonyms are similar
in their core meaning but are different in their detailed meanings and collocation. The
results also clarify that the corpus provides more additional information that is not
part of dictionaries.
The research paper is devoted to the corpus analysis of synonymous
adjectives STUNNING and CHARMING. First, the definitions of all six adjectives
are compared according to three dictionaries available on the Internet (The
Macmillan Dictionary, The Collins American Dictionary, and The Merriam-Webster
Dictionary) and the similarities and differences between the definitions are identified.
Second, the noun phrases of adjectives measured by raw frequency and MI count are
examined in the Corpus of Contemporary American English. The results of the top
list of noun phrases are grouped by lexical models. In addition, the stylistic variation
of six adjectives in five different genres represented by COCA (colloquial, fiction,
magazines, newspapers, and academic writing) is investigated. The results of the
study call into question some of the existing definitions of adjectives and, as a
consequence, suggest improvements in the description of closely related adjectives.
In addition to the three dictionaries, another important source of data was the
31
REFERENCES
1. Демська-Кульчицька О.М. Базові поняття корпусної лінгвістики //
Українська мова. – 2003. – №1. – С. 42-47.
2. Демська О.М. Текстовий корпус: ідея іншої форми. Київ : ВПЦ
НаУКМА, 2011. 282 с. 2.
3. Кочерган М.П. Загальне мовознавство: Підручник: – К.: Видавничий
центр “Академія”, 2003. – 464 с. (Альма-матер)
4. Biber, D., S. Conrad, and R. Reppen. (1998), Corpus Linguistics:
Investigating Language Structure and Use, Cambridge: Cambridge
University Press.
5. Chanchotphattha, K. (2013). A corpus-based study of English synonyms:
different, diverse, and various. Independent study. Language Institute,
Thammasat University, Bangkok.
6. Chung, S.-F. and K. Ahrens (2008), “MARVS Revisited: Incorporating
Sense Distribution and Mutual Information into Near-Synonym Analyses,”
Language and Linguistics: Lexicon, Grammar and Natural Language
Processing, 9(2), 415-434.
7. Collinson, W. E. (1939), “Comparative Synonymics,” Transactions of the
Philosophical Society, 54-77.
8. Croft, W. and D. A. Cruse (2004), Cognitive Linguistics. Cambridge:
Cambridge University Press.
9. Cruse, D. A. (1986), Lexical Semantics, Cambridge: Cambridge University
Press.
10.Edmonds, Philip; Hirst, Graeme. 2002. “Near-synonymy and lexical
choice”. Computational Linguistics 28(2), 105 – 144.
11.Dixon, R. M. W. (1991), A New Approach to English Grammar, on
Semantic Principles, New York: Oxford University Press.
12.Divjak, D. (2006). Ways of intending: Delineating and structuring near
synonyms. In Corpora in Cognitive Linguistics: Corpus-Based Approaches
to Syntax and Lexis (pp. 19-56). Germany: Mouton de Gruyter
34
17%
РИЗИК ПЛАГІАТУ
100%
ПЕРЕФРАЗУВАННЯ
1%
НЕПРАВИЛЬНІ ЦИТУВАННЯ
0%