You are on page 1of 19

(Language Corpus

Corpus))

Dr. M. Ganesan
Professor
CAS iin Li
Linguistics
i ti
Annamalai University
ganesan_au@yahoo.com

/ /



1.

2.


/
Competence / Performance

-
.

/ -

-
- 1960


- 1961

(Francis), (Kucera) -

(Brown Corpus)
-
(London - Lund Corpus)
- -
(Lancaster - Oslo - Pergen, LOB Corpus)


(British National Corpus, BNC)





-
, ,

, ,
,
,


- 1987,

-
(TDIL)
(CIIL)
1991 to
t 1995

18

30

-

-

((Eric Pederson))
- (LDC-IL)



,
, , ,
,
,
,

1981 - 1990

..........
6 ,

76

1.
2.
3
3.
4.
5.
6.
7.
8.
9.




1.
/

2.
3.
4.
5.
6.

-
1.
2
2.

3.
4.

-
1.
2.
3.


1.
2.
3.
4.

, , ,
(Word form)

(, ....)

(KWIC Concordance)
-

(1380)

(620)
(454)
(306)


-
-
-


-
- - -
-
-

?
- (Secondary corpus)

1.

45,370
,
2.
69,189
3.
84,129
4.
80,455
5.
73,455
6.
66,352
7. ,
1,15,444
8
8.

61 948
61,948
9. / /
73,608
10.
1,29,362
11.

67,258
,
12.
21,711
13. ()

.
1.
2.
3
3.
4.
5.
6.

77,899
43,779
24 023
24,023
8,908
8,229

2.
11.

2.
3.
4.
5. / /
6.
7.
8.
9. / /
10.
11.

12.
13. /
14.
15.
16.

17.
18. ()
19.
20
20.

21.
22. /

24,184
24
184
64,508
35,643
36,586
41,135
60,283
53,113
1,57,017
25,514
94,613
76,671
15,582
1,73,520
8,968
24,519
15 523
15,523
31,038
16,038

3. ,
1.
2.
3.
4.
5. -
6.
7.
8
8.

9.
10.
11.
12.
13.
14.
15.
16.
17.
18
18.

19.
20.

21,990
8,039
15,973
26,403
25,543
21,150
65,115
21,599
24,950
13 012
13,012
56,774
19,345

4.
1.
2.
3.
4.

10,680

5,911
16,361

5
5.




1.
2.
3.

6.
1.
2.
3.
4.
5.

59,676
18,576
19,576
23,686
27,040
16,749
--------------

3,609,888
---------------

You might also like