Professional Documents
Culture Documents
Translation
History of Machine Translation
• 17th century
• Before 1950
• the first wave
• MT Paradigms
17th century
• searching for a system of representing each
unique concept with corresponding linguistic
symbol, but the schemas of perfect reunion
of words and concepts doomed to fail
• the semiotic exploration laid foundations for
later semantic schemas
searching for a system of representing each unique concept with
corresponding linguistic symbol, but the schemas of perfect reunion of
words and concepts doomed to fail
• Francis Bacon
• Marin Mersenne
• René Descartes
Francis Bacon
•
Marin Mersenne
•
René Descartes
•
the semiotic exploration laid foundations for later
semantic schemas
•
Before 1950
• Georges Artsrouni (1933) a mechanical
bilingual dictionary
• Warren Weaver (1949) memorandum- the
first to suggest that computers might be
used in translatiion--cryptography
Georges Artsrouni (1933) a mechanical
bilingual dictionary
• 红色
Warren Weaver (1949) memorandum- the first to suggest that
computers might be used in translatiion--cryptography
• 橙色
Warren Weaver (1949) memorandum- the first to suggest that
computers might be used in translatiion--cryptography
•
morphological analysis, basic syntactic analysis and
transformation & reordering as fundamental processes proposed
•
1st MT conference (1952)
•
1st public MT demonstration
system(1954)
• a set of handcrafted rules and a small
vocabulary used to translation a fixed set
of sentences from Russian to English
a set of handcrafted rules and a small vocabulary used to
translation a fixed set of sentences from Russian to English
•
MT Paradigms
• Rule-based approaches
• Corpus-based approaches
• Current trends
Rule-based approaches
• Direct Transfer
• Analysis-transfer-generation
• Interlingua
Direct Transfer
•
Analysis-transfer-generation
• 3 processes are proposed (see Vauquois
Triangle)
3 processes are proposed (see Vauquois
Triangle)
•
Interlingua
• source text analysis resulted in language-
independent intermediate representation
• generating target text from intermediate
representation
source text analysis resulted in language-
independent intermediate representation
• 优先级 1
generating target text from intermediate
representation
• 优先级 2
source tokens transformed directly into target language,
with little or no source analysis
•
Corpus-based approaches
• Example-based MT
• Word-based statistical MT (1990s)
• Phrase-based SMT (early 20th cent.)
• Syntactic SMT (late 20th cent.)
Example-based MT
• 优先级 1
Word-based statistical MT (1990s)
• 优先级 2
Word-based statistical MT (1990s)
•
Phrase-based SMT (early 20th cent.)
• 优先级 3
Phrase-based SMT (early 20th cent.)
•
Moses- an extended, standard system
•
Google Translate
•
Syntactic SMT (late 20th cent.)
•
Current trends
• neural network
neural network
•
searching for a universal language that could recover the
lost language in biblical Adam