Professional Documents
Culture Documents
NMT Vs SMT: Dragos Munteanu
NMT Vs SMT: Dragos Munteanu
Dragos Munteanu
MT approaches
data
input output
Machine
Training + Decoding
model
SMT: Training
parallel
……… ……… monolingual ………
………
……… ………
french english english
……… ……… ………
Statistical Statistical
Analysis Analysis
P(s/t) P(t)
Translation Model Language Model
Translation Score
Language
Alignment Reordering
Model
Syntax
Morphology Smoothing Preordering
Model
Capitalization Transliteration
Word
Deletion
NMT Decoding
-0.2
-0.1
0.1
0.4
-0.3
1.1
ENCODER DECODER
4.3
-0.2
0.5
0.9
1.3
3.4
-5.3
Input
-6.2
Output
4.8
9.3
3.4
Text …
2.6
4.9 Text
0.1
2.6
8.3
-7.3
5.1
1.5
0.6
9.3
-6.2
2.9
1.4
-1.3
A Neural Network
PARAMETERS
A Deep Neural Network
A Deep Recurrent Neural Network
Word representations
Sparse
All words are equally different
Dense
Similar words have similar vectors
Word Embeddings
Predict
Update
Training: Dropout
Long Short Term Memory Units
Attention
NMT Advantages
• Improved quality
18
NMT Opportunities: Multilingual translation
NMT Opportunities: Low resource translation
Alignment Alignment
TRAINING TRANSLATING