Professional Documents
Culture Documents
Word Embedding
+
Recurrent Neural Networks
2
Word Embedding
● For example:
«Soviet» Word2Vec
Similar Context
Similar representation
«Union» Word2Vec
No similar Context
No Similar representation
«Queen» Word2Vec
Text
Some methods use 1D radial profiles obtained circular averaging of experimental
from circular averaging of experimental PSD Word Embedding
or by elliptical averaging. An inadequacy of
circular averaging is that it neglects
astigmatism. Astigmatism distorts the circular
shape of the Thon rings and thus decreases
their modulation depth in the obtained 1D
profile.
LSTM LSTM LSTM LSTM
Forward Cell Cell Cell Cell
Toy example:
5
Our Full solution
Text
circular averaging of experimental
Some methods use 1D radial profiles obtained
from circular averaging of experimental PSD Word Embedding
or by elliptical averaging. An inadequacy of
circular averaging is that it neglects
astigmatism. Astigmatism distorts the circular
shape of the Thon rings and thus decreases
their modulation depth in the obtained 1D
profile.
LSTM LSTM LSTM LSTM
Forward Cell Cell Cell Cell
+ + + +
● Inspec Dataset:
○ Experimental setting:
● Evaluation metrics:
● Test #1: Training a Word Embedding Network on Inspec dataset does not
perform well (Dataset too small)
8
Comparison Results
[1] Hulth: Improved automatic keyword extraction given more linguistic knowledge. In: Proc. of Conference on Empirical
Methods in Natural Language Processing (2003)
[2] Bougouin et al.: Graph-based topic ranking for keyphrase extraction. In: Proc. of Conference on Natural Language
Processing (2013)
[3] . Meng et al. : Deep Keyphrase Generation. In: Proc. of Annual Meeting of the Association for Computational
Linguistics (2017) 9
Some Examples #1
Legenda
● Highlighted words: Results of the proposed approach
● Underline bold: Ground truth
10
Some Examples #2
Note: probably the «scholarly publishing» was selected, because in the training set there
are several time the keyphrase «scholarly publishing model>
Legenda
● Highlighted words: Results of the proposed approach
● Underline bold: Ground truth
11