Professional Documents
Culture Documents
Cogsci Boot
Cogsci Boot
must be acquired from language specific A2 B1 early language corpus (e.g., selected this, • Robustness of results with respect to
X2 A2 X1,2,3 B2
distributional information.
A3 B3
not the, as a determiner seed)
seed selection (few seeds, ambiguity
A3 B3
• Popular methods in cognitive modeling • 220,000 words mapped to 7 categories
allowed)
A1 B1
(e.g., Redington et al. 1996) and X3 • The online learning algorithm leads to
A3 B2 • 25-fold cross-validation
unsupervised NLP (e.g., Haghighi & Klein developmental predictions and can
2006) are computationally expensive yet lexical and category frames • Accuracy 62.44% (baseline: 20.5%)
incorporate other mechanisms of word
still perform poorly
the_N_is: (the_is, D_is, the_V, D_V)⟹N learning (e.g., Lederer et al. 1999,
Exp 2: WSJ/Chinese Treebanks
• Simple and empirically motivated Stevens et al. 2016)
Two frame-based distance metrics • Haghighi & Klein (2006): large word vectors
methods (Mintz 2000) do not scale well • Future work: incorporation of structural
(Chemla et al. 2009)
Identity KL-clusters • Top 3 words per category (WSJ 45, Chinese information (e.g., morphology, non-local
w∈C, AwB, AxB
KLAB(w, x) clusterers
33): current model uses only text
contexts, abstract syntax)
• Need an effective model with better
⟹x∈C
⟹x∈argmin(x, C)