Professional Documents
Culture Documents
Recognizer Classifier and Linker
Recognizer Classifier and Linker
Concepts
ECHR: European Court of Human Rights
Summary
(What) A legal Named Entity Recognizer, Classifier and Linker is developed to identify
relevant parts of legal texts and connect them to a structured knowledge representation,
the LKIF ontology.
(How) The Named Entity Recognizer, Classifier and Linker is trained on the mentions of
entities in the Wikipedia (manually annotated examples) and is able to map the LKIF
ontology to the YAGO ontology and through it.
(Performance) The proposed approach achieves an around 80% F-measure for different
levels of granularity on two testing texts (one from wikipedia and another from a sample of
legal judgments), so this approach has potentiality to be applied to other legal sub-domains,
represented by different ontologies.
Three criticisms
(Data preprocessing) The author did not pre-process the training texts to balance the
classes for learners.
(Testing set not representative) The author's approach tested on holdout texts from the
Wikipedia and a small sample of judgments of the ECHR only is not representative enough to
ensure this approach can be ported to other legal sub-domains (A more representative
testing dataset is required to evaluate the performance).
(Endless loop) The author did not use strictly manually annotated texts as the training set to
train the Entity Recognizer, Classifier and Linker, and then gained an model to be able to pre-
annotate the legal domain articles of Wikipedia (trained on less strictness and used as less
strict application, pre-annotation).