Welcome to Scribd!

Recognizer Classifier and Linker

Uploaded by

0% found this document useful (0 votes)

9 views2 pages

The document summarizes a legal named entity recognizer, classifier and linker developed to identify relevant parts of legal texts and connect them to a structured knowledge representation. It trains on entity mentions in Wikipedia and maps to the LKIF and YAGO ontologies. Testing on Wikipedia and ECHR texts achieved 80% F-measure, indicating potential for other legal domains. However, the testing set may not be representative and class imbalance was not addressed.

Original Description:

Original Title

Recognizer_Classifier_and_Linker

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

9 views2 pages

Recognizer Classifier and Linker

Uploaded by

yu chen

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

5.

A Low-cost, High-coverage Legal Named Entity Recognizer, Classifier and Linker

A Low-cost, High-coverage Legal Named

Entity Recognizer, Classifier and Linker
Yu Chen, 03722670

Concepts
ECHR: European Court of Human Rights

Summary
(What) A legal Named Entity Recognizer, Classifier and Linker is developed to identify
relevant parts of legal texts and connect them to a structured knowledge representation,
the LKIF ontology.
(How) The Named Entity Recognizer, Classifier and Linker is trained on the mentions of
entities in the Wikipedia (manually annotated examples) and is able to map the LKIF
ontology to the YAGO ontology and through it.
(Performance) The proposed approach achieves an around 80% F-measure for different
levels of granularity on two testing texts (one from wikipedia and another from a sample of
legal judgments), so this approach has potentiality to be applied to other legal sub-domains,
represented by different ontologies.

Three positive aspects

Less effort is needed to build the training texts since the author utilizes the mentions of
entities in Wikepedia as the manually annotated training samples.
The author speculates that the very distinct conceptualization due to the bigger classes
(populated with more mentions in Wikipedia text) is the main error source of classification.
The developed tools and resources are open-source and freely available to anyone and can
be reproduced for any legal subdomain of interest.

Three criticisms
(Data preprocessing) The author did not pre-process the training texts to balance the
classes for learners.
(Testing set not representative) The author's approach tested on holdout texts from the
Wikipedia and a small sample of judgments of the ECHR only is not representative enough to
ensure this approach can be ported to other legal sub-domains (A more representative
testing dataset is required to evaluate the performance).
(Endless loop) The author did not use strictly manually annotated texts as the training set to
train the Entity Recognizer, Classifier and Linker, and then gained an model to be able to pre-
annotate the legal domain articles of Wikipedia (trained on less strictness and used as less
strict application, pre-annotation).

Three questions to the author

Is the specification of the mentions of entities in Wikipedia guaranteed to replace a strictly
manually annotated examples?
Can your approach be treated as a standard tool for legal experts to create a strictly
manually annotated training sets?
How do you ensure that a class balance processing can improve the performance of your
approach?

Pretraining NLP Legal
Document10 pages
Pretraining NLP Legal
yu chen
No ratings yet
Python-Chapter 1
Document63 pages
Python-Chapter 1
alia triki
No ratings yet
NIS Model Answer 2022
Document28 pages
NIS Model Answer 2022
Rishi
100% (13)
Answers of ch-6 Review Python-Class-8 (Computer) Kips
Document2 pages
Answers of ch-6 Review Python-Class-8 (Computer) Kips
Ritik Narayan
40% (5)
Test-Bank (1-2-3-5-6-8) PDF
Document12 pages
Test-Bank (1-2-3-5-6-8) PDF
Ahmed E Esmail
No ratings yet
Toolkit On Intelligent Transport System For Public Transport & BRTS
Document113 pages
Toolkit On Intelligent Transport System For Public Transport & BRTS
ebinsams007
No ratings yet
ARA Pathfinder Perimeter and Border Security R1 - 13 PDF
Document4 pages
ARA Pathfinder Perimeter and Border Security R1 - 13 PDF
José Simons
No ratings yet
Week 4 CSS Powerpoint
Document31 pages
Week 4 CSS Powerpoint
Be Nj
No ratings yet
A Simple Model For Classifying Web Queries by User Intent
Document6 pages
A Simple Model For Classifying Web Queries by User Intent
Adrian D.
No ratings yet
Rocchio Relevance
Document10 pages
Rocchio Relevance
Omar El Midaoui
No ratings yet
Case Western
Document4 pages
Case Western
Sambhav Verman
No ratings yet
Literature Review Mechanical Engineering
Document8 pages
Literature Review Mechanical Engineering
ijxnwybnd
100% (2)
A Personalized Ontology Model For Web Information Gathering
Document9 pages
A Personalized Ontology Model For Web Information Gathering
JAYAPRAKASH
100% (1)
Natural Language Processing: Mature Enough For Requirements Documents Analysis?
Document12 pages
Natural Language Processing: Mature Enough For Requirements Documents Analysis?
haliane
No ratings yet
1 s2.0 S1877050916300850 Main
Document8 pages
1 s2.0 S1877050916300850 Main
rahu
No ratings yet
Pascarella2019 Article ClassifyingCodeCommentsInJavaS
Document39 pages
Pascarella2019 Article ClassifyingCodeCommentsInJavaS
sajid
No ratings yet
Concurrent Context Free Framework For Conceptual Similarity Problem Using Reverse Dictionary
Document4 pages
Concurrent Context Free Framework For Conceptual Similarity Problem Using Reverse Dictionary
Editor IJRITCC
No ratings yet
Computer Science Literature Review Topics
Document5 pages
Computer Science Literature Review Topics
aflsigfek
100% (1)
Chapter 3
Document39 pages
Chapter 3
Soe Thi Ha Aung
No ratings yet
CS1310 Ooad 3 PDF
Document18 pages
CS1310 Ooad 3 PDF
baladivya15
No ratings yet
Ker Ruthven Lalmas PDF
Document53 pages
Ker Ruthven Lalmas PDF
Ebenezer Acheampong
No ratings yet
Entropy 22 01057 PDF
Document20 pages
Entropy 22 01057 PDF
ERE
No ratings yet
Literature Review Vanet
Document8 pages
Literature Review Vanet
aflrpjser
100% (1)
Measuring Open Source Quality A Literature Review
Document8 pages
Measuring Open Source Quality A Literature Review
fvf4r6xq
No ratings yet
Systematic Literature Review Computer Science
Document4 pages
Systematic Literature Review Computer Science
frvkuhrif
100% (1)
A System To Filter Unwanted Messages From Osn User Walls
Document30 pages
A System To Filter Unwanted Messages From Osn User Walls
Ismail Msw
100% (1)
Elem Am 98 Repeatability
Document13 pages
Elem Am 98 Repeatability
thouartu
No ratings yet
Object-Oriented Programming: An Objective Sense of Style: Association For
Document12 pages
Object-Oriented Programming: An Objective Sense of Style: Association For
Dave Paola
No ratings yet
Comparative Analysis of Web Services For Sentiment Analysis On Social Media Websites
Document17 pages
Comparative Analysis of Web Services For Sentiment Analysis On Social Media Websites
videepsinghal
No ratings yet
Larson Et Al 2016 Use What You Choose
Document8 pages
Larson Et Al 2016 Use What You Choose
Brian Larson
No ratings yet
Plagiarism and Its Detection in Programming Languages
Document8 pages
Plagiarism and Its Detection in Programming Languages
Glen E. Enaje
No ratings yet
Literature Review Engineering Example
Document4 pages
Literature Review Engineering Example
gxopkbxgf
100% (1)
Jurnal SPAS
Document13 pages
Jurnal SPAS
Mayaliana
No ratings yet
Finding Similar Patents Through Semantic Query Expansion: Sciencedirect
Document6 pages
Finding Similar Patents Through Semantic Query Expansion: Sciencedirect
saman
No ratings yet
Automatic Question Generation For Literature Review Writing Support
Document4 pages
Automatic Question Generation For Literature Review Writing Support
qyptsxvkg
No ratings yet
Assignment 1
Document9 pages
Assignment 1
DavidTumini Ogolo
No ratings yet
Software Quality Assessment Using Flexibility A Systematic Literature Review
Document9 pages
Software Quality Assessment Using Flexibility A Systematic Literature Review
c5s4gda0
No ratings yet
Assignment 1
Document9 pages
Assignment 1
DavidTumini Ogolo
No ratings yet
Adewuyi
Document15 pages
Adewuyi
SIMON AWOJIDE
No ratings yet
Performing Systematic Literature Review in Software Engineering
Document6 pages
Performing Systematic Literature Review in Software Engineering
c5sx83ws
No ratings yet
Ontologies in Software Testing A Systematic Literature Review
Document8 pages
Ontologies in Software Testing A Systematic Literature Review
afmabkbhckmajg
No ratings yet
Generating Query Facets Using Knowledge Bases: CSE Dept 1 VMTW
Document63 pages
Generating Query Facets Using Knowledge Bases: CSE Dept 1 VMTW
Mahveen Arshi
No ratings yet
A Method Based On Naming Similarity To Identify Reuse Opportunities
Document23 pages
A Method Based On Naming Similarity To Identify Reuse Opportunities
johnatansi
No ratings yet
Literature Review For Engineering Project
Document8 pages
Literature Review For Engineering Project
igmitqwgf
100% (1)
Literature Review of Online Student Registration System
Document5 pages
Literature Review of Online Student Registration System
pnquihcnd
No ratings yet
Towards The Self-Annotating Web: Philipp Cimiano, Siegfried Handschuh, Steffen Staab
Document10 pages
Towards The Self-Annotating Web: Philipp Cimiano, Siegfried Handschuh, Steffen Staab
Trần Thi
No ratings yet
Recognizing Cited Facts and Principles
Document2 pages
Recognizing Cited Facts and Principles
yu chen
No ratings yet
Information Retrieval Dissertation
Document5 pages
Information Retrieval Dissertation
ProfessionalPaperWritersUK
100% (1)
HTML Forms Built On User Trait Detection
Document16 pages
HTML Forms Built On User Trait Detection
saikiran
No ratings yet
Ontology Alignment Thesis
Document6 pages
Ontology Alignment Thesis
fexschhld
100% (1)
A Systematic Literature Review of Software Process Improvement For Small and Medium Web Companies
Document5 pages
A Systematic Literature Review of Software Process Improvement For Small and Medium Web Companies
afmzuiffugjdff
No ratings yet
MySEC 2011 6140639
Document6 pages
MySEC 2011 6140639
gorofa9108
No ratings yet
Folksonomies Versus Automatic Keyword Extraction
Document12 pages
Folksonomies Versus Automatic Keyword Extraction
tfnk
No ratings yet
A Hybrid Approach For Personalized Recommender System Using Weighted TFIDF On RSS Contents
Document11 pages
A Hybrid Approach For Personalized Recommender System Using Weighted TFIDF On RSS Contents
ATS
No ratings yet
Literature Review Web Based Systems
Document7 pages
Literature Review Web Based Systems
afmzetkeybbaxa
100% (1)
UIC at TREC 2008 Blog Track
Document11 pages
UIC at TREC 2008 Blog Track
minsandi
No ratings yet
Text Mining Through Semi Automatic Semantic Annotation
Document12 pages
Text Mining Through Semi Automatic Semantic Annotation
satsrini
No ratings yet
A New Method For Applicant of Explicit Semantic Analysis and Word Sense Disambiguation in Concept-Based Information Retrieval
Document10 pages
A New Method For Applicant of Explicit Semantic Analysis and Word Sense Disambiguation in Concept-Based Information Retrieval
iamtheindiaworld iamtheindiaworld
No ratings yet
Search Engine Personalization Tool Using Linear Vector Algorithm
Document9 pages
Search Engine Personalization Tool Using Linear Vector Algorithm
ahmed_trab
No ratings yet
Performance Evaluation of Query Processing Techniques in Information Retrieval
Document6 pages
Performance Evaluation of Query Processing Techniques in Information Retrieval
idescitation
No ratings yet
Systematic Literature Review Database
Document5 pages
Systematic Literature Review Database
aflsnbfir
100% (1)
Guidelines For Performing Systematic Literature Reviews in Software Engineering 2007
Document6 pages
Guidelines For Performing Systematic Literature Reviews in Software Engineering 2007
afmzhpeloejtzj
No ratings yet
Movie Recommendation - 01
Document7 pages
Movie Recommendation - 01
sudulagunta akshara
No ratings yet
Representing and Classifying Arguments On The Semantic Web
Document25 pages
Representing and Classifying Arguments On The Semantic Web
腾沈
No ratings yet
Improving Customer Decisions Using Product Reviews: CROM - Car Review Opinion Miner
Document4 pages
Improving Customer Decisions Using Product Reviews: CROM - Car Review Opinion Miner
przemyslaw_j
No ratings yet
Collaborative Bibliographic System For Review/survey Articles
Document17 pages
Collaborative Bibliographic System For Review/survey Articles
Anonymous Gl4IRRjzN
No ratings yet
A Language For Manipulating Clustered Web Documents Results
Document19 pages
A Language For Manipulating Clustered Web Documents Results
Alessandro Siro Campi
No ratings yet
Automatic Image Annotation: Fundamentals and Applications
From Everand
Automatic Image Annotation: Fundamentals and Applications
Fouad Sabry
No ratings yet
Paper 5
Document8 pages
Paper 5
yu chen
No ratings yet
Predict Court Behavior
Document2 pages
Predict Court Behavior
yu chen
No ratings yet
Recognizing Cited Facts and Principles
Document2 pages
Recognizing Cited Facts and Principles
yu chen
No ratings yet
The British Nationality Act As A Logic Program Summary
Document2 pages
The British Nationality Act As A Logic Program Summary
yu chen
No ratings yet
DSL - Payroll Summary
Document2 pages
DSL - Payroll Summary
yu chen
No ratings yet
A Domain-Specific Language For Payroll
Document38 pages
A Domain-Specific Language For Payroll
yu chen
No ratings yet
BN A As Logic Program
Document17 pages
BN A As Logic Program
yu chen
No ratings yet
EM Converge Property
Document8 pages
EM Converge Property
yu chen
No ratings yet
EM Converge Slides
Document26 pages
EM Converge Slides
yu chen
No ratings yet
EM Converge Guide
Document2 pages
EM Converge Guide
yu chen
No ratings yet
Object Oriented SAD-3 Requirement Elicitation
Document50 pages
Object Oriented SAD-3 Requirement Elicitation
Biruk Belayneh
No ratings yet
Project Name: Ecommerce Website Development 4 Weeks Challenge
Document5 pages
Project Name: Ecommerce Website Development 4 Weeks Challenge
Zahidul Hasan
No ratings yet
3 Stacks
Document3 pages
3 Stacks
Shru Singh
No ratings yet
Job JD Graphic Designer 15dec22
Document1 page
Job JD Graphic Designer 15dec22
Anil Kumar
No ratings yet
MATLAB For Modern Control Systems
Document17 pages
MATLAB For Modern Control Systems
Mikiyas Legesse
No ratings yet
Is Lab Manual IT 801
Document32 pages
Is Lab Manual IT 801
sai thesis
No ratings yet
BM02 AutoLightPlus
Document2 pages
BM02 AutoLightPlus
Nurhalisa Aksa
No ratings yet
Xi CS Revision Exam MS
Document8 pages
Xi CS Revision Exam MS
Deepak Kumar
No ratings yet
Arden's Theorem, Moore Machine, Mealy Machine
Document31 pages
Arden's Theorem, Moore Machine, Mealy Machine
Monishka Jaiswal
No ratings yet
TLE Grade9CSS Module1 Quarter3 Week1
Document4 pages
TLE Grade9CSS Module1 Quarter3 Week1
Axel Nicerio Rovelo
No ratings yet
Foundations of Information Systems (IS 2121)
Document50 pages
Foundations of Information Systems (IS 2121)
abdullah altwjre
No ratings yet
Week 2 Tutorial Activity Lifecycle
Document4 pages
Week 2 Tutorial Activity Lifecycle
Oviyan
No ratings yet
Cache Handling
Document4 pages
Cache Handling
Micah Ndiwa
No ratings yet
Origen Scrap
Document95 pages
Origen Scrap
carlos parra
No ratings yet
Sony GDM-F500 F500T9 N3P Revised @
Document60 pages
Sony GDM-F500 F500T9 N3P Revised @
Stefan
No ratings yet
LAB # 01 Introduction To Linux Operating System
Document15 pages
LAB # 01 Introduction To Linux Operating System
Areeba Noor
No ratings yet
Dell Precision-M4500 - Service Manual - En-Us
Document54 pages
Dell Precision-M4500 - Service Manual - En-Us
gotti45
No ratings yet
CNN LSTM Hybrid Approach For Sentiment Analysis
Document14 pages
CNN LSTM Hybrid Approach For Sentiment Analysis
IJRASETPublications
No ratings yet
Java Oop'Sconcept 1st Chapter
Document8 pages
Java Oop'Sconcept 1st Chapter
akanksha hatale
No ratings yet
Hioki Im3570 Handbuch en A981 08
Document458 pages
Hioki Im3570 Handbuch en A981 08
הליכות ג'יין נהריה
No ratings yet
TNPSC Model OMR Sheet
Document1 page
TNPSC Model OMR Sheet
training2jobs
50% (2)
FS - MM-EN-001 - Indicator For Insurance Items in Material Master - v0.1
Document7 pages
FS - MM-EN-001 - Indicator For Insurance Items in Material Master - v0.1
SUBHOJIT BANERJEE
No ratings yet
Administrator For IT Services (M - F) - Osijek - Telemach Hrvatska Doo
Document3 pages
Administrator For IT Services (M - F) - Osijek - Telemach Hrvatska Doo
Sergeo Armani
No ratings yet