Professional Documents
Culture Documents
A Practical Framework
Elwin Huaman and Dieter Fensel
Semantic Technology Institute (STI) Innsbruck
Department of Computer Science,
University of Innsbruck, Austria
IJCKG 2021
Outline
● What?
Basics - Research questions
● How?
Approach - Solution
● Why?
Motivation
Which KG
is best for
me?
What about their:
● Quality
● Correctness
● Completeness
● Verification
○ Check schema conformance
and integrity constraints.
■ RDFUnit, SHACL, ShEx,
SPIN, Stardog ICV, ...
❏ Detecting errors
● Validation
❏ Correcting errors
○ Compare with "real" world, a.k.a.
Fact Checking.
■ COPAAL, DeFacto,
FactCheck, FacTify, Leopard,
Surface, Tracy
● Duplicate detection
○ Identifying duplicates of a same
entity in a single or various KGs.
■ ADEL, DDaaS, Dedupe, DuDe,
❏ Finding relevant KGs
Duke, Legato, LIMES, SERIMI, Silk,
…
❏ Duplicate detection
Mapping Domain
Specif.
& Indexing
Weights
<<datastore>>
Quality Assessing Assessment
Metrics KGs Report
<<datastore>>
Verification Instance Configuration
Verifier
Report Matching Learning
Constraints
Validation Config.
Strategies Fusion
Strategies
Validator
<<datastore>> <<datastore>>
<<datastore>> [0.1] Instance [0.1] Triple Duplicates Entity Fusion
Validation Fusion
Report Validation Validation Report Report
Are they needed? In May 2016 Joshua Brown was killed by his car because its auto
pilot mixed up a very long car (large wheelbase) with a traffic sign.
16