Welcome to Scribd!

Skip carousel

03 Hackatorial Addon PDF

Uploaded by

coldfrites

0% found this document useful (0 votes)

15 views17 pages

Original Title

03-hackatorial-addon.pdf

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

15 views17 pages

03 Hackatorial Addon PDF

Uploaded by

coldfrites

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 17

Search inside document

Addon

Learning Machine Learning

Nils Reiter

September 26-27, 2018

Nils Reiter (CRETA) Addon September 26-27, 2018 1 / 10

Context/Applications/Entities

Section 1

Context/Applications/Entities

Nils Reiter (CRETA) Addon September 26-27, 2018 2 / 10

Context/Applications/Entities

Preparations
▶ Annotation
▶ Manually, following guidelines, with an annotation tool
▶ Stored with character oﬀsets

Nils Reiter (CRETA) Addon September 26-27, 2018 3 / 10

Context/Applications/Entities

Nils Reiter (CRETA) Addon September 26-27, 2018 3 / 10

Context/Applications/Entities

Preparations
▶ Annotation
▶ Manually, following guidelines, with an annotation tool
▶ Stored with character oﬀsets
▶ Preprocessing
▶ Sentence splitting, part of speech tagging, lemmatization
▶ Done automatically, MHG tagger developed Echelmeyer et al. (2017)
▶ Conversion into TSV format
▶ Putting everything into a reasonable data structure
▶ Preparing some information to be available directly
▶ Let’s look into the data structure

Nils Reiter (CRETA) Addon September 26-27, 2018 3 / 10

Context/Applications/Entities

TSV/CSV vs. CoNLL

▶ It’s actually not a real table
▶ Sentence boundaries are marked with an empty line
▶ Most CSV/TSV libraries will skip those!
▶ Commonly used in NLP

Nils Reiter (CRETA) Addon September 26-27, 2018 3 / 10

Context/Applications/Entities

Entity References → Entities

Persons

Locations

Text World

Figure: Entity references and entities

Nils Reiter (CRETA) Addon September 26-27, 2018 4 / 10

Context/Applications/Entities

The Missing Link

si brâhte dar durch vlühtesal [des werden Gahmuretes kint]PER .

Nils Reiter (CRETA) Addon September 26-27, 2018 5 / 10

Context/Applications/Entities

The Missing Link

si brâhte dar durch vlühtesal [des werden Gahmuretes kint]PER .

▶ Knowing that ‘des werden Gahmuretes kint’ refers to a person is nice,

but not enough
▶ We want to know exactly which person it refers to (in the discourse
context/the ﬁctional world)

Nils Reiter (CRETA) Addon September 26-27, 2018 5 / 10

Context/Applications/Entities

The Missing Link

si brâhte dar durch vlühtesal [des werden Gahmuretes kint]PER .

▶ Knowing that ‘des werden Gahmuretes kint’ refers to a person is nice,

but not enough
▶ We want to know exactly which person it refers to (in the discourse
context/the ﬁctional world)
▶ Can we make this a classiﬁcation task?
▶ What are the objects to classify?
▶ What are the classes?
▶ Where to we get features? What are potential features?

Nils Reiter (CRETA) Addon September 26-27, 2018 5 / 10

Context/Applications/Entities

Option 1: Entities as classes

▶ Each discourse entity is a class

▶ Classiﬁcation of entity references into these classes
▶ Features
▶ Surface, lemma, context, meaning representation, gender, …

Nils Reiter (CRETA) Addon September 26-27, 2018 7 / 10

Context/Applications/Entities

Option 1: Entities as classes

▶ Each discourse entity is a class

▶ Classiﬁcation of entity references into these classes
▶ Features
▶ Surface, lemma, context, meaning representation, gender, …
▶ Direct answer to the problem
▶ Diﬃcult to generalize
▶ Can only be trained/applied within the same discourse!
▶ Nothing to be gained from one text to the next
▶ Unless they have the same set of entities
▶ Established task in BioNLP: Entity tagging

Nils Reiter (CRETA) Addon September 26-27, 2018 7 / 10

Context/Applications/Entities

Option 2: Detect Co-Reference

▶ Binary classiﬁcation: True/False

▶ Classiﬁcation of pairs of entity references into these classes
▶ Do these two entity references refer to the same thing?
▶ Features
▶ Case, number, gender, meaning, context, …

Nils Reiter (CRETA) Addon September 26-27, 2018 8 / 10

Context/Applications/Entities

Option 2: Detect Co-Reference

▶ Binary classiﬁcation: True/False

▶ Classiﬁcation of pairs of entity references into these classes
▶ Do these two entity references refer to the same thing?
▶ Features
▶ Case, number, gender, meaning, context, …
▶ Generalization across texts
▶ The computer can learn something about a general language
phenomenon
▶ Needs post-processing: Naming the co-referent entities
▶ Established task in NLP: Coreference Resolution
▶ Linguistically motivated, theory available

Nils Reiter (CRETA) Addon September 26-27, 2018 8 / 10

Context/Applications/Entities

Modularization!

▶ NLP: Highly modularized pipelines

▶ Each module can be/include a machine learning step
▶ Speciﬁc, small tasks
▶ Dependencies between linguistic layers

Nils Reiter (CRETA) Addon September 26-27, 2018 9 / 10

Context/Applications/Entities

References I

Echelmeyer, Nora, Nils Reiter, and Sarah Schulz. “Ein PoS–Tagger für
„das” Mittelhochdeutsche”. In: Book of Abstracts of DHd 2017. Bern,
Switzerland, Feb. 2017.

Nils Reiter (CRETA) Addon September 26-27, 2018 10 / 10

Conversion Chart T-Scores To Standard Scores PDF
Document1 page
Conversion Chart T-Scores To Standard Scores PDF
coldfrites
No ratings yet
Final Terms BASF-2018-2026 PDF
Document1 page
Final Terms BASF-2018-2026 PDF
coldfrites
No ratings yet
03 Hackatorial Addon PDF
Document17 pages
03 Hackatorial Addon PDF
coldfrites
No ratings yet
Final Terms BASF-2018-2026 PDF
Document1 page
Final Terms BASF-2018-2026 PDF
coldfrites
No ratings yet
Conversion Chart T-Scores To Standard Scores PDF
Document1 page
Conversion Chart T-Scores To Standard Scores PDF
coldfrites
No ratings yet
Textual Analysis in Voyant Tools
Document8 pages
Textual Analysis in Voyant Tools
coldfrites
No ratings yet
At-Risk Students Foreign Language Fact Sheet PDF
Document4 pages
At-Risk Students Foreign Language Fact Sheet PDF
coldfrites
No ratings yet
At-Risk Students Foreign Language Fact Sheet PDF
Document4 pages
At-Risk Students Foreign Language Fact Sheet PDF
coldfrites
No ratings yet
IronmanBarcelona15EAr Agegroup Results
Document48 pages
IronmanBarcelona15EAr Agegroup Results
coldfrites
No ratings yet
Test
Document336 pages
Test
coldfrites
No ratings yet
SyntAX Manual
Document70 pages
SyntAX Manual
coldfrites
No ratings yet
Oasis - Supersonic (Guitar Pro)
Document7 pages
Oasis - Supersonic (Guitar Pro)
coldfrites
No ratings yet
Cartoon Sound Effects Collection
Document179 pages
Cartoon Sound Effects Collection
coldfrites
No ratings yet
IronmanBarcelona15EAr Agegroup Results
Document48 pages
IronmanBarcelona15EAr Agegroup Results
coldfrites
No ratings yet
Tab Triggerfinger All This Dancin Around
Document1 page
Tab Triggerfinger All This Dancin Around
coldfrites
No ratings yet
BS Flyer Final
Document3 pages
BS Flyer Final
coldfrites
No ratings yet
Oasis - Supersonic (Guitar Pro)
Document7 pages
Oasis - Supersonic (Guitar Pro)
coldfrites
No ratings yet
Stylo R Script Mini Howto
Document6 pages
Stylo R Script Mini Howto
coldfrites
No ratings yet
Oasis - Live Forever (Guitar Pro)
Document14 pages
Oasis - Live Forever (Guitar Pro)
coldfrites
No ratings yet
The ABC of PDF With Itext
Document32 pages
The ABC of PDF With Itext
felipe_tavares_12
No ratings yet
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5794)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (399)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (895)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (588)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (73)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (344)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1712)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2219)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Rating: 4 out of 5 stars
4/5 (1090)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1015)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (119)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4609)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
Rating: 3.5 out of 5 stars
3.5/5 (1937)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2100)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
MBT2 - Course of Study - Technological Globalization - 3 CUs
Document24 pages
MBT2 - Course of Study - Technological Globalization - 3 CUs
johann garcia
No ratings yet
CitectSCADA CSV - Include PDF
Document130 pages
CitectSCADA CSV - Include PDF
Michael Adu-boahen
No ratings yet
What Is A Bibliography
Document7 pages
What Is A Bibliography
Kaye Diamante Valleser
No ratings yet
IEC 61850 Driver: IEC61850 Device Config Tab
Document66 pages
IEC 61850 Driver: IEC61850 Device Config Tab
Milson Rodrigues
No ratings yet
AStudio 6 0 User Guide PDF
Document602 pages
AStudio 6 0 User Guide PDF
Dhirender Dagar
No ratings yet
The Complete Guide To Facebook Marketing PDF
Document60 pages
The Complete Guide To Facebook Marketing PDF
NataliaDjajic
100% (4)
SPI User Guide
Document1,812 pages
SPI User Guide
Satish Sohani
67% (3)
Chapter Six - Memory Social Media Bartoletti-With-Cover-Page
Document25 pages
Chapter Six - Memory Social Media Bartoletti-With-Cover-Page
Juan David Cárdenas
No ratings yet
UT Fulfiller Training - Final
Document122 pages
UT Fulfiller Training - Final
Krishna Moorthy
No ratings yet
Freshservice
Document36 pages
Freshservice
kamal haridan
No ratings yet
New Features Guide: Informatica (Version 9.1.0)
Document18 pages
New Features Guide: Informatica (Version 9.1.0)
Deepti Rasapalli
No ratings yet
Playbook Sports Guide
Document39 pages
Playbook Sports Guide
Filippe Oliveira
No ratings yet
What is a Multimedia Database
Document11 pages
What is a Multimedia Database
Pranali Narayankar
No ratings yet
Plugins Zotero Documentation
Document7 pages
Plugins Zotero Documentation
archivo_lc
No ratings yet
Parsing Json Appybuilder
Document4 pages
Parsing Json Appybuilder
Depri Pramana
No ratings yet
Turban Ec2012 PP 02
Document52 pages
Turban Ec2012 PP 02
dafes daniel
No ratings yet
Empower Youth with ICT Skills
Document5 pages
Empower Youth with ICT Skills
Kim Charlotte Balicat-Rojo Manzori
No ratings yet
Lesson 1 Introduction To Ict
Document49 pages
Lesson 1 Introduction To Ict
enajaral17
No ratings yet
Inspecting A Dataset
Document7 pages
Inspecting A Dataset
Nurmuslimahhh
No ratings yet
Blackhat Earning by Akimor
Document17 pages
Blackhat Earning by Akimor
Hboy
83% (6)
FortiGate - Using VMWare VASA With IBM Storwize Family and IBM XIV
Document24 pages
FortiGate - Using VMWare VASA With IBM Storwize Family and IBM XIV
Serge Palla
No ratings yet
TSAE Golding 1992 PHD
Document165 pages
TSAE Golding 1992 PHD
Jose Manuel Hernandez
No ratings yet
Presentation Completions
Document34 pages
Presentation Completions
ian84
No ratings yet
HTML Next Steps
Document18 pages
HTML Next Steps
VANSHIKA AGARWAL
No ratings yet
211 Apps
Document34 pages
211 Apps
Anonymous gVssqt
No ratings yet
DEVONthink Pro Office Manual PDF
Document146 pages
DEVONthink Pro Office Manual PDF
norzang
No ratings yet
Domino Tags
Document5 pages
Domino Tags
Amit Kumar
No ratings yet
UserGuide Indusoft
Document910 pages
UserGuide Indusoft
José
No ratings yet
Name of Student: - Learning Area: ICF 9 Date: - Quarter 3, Week 1
Document6 pages
Name of Student: - Learning Area: ICF 9 Date: - Quarter 3, Week 1
racquel jimenez
No ratings yet
Empowerment Technologies Gradee 11-12 Module 1
Document32 pages
Empowerment Technologies Gradee 11-12 Module 1
Eme Guill Fuertes
No ratings yet