Professional Documents
Culture Documents
Motivation
Massive Knowledge found as Natural Language Text based Question Answering (no tagging) Open Domain Question Answering Address Commonsense Reasoning Problem Linguistically motivated KRR
Integrate State of the Art Tools Ultimate goal: Getting closer to strong AI
Natural Language is parsed Utterances are represented using CGs Concepts and Relation types are mapped to Cyc equivalent counterparts Type hierarchies are computed Knowledge is accumulated If the input is a question
NLP
KRR
Commonsense
CCG
CGs
Open Cyc
C&C Tools
Cogitant
Background
C&C Tools
Cogitant
Conceptual Graphs
Open Cyc
Functions can be applied or composed Arguments can be picked up or turned into functors (Type raising) Small number of semantically transparent combinatory rules to combine CCG categories.
C&C Tools
Linguistically Motivated Large-Scale NLP with C&C and Boxer. (Curran, Clark, Bos, 2007)
C&C Parser POS Tagging, Supertagging Parsing, Chunking Named Entity Recognition Boxer Uses CCG parser output Generates DRS Semantic Representations
http://svn.ask.it.usyd.edu.au/trac/candc/wiki
C&C Tools
Large Scale NLP is possible with C&C and Boxer C&C Parser: state of the art parser for CCG Boxer: Semantic representations in DRS
Open Cyc
Open source version of Cyc system Cyc: greatest effort to encode Common Sense knowledge in machine processable way 500.000 concepts 26.000 relations and 5.000.000 assertions CycL language similar to Lisp We use Cyc to map parsed words to common sense counterparts such as person to #$Person (disambiguation)
(#$likesAsFriend #$GeorgeWBush #$AlGore) #$isa, #$genls (#$isa #$GeorgeWBush #$UnitedStatesPresident) (#$genls #$UnitedStatesPresident #$Person)
Cogitant
Library for Conceptual Graph operations Supports broad CG operations (Genest & Salvat, 1998)
Architecture - Modules
CG operations
Persistence of CGs
System Definition
User enters a sentence from web interface; This sentence is converted to CGIF using the NLP module; CGIF is converted to CGs using the reasoning module; Support is generated to CGs using the common sense module; Common sense rules gathered from common sense module are applied to CGs using reasoning module; CGs are merged to the previous ones using reasoning module; If the input sentence is a question sentence, same operations take place, except the resulting graph is used to query existing CGs using the reasoning service, and if there are projections from this query graph to previous CGs, results are displayed to the user; CGs are persisted using the storage module.
Cyc: (prettyString TERM STRING) Chain up to #$Thing using #$genls relations Same for relations using #$genlPreds Relation hierarchies are converted to forward rules
#$Place -> #$EnduringThing-Localized -> #$Location-Underspecified -> #$Thing -> #$SomethingExisting -> #$Individual -> #$Thing ^^ #$Trajector-Underspecified -> #$Location-Underspecified ^^ #$TemporallyExistingThing -> #$TemporalThing -> #$Individual ^^ #$SpatialThing-Localized -> #$TemporallyExistingThing ^^ #$SpatialThing -> #$Individual ^^ #$Boundary-Underspecified -> #$Region-Underspecified -> #$Location-Underspecified ^^ #$Landmark-Underspecified -> #$Individual ^^ #$Location-Underspecified ^^ #$SpatialThing-NonSituational -> #$SpatialThing ^^ #$Individual ^^ #$Location-Underspecified ^^
Answering Queries
Significance
Sentences like;
What are the intangible things in this situation? Was Mr. Hyde there while eating the apples? Does Mr. Hyde exist after eating the apples? Do the apples exist after Mr. Hyde ate them?
Deep Natural Language Understanding State of the art tools Open domain question answering
Difficulties
Does not work in Turkish locale (fixes are sent to maintainers) Still, provided API sends one IP packet per character, way too slow over network Custom socket API is developed and used over TCP Custom Lisp functions for generalization hierarchy and concept mapping
Java API is very limited (compared to C++) Only works over XML files
Cogitant problematic
Conclusion
Central Integrated Common Sense QAS CCG for Natural Language Processing Conceptual Graphs for KRR Cyc for Common Sense
Future Work
Implement Rule Induction Backward Chaining (Resolution) Improve NLP module and Common Sense mapping Probabilistic Reasoning Question Answering System (QAS) to be used in;
Education (Learning Management Systems) Semantic Search (Content Management Systems) Intelligent Help