SlideShare a Scribd company logo
From Text to Reasoning
Marko Grobelnik
Jozef Stefan Institute / Cycorp Europe, Slovenia
SWANK Workshop, Stanford, Apr 16th 2014Thanks to Michael Witbrock, Janez Starc, Luka Bradesko, Blaz Fortuna
Reflection on what should be the goal of
NLP
• The (mostly) forgotten long term aim of NLP is to understand the text
• …and not so much ‘processing’ itself (as NLP suggests)
• The curse of shallow solutions working well enough for too many
problems, made people (and researchers) happy for too long
• …as much as information retrieval and text mining are useful, they delayed
development of “text understanding”
Language vs. World
• …if we agree with the above statement, then at this point in time, we
have ‘language’, but the ‘world’ is more or less missing
• So – so what a ‘world’ or ‘world model’ could be?
CYC KNOWLEDGE BASE
Thing
Universe
isa
isa
Celestial
Body
isa
located in
Planet
subclass
Earth
isa
Animal
isa
Human
subclas
s
Physics
Money
Mathematics
Chemistry
Time
Learning
FoodVehicles
Event
Education
School
Language
LoveEmotions Going for a
walk
Death
Cat
Euro
Working
Words
Driving
RainStabbing someone
Nature
Tree
Hatred
Fear
Physics
Time
Learning
Vehicles
Event
Education
School
Emotions
Going for a
walk
Death
Cat
EuroWords
Driving
Rain
Stabbing someone
Nature
Tree
Hatred
Fear
Planet
Earth
isa
Human
Physics
Money
Mathematics
Chemistry
Time
Learning
FoodVehicles
Event
Education
Languag
e LoveEmotions Going for a
walk
Cat
Euro
Working
Words
Driving Rain
Tree
Hatred
Fear
Learning
Vehicles
Event
Education
School
Emotions
Euro
Driving
Stabbing someone
Hatred
Fear
Creating a World Model (top-down approach -
Cyc)
Model of the world…
• …beyond surface knowledge
• …to interconnect contextualized fragments
Why?
• To make reasoning capable of connecting
isolated fragments of knowledge
• To derive new knowledge beyond
materialized factual knowledge
World model
Top-down KA
Bottom-up KA
Multimodal data
Why we need a
World model?
Disambiguation with
a
world model
(CycKB)World model used as a set of common-sense semantic
constraints to disambiguate text
One of the challenges for the future: Micro-reading
• It is “easier” to understand millions of documents than one document
• …reading and understanding a single document is micro-reading
• The following experiment is on how much knowledge we can extract
from individual documents
• …extraction is in a form of first order inferentially productive Cyc logic
• …allowing us full reasoning to identify new facts
• …minimizing human involvement, optimizing precision and recall
Document Assertions Reasoning Dialogue
Example of text and extracted Cyc assertions
(1/2)
Automatically Extracted Assertions:
• (isa ?V1 ProsecutingEvent)
• (agent ?V1 RudyGiuliani)
• (genls Entity Agent)
• (isa RudyGiuliani Agent)
• (isa RudyGiuliani Entity)
• (isa ?V3 OrganizingEvent)
• (patient ?V3 (IntersectionFn
OrganizedCrime WallStreet))
• (isa (IntersectionFn OrganizedCrime
WallStreet) Patient)
• (genls Entity Patient)
• (isa OrganizedCrime Patient)
• (isa OrganizedCrime Entity)
• (isa WallStreet Patient)
• (isa WallStreet Entity)
Sentence:
He prosecuted a number of high-profile cases, including ones
against organized crime and Wall_Street financiers.
Example of text and extracted Cyc assertions
(2/2)
Automatically Extracted Assertions:
• (isa ?V1 SubstitutingEvent)
• (temporal ?V1 Lincoln)
• (genls Entity Agent)
• (isa Lincoln Agent)
• (genls Person Entity)
• (isa Lincoln Entity)
• (isa Lincoln Person)
• (isa ?V3 SucceedingEvent)
• (temporal ?V3 Grant)
• (isa Grant Agent)
• (isa Grant Entity)
• (isa Grant Person)
Sentence:
Each time a general failed, Lincoln substituted another
until finally Grant succeeded in 1865.
Reasoning on extracted assertions (Cyc)
Query:
(and
(isa ?Per Person)
(birthDate ?Per ?BD)
(occursBefore ?BD WorldWarII)
(thereExistsAtLeast 2 ?Role
(lifeRole ?Per ?Role)
(roleInIndustry ?Role FilmIndustry)
)
)
Answers:
Sir Derek_George_Jacobi
Sir Alexander_Korda
Victor Lonzo_Fleming
John_Francis_Junkin
Cornel_Wilde
George_Stevens
Bertrand_Blier
NL Query:
People born before World War II who had at least two roles in the film industry KB?
Knowledge Capture Knowledge Use
Rule:
(implies (and
(isa ?VENUE FoodTruck-Organization)
(lastVenue ?USER ?VENUE)
(suggestionsForCuriousCatQuestionType FoodTruckSecondaryTypeOfPlace-
CuriousCatQuestion ?SUGGESTIONLIST))
(curiousCatWantsToAskUser ?USER
(secondaryTypeOfPlace ?VENUE FoodTruck-Organization ?TYPE) ?SUGGESTIONLIST))
Witbrock, M., Bradeško, L., 2013,
Conversational Computation in
Michelucci, Pietro (Ed.)
Handbook of Human Computation,
531-543.
Intelligent
SIRI:
http://curiouscat.cc/
Some of the AI challenges for next years
• Background knowledge in a form of a World Model
• …to have knowledge contextualized
• Representing and scalable reasoning knowledge with
operational soft logic
• …to decrease brittleness of logic and increase scale
• Economically viable structured knowledge acquisition with
high precision and recall
• …to increase the reach of what we can acquire
• Emphasizing understanding vs. applying black box models

More Related Content

PPTX
Language as social sensor - Marko Grobelnik - Dubrovnik - HrTAL2016 - 30 Sep ...
PPTX
Global Media Monitor - Marko Grobelnik
PDF
Lecture: Semantic Word Clouds
PDF
Relation Extraction
PDF
Digital Humanities and “Digital” Social Sciences
PPTX
Introduction to nlp
DOC
Peace%20 building%20activites%20to%20faster%20peace%20culture%20in%20schools[1]
PPTX
Global Diaspora Services_Catalysts For Change Zone of Future Innovtion
Language as social sensor - Marko Grobelnik - Dubrovnik - HrTAL2016 - 30 Sep ...
Global Media Monitor - Marko Grobelnik
Lecture: Semantic Word Clouds
Relation Extraction
Digital Humanities and “Digital” Social Sciences
Introduction to nlp
Peace%20 building%20activites%20to%20faster%20peace%20culture%20in%20schools[1]
Global Diaspora Services_Catalysts For Change Zone of Future Innovtion

Viewers also liked (20)

PPT
ME 10 FAMOUSE NEW ZELANDEZ
PPT
Presentatie szigetonderzoek
PPTX
Mobile marketing-basics-101
PPT
Did you ever_think_
PPTX
The Nature of the Future: The Socialstructed World (Marina Gorbis Keynote)
PDF
2010.01.01 inventarisatie
PDF
Patents in a Knowledge Economy 2011, Bangalore, India
PPTX
South Africa
PPT
Networking sample1
PPTX
Adaptive Shelters_Catalysts For Change Zone of Future Innovtion
PPTX
Rural Youth Stewards_Catalysts For Change Zone of Future Innovtion
PDF
Social determinantshealthwho
PPTX
All Of The Above
PPT
Test
PDF
Aula android 03
PPT
Competencies to do in class
PDF
Transcript (2)
PDF
Tenth India Innovation Summit 2014 - Innovation for Inclusive Growth
PPTX
Innomantra - Intellectual Property Consulting & Services
PPTX
Advance Sql Server Store procedure Presentation
ME 10 FAMOUSE NEW ZELANDEZ
Presentatie szigetonderzoek
Mobile marketing-basics-101
Did you ever_think_
The Nature of the Future: The Socialstructed World (Marina Gorbis Keynote)
2010.01.01 inventarisatie
Patents in a Knowledge Economy 2011, Bangalore, India
South Africa
Networking sample1
Adaptive Shelters_Catalysts For Change Zone of Future Innovtion
Rural Youth Stewards_Catalysts For Change Zone of Future Innovtion
Social determinantshealthwho
All Of The Above
Test
Aula android 03
Competencies to do in class
Transcript (2)
Tenth India Innovation Summit 2014 - Innovation for Inclusive Growth
Innomantra - Intellectual Property Consulting & Services
Advance Sql Server Store procedure Presentation
Ad

Similar to From Text To Reasoning - Marko Grobelnik - SWANK Workshop Stanford - 16 Apr 2014 (8)

PPT
Artificial intelligence
PPT
DS Mirrors Artificial Intelligence ppt.ppt
PPT
Intoduction of Artificial Intelligence
PPT
Artificial Intelligence
PPTX
Introduction to AI - Second Lecture
DOCX
1Running Head FUTURE AT THE VETERAN AFFAIRS2FUTURE AT T.docx
DOCX
1Running Head FUTURE AT THE VETERAN AFFAIRS2FUTURE AT T.docx
PPTX
KRR Unit-IV for btech Students helpful.pptx
Artificial intelligence
DS Mirrors Artificial Intelligence ppt.ppt
Intoduction of Artificial Intelligence
Artificial Intelligence
Introduction to AI - Second Lecture
1Running Head FUTURE AT THE VETERAN AFFAIRS2FUTURE AT T.docx
1Running Head FUTURE AT THE VETERAN AFFAIRS2FUTURE AT T.docx
KRR Unit-IV for btech Students helpful.pptx
Ad

Recently uploaded (20)

PPTX
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
PDF
Microsoft Core Cloud Services powerpoint
PPT
DU, AIS, Big Data and Data Analytics.ppt
PDF
Microsoft 365 products and services descrption
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PDF
annual-report-2024-2025 original latest.
PDF
Navigating the Thai Supplements Landscape.pdf
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
PPTX
Business_Capability_Map_Collection__pptx
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
Leprosy and NLEP programme community medicine
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
DOCX
Factor Analysis Word Document Presentation
PDF
Transcultural that can help you someday.
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
Microsoft Core Cloud Services powerpoint
DU, AIS, Big Data and Data Analytics.ppt
Microsoft 365 products and services descrption
Optimise Shopper Experiences with a Strong Data Estate.pdf
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
annual-report-2024-2025 original latest.
Navigating the Thai Supplements Landscape.pdf
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
Business_Capability_Map_Collection__pptx
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Leprosy and NLEP programme community medicine
Topic 5 Presentation 5 Lesson 5 Corporate Fin
Factor Analysis Word Document Presentation
Transcultural that can help you someday.
Pilar Kemerdekaan dan Identi Bangsa.pptx

From Text To Reasoning - Marko Grobelnik - SWANK Workshop Stanford - 16 Apr 2014

  • 1. From Text to Reasoning Marko Grobelnik Jozef Stefan Institute / Cycorp Europe, Slovenia SWANK Workshop, Stanford, Apr 16th 2014Thanks to Michael Witbrock, Janez Starc, Luka Bradesko, Blaz Fortuna
  • 2. Reflection on what should be the goal of NLP • The (mostly) forgotten long term aim of NLP is to understand the text • …and not so much ‘processing’ itself (as NLP suggests) • The curse of shallow solutions working well enough for too many problems, made people (and researchers) happy for too long • …as much as information retrieval and text mining are useful, they delayed development of “text understanding”
  • 3. Language vs. World • …if we agree with the above statement, then at this point in time, we have ‘language’, but the ‘world’ is more or less missing • So – so what a ‘world’ or ‘world model’ could be?
  • 4. CYC KNOWLEDGE BASE Thing Universe isa isa Celestial Body isa located in Planet subclass Earth isa Animal isa Human subclas s Physics Money Mathematics Chemistry Time Learning FoodVehicles Event Education School Language LoveEmotions Going for a walk Death Cat Euro Working Words Driving RainStabbing someone Nature Tree Hatred Fear Physics Time Learning Vehicles Event Education School Emotions Going for a walk Death Cat EuroWords Driving Rain Stabbing someone Nature Tree Hatred Fear Planet Earth isa Human Physics Money Mathematics Chemistry Time Learning FoodVehicles Event Education Languag e LoveEmotions Going for a walk Cat Euro Working Words Driving Rain Tree Hatred Fear Learning Vehicles Event Education School Emotions Euro Driving Stabbing someone Hatred Fear Creating a World Model (top-down approach - Cyc)
  • 5. Model of the world… • …beyond surface knowledge • …to interconnect contextualized fragments Why? • To make reasoning capable of connecting isolated fragments of knowledge • To derive new knowledge beyond materialized factual knowledge World model Top-down KA Bottom-up KA Multimodal data Why we need a World model?
  • 6. Disambiguation with a world model (CycKB)World model used as a set of common-sense semantic constraints to disambiguate text
  • 7. One of the challenges for the future: Micro-reading • It is “easier” to understand millions of documents than one document • …reading and understanding a single document is micro-reading • The following experiment is on how much knowledge we can extract from individual documents • …extraction is in a form of first order inferentially productive Cyc logic • …allowing us full reasoning to identify new facts • …minimizing human involvement, optimizing precision and recall Document Assertions Reasoning Dialogue
  • 8. Example of text and extracted Cyc assertions (1/2) Automatically Extracted Assertions: • (isa ?V1 ProsecutingEvent) • (agent ?V1 RudyGiuliani) • (genls Entity Agent) • (isa RudyGiuliani Agent) • (isa RudyGiuliani Entity) • (isa ?V3 OrganizingEvent) • (patient ?V3 (IntersectionFn OrganizedCrime WallStreet)) • (isa (IntersectionFn OrganizedCrime WallStreet) Patient) • (genls Entity Patient) • (isa OrganizedCrime Patient) • (isa OrganizedCrime Entity) • (isa WallStreet Patient) • (isa WallStreet Entity) Sentence: He prosecuted a number of high-profile cases, including ones against organized crime and Wall_Street financiers.
  • 9. Example of text and extracted Cyc assertions (2/2) Automatically Extracted Assertions: • (isa ?V1 SubstitutingEvent) • (temporal ?V1 Lincoln) • (genls Entity Agent) • (isa Lincoln Agent) • (genls Person Entity) • (isa Lincoln Entity) • (isa Lincoln Person) • (isa ?V3 SucceedingEvent) • (temporal ?V3 Grant) • (isa Grant Agent) • (isa Grant Entity) • (isa Grant Person) Sentence: Each time a general failed, Lincoln substituted another until finally Grant succeeded in 1865.
  • 10. Reasoning on extracted assertions (Cyc) Query: (and (isa ?Per Person) (birthDate ?Per ?BD) (occursBefore ?BD WorldWarII) (thereExistsAtLeast 2 ?Role (lifeRole ?Per ?Role) (roleInIndustry ?Role FilmIndustry) ) ) Answers: Sir Derek_George_Jacobi Sir Alexander_Korda Victor Lonzo_Fleming John_Francis_Junkin Cornel_Wilde George_Stevens Bertrand_Blier NL Query: People born before World War II who had at least two roles in the film industry KB?
  • 11. Knowledge Capture Knowledge Use Rule: (implies (and (isa ?VENUE FoodTruck-Organization) (lastVenue ?USER ?VENUE) (suggestionsForCuriousCatQuestionType FoodTruckSecondaryTypeOfPlace- CuriousCatQuestion ?SUGGESTIONLIST)) (curiousCatWantsToAskUser ?USER (secondaryTypeOfPlace ?VENUE FoodTruck-Organization ?TYPE) ?SUGGESTIONLIST)) Witbrock, M., Bradeško, L., 2013, Conversational Computation in Michelucci, Pietro (Ed.) Handbook of Human Computation, 531-543. Intelligent SIRI: http://curiouscat.cc/
  • 12. Some of the AI challenges for next years • Background knowledge in a form of a World Model • …to have knowledge contextualized • Representing and scalable reasoning knowledge with operational soft logic • …to decrease brittleness of logic and increase scale • Economically viable structured knowledge acquisition with high precision and recall • …to increase the reach of what we can acquire • Emphasizing understanding vs. applying black box models