SlideShare a Scribd company logo
Tetherless World Constellation

Why Watson Won: A
cognitive perspective
Jim Hendler
and Simon Ellis
Tetherless World Professor of Computer,Web and Cognitive Sciences
Director, Rensselaer Institute for Data Exploration and Applications

Rensselaer Polytechnic Institute (RPI)
http://www.cs.rpi.edu/~hendler
@jahendler (twitter)
IBM Watson
Tetherless World Constellation
How’d I get into it? Watson and Semantic Web
Tetherless World Constellation

IBM
Watson and Semantic Web
Tetherless World Constellation

IBM
Is Watson cognitive?

???

“The computer‟s techniques for unraveling Jeopardy! clues sounded
just like mine. That machine zeroes in on key words in a clue, then
combs its memory (in Watson‟s case, a 15-terabyte data bank of
human knowledge) for clusters of associations with those words. It
rigorously checks the top hits against all the contextual information it
can muster: the category name; the kind of answer being sought; the
time, place, and gender hinted at in the clue; and so on. And when it
feels „sure‟ enough, it decides to buzz. This is all an instant, intuitive
process for a human Jeopardy! player, but I felt convinced that under
the hood my brain was doing more or less the same thing.”
— Ken Jennings
Outline
Tetherless World Constellation

• Is Ken right?
– How Watson Works
– Watson as a cognitive architecture??
– Beyond Watson
Inside Watson

???

Watson pipeline as published by IBM; see IBM J Res & Dev 56 (3/4), May/July 2012, p.
Question Analysis

???
???

Question analysis


What is the question asking for?



Which terms in the question refer to the answer?



Given any natural language question, how can Watson
accurately discover this information?

Who is the president of
Rensselaer Polytechnic Institute?

Question
Analysis

Focus Terms:
“Who”, “president of Rensselaer
Polytechnic Institute”
Answer Types: Person, President
Parsing and semantic analysis


What information about a previously unseen piece of
English text can Watson determine?



???

How is this information useful?

Natural Language Parsing

Semantic Analysis

- grammatical structure

- meanings of words, phrases, etc.

- parts of speech

- synonyms, entailment

- relationships between words

- hypernyms, hyponyms

- ...etc.

- ...etc.
Question analysis pipeline

Unstructured
Question Text

Parsing
&
Semantic
Analysis

Structured Annotations
of Question:
Focus, answer types,
Useful search queries

Machine
Learning
Classifiers

???
Search Result Processing and Candidate
???
Generation
Primary Search

???



Primary Search is used to generate the corpus of
information from which to take candidate answers,
passages, supporting evidence, and essentially all textual
input to the system



It formulates queries based on the results of Question
Analysis



These queries are passed into a (cached) search engine
which returns a set number of highly relevant documents
and their ranks.


on the open Web this could be a regular search engine (our
Candidate Generation

???



Candidate Generation generates a wide net of possible
answers for the question from each document.



Using each document, and the passages created by
Search Result Processing, we generate candidates using
three techniques:


Title of Document (T.O.D.): Adds the title of the document as a
candidate.



Wikipedia Title Candidate Generation: Adds any noun phrases
within the document‟s passage texts that are also the titles of
Wikipedia articles.



Anchor Text Candidate Generation: Adds candidates based on
the hyperlinks and metadata within the document.
Search Result Processing and
Candidate Generation

???
Scoring & Ranking

???
Scoring


Analyzes how well a candidate answer relates to the
question



Two basic types of scoring algorithm


Context-independent scoring



Context-dependent scoring

???
Types of scorers


Context-independent


Question Analysis



Ontologies (DBpedia, YAGO, etc)





???

Type hierarchy reasoning

Context-dependent


Analyzes feature of the natural language environment where
candidates were found




Relies on “passages” found during search

Many special purpose ones used in Jeopardy
Scorers


Passage Term Match



Textual Alignment



???

Skip-Bigram


Each of these scores supportive evidence



These scores are then merged to produce a single candidate
score
Example:Textual Alignment

???



Finds an optimal alignment of a question and a passage



Assigns “partial credit” for close matches




“Who is the President of RPI?”
Who

President of RPI.

Shirley Ann Jackson is the President of RPI.
Skip-Bigram


Constructs a graph


Nodes represent terms (syntactic objects)





???

Edges represent relations

Extracts skip-bigrams






A skip-bigram is a pair of nodes either directly connected or
which have only one intermediate node
Skip-bigrams represent close relationships between terms

Scores based on number of common skip-bigrams
Example


Who authored
“The Good Earth”?



“Pearl Buck, author of
the good earth…”

???
Watson Summary
Tetherless World Constellation

• Watson works by
– Analyzing the question
• natural language parsing
• text extraction

– Generating a large number of
candidates
• mostly search heuristics

– Scoring each
• through multiple scorers
• with weights adjusted by learning algorithm

– Returning top candidate
MiniDeepQA (Not Watson!)


???

RPI students implementing a DeepQA pipeline to explore
the principles underlying this kind of Q/A system

(THIS IS NOT WATSON!)


Pipeline development



Data caching



Graphical and command line interfaces



Parsing



Scoring
???

Examples

Right answer
???

Examples

Right answer?
???

Examples

Right answer
???

Examples

Right
answer??
???

Examples

had to get this
one right!
Scoring

???



One of DeepQA‟s main strengths is aggregating a
number of different scoring algorithms capable of
running in parallel.



RPI scorers are primitive compared to IBM‟s, but


allow us to explore the principles



allow us to explore different algorithms for computing
scores



allow us to create new ones not tried by IBM
Scoring Principles: combine
evidence


He was the Prime Minister of Canada in 1993.




candidates could include Trudeau, Harper, Campbell, Chretien,
Mulroney…

Try (Research):


Trudeau was Prime Minister of Canada in 1993 (doesn‟t match)



Campbell was Prime Minister of Canada in 1993 (MATCH)



Chretien was Prime Minister of Canada in 1993 (MATCH)



Scoring Research & type match


Trudeau: Re-search NO; Type: Yes



Campbell: Re-search YES; Type: No



Chretien: Re-search YES; Type: Yes

WHO WAS CHRETIEN?

???
New Scoring types


???

We can explore how new kinds of information can be
added to the Watson scoring pipeline


Example: new NLP extraction techniques




Example: Specialized Web Sources




Adding a ML-based extractor built by Heng Ji
Database advisor project

Example: More complex inferencing


Jeopardy questions are unambiguous, real world questions aren‟t

•
•


Where is Montreal?
Who is Jim Hendler?

Example: Special purpose reasoning…
Special purpose reasoning

???

• Can we match simulation (or steer) large scale
simulations to help answer NL questions?

-

eg. Answer questions such as “Why” and “How”
integrated with large scale simulations
Alternate Universe Reasoning
(Contexts)


How can a Watson reasoner appropriately use Q/A
contexts?


Where was Yoda born?




Very little is known about Yoda's early life.
He was from a remote planet,
but which one remains a mystery.

Where was Yoda made?



designed and built by Stuart Freeborn





The Yoda puppet was originally
for LucasFilm and Industrial Light & Magic.

Where did Yoda live?




Jedi Master Yoda went into voluntary exile on Dagobah

Where did Yoda live in the Phantom Menace?

???
But back to the original question
Tetherless World Constellation

• Q: How does Watson fare as a
cognitive model?
• A: Poorly
– no conversational ability
– no concept of self
– no deeper reasoning

…

• Q: How does Watson fare as a model
of question answering?
Watson and Q/A
Tetherless World Constellation

• Watson’s feed-forward pipeline has
the following properties
– lots of candidates generated
• the more the better

– “ad hoc” filtering pipelines
• domain independent usually score lower
than domain dependent

– no “counter-reasoning” between
answers
• separately scored, only comparison is
numbers
Production rules, modules, etc
Tetherless World Constellation

Production Rule style Architectures cf ACT-R (Anderson 1974; …2012)
- modularization, but not Watson style
- parallelization, but in rule productions (procedural memory)
- declarative memory is fact based
Watson is not well correlated, except for using search for declarative
memory
Network based
Tetherless World Constellation

Network based architectures (cf. spreading activation (Collins 75),
marker-passing (Hendler 86) … Microsaint 2006)
- positive activations
- inhibitory nodes (or other negative enforcers)
Watson has no negative inhibition, does use network-based scorers
MAC/FAC
Tetherless World Constellation

MAC/FAC (Gentner & Forbus, 1991)
Many are chosen, few are called model of analogic reasoning
Strong correspondence in performance, not in mechanism
New work by Forbus (SME) uses a more feed-forward mechanism
(Discussions in progress)
Cognitive Architecture? Watson as “component”

Decision Making

Memory
Reasoning

Watson, Cogito, and Clarion

Office of Research
Summary
Tetherless World Constellation

• Watson won by a combination of
–
–
–
–
–

natural language processing
search technologies
semantic typing (minimal reasoning)
scoring heuristics
machine learning (scorer tuning)

• Watson Q/A has some interesting analogies to
cognitive architectures of the past
– but mainly at a “level of abstraction”

• Watson as a memory component in a more
complex cognitive system is a very intriguing
possibility
Questions?
Tetherless World Constellation

More Related Content

PPTX
Watson: An Academic's Perspective
PPT
KR in the age of Deep Learning
PPT
On Beyond OWL: challenges for ontologies on the Web
PPT
Wither OWL
PPTX
The Unreasonable Effectiveness of Metadata
PPT
Semantic Web: The Inside Story
PPT
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
PPTX
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
Watson: An Academic's Perspective
KR in the age of Deep Learning
On Beyond OWL: challenges for ontologies on the Web
Wither OWL
The Unreasonable Effectiveness of Metadata
Semantic Web: The Inside Story
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web

What's hot (20)

PPT
Broad Data (India 2015)
PPT
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
PPTX
Watson at RPI - Summer 2013
PPTX
Digital Archiving, The Semantic Web, and Modern AI
PPT
Social Machines - 2017 Update (University of Iowa)
PPT
Data Big and Broad (Oxford, 2012)
PPT
The Semantic Web: It's for Real
PDF
Facilitating Web Science Collaboration through Semantic Markup
PPTX
Tragedy of the Data Commons (ODSC-East, 2021)
PPTX
"Why the Semantic Web will Never Work" (note the quotes)
PPTX
The Future(s) of the World Wide Web
PDF
HyperMembrane Structures for Open Source Cognitive Computing
PPT
The Semantic Web: 2010 Update
PPT
Broad Data
PDF
Towards An Improvement Community Platform for Service Innovation
PDF
CHIIR2017 - Tetris Model of Resolving Information Needs
PDF
Federating Cultures: Human Knowledge, Teachers, Students
PDF
SolrSherlock: Linkfinding among Biomolecules with Literature-based Discovery
PDF
The HCI Perspective on IR (DIR2016 Keynote)
PDF
DMTM Lecture 01 Introduction
Broad Data (India 2015)
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Watson at RPI - Summer 2013
Digital Archiving, The Semantic Web, and Modern AI
Social Machines - 2017 Update (University of Iowa)
Data Big and Broad (Oxford, 2012)
The Semantic Web: It's for Real
Facilitating Web Science Collaboration through Semantic Markup
Tragedy of the Data Commons (ODSC-East, 2021)
"Why the Semantic Web will Never Work" (note the quotes)
The Future(s) of the World Wide Web
HyperMembrane Structures for Open Source Cognitive Computing
The Semantic Web: 2010 Update
Broad Data
Towards An Improvement Community Platform for Service Innovation
CHIIR2017 - Tetris Model of Resolving Information Needs
Federating Cultures: Human Knowledge, Teachers, Students
SolrSherlock: Linkfinding among Biomolecules with Literature-based Discovery
The HCI Perspective on IR (DIR2016 Keynote)
DMTM Lecture 01 Introduction
Ad

Viewers also liked (20)

PDF
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
PDF
World of Watson 2016 - Architecting your Analytics House
PDF
2010 1 materialism1
PPT
Attitude Changes Everything
PDF
Every Crisis is Global, Social, Viral
PPT
Ch. 2 -_the_biological_perspective
PDF
An Evolutionary Perspective on Approximate RDF Query Answering
PPT
Drugan Notes- Biological Perspective
PPTX
Learning styles from a multicultural perspective
PPTX
Valuing ecosystem services: a biological perspective
PPTX
The bee effect: Action to effect change
PDF
Managing Multicultural Individuals
PPTX
Inculcate Self Confidence & Self Belief
PPTX
Watson System
PPTX
C:\Multicultural Education Powerpoint
PDF
IBM's watson
PDF
Watson on bluemix
PPT
Evolutionary perspective
PDF
IBM Watson & Open Source Software - LinuxCon 2012
PPTX
our behaviour is the foundation of our attitude and self perception
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
World of Watson 2016 - Architecting your Analytics House
2010 1 materialism1
Attitude Changes Everything
Every Crisis is Global, Social, Viral
Ch. 2 -_the_biological_perspective
An Evolutionary Perspective on Approximate RDF Query Answering
Drugan Notes- Biological Perspective
Learning styles from a multicultural perspective
Valuing ecosystem services: a biological perspective
The bee effect: Action to effect change
Managing Multicultural Individuals
Inculcate Self Confidence & Self Belief
Watson System
C:\Multicultural Education Powerpoint
IBM's watson
Watson on bluemix
Evolutionary perspective
IBM Watson & Open Source Software - LinuxCon 2012
our behaviour is the foundation of our attitude and self perception
Ad

Similar to Why Watson Won: A cognitive perspective (20)

PPTX
PPT 2.3.1.pptx_PPT 2.3.1.pptx_PPT 2.3.1.pptx
PPTX
PPT 2.3.1.pptx_PPT 2.3.1.pptx_PPT 2.3.1.pptx
PDF
Ibm cognitive seminar march 2015 watsonsim final
PPTX
PPT 2.3.2.pptx_PPT 2.3.2.pptx_PPT 2.3.2.pptx
PPTX
PPT 2.3.2.pptx_PPT 2.3.2.pptx_PPT 2.3.2.pptx
PPT
PPT
Eurolan 2005 Pedersen
PDF
Using and learning phrases
PDF
Iulia Pasov, Sixt. Trends in sentiment analysis. The entire history from rule...
PPT
Introduction to Natural Language Processing
PPT
Text Analytics for Semantic Computing
PPTX
NLP & DBpedia
PPTX
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
PPTX
Factoid based natural language question generation system
PPT
Web & text mining lecture10
PDF
Relation Extraction
PDF
Guided Persona-based AI surveys: Can we replicate personal mobility preferenc...
PPT
PPTX
Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...
PPTX
Knowledge acquisition using automated techniques
PPT 2.3.1.pptx_PPT 2.3.1.pptx_PPT 2.3.1.pptx
PPT 2.3.1.pptx_PPT 2.3.1.pptx_PPT 2.3.1.pptx
Ibm cognitive seminar march 2015 watsonsim final
PPT 2.3.2.pptx_PPT 2.3.2.pptx_PPT 2.3.2.pptx
PPT 2.3.2.pptx_PPT 2.3.2.pptx_PPT 2.3.2.pptx
Eurolan 2005 Pedersen
Using and learning phrases
Iulia Pasov, Sixt. Trends in sentiment analysis. The entire history from rule...
Introduction to Natural Language Processing
Text Analytics for Semantic Computing
NLP & DBpedia
Extracting Relevant Questions to an RDF Dataset Using Formal Concept Analysis
Factoid based natural language question generation system
Web & text mining lecture10
Relation Extraction
Guided Persona-based AI surveys: Can we replicate personal mobility preferenc...
Semantic Integration of Citizen Sensor Data and Multilevel Sensing: A compreh...
Knowledge acquisition using automated techniques

More from James Hendler (12)

PPTX
Knowing what AI Systems Don't know and Why it matters
PPTX
Exploring the Boundaries of Artificial Intelligence (or "Modern AI")
PPTX
Tragedy of the (Data) Commons
PPTX
Knowledge Graph Semantics/Interoperability
PPTX
Enhancing Precision Wellness with Personal Health Knowledge Graphs
PPTX
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
PPTX
Enhancing Precision Wellness with Knowledge Graphs and Semantic Analytics: O...
PPTX
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
PPTX
The Science of Data Science
PPT
Big Data and Computer Science Education
PPTX
The Rensselaer IDEA: Data Exploration
PPT
Future of the World WIde Web (India)
Knowing what AI Systems Don't know and Why it matters
Exploring the Boundaries of Artificial Intelligence (or "Modern AI")
Tragedy of the (Data) Commons
Knowledge Graph Semantics/Interoperability
Enhancing Precision Wellness with Personal Health Knowledge Graphs
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
Enhancing Precision Wellness with Knowledge Graphs and Semantic Analytics: O...
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
The Science of Data Science
Big Data and Computer Science Education
The Rensselaer IDEA: Data Exploration
Future of the World WIde Web (India)

Recently uploaded (20)

DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Modernizing your data center with Dell and AMD
PDF
Machine learning based COVID-19 study performance prediction
PDF
Empathic Computing: Creating Shared Understanding
PDF
Electronic commerce courselecture one. Pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
KodekX | Application Modernization Development
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
The AUB Centre for AI in Media Proposal.docx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
cuic standard and advanced reporting.pdf
Spectral efficient network and resource selection model in 5G networks
The Rise and Fall of 3GPP – Time for a Sabbatical?
Understanding_Digital_Forensics_Presentation.pptx
Modernizing your data center with Dell and AMD
Machine learning based COVID-19 study performance prediction
Empathic Computing: Creating Shared Understanding
Electronic commerce courselecture one. Pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Network Security Unit 5.pdf for BCA BBA.
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
NewMind AI Monthly Chronicles - July 2025
Mobile App Security Testing_ A Comprehensive Guide.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
KodekX | Application Modernization Development
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows

Why Watson Won: A cognitive perspective

  • 1. Tetherless World Constellation Why Watson Won: A cognitive perspective Jim Hendler and Simon Ellis Tetherless World Professor of Computer,Web and Cognitive Sciences Director, Rensselaer Institute for Data Exploration and Applications Rensselaer Polytechnic Institute (RPI) http://www.cs.rpi.edu/~hendler @jahendler (twitter)
  • 3. How’d I get into it? Watson and Semantic Web Tetherless World Constellation IBM
  • 4. Watson and Semantic Web Tetherless World Constellation IBM
  • 5. Is Watson cognitive? ??? “The computer‟s techniques for unraveling Jeopardy! clues sounded just like mine. That machine zeroes in on key words in a clue, then combs its memory (in Watson‟s case, a 15-terabyte data bank of human knowledge) for clusters of associations with those words. It rigorously checks the top hits against all the contextual information it can muster: the category name; the kind of answer being sought; the time, place, and gender hinted at in the clue; and so on. And when it feels „sure‟ enough, it decides to buzz. This is all an instant, intuitive process for a human Jeopardy! player, but I felt convinced that under the hood my brain was doing more or less the same thing.” — Ken Jennings
  • 6. Outline Tetherless World Constellation • Is Ken right? – How Watson Works – Watson as a cognitive architecture?? – Beyond Watson
  • 7. Inside Watson ??? Watson pipeline as published by IBM; see IBM J Res & Dev 56 (3/4), May/July 2012, p.
  • 9. ??? Question analysis  What is the question asking for?  Which terms in the question refer to the answer?  Given any natural language question, how can Watson accurately discover this information? Who is the president of Rensselaer Polytechnic Institute? Question Analysis Focus Terms: “Who”, “president of Rensselaer Polytechnic Institute” Answer Types: Person, President
  • 10. Parsing and semantic analysis  What information about a previously unseen piece of English text can Watson determine?  ??? How is this information useful? Natural Language Parsing Semantic Analysis - grammatical structure - meanings of words, phrases, etc. - parts of speech - synonyms, entailment - relationships between words - hypernyms, hyponyms - ...etc. - ...etc.
  • 11. Question analysis pipeline Unstructured Question Text Parsing & Semantic Analysis Structured Annotations of Question: Focus, answer types, Useful search queries Machine Learning Classifiers ???
  • 12. Search Result Processing and Candidate ??? Generation
  • 13. Primary Search ???  Primary Search is used to generate the corpus of information from which to take candidate answers, passages, supporting evidence, and essentially all textual input to the system  It formulates queries based on the results of Question Analysis  These queries are passed into a (cached) search engine which returns a set number of highly relevant documents and their ranks.  on the open Web this could be a regular search engine (our
  • 14. Candidate Generation ???  Candidate Generation generates a wide net of possible answers for the question from each document.  Using each document, and the passages created by Search Result Processing, we generate candidates using three techniques:  Title of Document (T.O.D.): Adds the title of the document as a candidate.  Wikipedia Title Candidate Generation: Adds any noun phrases within the document‟s passage texts that are also the titles of Wikipedia articles.  Anchor Text Candidate Generation: Adds candidates based on the hyperlinks and metadata within the document.
  • 15. Search Result Processing and Candidate Generation ???
  • 17. Scoring  Analyzes how well a candidate answer relates to the question  Two basic types of scoring algorithm  Context-independent scoring  Context-dependent scoring ???
  • 18. Types of scorers  Context-independent  Question Analysis  Ontologies (DBpedia, YAGO, etc)   ??? Type hierarchy reasoning Context-dependent  Analyzes feature of the natural language environment where candidates were found   Relies on “passages” found during search Many special purpose ones used in Jeopardy
  • 19. Scorers  Passage Term Match  Textual Alignment  ??? Skip-Bigram  Each of these scores supportive evidence  These scores are then merged to produce a single candidate score
  • 20. Example:Textual Alignment ???  Finds an optimal alignment of a question and a passage  Assigns “partial credit” for close matches    “Who is the President of RPI?” Who President of RPI. Shirley Ann Jackson is the President of RPI.
  • 21. Skip-Bigram  Constructs a graph  Nodes represent terms (syntactic objects)   ??? Edges represent relations Extracts skip-bigrams    A skip-bigram is a pair of nodes either directly connected or which have only one intermediate node Skip-bigrams represent close relationships between terms Scores based on number of common skip-bigrams
  • 22. Example  Who authored “The Good Earth”?  “Pearl Buck, author of the good earth…” ???
  • 23. Watson Summary Tetherless World Constellation • Watson works by – Analyzing the question • natural language parsing • text extraction – Generating a large number of candidates • mostly search heuristics – Scoring each • through multiple scorers • with weights adjusted by learning algorithm – Returning top candidate
  • 24. MiniDeepQA (Not Watson!)  ??? RPI students implementing a DeepQA pipeline to explore the principles underlying this kind of Q/A system (THIS IS NOT WATSON!)  Pipeline development  Data caching  Graphical and command line interfaces  Parsing  Scoring
  • 29. ??? Examples had to get this one right!
  • 30. Scoring ???  One of DeepQA‟s main strengths is aggregating a number of different scoring algorithms capable of running in parallel.  RPI scorers are primitive compared to IBM‟s, but  allow us to explore the principles  allow us to explore different algorithms for computing scores  allow us to create new ones not tried by IBM
  • 31. Scoring Principles: combine evidence  He was the Prime Minister of Canada in 1993.   candidates could include Trudeau, Harper, Campbell, Chretien, Mulroney… Try (Research):  Trudeau was Prime Minister of Canada in 1993 (doesn‟t match)  Campbell was Prime Minister of Canada in 1993 (MATCH)  Chretien was Prime Minister of Canada in 1993 (MATCH)  Scoring Research & type match  Trudeau: Re-search NO; Type: Yes  Campbell: Re-search YES; Type: No  Chretien: Re-search YES; Type: Yes WHO WAS CHRETIEN? ???
  • 32. New Scoring types  ??? We can explore how new kinds of information can be added to the Watson scoring pipeline  Example: new NLP extraction techniques   Example: Specialized Web Sources   Adding a ML-based extractor built by Heng Ji Database advisor project Example: More complex inferencing  Jeopardy questions are unambiguous, real world questions aren‟t • •  Where is Montreal? Who is Jim Hendler? Example: Special purpose reasoning…
  • 33. Special purpose reasoning ??? • Can we match simulation (or steer) large scale simulations to help answer NL questions? - eg. Answer questions such as “Why” and “How” integrated with large scale simulations
  • 34. Alternate Universe Reasoning (Contexts)  How can a Watson reasoner appropriately use Q/A contexts?  Where was Yoda born?   Very little is known about Yoda's early life. He was from a remote planet, but which one remains a mystery. Where was Yoda made?   designed and built by Stuart Freeborn   The Yoda puppet was originally for LucasFilm and Industrial Light & Magic. Where did Yoda live?   Jedi Master Yoda went into voluntary exile on Dagobah Where did Yoda live in the Phantom Menace? ???
  • 35. But back to the original question Tetherless World Constellation • Q: How does Watson fare as a cognitive model? • A: Poorly – no conversational ability – no concept of self – no deeper reasoning … • Q: How does Watson fare as a model of question answering?
  • 36. Watson and Q/A Tetherless World Constellation • Watson’s feed-forward pipeline has the following properties – lots of candidates generated • the more the better – “ad hoc” filtering pipelines • domain independent usually score lower than domain dependent – no “counter-reasoning” between answers • separately scored, only comparison is numbers
  • 37. Production rules, modules, etc Tetherless World Constellation Production Rule style Architectures cf ACT-R (Anderson 1974; …2012) - modularization, but not Watson style - parallelization, but in rule productions (procedural memory) - declarative memory is fact based Watson is not well correlated, except for using search for declarative memory
  • 38. Network based Tetherless World Constellation Network based architectures (cf. spreading activation (Collins 75), marker-passing (Hendler 86) … Microsaint 2006) - positive activations - inhibitory nodes (or other negative enforcers) Watson has no negative inhibition, does use network-based scorers
  • 39. MAC/FAC Tetherless World Constellation MAC/FAC (Gentner & Forbus, 1991) Many are chosen, few are called model of analogic reasoning Strong correspondence in performance, not in mechanism New work by Forbus (SME) uses a more feed-forward mechanism (Discussions in progress)
  • 40. Cognitive Architecture? Watson as “component” Decision Making Memory Reasoning Watson, Cogito, and Clarion Office of Research
  • 41. Summary Tetherless World Constellation • Watson won by a combination of – – – – – natural language processing search technologies semantic typing (minimal reasoning) scoring heuristics machine learning (scorer tuning) • Watson Q/A has some interesting analogies to cognitive architectures of the past – but mainly at a “level of abstraction” • Watson as a memory component in a more complex cognitive system is a very intriguing possibility