SlideShare a Scribd company logo
© 2011 IBM Corporation© 2011 IBM Corporation
Building Watson
A Brief Overview of the Jeopardy! Challenge
Dr. Mark Sherman
IBM Software Group Strategy
© 2011 IBM Corporation
 Capture the imagination
– The Next Deep Blue
 Engage the scientific community
– Envision new ways for computers to impact society & science
– Drive important and measurable scientific advances
 Be Relevant to IBM Customers
– Enable better, faster decision making
– Business Intelligence, Knowledge Discovery and Management, Government,
Compliance, Publishing, Legal, Healthcare, Business Integrity, Customer
Relationship Management, Web Self-Service, Product Support, etc.
A Grand Challenge Opportunity
2
© 2011 IBM Corporation
Informed Decision Making: Search vs. Expert Q&A
Decision Maker
Search Engine
Finds Documents containing Keywords
Delivers Documents based on Popularity
Has Question
Distills to 2-3 Keywords
Reads Documents, Finds
Answers
Finds & Analyzes Evidence
© 2011 IBM Corporation
Informed Decision Making: Search vs. Expert Q&A
Expert
Understands Question
Produces Possible Answers & Evidence
Delivers Response, Evidence & Confidence
Analyzes Evidence, Computes Confidence
Asks NL Question
Considers Answer & Evidence
Decision Maker
© 2011 IBM Corporation
Informed Decision Making: Search vs. Expert Q&A
Decision Maker
Search Engine
Finds Documents containing Keywords
Delivers Documents based on Popularity
Has Question
Distills to 2-3 Keywords
Reads Documents, Finds
Answers
Finds & Analyzes Evidence
Expert
Understands Question
Produces Possible Answers & Evidence
Delivers Response, Evidence & Confidence
Analyzes Evidence, Computes Confidence
Asks NL Question
Considers Answer & Evidence
Decision Maker
© 2011 IBM Corporation6
Broad Domain
Our Focus is on reusable NLP technology for analyzing vast volumes of as-is text.
Structured sources (DBs and KBs) provide background knowledge for interpreting the text.
We do NOT attempt to anticipate all
questions and build databases.
In a random sample of 20,000 questions we found
2,500 distinct types*. The most frequent occurring <3% of the time.
The distribution has a very long tail.
And for each these types 1000’s of different things may be asked.
*13% are non-distinct (e.g, it, this, these or NA)
Even going for the head of the tail will
barely make a dent
We do NOT try to build a formal
model of the world
© 2011 IBM Corporation7
What It Takes to compete against Top Human Jeopardy! Players
Our Analysis Reveals the Winner’s Cloud
Winning Human
Performance
Winning Human
Performance
Grand Champion
Human Performance
Grand Champion
Human Performance
Each dot – actual historical human Jeopardy! games
More ConfidentMore Confident Less ConfidentLess Confident
© 2011 IBM Corporation8
What It Takes to compete against Top Human Jeopardy! Players
Our Analysis Reveals the Winner’s Cloud
Winning Human
Performance
Winning Human
Performance
2007 QA Computer System2007 QA Computer System
Grand Champion
Human Performance
Grand Champion
Human Performance
Each dot – actual historical human Jeopardy! games
More ConfidentMore Confident Less ConfidentLess Confident
Computers?
Not So Good.
© 2011 IBM Corporation
Baseline
v0.1 12/07
v0.3 08/08
v0.5 05/09
v0.6 10/09
v0.7 04/10
v0.4 12/08
DeepQA: Incremental Progress in Answering Precision: 6/2007-4/2010
v0.2 05/08
© 2011 IBM Corporation
One Jeopardy! question can take 2 hours on a single 2.6Ghz Core
Optimized & Scaled out on 2880-Core IBM HPC using UIMA-AS,
Watson is answering in 2-6 seconds.
Question
100s Possible
Answers
1000’s of
Pieces of Evidence
Multiple
Interpretations
100,000’s scores from many simultaneous
Text Analysis Algorithms100s sources
. . .
Hypothesis
Generation
Hypothesis and
Evidence Scoring
Final Confidence
Merging &
Ranking
Synthesis
Question &
Topic
Analysis
Question
Decomposition
Hypothesis
Generation
Hypothesis and Evidence
Scoring
Answer &
Confidence
© 2011 IBM Corporation
Potential Business Applications
Tech Support: Help-desk, Contact Centers
Healthcare / Life Sciences: Diagnostic Assistance, Evidenced-
Based, Collaborative Medicine
Enterprise Knowledge Management and Business
Intelligence
Government: Improved Information Sharing
and Security
© 2011 IBM Corporation
The Core Technical Team
Researchers and Engineers in NLP, ML, IR, KR&R and CL at
IBM Labs and a growing number of universities
© 2011 IBM Corporation
THANK YOU

More Related Content

PPTX
Using Data for Decisions TechinAsia Singapore 2015
PDF
Strategies to make anyone use your Product | Product that Count
PDF
Condense Fact from the Vapor of Nuance
PDF
Product Strategy for Product Leaders
PPTX
Mike maples ventureshift 7 17-13
PDF
DevOps Frequently Asked Questions of 2013 with Gene Kim and Jonathan Thorpe (...
PPT
Poster e portfolio
PDF
Reflect complete
Using Data for Decisions TechinAsia Singapore 2015
Strategies to make anyone use your Product | Product that Count
Condense Fact from the Vapor of Nuance
Product Strategy for Product Leaders
Mike maples ventureshift 7 17-13
DevOps Frequently Asked Questions of 2013 with Gene Kim and Jonathan Thorpe (...
Poster e portfolio
Reflect complete

Similar to CMU 2011 Watson Event (20)

PDF
Watson DevCon 2016 - From Jeopardy! to the Future
PPTX
Sis sat 1000 josh dreller
PPTX
Inside the Mind of Watson: Cognitive Computing
PDF
Cognitive Assistants - Opportunities and Challenges - slides
PPTX
Watson presentationsit
PDF
What is Watson – An Overvie.pdf
PDF
Ibm watson - who what why
PPTX
PPT 2.3.1.pptx_PPT 2.3.1.pptx_PPT 2.3.1.pptx
PPTX
Why Watson Won: A cognitive perspective
PDF
Watson how it works?
PDF
IBM Watson-How it works
PDF
Ibm watson - how it works, and what it means for society beyond winning jeo...
PPTX
Watson Computer
PDF
IBM Watson & Cognitive Computing - Tech In Asia 2016
PPTX
Watson - a supercomputer
PDF
Watson white paper
PDF
Watson - Who What Why
PDF
Watson A System Designed For Answers
PDF
Ibm Watson Designed For Answers
PPTX
IBM Watson
Watson DevCon 2016 - From Jeopardy! to the Future
Sis sat 1000 josh dreller
Inside the Mind of Watson: Cognitive Computing
Cognitive Assistants - Opportunities and Challenges - slides
Watson presentationsit
What is Watson – An Overvie.pdf
Ibm watson - who what why
PPT 2.3.1.pptx_PPT 2.3.1.pptx_PPT 2.3.1.pptx
Why Watson Won: A cognitive perspective
Watson how it works?
IBM Watson-How it works
Ibm watson - how it works, and what it means for society beyond winning jeo...
Watson Computer
IBM Watson & Cognitive Computing - Tech In Asia 2016
Watson - a supercomputer
Watson white paper
Watson - Who What Why
Watson A System Designed For Answers
Ibm Watson Designed For Answers
IBM Watson
Ad

Recently uploaded (20)

PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Mushroom cultivation and it's methods.pdf
PPTX
1. Introduction to Computer Programming.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Encapsulation theory and applications.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PPTX
cloud_computing_Infrastucture_as_cloud_p
Building Integrated photovoltaic BIPV_UPV.pdf
NewMind AI Weekly Chronicles - August'25-Week II
Mushroom cultivation and it's methods.pdf
1. Introduction to Computer Programming.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Programs and apps: productivity, graphics, security and other tools
A comparative analysis of optical character recognition models for extracting...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
SOPHOS-XG Firewall Administrator PPT.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Reach Out and Touch Someone: Haptics and Empathic Computing
Encapsulation theory and applications.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Univ-Connecticut-ChatGPT-Presentaion.pdf
Heart disease approach using modified random forest and particle swarm optimi...
cloud_computing_Infrastucture_as_cloud_p
Ad

CMU 2011 Watson Event

  • 1. © 2011 IBM Corporation© 2011 IBM Corporation Building Watson A Brief Overview of the Jeopardy! Challenge Dr. Mark Sherman IBM Software Group Strategy
  • 2. © 2011 IBM Corporation  Capture the imagination – The Next Deep Blue  Engage the scientific community – Envision new ways for computers to impact society & science – Drive important and measurable scientific advances  Be Relevant to IBM Customers – Enable better, faster decision making – Business Intelligence, Knowledge Discovery and Management, Government, Compliance, Publishing, Legal, Healthcare, Business Integrity, Customer Relationship Management, Web Self-Service, Product Support, etc. A Grand Challenge Opportunity 2
  • 3. © 2011 IBM Corporation Informed Decision Making: Search vs. Expert Q&A Decision Maker Search Engine Finds Documents containing Keywords Delivers Documents based on Popularity Has Question Distills to 2-3 Keywords Reads Documents, Finds Answers Finds & Analyzes Evidence
  • 4. © 2011 IBM Corporation Informed Decision Making: Search vs. Expert Q&A Expert Understands Question Produces Possible Answers & Evidence Delivers Response, Evidence & Confidence Analyzes Evidence, Computes Confidence Asks NL Question Considers Answer & Evidence Decision Maker
  • 5. © 2011 IBM Corporation Informed Decision Making: Search vs. Expert Q&A Decision Maker Search Engine Finds Documents containing Keywords Delivers Documents based on Popularity Has Question Distills to 2-3 Keywords Reads Documents, Finds Answers Finds & Analyzes Evidence Expert Understands Question Produces Possible Answers & Evidence Delivers Response, Evidence & Confidence Analyzes Evidence, Computes Confidence Asks NL Question Considers Answer & Evidence Decision Maker
  • 6. © 2011 IBM Corporation6 Broad Domain Our Focus is on reusable NLP technology for analyzing vast volumes of as-is text. Structured sources (DBs and KBs) provide background knowledge for interpreting the text. We do NOT attempt to anticipate all questions and build databases. In a random sample of 20,000 questions we found 2,500 distinct types*. The most frequent occurring <3% of the time. The distribution has a very long tail. And for each these types 1000’s of different things may be asked. *13% are non-distinct (e.g, it, this, these or NA) Even going for the head of the tail will barely make a dent We do NOT try to build a formal model of the world
  • 7. © 2011 IBM Corporation7 What It Takes to compete against Top Human Jeopardy! Players Our Analysis Reveals the Winner’s Cloud Winning Human Performance Winning Human Performance Grand Champion Human Performance Grand Champion Human Performance Each dot – actual historical human Jeopardy! games More ConfidentMore Confident Less ConfidentLess Confident
  • 8. © 2011 IBM Corporation8 What It Takes to compete against Top Human Jeopardy! Players Our Analysis Reveals the Winner’s Cloud Winning Human Performance Winning Human Performance 2007 QA Computer System2007 QA Computer System Grand Champion Human Performance Grand Champion Human Performance Each dot – actual historical human Jeopardy! games More ConfidentMore Confident Less ConfidentLess Confident Computers? Not So Good.
  • 9. © 2011 IBM Corporation Baseline v0.1 12/07 v0.3 08/08 v0.5 05/09 v0.6 10/09 v0.7 04/10 v0.4 12/08 DeepQA: Incremental Progress in Answering Precision: 6/2007-4/2010 v0.2 05/08
  • 10. © 2011 IBM Corporation One Jeopardy! question can take 2 hours on a single 2.6Ghz Core Optimized & Scaled out on 2880-Core IBM HPC using UIMA-AS, Watson is answering in 2-6 seconds. Question 100s Possible Answers 1000’s of Pieces of Evidence Multiple Interpretations 100,000’s scores from many simultaneous Text Analysis Algorithms100s sources . . . Hypothesis Generation Hypothesis and Evidence Scoring Final Confidence Merging & Ranking Synthesis Question & Topic Analysis Question Decomposition Hypothesis Generation Hypothesis and Evidence Scoring Answer & Confidence
  • 11. © 2011 IBM Corporation Potential Business Applications Tech Support: Help-desk, Contact Centers Healthcare / Life Sciences: Diagnostic Assistance, Evidenced- Based, Collaborative Medicine Enterprise Knowledge Management and Business Intelligence Government: Improved Information Sharing and Security
  • 12. © 2011 IBM Corporation The Core Technical Team Researchers and Engineers in NLP, ML, IR, KR&R and CL at IBM Labs and a growing number of universities
  • 13. © 2011 IBM Corporation THANK YOU