The Future
2
The Big Dream (once again)
Dave Bowman: “Open the pod bay doors, HAL”
HAL 9000: “I’m sorry Dave. I’m afraid I can’t do that.”
3
Two Important Directions We Touched
• Semantics
• Machine Translation
Critical for the overall
advancement of the field
Practical, within the reach
of current technology
4
Two Important Directions We Touched
•Semantics
• Machine Translation
5
Semantics: Revolution is Needed?
• If we want the dream come true, we should
– not rely on superficial statistics alone
– need to get to the meaning of text
• A revolution in semantics is needed
– looking at words is not enough
– we need better models for
• multi-word expressions (~70% of terminology)
• semantic relations (meaning is in the links!)
• The revolution will be supported by
– Web-scale corpora
– linguistic knowledge
“Moving Lexical Semantics
from Alchemy to Science”
Discussion on [Corpora-List]
• This is what Chomsky has
done with syntax.
• Should we expect the same
for lexical semantics?
6
NAACL’2015: Accept/Reject by Area
0 10 20 30 40 50 60 70 80 90 100
Semantics
NLP for Web and Social Media and Social…
Machine Translation
Information Extraction and Question Answering
Tagging and Chunking and Syntax and Parsing
Machine Learning for NLP
Generation and Summarization
Language Resources and Evaluation
Text Categorization and Topic Models
Sentiment Analysis and Opinion Mining
Phonology and Morphology and Word…
Spoken Language Processing
Discourse and Pragmatics
NLP-enabled Technology
Linguistic and Psycholinguistic Aspects of CL
Dialogue and Interactive Systems
Information Retrieval
Language and Vision
Semantics has emerged
from a marginal to a
dominant position.
7
Two Important Directions
We Have Touched
• Semantics
•Machine Translation
8
Machine Translation: Revolution?
• Revolution?
– Two great revolutions so far
• 1993: statistical word-based translation
• 2003: statistical phrase-based translation
9
Machine Translation: Revolution?
• Revolution?
– Two great revolutions so far
• 1993: statistical word-based translation
• 2003: statistical phrase-based translation
– Overdue for the next revolution?
• 2013: ???
– Syntactic translation?
– Semantic translation?
SOURCE TARGET
words words
syntax syntax
semantics semantics
interlingua
phrases phrases
10
Machine Translation: Revolution?
• Revolution?
– Two great revolutions so far
• 1993: statistical word-based translation
• 2003: statistical phrase-based translation
– Overdue for the next revolution?
• 2014: revolution in progress?
Deep neural networks – the new revolution:
• Speech recognition
• Machine translation
• Semantics
BUT can they:
- scale to the Web?
- model linguistic structure
- handle MWEs
- use linguistic knowledge
11
The Future?
Three words: Web, semantics, linguistics
and deep neural networks?
12
The Future
v. 2.0
13
Human or Computer?
14
Human or Computer?
15
Human or Computer?
16
Human or Computer?
17
Human or Computer?
18
Human or Computer?
19
Human or Computer?
20
Human or Computer?
21
Human or Computer?
22
Human or Computer?
23
Human or Computer?
24
Human or Computer?
25
Human or Computer?
26
Human or Computer?
27
Human or Computer?
28
Human or Computer?
29
The Future is Now?
• Books
- Algorithm by Philip Parker, Insead
o 1,000,000+ books generated
o 100,000+ being sold at Amazon
• Robo journalism
- tons of articles generated today
- by 2025, can cover 90%
• What is next
- Fake reviews?
- Computers mining text written by other computers?
- From Computational to Computer Linguistics?
- …
30
The Future
v. 3.0
31
Hiroshima & Nagasaki
• 26/07/1945: Potsdam declaration was
an ultimatum to Japan: capitulate or face
“prompt and utter destruction”
• Japan’s prime minister Kantaro Suzuki
at a press-conference: “No comment.
We keep discussing.”
• He used the word mokusatsu, which
can mean (a) no comment, or (b) we
reject.
• 10 days later…
32
The Future?
Will the next nuclear war
start because of a
computer translation?
33
Moore’s Law: in 10 years…
Can NNs scale?
34
In 17 years: More Robots than Humans!
35
Maybe you do not believe it...
36
37
Artificial Intelligence as a Threat
“The development of full artificial intelligence
could spell the end of the human race.”
Stephen Hawking
“I am in the camp that is concerned
about super intelligence. ...and don't
understand why some people are not
concerned..” Bill Gates
“I think we should be very careful about artificial
intelligence. If I had to guess at what our biggest
existential threat is, it’s probably that.”
Elon Musk (donated $10 million to Future of Life Institute)
38
The Future: SkyNet?
39
The Future?
40
The Future?
41

More Related Content

PDF
M3 l16 translation at facebook
PDF
DH 199 Social Media Analytics
ODP
Christmas Presentation at Aarhus: What I do
PDF
Image Processing of Food Labels
PPTX
Preslav Nakov - The Web as a Training Set Part 2
PPTX
Preslav Nakov - The Web as a Training Set Part 1
PDF
Information retrieval to recommender systems
PPT
Tweeting beyond Facts – The Need for a Linguistic Perspective
M3 l16 translation at facebook
DH 199 Social Media Analytics
Christmas Presentation at Aarhus: What I do
Image Processing of Food Labels
Preslav Nakov - The Web as a Training Set Part 2
Preslav Nakov - The Web as a Training Set Part 1
Information retrieval to recommender systems
Tweeting beyond Facts – The Need for a Linguistic Perspective

Viewers also liked (9)

PDF
Credit risk predictive analytics
PPTX
The future of Big Data tooling
PPTX
Real-time analytics with HBase
PDF
Sentiment Analysis
PPT
Big Data: Improving capacity utilization of transport companies
PDF
Real-time information analysis: social networks and open data
PDF
Demand model development for the retail sector of industry
PDF
Machine learning for NLP
PPTX
Location Intelligence - the Next Evolution of Business Applications
Credit risk predictive analytics
The future of Big Data tooling
Real-time analytics with HBase
Sentiment Analysis
Big Data: Improving capacity utilization of transport companies
Real-time information analysis: social networks and open data
Demand model development for the retail sector of industry
Machine learning for NLP
Location Intelligence - the Next Evolution of Business Applications
Ad

Similar to Preslav Nakov - The Web as a Training Set Part 3 (20)

PDF
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
DOCX
Morse, Christian - LIBR 202 - The Future of Natural Language Processing
PPTX
NATURAL LANGUAGE PROCESSING in ARTIFICIAL INTELLIGENCE.pptx
PPTX
Natural lanaguage processing
PPTX
Natural Language Processing - Lecture.pptx
PPT
Text Analytics: Yesterday, Today and Tomorrow
PDF
A Comprehensive Analytical Study Of Traditional And Recent Development In Nat...
PPTX
Exosphere Chile Talk: Semantics, Deep Learning, and the Transformation of Bus...
PPT
NLP Introduction.ppt machine learning presentation
PDF
An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...
PPTX
The singularity is coming by Takuya Matsuda - CODE BLUE 2015
PPTX
NLP (4) for class 9 (1).pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnn
PDF
An Overview Of Natural Language Processing
PDF
Ontology And The Lexicon A Natural Language Processing Perspective Churen Hua...
PDF
Iulia Pasov, Sixt. Trends in sentiment analysis. The entire history from rule...
PDF
Intelligence is not Artificial - Stanford, June 2016
PPTX
NATURAL LANGUAGE PROCESSING.pptx
PPT
Artificialintelignce lecture1 BCS7
PPTX
Natural Language Processing (NLP).pptx
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
Morse, Christian - LIBR 202 - The Future of Natural Language Processing
NATURAL LANGUAGE PROCESSING in ARTIFICIAL INTELLIGENCE.pptx
Natural lanaguage processing
Natural Language Processing - Lecture.pptx
Text Analytics: Yesterday, Today and Tomorrow
A Comprehensive Analytical Study Of Traditional And Recent Development In Nat...
Exosphere Chile Talk: Semantics, Deep Learning, and the Transformation of Bus...
NLP Introduction.ppt machine learning presentation
An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...
The singularity is coming by Takuya Matsuda - CODE BLUE 2015
NLP (4) for class 9 (1).pptx nnnnnnnnnnnnnnnnnnnnnnnnnnnnn
An Overview Of Natural Language Processing
Ontology And The Lexicon A Natural Language Processing Perspective Churen Hua...
Iulia Pasov, Sixt. Trends in sentiment analysis. The entire history from rule...
Intelligence is not Artificial - Stanford, June 2016
NATURAL LANGUAGE PROCESSING.pptx
Artificialintelignce lecture1 BCS7
Natural Language Processing (NLP).pptx
Ad

More from Data Science Society (20)

PDF
[Data Meetup] Data Science in Finance - Factor Models in Finance
PDF
[Data Meetup] Data Science in Finance - Building a Quant ML pipeline
PPTX
[Data Meetup] Data Science in Journalism - Tanbih, QCRI and MIT
PPTX
Computer Vision in Real Estate
PPTX
ML in Proptech - Concept to Production
PPTX
Lessons Learned: Linked Open Data implemented in 2 Use Cases
PPT
AI methods for localization in noisy environment
PPTX
Object Identification and Detection Hackathon Solution
PPTX
Data Science for Open Innovation in SMEs and Large Corporations
PDF
Air Pollution in Sofia - Solution through Data Science by Kiwi team
PPTX
Machine Learning in Astrophysics
PPTX
#AcademiaDatathon Finlists' Solution of Crypto Datathon Case
PPTX
Coreference Extraction from Identric’s Documents - Solution of Datathon 2018
PDF
DNA Analytics - What does really goes into Sausages - Datathon2018 Solution
PDF
Relationships between research tasks and data structure (basic methods and a...
PDF
Data science tools - A.Marchev and K.Haralampiev
PDF
Problems of Application of Machine Learning in the CRM - panel
PDF
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...
PDF
Intelligent Question Answering Using the Wisdom of the Crowd, Preslav Nakov
PDF
Master class Hristo Hadjitchonev - Aubg
[Data Meetup] Data Science in Finance - Factor Models in Finance
[Data Meetup] Data Science in Finance - Building a Quant ML pipeline
[Data Meetup] Data Science in Journalism - Tanbih, QCRI and MIT
Computer Vision in Real Estate
ML in Proptech - Concept to Production
Lessons Learned: Linked Open Data implemented in 2 Use Cases
AI methods for localization in noisy environment
Object Identification and Detection Hackathon Solution
Data Science for Open Innovation in SMEs and Large Corporations
Air Pollution in Sofia - Solution through Data Science by Kiwi team
Machine Learning in Astrophysics
#AcademiaDatathon Finlists' Solution of Crypto Datathon Case
Coreference Extraction from Identric’s Documents - Solution of Datathon 2018
DNA Analytics - What does really goes into Sausages - Datathon2018 Solution
Relationships between research tasks and data structure (basic methods and a...
Data science tools - A.Marchev and K.Haralampiev
Problems of Application of Machine Learning in the CRM - panel
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...
Intelligent Question Answering Using the Wisdom of the Crowd, Preslav Nakov
Master class Hristo Hadjitchonev - Aubg

Recently uploaded (20)

PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PDF
Transcultural that can help you someday.
PPTX
IMPACT OF LANDSLIDE.....................
PDF
Global Data and Analytics Market Outlook Report
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PDF
Data Engineering Interview Questions & Answers Data Modeling (3NF, Star, Vaul...
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PDF
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
PPTX
Managing Community Partner Relationships
PPT
statistic analysis for study - data collection
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPT
Predictive modeling basics in data cleaning process
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PPTX
A Complete Guide to Streamlining Business Processes
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
New ISO 27001_2022 standard and the changes
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
Topic 5 Presentation 5 Lesson 5 Corporate Fin
Transcultural that can help you someday.
IMPACT OF LANDSLIDE.....................
Global Data and Analytics Market Outlook Report
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
Data Engineering Interview Questions & Answers Data Modeling (3NF, Star, Vaul...
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
Managing Community Partner Relationships
statistic analysis for study - data collection
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
Predictive modeling basics in data cleaning process
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
A Complete Guide to Streamlining Business Processes
retention in jsjsksksksnbsndjddjdnFPD.pptx
STERILIZATION AND DISINFECTION-1.ppthhhbx
New ISO 27001_2022 standard and the changes
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx

Preslav Nakov - The Web as a Training Set Part 3

  • 2. 2 The Big Dream (once again) Dave Bowman: “Open the pod bay doors, HAL” HAL 9000: “I’m sorry Dave. I’m afraid I can’t do that.”
  • 3. 3 Two Important Directions We Touched • Semantics • Machine Translation Critical for the overall advancement of the field Practical, within the reach of current technology
  • 4. 4 Two Important Directions We Touched •Semantics • Machine Translation
  • 5. 5 Semantics: Revolution is Needed? • If we want the dream come true, we should – not rely on superficial statistics alone – need to get to the meaning of text • A revolution in semantics is needed – looking at words is not enough – we need better models for • multi-word expressions (~70% of terminology) • semantic relations (meaning is in the links!) • The revolution will be supported by – Web-scale corpora – linguistic knowledge “Moving Lexical Semantics from Alchemy to Science” Discussion on [Corpora-List] • This is what Chomsky has done with syntax. • Should we expect the same for lexical semantics?
  • 6. 6 NAACL’2015: Accept/Reject by Area 0 10 20 30 40 50 60 70 80 90 100 Semantics NLP for Web and Social Media and Social… Machine Translation Information Extraction and Question Answering Tagging and Chunking and Syntax and Parsing Machine Learning for NLP Generation and Summarization Language Resources and Evaluation Text Categorization and Topic Models Sentiment Analysis and Opinion Mining Phonology and Morphology and Word… Spoken Language Processing Discourse and Pragmatics NLP-enabled Technology Linguistic and Psycholinguistic Aspects of CL Dialogue and Interactive Systems Information Retrieval Language and Vision Semantics has emerged from a marginal to a dominant position.
  • 7. 7 Two Important Directions We Have Touched • Semantics •Machine Translation
  • 8. 8 Machine Translation: Revolution? • Revolution? – Two great revolutions so far • 1993: statistical word-based translation • 2003: statistical phrase-based translation
  • 9. 9 Machine Translation: Revolution? • Revolution? – Two great revolutions so far • 1993: statistical word-based translation • 2003: statistical phrase-based translation – Overdue for the next revolution? • 2013: ??? – Syntactic translation? – Semantic translation? SOURCE TARGET words words syntax syntax semantics semantics interlingua phrases phrases
  • 10. 10 Machine Translation: Revolution? • Revolution? – Two great revolutions so far • 1993: statistical word-based translation • 2003: statistical phrase-based translation – Overdue for the next revolution? • 2014: revolution in progress? Deep neural networks – the new revolution: • Speech recognition • Machine translation • Semantics BUT can they: - scale to the Web? - model linguistic structure - handle MWEs - use linguistic knowledge
  • 11. 11 The Future? Three words: Web, semantics, linguistics and deep neural networks?
  • 29. 29 The Future is Now? • Books - Algorithm by Philip Parker, Insead o 1,000,000+ books generated o 100,000+ being sold at Amazon • Robo journalism - tons of articles generated today - by 2025, can cover 90% • What is next - Fake reviews? - Computers mining text written by other computers? - From Computational to Computer Linguistics? - …
  • 31. 31 Hiroshima & Nagasaki • 26/07/1945: Potsdam declaration was an ultimatum to Japan: capitulate or face “prompt and utter destruction” • Japan’s prime minister Kantaro Suzuki at a press-conference: “No comment. We keep discussing.” • He used the word mokusatsu, which can mean (a) no comment, or (b) we reject. • 10 days later…
  • 32. 32 The Future? Will the next nuclear war start because of a computer translation?
  • 33. 33 Moore’s Law: in 10 years… Can NNs scale?
  • 34. 34 In 17 years: More Robots than Humans!
  • 35. 35 Maybe you do not believe it...
  • 36. 36
  • 37. 37 Artificial Intelligence as a Threat “The development of full artificial intelligence could spell the end of the human race.” Stephen Hawking “I am in the camp that is concerned about super intelligence. ...and don't understand why some people are not concerned..” Bill Gates “I think we should be very careful about artificial intelligence. If I had to guess at what our biggest existential threat is, it’s probably that.” Elon Musk (donated $10 million to Future of Life Institute)
  • 41. 41