SlideShare a Scribd company logo
DEEP LEARNING FOR NLP
IVANVERGILIEV
• @IvanVergiliev
• IvanVergiliev.github.io
WHAT IS DEEP LEARNING?
IT’S BASICALLY A NEURAL
NETWORK…
BUT WITH LOTS OF
HIDDEN LAYERS
ISN’TTHAT FROMTHE ‘80S?
NO
IT’S ACTUALLY FROMTHE ‘40S
WE HAVE
GPUS NOW
AS WELL AS SOME NEW IDEAS
e.g. Convolutional Neural Nets
WHY DO WE NEED
NLP
NATURAL LANGUAGES ARE
HARD
LOOK AT
JAVASCRIPT
Developed by some of
the most innovative
developers
BULGARIAN
~1300 years
> 7 million just today
ESPERANTO IS PROBABLY
EASIER
LANGUAGE MODELS
WORD FREQUENCIES
• wow, a word
cloud
N-GRAM MODELS
COUNT WORD
OCCURRENCES
Machine learning for NLP
APPLICATIONS
• Spelling correction
• Optical Character Recognition
• Speech Recognition
TEXT GENERATORS
not very useful, but fun
HACKERNEWSTITLES
• How Facebook is killing Linux on the desktop
• Facebook claims it can read your e-mail
without a data plan
• Only a few countries are teaching children
how to drive customers away
http://what-would-i-say.com/
WORD CLASSES
• 2014 was a good year.
• <YEAR> was a good year.
NOT GOOD ENOUGH
• I like cake
• I love pie
PART OF SPEECH
TAGGING
The lecturer criticised the person.
NEED A REPRESENTATION
OF MEANING
DISTRIBUTED REPRESENTATION
• city = [-0.5, 0.3, …, 0.7]
• town = [-0.52, 0.35, …, 0.8]
TRAIN A
NEURAL
NETWORK
THEVECTORS ACTUALLY
MAKE SENSE
DEMOTIME
MORE SENSETHAN
EXPECTED
W(‘’WOMAN")−W(‘‘MAN")
≃
W(‘‘AUNT")−W(‘‘UNCLE")
W(‘’WOMAN”)−W(‘‘MAN")
≃
W(‘‘QUEEN")−W(‘‘KING")
DEMOTIME
AGAIN
NEURAL LANGUAGE MODELS
FEEDFORWARD NEURAL
NETWORK BASED
LANGUAGE MODEL
It even sounds fancy
FEEDFORWARD NEURAL NETWORK
BASED LANGUAGE MODEL
LOCAL CONTEXT ONLY
Yesterday - the third day of the month -
I went out.
RECURRENT NEURAL
NETWORK
WHERE DO WE GO NOW?
PARAGRAPHVECTORS
WHY DEFINE WORDS AT
ALL?
Can’t we learn from raw data like the image nets do?
TEXT UNDERSTANDING
FROM SCRATCH
SHARED REPRESENTATIONS
CAN WE PUT OTHERTHINGS
INTHE SAME SPACE?
APPARENTLY,YES
Machine learning for NLP
BUT WHY JUSTTEXT?
AUTOMATED
IMAGE
CAPTIONING
DEMO
YEAH,AGAIN
TRIVIA
“WHAT I LEARNED FROM
COMPETING AGAINST A
CONVNET ON IMAGENET”
“AWW,A CUTE DOG!”
HUMAN WON
ATOUGH RACETHOUGH
BREAKING NEURAL
NETWORKS
IT’S A PANDA
AND NOW?
NOT A PANDA
ANYMORE
THANKS!
QUESTIONS?
REFERENCES
• http://colah.github.io/posts/2014-07-Conv-Nets-Modular/
• http://www.fit.vutbr.cz/~imikolov/rnnlm/thesis.pdf
• http://cs.stanford.edu/~quocle/paragraph_vector.pdf
• http://arxiv.org/pdf/1502.01710v2.pdf

More Related Content

PDF
Let's talk about voice
KEY
Wonderful Web App UX
PDF
SYBIL making process
PPTX
Give A/P Shared Services Reps an Extra Brain with Intelligent SAP-Empowered V...
PPTX
Non technical james nash
PPT
Python Learning for Natural Language Processing
PDF
Deep Learning for NLP
PDF
Machine learning for Music
Let's talk about voice
Wonderful Web App UX
SYBIL making process
Give A/P Shared Services Reps an Extra Brain with Intelligent SAP-Empowered V...
Non technical james nash
Python Learning for Natural Language Processing
Deep Learning for NLP
Machine learning for Music

Viewers also liked (20)

PDF
Image Processing of Food Labels
PPTX
Preslav Nakov - The Web as a Training Set Part 2
PPTX
Preslav Nakov - The Web as a Training Set Part 3
PPTX
Preslav Nakov - The Web as a Training Set Part 1
PDF
Information retrieval to recommender systems
PDF
Practical Deep Learning for NLP
PDF
Credit risk predictive analytics
PPT
Tweeting beyond Facts – The Need for a Linguistic Perspective
PPTX
The future of Big Data tooling
PPTX
Real-time analytics with HBase
PDF
Sentiment Analysis
PPT
Big Data: Improving capacity utilization of transport companies
PDF
Real-time information analysis: social networks and open data
PDF
Demand model development for the retail sector of industry
PDF
Programming for MIDI controllers
PPTX
The Profitable Startup
PPTX
Curso del-juego-go
PDF
일하는여성 82호(본문)(2010.4.30)
DOCX
Mapas conceptuales de unidades paola
PDF
ZineINC™ - komunikacja wewnętrzna w nowym wydaniu
Image Processing of Food Labels
Preslav Nakov - The Web as a Training Set Part 2
Preslav Nakov - The Web as a Training Set Part 3
Preslav Nakov - The Web as a Training Set Part 1
Information retrieval to recommender systems
Practical Deep Learning for NLP
Credit risk predictive analytics
Tweeting beyond Facts – The Need for a Linguistic Perspective
The future of Big Data tooling
Real-time analytics with HBase
Sentiment Analysis
Big Data: Improving capacity utilization of transport companies
Real-time information analysis: social networks and open data
Demand model development for the retail sector of industry
Programming for MIDI controllers
The Profitable Startup
Curso del-juego-go
일하는여성 82호(본문)(2010.4.30)
Mapas conceptuales de unidades paola
ZineINC™ - komunikacja wewnętrzna w nowym wydaniu
Ad

Similar to Machine learning for NLP (18)

PPTX
Delhi NCR JUG meetup - NLP - APIs - By Vikas Malik
PDF
Let’s learn how to use JavaScript responsibly and stay up-to-date.
PDF
Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...
PDF
gPBL - Reading Assistant for Blind - Project Proposal
PPTX
Personal voice assistant - jarvis
PPTX
Why Java Is Great - Unbiased Perspective!
PPTX
Noam Kfir - There is no Java Script - code.talks 2015
PPTX
There Is No JavaScript
PDF
Forget Ruby. Forget CoffeeScript. Do SOA
PDF
Dawn of the Devices
PPTX
Introduction to deep learning
PDF
SFScon19 - Andrea Janes - API fluency remembering APIs to become more effective
PDF
Why natural language is next step in the AI evolution
PDF
C# o basico
PPTX
history and evaluation of java.pptx
PDF
Dr. Russell Keith-Magee: Building a development community
PPTX
Bert algorithm 2
PDF
PyData Frankfurt - (Efficient) Data Exchange with "Foreign" Ecosystems
Delhi NCR JUG meetup - NLP - APIs - By Vikas Malik
Let’s learn how to use JavaScript responsibly and stay up-to-date.
Creating Chatbots Using TensorFlow | Chatbot Tutorial | Deep Learning Trainin...
gPBL - Reading Assistant for Blind - Project Proposal
Personal voice assistant - jarvis
Why Java Is Great - Unbiased Perspective!
Noam Kfir - There is no Java Script - code.talks 2015
There Is No JavaScript
Forget Ruby. Forget CoffeeScript. Do SOA
Dawn of the Devices
Introduction to deep learning
SFScon19 - Andrea Janes - API fluency remembering APIs to become more effective
Why natural language is next step in the AI evolution
C# o basico
history and evaluation of java.pptx
Dr. Russell Keith-Magee: Building a development community
Bert algorithm 2
PyData Frankfurt - (Efficient) Data Exchange with "Foreign" Ecosystems
Ad

More from Data Science Society (20)

PDF
[Data Meetup] Data Science in Finance - Factor Models in Finance
PDF
[Data Meetup] Data Science in Finance - Building a Quant ML pipeline
PPTX
[Data Meetup] Data Science in Journalism - Tanbih, QCRI and MIT
PPTX
Computer Vision in Real Estate
PPTX
ML in Proptech - Concept to Production
PPTX
Lessons Learned: Linked Open Data implemented in 2 Use Cases
PPT
AI methods for localization in noisy environment
PPTX
Object Identification and Detection Hackathon Solution
PPTX
Data Science for Open Innovation in SMEs and Large Corporations
PDF
Air Pollution in Sofia - Solution through Data Science by Kiwi team
PPTX
Machine Learning in Astrophysics
PPTX
#AcademiaDatathon Finlists' Solution of Crypto Datathon Case
PPTX
Coreference Extraction from Identric’s Documents - Solution of Datathon 2018
PDF
DNA Analytics - What does really goes into Sausages - Datathon2018 Solution
PDF
Relationships between research tasks and data structure (basic methods and a...
PDF
Data science tools - A.Marchev and K.Haralampiev
PDF
Problems of Application of Machine Learning in the CRM - panel
PDF
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...
PDF
Intelligent Question Answering Using the Wisdom of the Crowd, Preslav Nakov
PDF
Master class Hristo Hadjitchonev - Aubg
[Data Meetup] Data Science in Finance - Factor Models in Finance
[Data Meetup] Data Science in Finance - Building a Quant ML pipeline
[Data Meetup] Data Science in Journalism - Tanbih, QCRI and MIT
Computer Vision in Real Estate
ML in Proptech - Concept to Production
Lessons Learned: Linked Open Data implemented in 2 Use Cases
AI methods for localization in noisy environment
Object Identification and Detection Hackathon Solution
Data Science for Open Innovation in SMEs and Large Corporations
Air Pollution in Sofia - Solution through Data Science by Kiwi team
Machine Learning in Astrophysics
#AcademiaDatathon Finlists' Solution of Crypto Datathon Case
Coreference Extraction from Identric’s Documents - Solution of Datathon 2018
DNA Analytics - What does really goes into Sausages - Datathon2018 Solution
Relationships between research tasks and data structure (basic methods and a...
Data science tools - A.Marchev and K.Haralampiev
Problems of Application of Machine Learning in the CRM - panel
Disruptive as Usual: New Technologies and Data Value Professor Severino Mereg...
Intelligent Question Answering Using the Wisdom of the Crowd, Preslav Nakov
Master class Hristo Hadjitchonev - Aubg

Recently uploaded (20)

PPTX
1_Introduction to advance data techniques.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
annual-report-2024-2025 original latest.
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPT
Quality review (1)_presentation of this 21
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
Business Analytics and business intelligence.pdf
PDF
Introduction to Data Science and Data Analysis
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Computer network topology notes for revision
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Introduction to machine learning and Linear Models
1_Introduction to advance data techniques.pptx
Fluorescence-microscope_Botany_detailed content
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
annual-report-2024-2025 original latest.
Miokarditis (Inflamasi pada Otot Jantung)
Quality review (1)_presentation of this 21
climate analysis of Dhaka ,Banglades.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Business Analytics and business intelligence.pdf
Introduction to Data Science and Data Analysis
Data_Analytics_and_PowerBI_Presentation.pptx
Introduction to Knowledge Engineering Part 1
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Computer network topology notes for revision
SAP 2 completion done . PRESENTATION.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
[EN] Industrial Machine Downtime Prediction
STUDY DESIGN details- Lt Col Maksud (21).pptx
Introduction to machine learning and Linear Models

Machine learning for NLP