SlideShare a Scribd company logo
NLP “Crash Course”
Charlie Greenbacker
dcnlp.org
Agenda
• Introduction & Motivation
• Famous Examples
• Basics
• Major Task Areas
• Protips
• Resources
Introduction
& Motivation
By “NLP” we mean...
Natural Language Processing
(#NLProc)
aka Computational Linguistics,Text Analytics, etc.
not Neuro-linguistic Programming! (#NLP)
Introduction
& Motivation
Natural Language Processing is...
Using computers to process (i.e., analyze,
understand, generate, etc.) natural human
languages (e.g., English, Chinese, Klingon).
Hello, world! 你好,世界!
That sounds hard... why should I care?
• Most of the knowledge created by humans
is unstructured text (information overload)
• Need some way to make sense of it all
• Enable quantitative analysis of text data
Introduction
& Motivation
Famous Examples
Siri (Apple, SRI, Nuance)
Speech Recognition/Generation
IBM Watson
Question Answering
Google Translate
MachineTranslation
Basics
• Segmentation
• Part-of-speech tagging
• Noun phrase (NP) chunking
• Parsing
• Word sense disambiguation
Basics
• Stop words, stemming/lemmatization
• Frequency analysis
(terms, ngrams,TF-IDF)
• Machine learning (classification,
clustering, recommendation)
Major Task Areas
Question Answering
• Match query with knowledge base
• Closed domain vs open domain
• Reasoning about intent of question
Major Task Areas
Speech Recognition
• Speech to text
• Trained/untrained user models
• Voice-based interfaces
Major Task Areas
Named Entity Recognition
• Entity extraction
• Persons, organizations, location
• Grammar, syntax, phrasing
Major Task Areas
Entity Resolution
• Linking names to ground truth
• Disambiguating similar names
Major Task Areas
Co-reference Resolution
• Finding antecedents for pronouns
• Name resolution
Major Task Areas
Relationship Extraction
• Attribute values
• SVO triples
• Populating ontologies
Major Task Areas
Information Retrieval
• Query expansion
• Relevancy of results
• “More like this”
Major Task Areas
Assistive Technologies
• Text simplification
• Predictive text input
• Alternative interfaces
Major Task Areas
NLG + Automatic Summarization
• Generating text from data
• Extractive summarization
• Abstractive summarization
Major Task Areas
Machine Translation
• From source to target, and back!
• Single terms work... sometimes
• Idioms, metaphors, cultural references
Major Task Areas
Sentiment Analysis
• Polarity, intensity, direction
• "Easy" for movie/product reviews
• "Impossible" for nearly anything else
Protips
• Domain adaptation
(retrain your models, social media != news)
• Assume everything is in beta
(error rates compound, translate last,
consult the research literature)
• Evaluation is essential
(human judges,“gold standard” data,
cross-validation, appropriate metrics)
Resources
(toolkits)
Stanford CoreNLP
Java, GPL
Apache OpenNLP
Java,Apache License
NLTK
Python,Apache License
Resources
(books)
Natural Language
Processing with Python
Bird, Klein, and Loper
Speech and Language______________
Processing______________
Jurafsky and Martin______________
Foundations of Statistical
Natural Language Processing
Manning and Schütze
Resources
(groups)
ACL (Association for
Computational Linguistics)
Conferences,Workshops, Journals, SIGs
DC NLP
NLP Meetups
Data Community DC
NLPWorkshops
Questions?
Charlie Greenbacker
dcnlp.org
@greenbacker

More Related Content

PPT
Equivalence partitions analysis
PPT
Black box testing lecture 11
PPTX
Word2 vec
PPTX
Natural language processing (NLP)
PDF
Natural Language Processing
PDF
Natural Language Processing with Python
PPTX
Brain Tumour Detection.pptx
PDF
Artificial Intelligence for Automated Software Testing
Equivalence partitions analysis
Black box testing lecture 11
Word2 vec
Natural language processing (NLP)
Natural Language Processing
Natural Language Processing with Python
Brain Tumour Detection.pptx
Artificial Intelligence for Automated Software Testing

What's hot (20)

PPTX
Natural Language Processing
PDF
Deep learning for NLP and Transformer
PPTX
Text similarity measures
PPTX
L6 Digital Forensic Investigation Tools.pptx
PPTX
Word embeddings
PPTX
Natural Language Processing using Artificial Intelligence
PPT
Big Data and Natural Language Processing
PDF
Introduction to natural language processing
PPT
Bayesian networks
PPTX
Text Classification/Categorization
PPTX
RECURSIVE DESCENT PARSING
PDF
Topics Modeling
PPTX
Black box software testing
PDF
Natural language processing
PDF
Natural Language Processing
PDF
Introduction to functional programming (In Arabic)
PPTX
Clustering for Stream and Parallelism (DATA ANALYTICS)
PPT
Natural language processing
PDF
Word2vec algorithm
Natural Language Processing
Deep learning for NLP and Transformer
Text similarity measures
L6 Digital Forensic Investigation Tools.pptx
Word embeddings
Natural Language Processing using Artificial Intelligence
Big Data and Natural Language Processing
Introduction to natural language processing
Bayesian networks
Text Classification/Categorization
RECURSIVE DESCENT PARSING
Topics Modeling
Black box software testing
Natural language processing
Natural Language Processing
Introduction to functional programming (In Arabic)
Clustering for Stream and Parallelism (DATA ANALYTICS)
Natural language processing
Word2vec algorithm
Ad

Similar to Natural Language Processing Crash Course (20)

PPTX
Introduction to natural language processing (NLP)
PDF
Lazy man's learning: How To Build Your Own Text Summarizer
PDF
Natural Language Processing: L01 introduction
PPTX
Introduction to NLP.pptx
PDF
NOVA Data Science Meetup 1/19/2017 - Presentation 2
PDF
Natural language processing (NLP) introduction
PPT
Intro 2 document
PPT
introduction to natural language processing(NLP).ppt
PPTX
natural language processing unit-3 ppt
PPTX
Natural Language Processing & its importance
PDF
Natural language processing (nlp)
PPT
Lecture1 Natural Language Processing for
PPT
PPT
PPTX
LONGSEM2024-25_CSE3015_ETH_AP2024256000125_Reference-Material-I.pptx
PPTX
Natural Language Processing (NLP).pptx
PDF
Pycon India 2018 Natural Language Processing Workshop
PDF
Beyond the Symbols: A 30-minute Overview of NLP
PPTX
Designing and Implementing Search Solutions
PPTX
Natural Language Processing (NLP).pptx
Introduction to natural language processing (NLP)
Lazy man's learning: How To Build Your Own Text Summarizer
Natural Language Processing: L01 introduction
Introduction to NLP.pptx
NOVA Data Science Meetup 1/19/2017 - Presentation 2
Natural language processing (NLP) introduction
Intro 2 document
introduction to natural language processing(NLP).ppt
natural language processing unit-3 ppt
Natural Language Processing & its importance
Natural language processing (nlp)
Lecture1 Natural Language Processing for
LONGSEM2024-25_CSE3015_ETH_AP2024256000125_Reference-Material-I.pptx
Natural Language Processing (NLP).pptx
Pycon India 2018 Natural Language Processing Workshop
Beyond the Symbols: A 30-minute Overview of NLP
Designing and Implementing Search Solutions
Natural Language Processing (NLP).pptx
Ad

Recently uploaded (20)

PPT
Reliability_Chapter_ presentation 1221.5784
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
.pdf is not working space design for the following data for the following dat...
PPT
Quality review (1)_presentation of this 21
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Lecture1 pattern recognition............
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Introduction to machine learning and Linear Models
Reliability_Chapter_ presentation 1221.5784
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
STUDY DESIGN details- Lt Col Maksud (21).pptx
Introduction to Knowledge Engineering Part 1
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Business Ppt On Nestle.pptx huunnnhhgfvu
Clinical guidelines as a resource for EBP(1).pdf
Introduction-to-Cloud-ComputingFinal.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Supervised vs unsupervised machine learning algorithms
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
.pdf is not working space design for the following data for the following dat...
Quality review (1)_presentation of this 21
Qualitative Qantitative and Mixed Methods.pptx
Lecture1 pattern recognition............
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Introduction to machine learning and Linear Models

Natural Language Processing Crash Course