SlideShare a Scribd company logo
Natural Language Processing
SoSe 2016
Introduction to Natural Language Processing
Dr. Mariana Neves April 11th, 2016
Introduction to Natural Language Processing
http://bit.ly/2Mub6xP
Outline
●
Introduction to Language
●
NLP Applications
●
NLP Techniques
●
Linguistic Knowledge
●
Challenges
●
NLP course
2
Outline
●
Introduction to Language
●
NLP Applications
●
NLP Techniques
●
Linguistic Knowledge
●
Challenges
●
NLP course
3
4
(http://www.transparent.com/learn-japanese/articles/dec_99.html)(http://expertenough.com/2392/german-language-hacks)
Natural Language
5
Artificial Language
(https://netbeans.org/features/java/) (http://noobite.com/learn-programming-start-with-python/)
Language
6
A vocabulary consists
of a set of words (wi
)
(http://learnenglish.britishcouncil.org/en/vocabulary-games)
A text is composed of a
sequence of words from
a vocabulary
A language is
constructed of a
set of all possible texts
(http://www.nature.com/polopoly_fs/1.16929!/menu/main/topColumns/topLeftColumn/pdf/518273a.pdf)
(http://www.old-engli.sh/language.php)
Outline
●
Introduction to Language
●
NLP Applications
●
NLP Techniques
●
Linguistic Knowledge
●
Challenges
●
NLP course
7
Spell and Grammar Checking
●
Checking spelling and grammar
●
Suggesting alternatives for the errors
8
Word Prediction
●
Predicting the next word that is highly probable to be typed by
the user
9
Information Retrieval
●
Finding relevant information to the user’s query
10
Text Categorization
●
Assigning one (or more) pre-defined category to a text
11
Text Categorization
12
http://www.uclassify.com/browse/mvazquez/News-Classifier
Summarization
●
Generating a short summary from one or more documents,
sometimes based on a given query
13
http://smmry.com/
Summarization
14
Question answering
●
Answering questions with a short answer
15
http://start.csail.mit.edu/index.php
Question Answering & Summarization
16
Question answering
●
IBM Watson in Jeopardy
17
https://www.youtube.com/watch?v=WFR3lOm_xhE
Information
Extraction
●
Extracting
important
concepts from
texts and
assigning them
to slot in a
certain
template
18
Information Extraction
●
Includes named-entity recognition
19
http://cogcomp.cs.illinois.edu/page/demo_view/Wikifier
Information Extraction
20
Machine Translation
●
Translating a text from one language to another
21
Sentiment Analysis
●
Identifying sentiments and opinions stated in a text
22
Optical Character Recognition
●
Recognizing printed or handwritten texts and converting them
to computer-readable texts
23
Speech recognition
●
Recognizing a spoken language and transforming it into a text
24
Speech synthesis
●
Producing a spoken language from a text
25
Spoken dialog systems
●
Running a dialog between the user and the system
26
Spoken dialog systems
27
(http://dialog-demo.mybluemix.net/)
Level of difficulties
●
Easy (mostly solved)
– Spell and grammar checking
– Some text categorization tasks
– Some named-entity recognition tasks
28
Level of difficulties
●
Intermediate (good progress)
– Information retrieval
– Sentiment analysis
– Machine translation
– Information extraction
29
Level of difficulties
●
Difficult (still hard)
– Question answering
– Summarization
– Dialog systems
30
Outline
●
Introduction to Language
●
NLP Applications
●
NLP Techniques
●
Linguistic Knowledge
●
Challenges
●
NLP course
31
Section splitting
●
Splitting a text into sections
32
Sentence splitting
●
Splitting a text into sentences
33
Part-of-speech tagging
●
Assigning a syntatic tag to each word in a sentence
34
http://nlp.stanford.edu:8080/corenlp/
Parsing
●
Building the syntactic tree of a sentence
35
http://nlp.stanford.edu:8080/corenlp/
Parsing
●
Building the syntactic tree of a sentence
36
http://nlp.stanford.edu:8080/corenlp/
Named-entity recognition
●
Identifying pre-defined entity types in a sentence
37
Word sense disambiguation
●
Figuring out the exact meaning of a word or entity
38
http://www.thefreedictionary.com/tie
Word sense disambiguation
39
http://www.ling.gu.se/~lager/Home/pwe_ui.html
Word sense disambiguation
40
Semantic role labeling
●
Extracting subject-predicate-object triples from a sentence
41
http://cogcomp.cs.illinois.edu/page/demo_view/srl
Outline
●
Introduction to Language
●
NLP Applications
●
NLP Techniques
●
Linguistic Knowledge
●
Challenges
●
NLP course
42
Phonetics
and
phonology
●
The study of
linguistic
sounds and
their
relations to
words
43
http://german.about.com/library/blfunkabc.htm
Morphology
●
The study of internal structures of words and how they can be
modified
●
Parsing complex words into their components
44
(http://allthingslinguistic.com/post/50939757945/morphological-typology-illustrations-from)
Syntax
●
The study of the structural relationships between words in a
sentence
45
Semantics
●
The study of the meaning of words, and how these combine to
form the meanings of sentences
– Synonymy: fall & autumn
– Hypernymy & hyponymy (is a): animal & dog
– Meronymy (part of): finger & hand
– Homonymy: fall (verb & season)
– Antonymy: big & small
46
Pragmatics
●
Social use of language
●
The study of how language is used to accomplish goals, and
the influence of context on meaning
●
Understanding the aspects of a language which depends on
situation and world knowledge
47
Give me the salt!
Could you please give me the salt?
Discourse
●
The study of linguistic units larger than a single statement
48
John reads a book. He borrowed it from his friend.
(http://en.wikipedia.org/wiki/Berlin)
Outline
●
Introduction to Language
●
NLP Applications
●
NLP Techniques
●
Linguistic Knowledge
●
Challenges
●
NLP course
49
Paraphrasing
●
Different words/sentences express the same meaning
– Season of the year
●
Fall
●
Autumn
– Book delivery time
●
When will my book arrive?
●
When will I receive my book?
50
Ambiguity
●
One word/sentence can have different meanings
– Fall
●
The third season of the year
●
Moving down towards the ground or towards a lower
position
– The door is open.
●
Expressing a fact
●
A request to close the door
51
Phonetics and Phonology
52
http://worldsgreatestsmile.com/html/phonological_ambiguity.html
Syntax and ambiguity
●
I saw the man with a telescope.
– Who had the telescope?
53
(http://www.realtytrac.com/landing/2009-year-end-foreclosure-report.html)
Semantics
●
The astronomer loves the star.
– Star in the sky
– Celebrity
54
(http://www.businessnewsdaily.com/2023-celebrity-hiring.html)
(http://en.wikipedia.org/wiki/Star#/media/File:Starsinthesky.jpg)
Discourse analysis
●
Alice understands that you like your mother, but she …
– Does she refer to Alice or your mother?
55
Outline
●
Introduction to Language
●
NLP Applications
●
NLP Techniques
●
Linguistic Knowledge
●
Challenges
●
NLP course
56
NLP Course
●
Home page:
– http://hpi.de/plattner/teaching/summer-term-
2016/natural-language-processing.html
●
Lecture
–
– HS3
– 3 credit points
57
http://bit.ly/2Mub6xP
NLP for Machine Learning Featuring
Label Encoding
One hot encoding
Synonym treatment
Stemming
Lemmatization
Stop words
Parts Of Speech Tagging
TF-IDF and its math Behind

More Related Content

PDF
Deep learning Type Inference for Dynamic Programming Languages
PDF
Introduction to natural language processing
PDF
Copy of btp presentation
PDF
Natural language processing for requirements engineering: ICSE 2021 Technical...
PPTX
lecture 1 intro NLP_lecture 1 intro NLP.pptx
PDF
AIS Technical Development Workshop 2: Text Analytics with Python
PPTX
Presentacion_Procesamiento_Lenguaje.pptx
PDF
600Desc
Deep learning Type Inference for Dynamic Programming Languages
Introduction to natural language processing
Copy of btp presentation
Natural language processing for requirements engineering: ICSE 2021 Technical...
lecture 1 intro NLP_lecture 1 intro NLP.pptx
AIS Technical Development Workshop 2: Text Analytics with Python
Presentacion_Procesamiento_Lenguaje.pptx
600Desc

Similar to Introduction to Natural Language Processing (20)

PDF
600Desc
PPTX
Power point presentatiom naturallanguage processing.pptx
PPTX
Power point presentatiom naturallanguage processing.pptx
PPTX
Natural Language Processing-(NLP).pptx
PDF
What can typological knowledge bases and language representations tell us abo...
PPTX
Natural language for processing.pptx by Daniel solma
PPTX
Encouraging autonomy through technology-enhanced tools
PPTX
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
PDF
AI Lesson 40
PDF
Lesson 40
PPTX
Technical Development Workshop - Text Analytics with Python
PPT
L1 nlp intro
PPTX
NLP presentation.pptx
PDF
naturallanguageprocessing-160722053804.pdf
PPTX
Branches of linguistics
PPTX
MixedLanguageProcessingTutorialEMNLP2019.pptx
PPTX
Natural language processing
PPTX
Introduction to Natural Language Processing (NLP)
PDF
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
PDF
Problem Solving In A Foreign Language Lena Heine
600Desc
Power point presentatiom naturallanguage processing.pptx
Power point presentatiom naturallanguage processing.pptx
Natural Language Processing-(NLP).pptx
What can typological knowledge bases and language representations tell us abo...
Natural language for processing.pptx by Daniel solma
Encouraging autonomy through technology-enhanced tools
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
AI Lesson 40
Lesson 40
Technical Development Workshop - Text Analytics with Python
L1 nlp intro
NLP presentation.pptx
naturallanguageprocessing-160722053804.pdf
Branches of linguistics
MixedLanguageProcessingTutorialEMNLP2019.pptx
Natural language processing
Introduction to Natural Language Processing (NLP)
Multilingual vocabularies for the Web: Session on multilingual vocabularies, ...
Problem Solving In A Foreign Language Lena Heine
Ad

More from gokulprasath06 (7)

PDF
Exploratory data analysis
PDF
K-means Clustering Algorithm with Matlab Source code
PDF
Introduction to Data Mining - A Beginner's Guide
PDF
Reinforcement Learning Guide For Beginners
PDF
Artificial Neural Networks (ANN)
PDF
Data science guide
Exploratory data analysis
K-means Clustering Algorithm with Matlab Source code
Introduction to Data Mining - A Beginner's Guide
Reinforcement Learning Guide For Beginners
Artificial Neural Networks (ANN)
Data science guide
Ad

Recently uploaded (20)

PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
Pharma ospi slides which help in ospi learning
PDF
Business Ethics Teaching Materials for college
PPTX
Cell Structure & Organelles in detailed.
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
master seminar digital applications in india
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
Pre independence Education in Inndia.pdf
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
RMMM.pdf make it easy to upload and study
PDF
Classroom Observation Tools for Teachers
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
VCE English Exam - Section C Student Revision Booklet
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Pharma ospi slides which help in ospi learning
Business Ethics Teaching Materials for college
Cell Structure & Organelles in detailed.
PPH.pptx obstetrics and gynecology in nursing
master seminar digital applications in india
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Pre independence Education in Inndia.pdf
Week 4 Term 3 Study Techniques revisited.pptx
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Renaissance Architecture: A Journey from Faith to Humanism
Supply Chain Operations Speaking Notes -ICLT Program
Abdominal Access Techniques with Prof. Dr. R K Mishra
RMMM.pdf make it easy to upload and study
Classroom Observation Tools for Teachers
O5-L3 Freight Transport Ops (International) V1.pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf

Introduction to Natural Language Processing