SlideShare a Scribd company logo
A LIGHT INTRODUCTION TO
TRANSFER LEARNING FOR NLP
NATURAL LANGUAGE PROCESSING
• NLP INVOLVES MACHINE OR ROBOTS TO UNDERSTAND
• IT PROCESS THE LANGUAGE THAT HUMAN SPEAK.
TRANSFER LEARNING
• TRANSFER LEARNING IS THE PROCESS OF TRAINING A MODEL ON A LARGE-
SCALE DATASET AND THEN USING THAT PRETRAINED MODEL TO CONDUCT
LEARNING FOR ANOTHER DOWNSTREAM TASK (I.E., TARGET TASK)
• TRANSFER LEARNING WAS POPULARIZED IN THE FIELD OF COMPUTER VISION
THANKS TO THE IMAGENET DATASET
TRANSFER LEARNING FOR NLP
• TRAINING AND TEST DISTRIBUTIONS ARE DIFFERENT
• DIFFERENT TEXT TYPES.
• DIFFERENT ACCENTS/AGES.
• DIFFERENT TOPICS/CATEGORIES.
PRETRAINED NLP MODELS
• ULMFIT
• ELMO
• GLOMO
• OPENAI TRANSFORMER
ULMFIT
• ULMFIT WAS PROPOSED AND DESIGNED BY FAST.AI’S JEREMY HOWARD AND
DEEPMIND’S SEBASTIAN RUDER. YOU COULD SAY THAT ULMFIT WAS THE
RELEASE THAT GOT THE TRANSFER LEARNING PARTY STARTED LAST YEAR.
• UMFIT STANDS FOR UNIVERSAL LANGUAGE MODEL FINE TUNING FOR TEXT
CLASSIFICATION
ELMO
• ELMO WORD REPRESENTATIONS, OR EMBEDDINGS FROM
LANGUAGE MODELS
• PRETRAINING THE ENTIRE MODEL WITH DEEP
CONTEXTUALIZED REPRESENTATIONS THROUGH STACKED
NEURAL LAYERS
• ELMO IS A NOVEL WAY OF REPRESENTING WORDS IN
VECTORS AND EMBEDDINGS. THESE ELMO WORD
EMBEDDINGS HELP US ACHIEVE STATE-OF-THE-ART
RESULTS ON MULTIPLE NLP TASKS
GLOMO
• UNSUPERVISED LEARNED RELATIONAL GRAPHS AS TRANSFERABLE
REPRESENT0ATIONS
• MODERN DEEP TRANSFER LEARNING APPROACHES HAVE MAINLY FOCUSED ON
LEARNING GENERIC FEATURE VECTORS FROM ONE TASK THAT ARE
TRANSFERABLE TO OTHER TASKS, SUCH AS WORD EMBEDDINGS IN LANGUAGE
AND PRETRAINED CONVOLUTIONAL FEATURES IN VISION.
• THESE APPROACHES USUALLY TRANSFER UNARY FEATURES AND LARGELY
IGNORE MORE STRUCTURED GRAPHICAL REPRESENTATIONS.
• THIS WORK EXPLORES THE POSSIBILITY OF LEARNING GENERIC LATENT
RELATIONAL GRAPHS THAT CAPTURE DEPENDENCIES BETWEEN PAIRS OF DATA
UNITS (E.G., WORDS OR PIXELS)
OPEN AI TRANSFORMER
• IMPROVING LANGUAGE UNDERSTANDING WITH UNSUPERVISED LEARNING
• THE TRANSFORMER ARCHITECTURE IS AT THE CORE OF ALMOST ALL THE
RECENT MAJOR DEVELOPMENTS IN NLP. IT WAS INTRODUCED IN 2017 BY
GOOGLE. BACK THEN, RECURRENT NEURAL NETWORKS (RNN) WERE BEING USED
FOR LANGUAGE TASKS, LIKE MACHINE TRANSLATION AND QUESTION
ANSWERING SYSTEMS.
• GOOGLE RELEASED AN IMPROVED VERSION OF TRANSFORMER LAST YEAR
CALLED UNIVERSAL TRANSFORMER.

More Related Content

PPTX
Natural language processing (NLP)
PDF
State-of-the-Art Text Classification using Deep Contextual Word Representations
PPTX
Natural language processing
PDF
Natural language processing
PPTX
Introduction to Natural Language Processing
PPTX
Natural language processing
PPT
Natural Language Processing
Natural language processing (NLP)
State-of-the-Art Text Classification using Deep Contextual Word Representations
Natural language processing
Natural language processing
Introduction to Natural Language Processing
Natural language processing
Natural Language Processing

What's hot (20)

PPTX
natural language processing help at myassignmenthelp.net
PPTX
Natural Language Processing
PPTX
Natural Language Processing
PPTX
Processing Written English
PPTX
Natural Language Processing
DOCX
Natural Language Processing
PPTX
Natural language processing
PPTX
Lecture 1: Semantic Analysis in Language Technology
PDF
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
PPTX
Artificial Intelligence Notes Unit 4
PDF
Learning to understand phrases by embedding the dictionary
PPTX
Natural Language Processing
PPTX
Natural language processing
PDF
Natural Language Processing: L01 introduction
PDF
Natural language processing (nlp)
PDF
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
PPTX
Natural language-processing
PPTX
Natural language processing
PPT
Natural language processing
natural language processing help at myassignmenthelp.net
Natural Language Processing
Natural Language Processing
Processing Written English
Natural Language Processing
Natural Language Processing
Natural language processing
Lecture 1: Semantic Analysis in Language Technology
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Artificial Intelligence Notes Unit 4
Learning to understand phrases by embedding the dictionary
Natural Language Processing
Natural language processing
Natural Language Processing: L01 introduction
Natural language processing (nlp)
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Natural language-processing
Natural language processing
Natural language processing
Ad

Similar to A Light Introduction to Transfer Learning for NLP (20)

PPTX
Natural Language Processing (NLP)
PPTX
naturallanguageprocessingnlp-231215172843-839c05ab.pptx
PDF
Natural language processing module 1 chapter 1
PPTX
operating system notes for II year IV semester students
PPTX
Natural Language Processing (NLP).pptx
PPTX
NLP Introduction and basics of natural language processing
PPTX
Neural Language Model_ Webinar.pptx new1
PPTX
Presentation generative-transformational grammar
PPT
L1 nlp intro
PPTX
PPT Unit 5=software- engineering-21.pptx
PPTX
6CS4_AI_Unit-5 @zammers.pptx(for artificial intelligence)
PPTX
NLP_KASHK: Introduction
PDF
Poster @ enetCollect CA MC meeting in Iasi, Romania
PPT
1 Introduction.ppt
PDF
Master LLMs with LangChain -the basics of LLM
PDF
sete linguagens em sete semanas
PDF
Demystifying Interlanguage Pragmatics for EFL Teachers
PPTX
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
PPTX
LONGSEM2024-25_CSE3015_ETH_AP2024256000125_Reference-Material-I.pptx
Natural Language Processing (NLP)
naturallanguageprocessingnlp-231215172843-839c05ab.pptx
Natural language processing module 1 chapter 1
operating system notes for II year IV semester students
Natural Language Processing (NLP).pptx
NLP Introduction and basics of natural language processing
Neural Language Model_ Webinar.pptx new1
Presentation generative-transformational grammar
L1 nlp intro
PPT Unit 5=software- engineering-21.pptx
6CS4_AI_Unit-5 @zammers.pptx(for artificial intelligence)
NLP_KASHK: Introduction
Poster @ enetCollect CA MC meeting in Iasi, Romania
1 Introduction.ppt
Master LLMs with LangChain -the basics of LLM
sete linguagens em sete semanas
Demystifying Interlanguage Pragmatics for EFL Teachers
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
LONGSEM2024-25_CSE3015_ETH_AP2024256000125_Reference-Material-I.pptx
Ad

More from Lahore Garrison University (20)

PDF
Diagnostic Expert System
PDF
Heuristic evaluation on Whatsapp
DOCX
Assignmnet 1hci tasmiya 209
DOCX
Windows and linux
DOCX
Automatic plant watering system
PDF
Numerical computing assignment 1
PDF
Instagram human computer interaction project
DOCX
Cloud quiz question answer
PDF
Fomulae numerical computing
DOCX
Pact framework in hci
DOC
Human computer interaction assignment 01
DOCX
Quiz 1 cloud computing
DOCX
Assignment hci : Draw the Users Mental Model for a Transfer of Money from one...
DOCX
professional practice case study
DOCX
PPTX
Sql injection & command injection
Diagnostic Expert System
Heuristic evaluation on Whatsapp
Assignmnet 1hci tasmiya 209
Windows and linux
Automatic plant watering system
Numerical computing assignment 1
Instagram human computer interaction project
Cloud quiz question answer
Fomulae numerical computing
Pact framework in hci
Human computer interaction assignment 01
Quiz 1 cloud computing
Assignment hci : Draw the Users Mental Model for a Transfer of Money from one...
professional practice case study
Sql injection & command injection

Recently uploaded (20)

PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
Cell Structure & Organelles in detailed.
PDF
Computing-Curriculum for Schools in Ghana
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Complications of Minimal Access Surgery at WLH
PDF
Classroom Observation Tools for Teachers
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
master seminar digital applications in india
PDF
01-Introduction-to-Information-Management.pdf
PDF
RMMM.pdf make it easy to upload and study
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Cell Structure & Organelles in detailed.
Computing-Curriculum for Schools in Ghana
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
O5-L3 Freight Transport Ops (International) V1.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
O7-L3 Supply Chain Operations - ICLT Program
FourierSeries-QuestionsWithAnswers(Part-A).pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Renaissance Architecture: A Journey from Faith to Humanism
Complications of Minimal Access Surgery at WLH
Classroom Observation Tools for Teachers
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
master seminar digital applications in india
01-Introduction-to-Information-Management.pdf
RMMM.pdf make it easy to upload and study
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student

A Light Introduction to Transfer Learning for NLP

  • 1. A LIGHT INTRODUCTION TO TRANSFER LEARNING FOR NLP
  • 2. NATURAL LANGUAGE PROCESSING • NLP INVOLVES MACHINE OR ROBOTS TO UNDERSTAND • IT PROCESS THE LANGUAGE THAT HUMAN SPEAK.
  • 3. TRANSFER LEARNING • TRANSFER LEARNING IS THE PROCESS OF TRAINING A MODEL ON A LARGE- SCALE DATASET AND THEN USING THAT PRETRAINED MODEL TO CONDUCT LEARNING FOR ANOTHER DOWNSTREAM TASK (I.E., TARGET TASK) • TRANSFER LEARNING WAS POPULARIZED IN THE FIELD OF COMPUTER VISION THANKS TO THE IMAGENET DATASET
  • 4. TRANSFER LEARNING FOR NLP • TRAINING AND TEST DISTRIBUTIONS ARE DIFFERENT • DIFFERENT TEXT TYPES. • DIFFERENT ACCENTS/AGES. • DIFFERENT TOPICS/CATEGORIES.
  • 5. PRETRAINED NLP MODELS • ULMFIT • ELMO • GLOMO • OPENAI TRANSFORMER
  • 6. ULMFIT • ULMFIT WAS PROPOSED AND DESIGNED BY FAST.AI’S JEREMY HOWARD AND DEEPMIND’S SEBASTIAN RUDER. YOU COULD SAY THAT ULMFIT WAS THE RELEASE THAT GOT THE TRANSFER LEARNING PARTY STARTED LAST YEAR. • UMFIT STANDS FOR UNIVERSAL LANGUAGE MODEL FINE TUNING FOR TEXT CLASSIFICATION
  • 7. ELMO • ELMO WORD REPRESENTATIONS, OR EMBEDDINGS FROM LANGUAGE MODELS • PRETRAINING THE ENTIRE MODEL WITH DEEP CONTEXTUALIZED REPRESENTATIONS THROUGH STACKED NEURAL LAYERS • ELMO IS A NOVEL WAY OF REPRESENTING WORDS IN VECTORS AND EMBEDDINGS. THESE ELMO WORD EMBEDDINGS HELP US ACHIEVE STATE-OF-THE-ART RESULTS ON MULTIPLE NLP TASKS
  • 8. GLOMO • UNSUPERVISED LEARNED RELATIONAL GRAPHS AS TRANSFERABLE REPRESENT0ATIONS • MODERN DEEP TRANSFER LEARNING APPROACHES HAVE MAINLY FOCUSED ON LEARNING GENERIC FEATURE VECTORS FROM ONE TASK THAT ARE TRANSFERABLE TO OTHER TASKS, SUCH AS WORD EMBEDDINGS IN LANGUAGE AND PRETRAINED CONVOLUTIONAL FEATURES IN VISION. • THESE APPROACHES USUALLY TRANSFER UNARY FEATURES AND LARGELY IGNORE MORE STRUCTURED GRAPHICAL REPRESENTATIONS. • THIS WORK EXPLORES THE POSSIBILITY OF LEARNING GENERIC LATENT RELATIONAL GRAPHS THAT CAPTURE DEPENDENCIES BETWEEN PAIRS OF DATA UNITS (E.G., WORDS OR PIXELS)
  • 9. OPEN AI TRANSFORMER • IMPROVING LANGUAGE UNDERSTANDING WITH UNSUPERVISED LEARNING • THE TRANSFORMER ARCHITECTURE IS AT THE CORE OF ALMOST ALL THE RECENT MAJOR DEVELOPMENTS IN NLP. IT WAS INTRODUCED IN 2017 BY GOOGLE. BACK THEN, RECURRENT NEURAL NETWORKS (RNN) WERE BEING USED FOR LANGUAGE TASKS, LIKE MACHINE TRANSLATION AND QUESTION ANSWERING SYSTEMS. • GOOGLE RELEASED AN IMPROVED VERSION OF TRANSFORMER LAST YEAR CALLED UNIVERSAL TRANSFORMER.