SlideShare a Scribd company logo
Neural Machine Translation
By Jointly Learning To Align And Translation
Published as a conference paper as ICLR 2015
What is this paper?
 In neural machine translation, it proposed the model that extended function which automatically
(soft-)search for parts of an input sentence that are relevant to predicting a target word, without
having to form these parts as a hard segment explicitly.
What’s better than previous paper?
 This model surpassed previous model significantly, regardless of the sentence length and is much
stronger to the length of an input sentence.
 (Previous model’s performance becomes worse as increase the length of input sentence.)
What’re important parts of
technics and method?
 Unlike previous, the new architecture consists of a bidirectional RNN.
(Bidirectional means two opposite directions)
 By letting the decoder have an attention mechanism, this does not need to encode all
information in the input sentence into a fixed-length vector.
How did they verify it?
 On the task of English-to-French, they compared RNN search (this model) with RNN Encoder-
Decoder (previous model) in BLUE score using ACL WMT ‘14.
(About BLUE score… https://to-in.com/blog/102282)
Is there a discussion?
 Considering that the proposed architecture, or the whole family of neural machine translation,
has only been proposed as recently as this year.
 One if challenges left for the future is to better handle unknown, or rare words.
Is there a paper to read next?
 Sequence to sequence
 RNN Encoder-Decoder
Paper Information
 https://arxiv.org/abs/1409.0473

More Related Content

PPTX
A neural conversational_model
PDF
[Impl] neural machine translation
PDF
Neural Machine Translation (D3L4 Deep Learning for Speech and Language UPC 2017)
PPTX
Natural Language to Visualization by Neural Machine Translation
PDF
Deep learning for NLP and Transformer
PDF
Transformer Introduction (Seminar Material)
PDF
NLP using transformers
PPTX
Notes on attention mechanism
A neural conversational_model
[Impl] neural machine translation
Neural Machine Translation (D3L4 Deep Learning for Speech and Language UPC 2017)
Natural Language to Visualization by Neural Machine Translation
Deep learning for NLP and Transformer
Transformer Introduction (Seminar Material)
NLP using transformers
Notes on attention mechanism

What's hot (20)

PPTX
Deep Learning for Machine Translation
PPTX
PPTX
Attention Mechanism in Language Understanding and its Applications
PPTX
An LSTM-Based Neural Network Architecture for Model Transformations
PPTX
Thomas Wolf "Transfer learning in NLP"
PDF
Learning to Translate with Joey NMT
PDF
BERT: Bidirectional Encoder Representations from Transformers
PPTX
Master Thesis of Computer Engineering: OpenTranslator
PPTX
[Paper review] BERT
PDF
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
PPTX
Scalable image recognition model with deep embedding
PDF
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
PPTX
PDF
Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)
PDF
Machine Translation Introduction
PPTX
Deep Learning for Machine Translation, by Satoshi Enoue, SYSTRAN
PDF
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
PPTX
Word embeddings, RNN, GRU and LSTM
PPTX
Natural language processing techniques transition from machine learning to de...
PDF
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Deep Learning for Machine Translation
Attention Mechanism in Language Understanding and its Applications
An LSTM-Based Neural Network Architecture for Model Transformations
Thomas Wolf "Transfer learning in NLP"
Learning to Translate with Joey NMT
BERT: Bidirectional Encoder Representations from Transformers
Master Thesis of Computer Engineering: OpenTranslator
[Paper review] BERT
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Scalable image recognition model with deep embedding
Deep Learning for NLP (without Magic) - Richard Socher and Christopher Manning
Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)
Machine Translation Introduction
Deep Learning for Machine Translation, by Satoshi Enoue, SYSTRAN
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
Word embeddings, RNN, GRU and LSTM
Natural language processing techniques transition from machine learning to de...
Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...
Ad

Similar to Neural machine translation by jointly learning to align and translate (20)

PDF
[論文読み会資料] Asynchronous Bidirectional Decoding for Neural Machine Translation
PDF
Advanced Neural Machine Translation (D4L2 Deep Learning for Speech and Langua...
PPTX
[PACLING2019] Improving Context-aware Neural Machine Translation with Target-...
PDF
Building a Neural Machine Translation System From Scratch
PDF
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
PDF
MIT-Lec-11- RSA.pdf
PDF
Pointing the Unknown Words
PDF
Deep Learning for Machine Translation: a paradigm shift - Alberto Massidda - ...
PDF
05-transformers.pdf
PDF
IRJET- Visual Information Narrator using Neural Network
PPTX
A Tour of Neural Sequence Generators
PPTX
Transformers AI PPT.pptx
PDF
Tensor flow05 neural-machine-translation-seq2seq
PDF
Deep Learning, Where Are You Going?
PPTX
[Paper Reading] Attention is All You Need
PDF
ANALYZING ARCHITECTURES FOR NEURAL MACHINE TRANSLATION USING LOW COMPUTATIO...
PDF
ANALYZING ARCHITECTURES FOR NEURAL MACHINE TRANSLATION USING LOW COMPUTATIONA...
PDF
ANALYZING ARCHITECTURES FOR NEURAL MACHINE TRANSLATION USING LOW COMPUTATIONA...
PPTX
Monotonic Multihead Attention review
PDF
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
[論文読み会資料] Asynchronous Bidirectional Decoding for Neural Machine Translation
Advanced Neural Machine Translation (D4L2 Deep Learning for Speech and Langua...
[PACLING2019] Improving Context-aware Neural Machine Translation with Target-...
Building a Neural Machine Translation System From Scratch
Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015
MIT-Lec-11- RSA.pdf
Pointing the Unknown Words
Deep Learning for Machine Translation: a paradigm shift - Alberto Massidda - ...
05-transformers.pdf
IRJET- Visual Information Narrator using Neural Network
A Tour of Neural Sequence Generators
Transformers AI PPT.pptx
Tensor flow05 neural-machine-translation-seq2seq
Deep Learning, Where Are You Going?
[Paper Reading] Attention is All You Need
ANALYZING ARCHITECTURES FOR NEURAL MACHINE TRANSLATION USING LOW COMPUTATIO...
ANALYZING ARCHITECTURES FOR NEURAL MACHINE TRANSLATION USING LOW COMPUTATIONA...
ANALYZING ARCHITECTURES FOR NEURAL MACHINE TRANSLATION USING LOW COMPUTATIONA...
Monotonic Multihead Attention review
Advancements in Hindi-English Neural Machine Translation: Leveraging LSTM wit...
Ad

Recently uploaded (20)

PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PPTX
1. Introduction to Computer Programming.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
Encapsulation theory and applications.pdf
PDF
Getting Started with Data Integration: FME Form 101
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Mobile App Security Testing_ A Comprehensive Guide.pdf
A comparative study of natural language inference in Swahili using monolingua...
Group 1 Presentation -Planning and Decision Making .pptx
Machine learning based COVID-19 study performance prediction
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Heart disease approach using modified random forest and particle swarm optimi...
1. Introduction to Computer Programming.pptx
Empathic Computing: Creating Shared Understanding
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Network Security Unit 5.pdf for BCA BBA.
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Encapsulation theory and applications.pdf
Getting Started with Data Integration: FME Form 101
Digital-Transformation-Roadmap-for-Companies.pptx
A Presentation on Artificial Intelligence
Advanced methodologies resolving dimensionality complications for autism neur...
Programs and apps: productivity, graphics, security and other tools
gpt5_lecture_notes_comprehensive_20250812015547.pdf

Neural machine translation by jointly learning to align and translate

  • 1. Neural Machine Translation By Jointly Learning To Align And Translation Published as a conference paper as ICLR 2015
  • 2. What is this paper?  In neural machine translation, it proposed the model that extended function which automatically (soft-)search for parts of an input sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.
  • 3. What’s better than previous paper?  This model surpassed previous model significantly, regardless of the sentence length and is much stronger to the length of an input sentence.  (Previous model’s performance becomes worse as increase the length of input sentence.)
  • 4. What’re important parts of technics and method?  Unlike previous, the new architecture consists of a bidirectional RNN. (Bidirectional means two opposite directions)  By letting the decoder have an attention mechanism, this does not need to encode all information in the input sentence into a fixed-length vector.
  • 5. How did they verify it?  On the task of English-to-French, they compared RNN search (this model) with RNN Encoder- Decoder (previous model) in BLUE score using ACL WMT ‘14. (About BLUE score… https://to-in.com/blog/102282)
  • 6. Is there a discussion?  Considering that the proposed architecture, or the whole family of neural machine translation, has only been proposed as recently as this year.  One if challenges left for the future is to better handle unknown, or rare words.
  • 7. Is there a paper to read next?  Sequence to sequence  RNN Encoder-Decoder