SlideShare a Scribd company logo
A Statistical Approach to
  Machine Translation
  Peter F. Brown, John Cocke, Stephen A. Della Pietra,
  Vincent J. Della Pietra, Fredrick Jeinek, John D. Lafferty,
  Robert L. Mercer, and Paul S. Roossin

                                     Published in:
                                     · Journal
                                     Computational Linguistics archive
                                     Volume 16 Issue 2, June 1990
                                     Pages 79-85
                                     MIT Press Cambridge, MA, USA
Introduction
• (S, T): Every pair of sentences
• Pr(T|S): probability of T when presented with S
• Given a sentence T in the target language, seek
  the sentence S from which the translator
  produced T.
• Chance of error is minimized by choosing the
  most probable sentence S given T
  –   In other words, to maximize Pr(S|T)
  –   Pr(S|T) = Pr(S) Pr(T|S) / Pr(T) : Bayes’ theorem
  –   Pr(S): language model probability of S
  –   Pr(T|S): translation probability of T given S
• Statistical Translation System requires:
  – A method for computing language model
    probabilities
  – A method for computing translation probabilities
  – A method for searching among possible source
    sentences S for the one that gives the greatest
    value for Pr(S)Pr(T|S)
The Language Model
• a word string S1, S2, … Sn
• Pr(S1, S2, … Sn)
              = Pr(S1) Pr(S2|S1)…Pr(Sn|S1S2…Sn-1)
• Given a history S1S2…Sj-1, you must know the
  probability of object word Sj
• Histories are too long, probability parameters cannot
  be separated
• To reduce parameters:
   – Categorize each history into same class
   – Probability of object depends on the history in same class
The Translation Model
• A word can be translated into more than one word
   – Fertility: num. of T words that an S word produces
• Notation for alignment:
   – (Jean aime Marie | John(1) loves(2) Mary(3) )
      • ( T | S)
      • Num. in S words are positions in T
• Computing the probability of the alignment:
   – (Le chien est battu par Jean | John(6) does beat(3, 4)
     the(1) dog(2))
The Translation Model
• In English adjectives precede nouns, in French
  adj. follows nouns
  – Distortion: T word appears far from S word in
    alignment
• Distortion probability:
  – Pr(i|j, l)
     • i: a target position
     • J: a source position
     • l: the target length
Searching
• Searching for the sentence S to maximize
  Pr(S)Pr(T|S)
• Uses stack search
  – A list of partial alignment hypothesis
  – (Jean aime Marie| * )
     • *: a place holder for unknown sequence of S
  – In iteration, extends most promising entries and adds
    to its hypothesis
  – Ends when complete alignment is the most promising
    significantly
Parameter Estimation
• Both LM and TM have many parameters
• To estimate, needs pairs of translations
• For this experiment, they used Canadian
  parliament’s records translated in
  English/French
  – Param. & LM/TM can be estimated from this
Two Pilot Experiments
• First experiment: to estimate params for TM
• 9000 most common words in English/French
  from Hansard data
Two Pilot Experiment
• Second experiment:
   – French to English
   – 1,000 most frequently used English words
   – 1,700 most frequently used French words ( covered by the 1k
     English words)
   – Estimated 17 million parameters of translation model from
     117,000 pairs of sentences
   – Est. bigram language model from 570,000 sentences from the
     English part of Hansard
   – Evaluation:
      •   Exact: Decoded sentence was exactly the same
      •   Alternate: same meaning, slightly different words
      •   Different: not covey the same meaning as the translation
      •   Wrong: makes a sense but not interpreted
      •   Ungrammatical: grammatically deficient
Translation Example/Result

More Related Content

PPT
Lecture 7: Definite Clause Grammars
PPTX
sentence patterns
PDF
Codeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
PDF
Towards OpenLogos Hybrid Machine Translation - Anabela Barreiro
PDF
Statistical machine translation in a few slides
PPTX
Machine translation with statistical approach
PDF
Summary of Rule-based Reordering Space in Statistical Machine Translation
PDF
7. ebmt based on st sm
Lecture 7: Definite Clause Grammars
sentence patterns
Codeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
Towards OpenLogos Hybrid Machine Translation - Anabela Barreiro
Statistical machine translation in a few slides
Machine translation with statistical approach
Summary of Rule-based Reordering Space in Statistical Machine Translation
7. ebmt based on st sm

Viewers also liked (20)

PPTX
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
PPTX
WEBINAR: TAUS Outlook 2013
PDF
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
PPTX
Machine Translation: What it is?
PPT
Google services
PDF
Designing e-Learning Content for Localization
PPT
Sec16.3: Reordering Integration
PPTX
Escaping style and script data
PPTX
Translationusing moses1
PPT
Assamese to English Statistical Machine Translation
PPTX
Data Localization and Translation
PDF
Going Global? The ABC of Localization-Friendly Content
PPTX
Translation & Localization
PPTX
Machine Translation: Latest Innovations and their Impact on Commercial Transl...
 
PPTX
TAUS webinar The Big Picture View On The Translation Industry, March 2013
PPTX
Localization and globalization in c#
PPT
Localization framework
PDF
A Computational Model of Staged Language Acquisition
PPT
Statistical machine translation for indian language copy
PPTX
C#: Globalization and localization
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
WEBINAR: TAUS Outlook 2013
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
Machine Translation: What it is?
Google services
Designing e-Learning Content for Localization
Sec16.3: Reordering Integration
Escaping style and script data
Translationusing moses1
Assamese to English Statistical Machine Translation
Data Localization and Translation
Going Global? The ABC of Localization-Friendly Content
Translation & Localization
Machine Translation: Latest Innovations and their Impact on Commercial Transl...
 
TAUS webinar The Big Picture View On The Translation Industry, March 2013
Localization and globalization in c#
Localization framework
A Computational Model of Staged Language Acquisition
Statistical machine translation for indian language copy
C#: Globalization and localization
Ad

Similar to A statistical approach to machine translation (20)

PPTX
Statistical machine translation
PDF
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
PPT
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
 
PDF
Tamil-English Document Translation Using Statistical Machine Translation Appr...
PDF
Improving the role of language model in statistical machine translation (Indo...
PDF
Machine translation course program (in English)
PDF
Deep Learning for Machine Translation - A dramatic turn of paradigm
PDF
NEURAL AND STATISTICAL MACHINE TRANSLATION: CONFRONTING THE STATE OF THE ART
PDF
NEURAL AND STATISTICAL MACHINE TRANSLATION: CONFRONTING THE STATE OF THE ART
PDF
PDF
Koreanizer : Statistical Machine Translation based Ro-Ko Transliterator
PDF
The Effect of Translationese on Statistical Machine Translation
PPTX
Statistical Machine Translation for Language Localisation
PPTX
Real-time DirectTranslation System for Sinhala and Tamil Languages.
PDF
Seminar report on a statistical approach to machine
PDF
7. Trevor Cohn (usfd) Statistical Machine Translation
PPTX
project present
PPTX
Experiments with Different Models of Statistcial Machine Translation
PPTX
Experiments with Different Models of Statistcial Machine Translation
PDF
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
Statistical machine translation
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
 
Tamil-English Document Translation Using Statistical Machine Translation Appr...
Improving the role of language model in statistical machine translation (Indo...
Machine translation course program (in English)
Deep Learning for Machine Translation - A dramatic turn of paradigm
NEURAL AND STATISTICAL MACHINE TRANSLATION: CONFRONTING THE STATE OF THE ART
NEURAL AND STATISTICAL MACHINE TRANSLATION: CONFRONTING THE STATE OF THE ART
Koreanizer : Statistical Machine Translation based Ro-Ko Transliterator
The Effect of Translationese on Statistical Machine Translation
Statistical Machine Translation for Language Localisation
Real-time DirectTranslation System for Sinhala and Tamil Languages.
Seminar report on a statistical approach to machine
7. Trevor Cohn (usfd) Statistical Machine Translation
project present
Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translation
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
Ad

More from Hiroshi Matsumoto (17)

PPTX
Phrase linguistic classification and generalization for improving statistical...
PDF
Paraphrasing Swedish Compound Nouns in Machine Translation
PPTX
Graph Propagation for Paraphrasing Out-of-Vocabulary Words in Statistical Mac...
PPTX
Summary of Dialectal to standard Arabic paraphrasing to improve Arabic-Englis...
PDF
Improving translation via targeted paraphrasing
PDF
Summary: A Sense-Based Translation Model for Statistical Machine Translation
PDF
Predicting Power Relations between Participants in Written Dialog from a Sing...
PPTX
Modeling Irony in Twitter
PPTX
Factored translationmodel
PDF
10.combination of sm_tn_rbmt
PDF
9. cgc parser with_norml_std
PDF
8. relearnt rbmt
PDF
Summary of English Japanese Translation by MSR-MT
PPTX
PPTX
Mt framework nagao_makoto
PPTX
Approach to japanese english automatic translation by Susumu Kuno
PPTX
Machine translation
Phrase linguistic classification and generalization for improving statistical...
Paraphrasing Swedish Compound Nouns in Machine Translation
Graph Propagation for Paraphrasing Out-of-Vocabulary Words in Statistical Mac...
Summary of Dialectal to standard Arabic paraphrasing to improve Arabic-Englis...
Improving translation via targeted paraphrasing
Summary: A Sense-Based Translation Model for Statistical Machine Translation
Predicting Power Relations between Participants in Written Dialog from a Sing...
Modeling Irony in Twitter
Factored translationmodel
10.combination of sm_tn_rbmt
9. cgc parser with_norml_std
8. relearnt rbmt
Summary of English Japanese Translation by MSR-MT
Mt framework nagao_makoto
Approach to japanese english automatic translation by Susumu Kuno
Machine translation

Recently uploaded (20)

PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Basic Mud Logging Guide for educational purpose
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
Cell Types and Its function , kingdom of life
PDF
RMMM.pdf make it easy to upload and study
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Computing-Curriculum for Schools in Ghana
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
master seminar digital applications in india
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Pharma ospi slides which help in ospi learning
Supply Chain Operations Speaking Notes -ICLT Program
Basic Mud Logging Guide for educational purpose
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Anesthesia in Laparoscopic Surgery in India
Module 4: Burden of Disease Tutorial Slides S2 2025
Cell Types and Its function , kingdom of life
RMMM.pdf make it easy to upload and study
Microbial disease of the cardiovascular and lymphatic systems
FourierSeries-QuestionsWithAnswers(Part-A).pdf
O7-L3 Supply Chain Operations - ICLT Program
2.FourierTransform-ShortQuestionswithAnswers.pdf
Final Presentation General Medicine 03-08-2024.pptx
PPH.pptx obstetrics and gynecology in nursing
Computing-Curriculum for Schools in Ghana
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
master seminar digital applications in india
O5-L3 Freight Transport Ops (International) V1.pdf
Pharma ospi slides which help in ospi learning

A statistical approach to machine translation

  • 1. A Statistical Approach to Machine Translation Peter F. Brown, John Cocke, Stephen A. Della Pietra, Vincent J. Della Pietra, Fredrick Jeinek, John D. Lafferty, Robert L. Mercer, and Paul S. Roossin Published in: · Journal Computational Linguistics archive Volume 16 Issue 2, June 1990 Pages 79-85 MIT Press Cambridge, MA, USA
  • 2. Introduction • (S, T): Every pair of sentences • Pr(T|S): probability of T when presented with S • Given a sentence T in the target language, seek the sentence S from which the translator produced T. • Chance of error is minimized by choosing the most probable sentence S given T – In other words, to maximize Pr(S|T) – Pr(S|T) = Pr(S) Pr(T|S) / Pr(T) : Bayes’ theorem – Pr(S): language model probability of S – Pr(T|S): translation probability of T given S
  • 3. • Statistical Translation System requires: – A method for computing language model probabilities – A method for computing translation probabilities – A method for searching among possible source sentences S for the one that gives the greatest value for Pr(S)Pr(T|S)
  • 4. The Language Model • a word string S1, S2, … Sn • Pr(S1, S2, … Sn) = Pr(S1) Pr(S2|S1)…Pr(Sn|S1S2…Sn-1) • Given a history S1S2…Sj-1, you must know the probability of object word Sj • Histories are too long, probability parameters cannot be separated • To reduce parameters: – Categorize each history into same class – Probability of object depends on the history in same class
  • 5. The Translation Model • A word can be translated into more than one word – Fertility: num. of T words that an S word produces • Notation for alignment: – (Jean aime Marie | John(1) loves(2) Mary(3) ) • ( T | S) • Num. in S words are positions in T • Computing the probability of the alignment: – (Le chien est battu par Jean | John(6) does beat(3, 4) the(1) dog(2))
  • 6. The Translation Model • In English adjectives precede nouns, in French adj. follows nouns – Distortion: T word appears far from S word in alignment • Distortion probability: – Pr(i|j, l) • i: a target position • J: a source position • l: the target length
  • 7. Searching • Searching for the sentence S to maximize Pr(S)Pr(T|S) • Uses stack search – A list of partial alignment hypothesis – (Jean aime Marie| * ) • *: a place holder for unknown sequence of S – In iteration, extends most promising entries and adds to its hypothesis – Ends when complete alignment is the most promising significantly
  • 8. Parameter Estimation • Both LM and TM have many parameters • To estimate, needs pairs of translations • For this experiment, they used Canadian parliament’s records translated in English/French – Param. & LM/TM can be estimated from this
  • 9. Two Pilot Experiments • First experiment: to estimate params for TM • 9000 most common words in English/French from Hansard data
  • 10. Two Pilot Experiment • Second experiment: – French to English – 1,000 most frequently used English words – 1,700 most frequently used French words ( covered by the 1k English words) – Estimated 17 million parameters of translation model from 117,000 pairs of sentences – Est. bigram language model from 570,000 sentences from the English part of Hansard – Evaluation: • Exact: Decoded sentence was exactly the same • Alternate: same meaning, slightly different words • Different: not covey the same meaning as the translation • Wrong: makes a sense but not interpreted • Ungrammatical: grammatically deficient