SlideShare a Scribd company logo
ErrorAnalysisofRule-basedMachineTranslationOutput
A CaseStudyonEnglish– PersianMTSystems
‫ترجمه؟‬ ‫دستگاه‬‫خروجی‬‫بر‬‫مبتنی‬ ‫حکومت‬‫از‬‫خطا‬‫تحلیل‬ ‫و‬‫تجزیه‬
‫انگلیسی‬‫زبان‬ ‫در‬ ‫موردی‬ ‫مطالعه‬-‫های‬‫سیستم‬ ‫ارسی‬‫ف‬MT
ZahraPourniksefat
IslamicAzadUniversity– Science& ResearchBranch
Error Analysis of Rule-based Machine Translation Outputs
Agenda
Introduction
 Machine Translation Overview
 Evaluation of MT systems
Methods & Materials
Error Categories & Description
Results & Discussion
Machine Translation Overview
Definition: The term Machine Translation (MT) is used for translating text or
speech from one natural language to another by using computers and software.
• Systran: MT is much faster than human translators because it is much cheaper
and has a better memory than human translators.
• Shahba 2002 believed that “It’s better to spend our time on the actual act of
translation rather than typing the English text or scanning it for the MT to
translate. Efforts in MT are by themselves valuable as they at least satisfy one of
the needs of human beings: need for innovation and discovery”
• MT is more economic on time and money, but it is less accurate than human
translators (Frederking, 2004).
Why MT matters?
According to Hatim and Munday it’s an important topic - socially, politically, commercially,
scientifically, and intellectually or philosophically (2004)
• The social or political importance of MT arises from the socio- political importance of translation in
communities where more than one language is generally spoken. So translation is necessary for
communication- for ordinary human interaction, and for gathering the information one needs to play
a full part in society.
• The commercial importance of MT is a result of related factors. First, translation itself is
commercially important. Second, translation is expensive.
• Scientifically, MT is interesting, because it is an obvious application and testing ground for many
ideas in Computer Science, Artificial Intelligence, and Linguistics.
• Philosophically , MT is interesting, because it represents an attempt to automate an activity that can
require the full range of human knowledge.
Some Misconceptions about MT
MT is a waste of time because you will never make a machine that can translate
Shakespeare. This criticism that MT systems cannot translate Shakespeare is a bit like
the criticism of industrial robots for not being able to dance.(Hatim and Munday, 2004)
• First, translating literature requires special literary skills – it is not the kind of
thing that the average professional translators normally attempt
• Second, literary translation is a small proportion of the translation that has to be
done.
• Finally, one may wonder who would ever want to translate Shakespeare by
machine – it’s a job that human translators find challenging and rewarding, and
it’s not a job that MT systems have been designed for.
Approaches to MT
• Direct Machine Translation Approach
The first developed MT systems where a word–for–word translation from the source language to the target
language is performed.
• Transfer Machine Translation Approach
1. The analysis stage that is the direct strategy which takes benefits of a dictionary in source language to
demonstrate the source language from linguistic point of view.
2. The transfer stage varies the outputs of the analysis stage to produce structural and linguistic equivalents
between the two languages.
3. The generation stage is the third stage in which a target language dictionary is applied to result the target
language document on the basis of linguistic information. (Steiner, 1988)
• Interlingua Machine Translation Approach
 First the source text meaning is decoded
 Second the resulted meaning is re-encoded in the target language
Approaches to MT cont’d.
• Rule-based Machine Translation Approach
It operates on the linguistic data on source and target languages fundamentally
taken from bilingual dictionaries and the basic semantic, morphological, and
syntactic grammar of the individual language (Gelbukh, 2011).
Minimally, to get a Persian translation of English sentence one needs:
1. A dictionary that will map each English word to an appropriate Persian
word.
2. Rules representing regular English sentence structure
3. Rules representing regular Persian sentence structure
4. And finally, we need rules according to which one can relate these two
structures together.
Approaches to MT cont’d.
• Statistical Machine Translation Approach
This system uses a corpus or database as a translated example for analyzing and decoding
source language. In comparison with the machine translation of about three decades ago,
Google Translate as an example of more contemporary automated engine for the task of
translation has taken a giant leap. However, it is still too imperfect. (Nierenberg, 1998)
• Hybrid Machine Translation Approach
1. Rules post-processed by statistics in which translation are practiced on the pivot of
rule-based engine. Next statistics are applied to correct the output.
2. Statistics guided by rules in which rules have an important role to pre-process date
to quite the statistical representation to normalize. This approach is powerful,
flexible and under more control when it’s translating.
Evaluation of MT Systems
• Human translation assessment (Secară 2005; Williams 2001) has been
moving from microtextual, word- or sentence-level error analysis methods
toward more macrotextual methods focused on the function, purpose and
effect of the text.At the same time, machine translation assessment has
mainly been microtextual and focused on the aspects of accuracy and
fluency.
• Hovy (2002) discussed the complexity of MT evaluation, and stressed the
importance of adjusting evaluation to the purpose and context of the
translation.
Evaluation of MT Systems cont’d.
Mary A. Flangan Believed that Machine translation quality can be difficult to
quantify for a number of reasons:
1) A text can have several different translations, all of which are correct.
2) Defining the boundaries of errors in MT output is often difficult. Errors sometimes
involve only single words, but more often involve phrases, discontinuous
expressions, word order or relationships across sentence boundaries. Therefore,
simply counting the number of wrong words in the translation is not meaningful.
3) One error can lead to another. For example, if the part of speech of a word is
identified incorrectly by the MT software, the entire analysis of the sentence may be
affected, creating a chain of errors.
4) The cause of errors in MT output is not always apparent. The evaluator usually does
not have access to a trace of the software's tests and actions. Thus it can be difficult
to identify what went wrong in the translation of a sentence.
Evaluation of MT Systems cont’d.
Types of Evaluation
 Automatic Evaluation
the Word Error Rate (WER), the Position independent word Error Rate (PER), the
BLEU (Papineni et al., 2002) and the NIST (Doddington, 2002) where the MT
output is compared to one or more human reference translations.
 Human Evaluation
Due to the complexity of natural language, manual evaluation seems more reliable
1. Three passages were selected and translated by Rule-based MT Systems
and compared with one Statistical MT System and Human translator
2. Error categories were derived after the analysis of each text
Methods & Materials
• Three passages were translated by two different MT systems and also a human
translator.
• From each text type a passage of approximately 400 words was taken from story,
user guide and magazine.
• The rule-based MT – Arya TM– system was designed based on thousands of lexical
and grammatical rules.
• The statistical system, Google Translate by Google Inc., is based on the use of large
monolingual and parallel corpora for translation.
• The unit of analysis was set to a sentence level because it’s the largest unit which
can be easily recognized in MT systems and ST sentence can be clearly
corresponded to its TT pairs.
Table of Source Text Passages for Analysis
Number of Words Number of Sentences
Short Story
The Lottery 398 13
User Guide
Microsoft Access 2012 390 16
Magazine
Academic article 415 15
Errors Category
Syntactic
Word
Order
Missing
Words
Punctuation
Parts
of
Speech
Conjugation
Unknown
Words
Semantic
Incorrect
Words
Polysemy
Idiomatic
Expressions
• For English-to-Persian Rule- based MT systems the following categories were
derived
Error Categories & Descriptions
Error Categories & Descriptions cont’d.
Description of Error Categories:
• Syntactic Errors: Those errors that are related to the grammar of the language such as parts
of speech or conjugation
Word order that means sentence elements ordered incorrectly
Example: Commands generally take the form of buttons and lists. (User Guide)
Missing words: incorrect elision of some words
Example: This requires better data collection and analysis tools for studying outcomes and
consistent use of these tools across individual studies. (Magazine)
Arya Translation System ‫ها‬ ‫فهرست‬ ‫و‬ ‫گیرد‬ ‫می‬ ‫شاگرد‬ ‫فرم‬ ‫کلی‬ ‫بطور‬ ‫ها‬ ‫دستور‬.
Google Translate ‫لیست‬ ‫و‬ ‫ها‬ ‫دکمه‬ ‫شکل‬ ‫به‬ ‫کلی‬ ‫طور‬ ‫به‬ ‫دستورات‬.
Arya Translation System ‫می‬ ‫مطالعه‬ ‫برای‬ ‫تحلیل‬ ‫ها‬ ‫ابزار‬ ‫و‬ ‫های‬ ‫نیاز‬ ‫بهتر‬ ‫اطالعات‬ ‫مجموعه‬ ‫این‬
‫کند‬ ‫می‬ ‫استفاده‬ ‫سازگار‬ ‫و‬ ‫ها‬ ‫حاصل‬ ‫کن‬.
Google Translate ‫و‬ ‫تجزیه‬ ‫ابزار‬ ‫از‬ ‫استفاده‬ ‫با‬ ‫و‬ ‫بهتر‬ ‫ها‬ ‫داده‬ ‫آوری‬ ‫جمع‬ ‫مستلزم‬ ‫امر‬ ‫این‬
‫سراسر‬ ‫در‬ ‫ابزار‬ ‫این‬ ‫از‬ ‫مداوم‬ ‫استفاده‬ ‫و‬ ‫نتایج‬ ‫بررسی‬ ‫برای‬ ‫تحلیل‬‫مطالعات‬
‫فردی‬.
Error Categories & Descriptions cont’d.
Unknown words: word not in a dictionary
Example: The women, wearing faded house dresses and sweaters, came shortly after their
menfolk.( Story)
Punctuation: incorrect punctuation
Example: The children assembled first, of course. (Story)
Arya Translation System ‫ها‬ ‫زن‬,‫پس‬ ‫زودی‬ ‫به‬ ، ‫کردند‬ ‫محو‬ ‫ها‬ ‫ژاکت‬ ‫و‬ ‫خانه‬ ‫ها‬ ‫لباس‬ ‫کننده‬ ‫خسته‬
‫از‬menfolk‫شان‬‫آمدند‬
Google Translate ‫از‬ ‫پس‬ ‫کوتاهی‬ ‫مدت‬ ‫آمد‬ ‫در‬ ،‫خانه‬ ‫پژمرده‬ ‫ژاکت‬ ‫و‬ ‫لباس‬ ‫پوشیدن‬ ،‫زنان‬
menfolk‫را‬ ‫خود‬.
Arya Translations ‫اول‬ ‫ها‬ ‫بچه‬,‫کردند‬ ‫جمع‬ ‫البته‬.
Google Translations ‫البته‬ ،‫اول‬ ‫مونتاژ‬ ‫کودکان‬.
Error Categories & Descriptions cont’d.
Parts of speech: errors in identifying pars of speech such as noun or verb
Example: If you decrease the width of the ribbon, small button labels disappear. (User Guide)
Conjugation: incorrectly formed verb or wrong tense
Example: Soon the women, standing by their husbands, began to call to their children, and the
children came reluctantly, having to be called four or five times.
Arya Translation System ‫زند‬ ‫می‬ ‫برچسب‬ ‫ناپدید‬ ‫کوچک‬ ‫دکمه‬ ، ‫کاهشبیابید‬ ‫نوار‬ ‫پهنا‬ ‫شما‬ ‫اگر‬
Google Translate ‫شون‬ ‫می‬ ‫ناپدید‬ ‫کوچک‬ ‫دکمه‬ ‫ها‬ ‫برچسب‬ ،‫دهد‬ ‫کاهش‬ ‫را‬ ‫شما‬ ‫نوار‬ ‫عرض‬ ‫اگر‬‫د‬
Arya Translations ‫ها‬ ‫زن‬ ‫بزودی‬,‫شان‬ ‫های‬ ‫شوهر‬ ‫کن‬ ‫می‬ ‫حمایت‬,‫ها‬ ‫بچه‬ ‫به‬ ‫صدا‬ ‫کردن‬ ‫شروع‬ ‫به‬
‫شان‬,‫دوره‬ ‫پنج‬ ‫یا‬ ‫چهار‬ ‫زده‬ ‫صدا‬ ‫اشد‬ ‫که‬ ‫دارد‬ ‫می‬ ، ‫آمدند‬ ‫اکراه‬ ‫با‬ ‫ها‬ ‫بچه‬ ‫و‬.
Google Translations ‫ب‬ ‫ها‬ ‫بچه‬ ‫و‬ ،‫خود‬ ‫فرزندان‬ ‫به‬ ‫تماس‬ ‫به‬ ‫شروع‬ ،‫ایستاده‬ ‫خود‬ ‫شوهران‬ ،‫زنان‬ ‫زودی‬ ‫به‬‫ه‬
‫بار‬ ‫پنج‬ ‫یا‬ ‫چهار‬ ‫نام‬ ‫به‬ ،‫اکراه‬.
Error Categories & Descriptions cont’d.
• Semantic Errors: Those errors that are related to the meaning such as incorrect meaning
of words or expressions which caused the incorrect meaning of the whole sentence.
Incorrect word: completely incorrect meaning
Polysemy: incorrect selection of the meaning of the words with more than one meaning
Example: The people of the village began to gather in the square, between the post office and the bank,
around ten o'clock.
Style and idiomatic expression : incorrect translation of multi-word expression
Example: They greeted one another and exchanged bits of gossip as they went to join their husbands.
Arya Translations ‫و‬ ‫که‬ ‫رفتند‬ ‫آنها‬ ‫کردند‬ ‫معاوضه‬ ‫غیبت‬ ‫ذره‬ ‫و‬ ‫همدیگر‬ ‫سالم‬ ‫آنها‬‫صل‬
‫شان‬ ‫های‬ ‫شوهر‬ ‫کنند‬.
Google Translations ‫اساس‬ ‫بی‬ ‫شایعات‬ ‫از‬ ‫بیت‬ ‫بدل‬ ‫و‬ ‫رد‬ ‫و‬ ‫یکدیگر‬ ‫استقبال‬ ‫آنها‬‫به‬ ‫را‬
‫رفت‬ ‫خود‬ ‫شوهر‬ ‫به‬ ‫پیوستن‬ ‫برای‬ ‫را‬ ‫آنها‬ ‫عنوان‬.
Arya Translations ‫مربع‬ ‫در‬ ‫شوند‬ ‫جمع‬ ‫که‬ ‫کردند‬ ‫شروع‬ ‫روستا‬ ‫مردم‬,‫در‬
‫ساعت‬ ‫َه‬‫د‬ ‫حدود‬ ، ‫بانک‬ ‫و‬ ‫پستخانه‬ ‫میان‬
Google Translations ‫اداره‬ ‫بین‬ ،‫آوری‬ ‫جمع‬ ‫به‬ ‫شروع‬ ‫میدان‬ ‫در‬ ‫روستا‬ ‫مردم‬
‫ده‬ ‫حدود‬ ‫ساعت‬ ،‫بانک‬ ‫و‬ ‫پست‬.
Results & Discussions
RBMT
SMT
Human
Word Order Missing Words Unknown
Words
Punctuation Parts of Speech Conjugation
Story 12 6 3 12 8 9
User Guide 17 5 1 10 5 13
Magazine 14 4 10 7 15
Story 11 7 3 12 7 8
User Guide 17 5 1 9 7 11
Magazine 15 2 5 9 6 14
Story 1 2 0 3 1 3
User Guide 0 0 2 1 0 1
Magazine 2 1 1 1 2 2
Syntactic Category
Word Order
Missing
Words
Unknown
Words Punctuation
Parts of
Speech Conjugation
TableofSyntacticErrors
RBMT
SMT
Human
Incorrect Lexicon Polysemy Idiomatic Expression
Story 10 7 12
User Guide 7 9 5
Magazine 7 11 9
Story 8 8 9
User Guide 3 7 3
Magazine 5 13 8
Story 0 0 2
User Guide 1 0 0
Magazine 0 1 1
Semantic Category
Incorrect Lexicon Polysemy Idiomatic Expression
TableofSemanticErrors
Results & Discussions contd.
Results & Discussions cont’d.
• Both systems made the least errors with the simpler sentences and the most ones with the
compound- complex sentences, as well as lexically or structurally ambiguous texts. This is
because ambiguous source texts with different contents can correspond with more than one
representation.
• For the rule-based system, the most typical errors are in conjugation, word order and also in
rendering polysemous words and idiomatic expressions. For the statistical system the most
common error is in conjugating and determining the tense. However, it has also some
problems in translating words with multiple meaning and idiomatic expression.
• To see whether machine translation accuracy is affected by text-type three different genres
were analyzed thoroughly. And for the different text types, the rule- based system had
similar amounts of syntactic and semantic errors in each text.
Future!
• Evaluating MT quality is necessarily a subjective process because it involves
human judgments.
• Determining the best category for an error in MT output is not easy because we
have to place them on how they are realized rather than the cause of errors and
many machine translated sentences contained multiple, linked errors.
• Future work will therefore be focused on the cause of errors and ranking error
categories. The error categories presented here is flexible, allowing for the
deletion or addition of more categories.
Error Analysis of Rule-based Machine Translation Outputs

More Related Content

PPTX
Volumetric modulated Arc-Therapy
PPTX
Mga Angkop na Salita sa Pagbuo ng Tula.pptx
PPTX
LESSON 5 APPRAISE THE UNITY OF PLOT, SETTING AND CHARACTERIZATION
PPTX
Prof.Dr. Mustafa Esassolak Radyasyon Onkolojisi Uzmanı Baş Boyun Kanserlerind...
PDF
DepEd k12 English 7 fourth quarter module 2
PPTX
Multi lingual corpus for machine aided translation
PPTX
Traductores de Nicaragua (505)2289-4596
PDF
6. Khalil Sima'an (UVA) Statistical Machine Translation
Volumetric modulated Arc-Therapy
Mga Angkop na Salita sa Pagbuo ng Tula.pptx
LESSON 5 APPRAISE THE UNITY OF PLOT, SETTING AND CHARACTERIZATION
Prof.Dr. Mustafa Esassolak Radyasyon Onkolojisi Uzmanı Baş Boyun Kanserlerind...
DepEd k12 English 7 fourth quarter module 2
Multi lingual corpus for machine aided translation
Traductores de Nicaragua (505)2289-4596
6. Khalil Sima'an (UVA) Statistical Machine Translation

Viewers also liked (13)

PPTX
A review on Google Translator
PDF
Human vs machine translation
PPTX
Google translator
PPT
Mixing Computer-Assisted Translation and Machine Translation
PPTX
Machine translation vs human translation
PPTX
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
PPT
Machine Translation And Computer Assisted Translation
PPS
Google Translate Update
PPTX
Google translate
PPTX
Machine Translation=Google Translator
PPTX
Machine translation
PPTX
Machine Translation
PPTX
Machine Translation: What it is?
A review on Google Translator
Human vs machine translation
Google translator
Mixing Computer-Assisted Translation and Machine Translation
Machine translation vs human translation
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Machine Translation And Computer Assisted Translation
Google Translate Update
Google translate
Machine Translation=Google Translator
Machine translation
Machine Translation
Machine Translation: What it is?
Ad

Similar to Error Analysis of Rule-based Machine Translation Outputs (20)

PPT
mt_cat_presentations CAT TRANSLATION PPT
PDF
Machine Translation Approaches and Design Aspects
PPT
Arabic MT Project
PDF
Survey on Indian CLIR and MT systems in Marathi Language
PDF
A Novel Approach for Rule Based Translation of English to Marathi
PDF
A Novel Approach for Rule Based Translation of English to Marathi
PDF
A Novel Approach for Rule Based Translation of English to Marathi
PDF
A Novel Approach for Rule Based Translation of English to Marathi
PDF
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
PDF
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
PDF
Lepor: augmented automatic MT evaluation metric
PDF
Unsupervised Quality Estimation Model for English to German Translation and I...
PDF
Ac04507168175
PDF
Integration of speech recognition with computer assisted translation
DOCX
THESIS PROPOSAL
PDF
INTRODUCTION TO Natural language processing
PDF
LLM.pdf
PDF
Natural Language Processing, Techniques, Current Trends and Applications in I...
PDF
MT(1).pdf
 
PDF
Design and Development of a Malayalam to English Translator- A Transfer Based...
mt_cat_presentations CAT TRANSLATION PPT
Machine Translation Approaches and Design Aspects
Arabic MT Project
Survey on Indian CLIR and MT systems in Marathi Language
A Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to Marathi
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
Lepor: augmented automatic MT evaluation metric
Unsupervised Quality Estimation Model for English to German Translation and I...
Ac04507168175
Integration of speech recognition with computer assisted translation
THESIS PROPOSAL
INTRODUCTION TO Natural language processing
LLM.pdf
Natural Language Processing, Techniques, Current Trends and Applications in I...
MT(1).pdf
 
Design and Development of a Malayalam to English Translator- A Transfer Based...
Ad

Error Analysis of Rule-based Machine Translation Outputs

  • 1. ErrorAnalysisofRule-basedMachineTranslationOutput A CaseStudyonEnglish– PersianMTSystems ‫ترجمه؟‬ ‫دستگاه‬‫خروجی‬‫بر‬‫مبتنی‬ ‫حکومت‬‫از‬‫خطا‬‫تحلیل‬ ‫و‬‫تجزیه‬ ‫انگلیسی‬‫زبان‬ ‫در‬ ‫موردی‬ ‫مطالعه‬-‫های‬‫سیستم‬ ‫ارسی‬‫ف‬MT ZahraPourniksefat IslamicAzadUniversity– Science& ResearchBranch
  • 3. Agenda Introduction  Machine Translation Overview  Evaluation of MT systems Methods & Materials Error Categories & Description Results & Discussion
  • 4. Machine Translation Overview Definition: The term Machine Translation (MT) is used for translating text or speech from one natural language to another by using computers and software. • Systran: MT is much faster than human translators because it is much cheaper and has a better memory than human translators. • Shahba 2002 believed that “It’s better to spend our time on the actual act of translation rather than typing the English text or scanning it for the MT to translate. Efforts in MT are by themselves valuable as they at least satisfy one of the needs of human beings: need for innovation and discovery” • MT is more economic on time and money, but it is less accurate than human translators (Frederking, 2004).
  • 5. Why MT matters? According to Hatim and Munday it’s an important topic - socially, politically, commercially, scientifically, and intellectually or philosophically (2004) • The social or political importance of MT arises from the socio- political importance of translation in communities where more than one language is generally spoken. So translation is necessary for communication- for ordinary human interaction, and for gathering the information one needs to play a full part in society. • The commercial importance of MT is a result of related factors. First, translation itself is commercially important. Second, translation is expensive. • Scientifically, MT is interesting, because it is an obvious application and testing ground for many ideas in Computer Science, Artificial Intelligence, and Linguistics. • Philosophically , MT is interesting, because it represents an attempt to automate an activity that can require the full range of human knowledge.
  • 6. Some Misconceptions about MT MT is a waste of time because you will never make a machine that can translate Shakespeare. This criticism that MT systems cannot translate Shakespeare is a bit like the criticism of industrial robots for not being able to dance.(Hatim and Munday, 2004) • First, translating literature requires special literary skills – it is not the kind of thing that the average professional translators normally attempt • Second, literary translation is a small proportion of the translation that has to be done. • Finally, one may wonder who would ever want to translate Shakespeare by machine – it’s a job that human translators find challenging and rewarding, and it’s not a job that MT systems have been designed for.
  • 7. Approaches to MT • Direct Machine Translation Approach The first developed MT systems where a word–for–word translation from the source language to the target language is performed. • Transfer Machine Translation Approach 1. The analysis stage that is the direct strategy which takes benefits of a dictionary in source language to demonstrate the source language from linguistic point of view. 2. The transfer stage varies the outputs of the analysis stage to produce structural and linguistic equivalents between the two languages. 3. The generation stage is the third stage in which a target language dictionary is applied to result the target language document on the basis of linguistic information. (Steiner, 1988) • Interlingua Machine Translation Approach  First the source text meaning is decoded  Second the resulted meaning is re-encoded in the target language
  • 8. Approaches to MT cont’d. • Rule-based Machine Translation Approach It operates on the linguistic data on source and target languages fundamentally taken from bilingual dictionaries and the basic semantic, morphological, and syntactic grammar of the individual language (Gelbukh, 2011). Minimally, to get a Persian translation of English sentence one needs: 1. A dictionary that will map each English word to an appropriate Persian word. 2. Rules representing regular English sentence structure 3. Rules representing regular Persian sentence structure 4. And finally, we need rules according to which one can relate these two structures together.
  • 9. Approaches to MT cont’d. • Statistical Machine Translation Approach This system uses a corpus or database as a translated example for analyzing and decoding source language. In comparison with the machine translation of about three decades ago, Google Translate as an example of more contemporary automated engine for the task of translation has taken a giant leap. However, it is still too imperfect. (Nierenberg, 1998) • Hybrid Machine Translation Approach 1. Rules post-processed by statistics in which translation are practiced on the pivot of rule-based engine. Next statistics are applied to correct the output. 2. Statistics guided by rules in which rules have an important role to pre-process date to quite the statistical representation to normalize. This approach is powerful, flexible and under more control when it’s translating.
  • 10. Evaluation of MT Systems • Human translation assessment (Secară 2005; Williams 2001) has been moving from microtextual, word- or sentence-level error analysis methods toward more macrotextual methods focused on the function, purpose and effect of the text.At the same time, machine translation assessment has mainly been microtextual and focused on the aspects of accuracy and fluency. • Hovy (2002) discussed the complexity of MT evaluation, and stressed the importance of adjusting evaluation to the purpose and context of the translation.
  • 11. Evaluation of MT Systems cont’d. Mary A. Flangan Believed that Machine translation quality can be difficult to quantify for a number of reasons: 1) A text can have several different translations, all of which are correct. 2) Defining the boundaries of errors in MT output is often difficult. Errors sometimes involve only single words, but more often involve phrases, discontinuous expressions, word order or relationships across sentence boundaries. Therefore, simply counting the number of wrong words in the translation is not meaningful. 3) One error can lead to another. For example, if the part of speech of a word is identified incorrectly by the MT software, the entire analysis of the sentence may be affected, creating a chain of errors. 4) The cause of errors in MT output is not always apparent. The evaluator usually does not have access to a trace of the software's tests and actions. Thus it can be difficult to identify what went wrong in the translation of a sentence.
  • 12. Evaluation of MT Systems cont’d. Types of Evaluation  Automatic Evaluation the Word Error Rate (WER), the Position independent word Error Rate (PER), the BLEU (Papineni et al., 2002) and the NIST (Doddington, 2002) where the MT output is compared to one or more human reference translations.  Human Evaluation Due to the complexity of natural language, manual evaluation seems more reliable 1. Three passages were selected and translated by Rule-based MT Systems and compared with one Statistical MT System and Human translator 2. Error categories were derived after the analysis of each text
  • 13. Methods & Materials • Three passages were translated by two different MT systems and also a human translator. • From each text type a passage of approximately 400 words was taken from story, user guide and magazine. • The rule-based MT – Arya TM– system was designed based on thousands of lexical and grammatical rules. • The statistical system, Google Translate by Google Inc., is based on the use of large monolingual and parallel corpora for translation. • The unit of analysis was set to a sentence level because it’s the largest unit which can be easily recognized in MT systems and ST sentence can be clearly corresponded to its TT pairs. Table of Source Text Passages for Analysis Number of Words Number of Sentences Short Story The Lottery 398 13 User Guide Microsoft Access 2012 390 16 Magazine Academic article 415 15
  • 14. Errors Category Syntactic Word Order Missing Words Punctuation Parts of Speech Conjugation Unknown Words Semantic Incorrect Words Polysemy Idiomatic Expressions • For English-to-Persian Rule- based MT systems the following categories were derived Error Categories & Descriptions
  • 15. Error Categories & Descriptions cont’d. Description of Error Categories: • Syntactic Errors: Those errors that are related to the grammar of the language such as parts of speech or conjugation Word order that means sentence elements ordered incorrectly Example: Commands generally take the form of buttons and lists. (User Guide) Missing words: incorrect elision of some words Example: This requires better data collection and analysis tools for studying outcomes and consistent use of these tools across individual studies. (Magazine) Arya Translation System ‫ها‬ ‫فهرست‬ ‫و‬ ‫گیرد‬ ‫می‬ ‫شاگرد‬ ‫فرم‬ ‫کلی‬ ‫بطور‬ ‫ها‬ ‫دستور‬. Google Translate ‫لیست‬ ‫و‬ ‫ها‬ ‫دکمه‬ ‫شکل‬ ‫به‬ ‫کلی‬ ‫طور‬ ‫به‬ ‫دستورات‬. Arya Translation System ‫می‬ ‫مطالعه‬ ‫برای‬ ‫تحلیل‬ ‫ها‬ ‫ابزار‬ ‫و‬ ‫های‬ ‫نیاز‬ ‫بهتر‬ ‫اطالعات‬ ‫مجموعه‬ ‫این‬ ‫کند‬ ‫می‬ ‫استفاده‬ ‫سازگار‬ ‫و‬ ‫ها‬ ‫حاصل‬ ‫کن‬. Google Translate ‫و‬ ‫تجزیه‬ ‫ابزار‬ ‫از‬ ‫استفاده‬ ‫با‬ ‫و‬ ‫بهتر‬ ‫ها‬ ‫داده‬ ‫آوری‬ ‫جمع‬ ‫مستلزم‬ ‫امر‬ ‫این‬ ‫سراسر‬ ‫در‬ ‫ابزار‬ ‫این‬ ‫از‬ ‫مداوم‬ ‫استفاده‬ ‫و‬ ‫نتایج‬ ‫بررسی‬ ‫برای‬ ‫تحلیل‬‫مطالعات‬ ‫فردی‬.
  • 16. Error Categories & Descriptions cont’d. Unknown words: word not in a dictionary Example: The women, wearing faded house dresses and sweaters, came shortly after their menfolk.( Story) Punctuation: incorrect punctuation Example: The children assembled first, of course. (Story) Arya Translation System ‫ها‬ ‫زن‬,‫پس‬ ‫زودی‬ ‫به‬ ، ‫کردند‬ ‫محو‬ ‫ها‬ ‫ژاکت‬ ‫و‬ ‫خانه‬ ‫ها‬ ‫لباس‬ ‫کننده‬ ‫خسته‬ ‫از‬menfolk‫شان‬‫آمدند‬ Google Translate ‫از‬ ‫پس‬ ‫کوتاهی‬ ‫مدت‬ ‫آمد‬ ‫در‬ ،‫خانه‬ ‫پژمرده‬ ‫ژاکت‬ ‫و‬ ‫لباس‬ ‫پوشیدن‬ ،‫زنان‬ menfolk‫را‬ ‫خود‬. Arya Translations ‫اول‬ ‫ها‬ ‫بچه‬,‫کردند‬ ‫جمع‬ ‫البته‬. Google Translations ‫البته‬ ،‫اول‬ ‫مونتاژ‬ ‫کودکان‬.
  • 17. Error Categories & Descriptions cont’d. Parts of speech: errors in identifying pars of speech such as noun or verb Example: If you decrease the width of the ribbon, small button labels disappear. (User Guide) Conjugation: incorrectly formed verb or wrong tense Example: Soon the women, standing by their husbands, began to call to their children, and the children came reluctantly, having to be called four or five times. Arya Translation System ‫زند‬ ‫می‬ ‫برچسب‬ ‫ناپدید‬ ‫کوچک‬ ‫دکمه‬ ، ‫کاهشبیابید‬ ‫نوار‬ ‫پهنا‬ ‫شما‬ ‫اگر‬ Google Translate ‫شون‬ ‫می‬ ‫ناپدید‬ ‫کوچک‬ ‫دکمه‬ ‫ها‬ ‫برچسب‬ ،‫دهد‬ ‫کاهش‬ ‫را‬ ‫شما‬ ‫نوار‬ ‫عرض‬ ‫اگر‬‫د‬ Arya Translations ‫ها‬ ‫زن‬ ‫بزودی‬,‫شان‬ ‫های‬ ‫شوهر‬ ‫کن‬ ‫می‬ ‫حمایت‬,‫ها‬ ‫بچه‬ ‫به‬ ‫صدا‬ ‫کردن‬ ‫شروع‬ ‫به‬ ‫شان‬,‫دوره‬ ‫پنج‬ ‫یا‬ ‫چهار‬ ‫زده‬ ‫صدا‬ ‫اشد‬ ‫که‬ ‫دارد‬ ‫می‬ ، ‫آمدند‬ ‫اکراه‬ ‫با‬ ‫ها‬ ‫بچه‬ ‫و‬. Google Translations ‫ب‬ ‫ها‬ ‫بچه‬ ‫و‬ ،‫خود‬ ‫فرزندان‬ ‫به‬ ‫تماس‬ ‫به‬ ‫شروع‬ ،‫ایستاده‬ ‫خود‬ ‫شوهران‬ ،‫زنان‬ ‫زودی‬ ‫به‬‫ه‬ ‫بار‬ ‫پنج‬ ‫یا‬ ‫چهار‬ ‫نام‬ ‫به‬ ،‫اکراه‬.
  • 18. Error Categories & Descriptions cont’d. • Semantic Errors: Those errors that are related to the meaning such as incorrect meaning of words or expressions which caused the incorrect meaning of the whole sentence. Incorrect word: completely incorrect meaning Polysemy: incorrect selection of the meaning of the words with more than one meaning Example: The people of the village began to gather in the square, between the post office and the bank, around ten o'clock. Style and idiomatic expression : incorrect translation of multi-word expression Example: They greeted one another and exchanged bits of gossip as they went to join their husbands. Arya Translations ‫و‬ ‫که‬ ‫رفتند‬ ‫آنها‬ ‫کردند‬ ‫معاوضه‬ ‫غیبت‬ ‫ذره‬ ‫و‬ ‫همدیگر‬ ‫سالم‬ ‫آنها‬‫صل‬ ‫شان‬ ‫های‬ ‫شوهر‬ ‫کنند‬. Google Translations ‫اساس‬ ‫بی‬ ‫شایعات‬ ‫از‬ ‫بیت‬ ‫بدل‬ ‫و‬ ‫رد‬ ‫و‬ ‫یکدیگر‬ ‫استقبال‬ ‫آنها‬‫به‬ ‫را‬ ‫رفت‬ ‫خود‬ ‫شوهر‬ ‫به‬ ‫پیوستن‬ ‫برای‬ ‫را‬ ‫آنها‬ ‫عنوان‬. Arya Translations ‫مربع‬ ‫در‬ ‫شوند‬ ‫جمع‬ ‫که‬ ‫کردند‬ ‫شروع‬ ‫روستا‬ ‫مردم‬,‫در‬ ‫ساعت‬ ‫َه‬‫د‬ ‫حدود‬ ، ‫بانک‬ ‫و‬ ‫پستخانه‬ ‫میان‬ Google Translations ‫اداره‬ ‫بین‬ ،‫آوری‬ ‫جمع‬ ‫به‬ ‫شروع‬ ‫میدان‬ ‫در‬ ‫روستا‬ ‫مردم‬ ‫ده‬ ‫حدود‬ ‫ساعت‬ ،‫بانک‬ ‫و‬ ‫پست‬.
  • 19. Results & Discussions RBMT SMT Human Word Order Missing Words Unknown Words Punctuation Parts of Speech Conjugation Story 12 6 3 12 8 9 User Guide 17 5 1 10 5 13 Magazine 14 4 10 7 15 Story 11 7 3 12 7 8 User Guide 17 5 1 9 7 11 Magazine 15 2 5 9 6 14 Story 1 2 0 3 1 3 User Guide 0 0 2 1 0 1 Magazine 2 1 1 1 2 2 Syntactic Category Word Order Missing Words Unknown Words Punctuation Parts of Speech Conjugation TableofSyntacticErrors
  • 20. RBMT SMT Human Incorrect Lexicon Polysemy Idiomatic Expression Story 10 7 12 User Guide 7 9 5 Magazine 7 11 9 Story 8 8 9 User Guide 3 7 3 Magazine 5 13 8 Story 0 0 2 User Guide 1 0 0 Magazine 0 1 1 Semantic Category Incorrect Lexicon Polysemy Idiomatic Expression TableofSemanticErrors Results & Discussions contd.
  • 21. Results & Discussions cont’d. • Both systems made the least errors with the simpler sentences and the most ones with the compound- complex sentences, as well as lexically or structurally ambiguous texts. This is because ambiguous source texts with different contents can correspond with more than one representation. • For the rule-based system, the most typical errors are in conjugation, word order and also in rendering polysemous words and idiomatic expressions. For the statistical system the most common error is in conjugating and determining the tense. However, it has also some problems in translating words with multiple meaning and idiomatic expression. • To see whether machine translation accuracy is affected by text-type three different genres were analyzed thoroughly. And for the different text types, the rule- based system had similar amounts of syntactic and semantic errors in each text.
  • 22. Future! • Evaluating MT quality is necessarily a subjective process because it involves human judgments. • Determining the best category for an error in MT output is not easy because we have to place them on how they are realized rather than the cause of errors and many machine translated sentences contained multiple, linked errors. • Future work will therefore be focused on the cause of errors and ranking error categories. The error categories presented here is flexible, allowing for the deletion or addition of more categories.