Machine Translation Quality Estimation - A Linguist's Approach

MACHINE TRANSLATION
QUALITY ESTIMATION
A Linguist’s Approach

WHAT IS MT QUALITY ESTIMATION?
Automatically providing a quality indicator for machine
translation output without depending on human reference
translations.
Our objective:
Estimate quality and post-editing effort for eBay listing titles
and descriptions
MT QUALITY ESTIMATION – A LINGUIST’S APPROACH 2

ONE big CHALLENGE
min W Ʃ T
t=1 ||(W(t)X(t) − Y (t) )||2 2 + λs||S||1 + λb||B||1,∞ subject to: W = S + B
or
“State-of-the-art QE explores different supervised linear or non-linear learning methods
for regression or classification such as Support Vector Machines (SVM), different types
of Decision Trees, Neural Networks, Elastic-Net, Gaussian Processes, Naive Bayes,
among others”
(Machine Translation Quality Estimation Across Domains, de Souza et al, 2014)

A LINGUIST’S APPROACH
Using linguistic features from 3 dimensions:
COMPLEXITY ADEQUACY FLUENCY

FEATURES
Complexity:
• Length
• Polysemy
Adequacy:
• QA
 Terminology
 Patterns
 Blacklist
 Numbers
• Automated
Post-Editing
• (POS)
• (NER)
Fluency:
• Misspellings
• Grammar errors

IMPLEMENTATION
Checkmate+LanguageTool
Reusable Profile
Detailed Report
Score

TESTING
• One Language (es-LA)
• Short samples (~300 words)
• Bigger samples (~1000 words)
• Post-Edited files (~50,000 words)
• pt-BR, ru-RU, zh-CN

MEASURING RESULTS

SAMPLES - SCORE AND TIME ALIGN

FILES - SCORE AND ED ALIGN
Average ED (es-LA, descriptions) = 72

MT QE OVER TIME

SAMPLES - OTHER LANGUAGES

CHALLENGES
• False positives
• Matching score and post-editing effort
• Same weight for all features

WHAT’S NEXT
• Tracking scores over time
• Adding scores to our post-editing tool
• Adding new languages
• Researching new features

HOW CAN YOU USE THIS?
• Tailor the model to your needs
• Estimate quality at the file/segment level
• Target post-editing, discard bad content
• Estimate post-editing effort/time
• Compare MT systems
• Monitor MT system progress

Q&A
THANK YOU! jrowda@ebay.com

Machine Translation Quality Estimation - A Linguist's Approach

More Related Content

Viewers also liked (15)

Similar to Machine Translation Quality Estimation - A Linguist's Approach (20)

Recently uploaded (20)

Machine Translation Quality Estimation - A Linguist's Approach

Editor's Notes