SlideShare a Scribd company logo
Wanted: Best Practices for Collaborative
             Translation
                  Alain Désilets
       National Research Council of Canada
          alain.desilets@nrc-cnrc.gc.ca


               With support from:
A (Very) Brief History
                                       of Collaborative
                                       Translation
Circa 2005: “Wikis, what's that?”

Circa 2006: “I know about Wikipedia, but I hear it’s garbage because
   anyone can write anything on it.”

Circa 2007: “You know, I have been to Wikipedia a couple of times and
   was pleasantly surprised by the quality of what I found there.”

Circa 2008: “Actually, this wiki stuff is really interesting. Now, I routinely
   use Wikipedia in my work, and although I am cautious with it, I find it
   useful.

   I have a sense that this wiki/collaborative stuff will have a wider impact
   for translation, but I’m not quite sure how.”
A (Very) Brief History
                                       of Collaborative
                                       Translation (2)

Circa 2009: “Whoa! Translation Crowdsourcing is going to put me out of a
   job!”



Circa 2010: “Well, I guess that was a storm in a teacup. Crowdsourcing
   will be used in some specific and limited contexts, but it won't take
   over. Maybe this is more an opportunity for us than a threat…”



Circa 2011: “Hum… getting this collaborative translation stuff to work
   right is hard and confusing.”
Talk Outline


The different “flavors” of Collaborative Translation

Common issues and Challenges in Collaborative
 Translation

Capturing Collaborative Translation best-practices
 in the form of Design Patterns
The different flavors of
Collaborative Translation
Collaborative
Translation
Definition




Collaborative Translation is the use
 of any open online collaboration
 technology or process, in order to
 help with translation tasks, or tasks
 related to translation (ex:
 terminology).
Available in the
                         following flavors…


•   Translation crowdsourcing
•   Collaborative terminology resources
•   Translation memory sharing
•   Online marketplaces for translators
•   Agile translation teamware
•   Post-editing by the crowd
Translation
                                           Crowdsourcing
Mechanical-Turk-like systems to support the translation of content
  by large crowds of mostly amateurs, through an open-call
  process.

By far the most talked about collaborative translation approach


•   Software user interface (Facebook, Adobe, Symantec, Firefox)
•   Technical documentation (Adobe, Symantec)
•   Transcripts of videos of an “inspirational” nature (TED Talks, Adobe TV)
•   Humanitarian aid content (Translators without Borders, Kiva.org, Haiti
    Earthquake Mission)
•   Large scale collection of linguistic data for research purposes or machine
    translation training (NAACL Workshop on Crowdsourcing).
Collaborative
                                  terminology resources

Wikipedia-like platforms for the creation and maintenance of large
  terminology resources by a crowd of translators, terminologists,
  domain experts, and even general members of the public.

•   Wikipedia
•   Wiktionary
•   ProZ’s Kudoz forum
•   Urban Dictionary
•   TermWiki.com
•   TikiWiki
•   Reverso dictionary
Translation memory
                                     sharing

Platforms for large scale pooling and sharing of multilingual
   parallel corpora between organizations and individuals.

•   TAUS Data Association
•   MyMemory
•   Google Translator Toolkit
•   WeBiText


Often, collaboration is “implicit”, for example, in the case of
   WeBiText.
Online marketplaces
                                  for translators

eBay-like disintermediated environments for connecting customers
  and translators directly, with minimal intervention by a middle
  man.

• ProZ.com
• TranslatorsCafe
• Translated.net

Collaborative aspects comes from things like “open call sourcing”
  and reputation management based on community assessment.
Agile translation
                                   teamware
Wiki-like systems and processes that allow multidisciplinary teams
  of professionals (translators, terminologists, domain experts,
  revisers, managers) to collaborate on large translation projects,
  using an agile, grassroots, parallelized process instead of the
  more top-down, assembly-line approach found in most
  translation workflow systems.

No specific software or site, but many case studies describing how
  to implement this approach, using general purpose
  collaboration tools like wikis, BaseCamp.
• Beninatto & De Palma, 2008,
• Calvert, 2008
• Yahaya, 2008

Some translation workflow systems starting to market themselves
  as being “collaborative”
Post-editing by the
                                 crowd

Systems allowing a large crowd of mostly amateurs to correct the
  output of machine translations systems, often with the aim of
  improving the system’s accuracy.




• Asia Online’s Wikipedia translation project
• Google Translate allows anonymous users to correct the
  outputs produced by the systems
• Likewise for Microsoft’s Bing Translator
Is this REALLY New?



Weren’t Terminology Databases, Translation Memories and
  Translation Workflow Systems already collaborative?
• Yes, but…
• … Collaborative Translation is about using these kinds of
  groupware technologies in the context of much larger groups or
  communities, where people have fewer reasons to trust each
  other a-priori.

It’s one thing to open yourself to collaboration with colleagues and
    customers.

It’s quite another thing to open yourself to the whole world.
Common issues and Challenges
in Collaborative Translation
This is NOT easy


Choosing a flavor and tailoring it to your needs is still somewhat of
  a black art, guided by trial and error.

There are lots of important and poorly understood issues that
  arise, many of which are common to most flavors:
• Alignment with business goals
• Quality control
• Crowd motivation
• Proper role of professionals
Alignment with
                                   business goals

Why are you doing this in the first place? Which flavor can deliver
 what you want?

The actual benefit you get from a given flavor is not necessarily
  what you think!

Translation crowdsourcing
• Reduce cost? – Yes, but not the biggest benefit.
• Decrease lead time? – Definitely.
• Translation more in-tune with target audience’s idiosyncrasies?
  -- Also
• Most importantly: Increase brand loyalty by engaging end-users
  as co-creators of products, instead of passive consumers.
Quality Control


How to control quality when you open yourself to contributions
  from a potential large group of “outsiders”?

Many ways:
• Screen contributors before letting them in (ex: Translators
  Without Border, Kiva.org).
• Have members of the community vote on the quality of each
  other’s work (ex: Facebook, Translated.net).
• Have in-house professionals revise the work done by the
  community (ex: Facebook).
Quality Control (2)


Do not assume that quality of community-produced content will be
  lower.

For instance, Wikipedia provably measures up to professionally
  produced encyclopedia like Britannica (English) and Brockhaus
  (German).

Quality issues tend to iron themselves out provided that you attract
  a sufficient large number of the right people

Wisdom of crowd effects works surprisingly well when the
  following conditions are met:
• Diversity
• Independence
• Aggregation
Crowd motivation


If you are to attract and retain enough of the “right people” you
    need to understand why thy might contribute.
•   Mandated by management (ex: Agile Translation teamware)
•   Emotional bond with the content (ex: Facebook, and surprisingly, Adobe)
•   Prestige of the content (ex: TEDTalks)
•   Wanting to do good (ex: Translators Without Border, Kiva, Haiti Earthquake
    Mission, Data collection for scientific research)
•   Pride in one’s native language (ex: Data collection for R&D in MT for small
    density languages)
•   Trying to perfect second language skills
•   Trying to make a go at professional translation career (ex: Kiva.org)
•   And in some cases, $$$
     – Will this be the dominant scenario?
     – How to set compensation high enough to attract good contributors, but not so
       high that it interferes with more intrinsic motivations, or attracts people out to
       game the system.
Role of professionals


Some flavors of CT are designed specifically for professionals (ex: Agile
  translation teamware, Online translator marketplaces).

But some (e.g. Translation crowdsourcing), tend to de-emphasize their role.

When should professionals be involved, and what should be their role?
• Revise work done by amateurs?
   – Focus on more challenging aspects of translation like terminology,
     style, fluidity?
• Manage and coach the crowd?
• Focus on more mission-critical and hard to translate content?

Translation Crowdsourcing may actually increase the size of the pie, by
   making it possible to tackle content and/or small languages that would
   otherwise not have been dealt with anyway.
Capturing Collaborative
Translation best-practices in the
form of Design Patterns
Wanted: Best-
                                  Practices

Collaborative Translations presents practitioners with a varied and
  complex envelopes of different approaches and technologies.

Selecting a flavor and tuning it to meet your needs is complex.

We need some sort of concise, easy-to-consult repository of best-
 practices for that field.

We propose a way to collaboratively create such a repository a
 community, in the form of a design patterns language.
collaborative-translation-
patterns.com
About Design Patterns




A format for describing a common
   solution to a common problem in
   a given field

Originally used in Architecture, but
   since then adopted in other fields
   such as Software Engineering,
   Education, etc.
Design Patterns
                                            Example
                    Publish Contributions Rapidly

Context
    This pattern is useful for motivating contributors in any collaborative
      translation context, but it is particularly useful in translation crowdsourcing
      scenarios.

Problem
    Contributors are often motivated by a desire to have a positive impact on the
      community they are participating in. However, they cannot achieve this
      sense of being useful, if their contributions do not become available to the
      rest of the community in a reasonable amount of time.

Solution
    Therefore, minimize the delay between the moment when a member of the
       community contributes to the site, and the moment where it becomes
       publicly available to the rest of the community. Ideally, the contribution
       should become visible to the rest of the community as soon as the user
       clicks on the Save button.
Design Patterns
                                           Example (2)

Related patterns
    – Point System is another way for a contributor to get a sense of how useful he
      has been to the community.
    – Campaign Progress Gauge is another practice which allows members of the
      community to see the positive impact of their actions. The main difference is
      that it operates more at a community/project level rather than at a
      individual/contribution level.

Real-life examples
    – At Facebook, translations become available in a matter of hours.
    – In the context of software localization by the crowd, Adobe makes a conscious
      effort to wrap the community's translations into every new releases of the
      product.
TAUS Roundtable on
                                   Collaborative
                                   Translation
Wiki “Barn Raising” workshop held on October 12th, 2011 at
  Localization World in Santa Clara.


12 practitioners
• One third with hands on experience of CT (NRC, Adobe,
  Symantec, Kiva, World Wide Lexicon)
• Two thirds with no experience, but a strong interest in trying it
  (In Every Language, MemSource, Firma 8, SPIL Games)

Talks by the experienced users about what worked and didn’t.

Followed by brainstorming of what the recurring best-practices
  seem to be.
The Best-Practices


End result:
=> 50+ best practices organized into 6 themes

Planning and Scoping
  Translation as User Engagement, Align Stakeholder Expectations, Early and
  Continuous Clarification of Translator Expectations, Backup Plan, Project,
  Check Points, Appoint Initial Community Manager, Clear Objectives, Identify
  Compatible Content

Community Motivation
  Campaign Progress Gauge, Contributor Recognition, Leader Board, Official
  Certificate, Point System, Offer Double Points, Hand-Out Unique Branded,
  Products, Contributor of the Month, Grant Special Access Rights, Playful
  Casual Translation, Campaign, Publish Contributions Rapidly, Playful
  Competition Among Contributors
The Best-Practices (2)


Quality
  Content-Specific Testing, Entry Exam, Peer Review, Automatic Reputation
  Management, Random Spot-Checking, Revision Crowdsourcing, Users as
  Translators, Voting, Transparent Quality Level, Publish then Revise

Contributor Career Path
  Flexible Contributor Career Path, Lurker to Contributor Transition, Anonymous
  Translation, Find the Leaders, Support Variable Levels of Involvement,
  Community Manager, Content Prioritizer

Right Sizing
  Appropriate Chunk Size, Community-Appropriate Project Size, Break Up
  Crowd Into Teams, Require Minimal Involvement Level, Keep the Crowd
  Small, Volunteer Team Leaders
The Best-Practices (3)




Tools and Processes
  Hint at Content Priority, First In, First Out, Task Self Selection, Layered
  Fallbacks, Official Linguistic Resources, Automatic Suggestions, Provide,
  Context, In-Place Translation, Community Forum, Analytics for Content
  Prioritization, Simplicity First, Good Examples of Contributions, Encourage
  Self, Set Deadlines
Some Observations


The bulk of practices relate to Translation Crowdsourcing.
=> We need to spend more time capturing practices for other
  flavors of Collaborative Translation

The bulk of the practices so far are not specific to translation.
• They would be useful in the context of crowdsourcing efforts in
  any domain.
• Maybe all we need is to codify and/or learn about the best
  practices for crowdsourcing in general?

The more similar two organizations are, the more similar their
  practices will be (ex: Kiva and TWB, versus Kiva and Adobe).
Conclusion
Conclusion

Collaborative Translation presents practitioners with a very large
  and varied set of tools and processes.

Choosing a particular flavor of CT and tailoring to meet one’s
  needs can be a daunting task.

We need a concise, easy to consult, modular compendium of
 current best practices in that area.

We have started building such a compendium in the form of a wiki
 site (www.collaborative-translation-patterns.com) which
 captures best practices in the form of design patterns.

We invite every one in this room to contribute to it if they can.
Thank You!
Wanted: Best Practices for Collaborative Translation

More Related Content

PPT
Types of translation
PPTX
Relationship between language, culture, and identity
PPTX
Cooperative principle / grice maxims
PPTX
Macaulay
PPTX
Theory of meaning by Ogden and Richards
PPT
Gender and language
PPTX
Corpus linguistics
PPTX
Origins of language
Types of translation
Relationship between language, culture, and identity
Cooperative principle / grice maxims
Macaulay
Theory of meaning by Ogden and Richards
Gender and language
Corpus linguistics
Origins of language

What's hot (20)

PPTX
Text analysis
PPSX
English as a global language
PPTX
Multilingualism and bilingualism
PPTX
Discourse Analysis ppt
PPTX
Translation and culture
PPTX
Language and gender
PPTX
Communicative and semantic translation
PPTX
Listening to the news
PPTX
Sapir Whorf hypothesis
PPTX
SAPIRWHORF HYPOTHEIS
PPTX
DOC
Linguistic And Social Inequality
PPTX
"Linguistics is a Science"
PPTX
Linguistic conflict
PPTX
Historicism- The school of thought
PPTX
Translation studies
PPTX
Foreignization & domestication
PPTX
Translation methods
PPTX
Lingua francas
PPTX
Translation loss and gain
Text analysis
English as a global language
Multilingualism and bilingualism
Discourse Analysis ppt
Translation and culture
Language and gender
Communicative and semantic translation
Listening to the news
Sapir Whorf hypothesis
SAPIRWHORF HYPOTHEIS
Linguistic And Social Inequality
"Linguistics is a Science"
Linguistic conflict
Historicism- The school of thought
Translation studies
Foreignization & domestication
Translation methods
Lingua francas
Translation loss and gain
Ad

Similar to Wanted: Best Practices for Collaborative Translation (20)

PPT
2010 06-u maryland-crowd_sourcing-workshop-v2010-06-16-10h44
PPTX
The Rosetta Foundation
PPT
Surviving crowdsourcing
PPTX
Final project oral presentation
PPTX
Jasig-sakai2012-communitytranslation-kajita
PPT
Rico celia
PDF
Gilbane sdl-innovate2010-final
PDF
Crowdsourcing In the Localization Process
PPTX
The Language Economy or the Lang “gig” Economy? Crowdsourcing Comes of Age
PDF
New challenges in interactive media & video game localization projects
PPTX
Lingotek Community Translation Platform June 2009 Public
PPTX
Trends in Localisation
PDF
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
PPTX
Empowering the specialized translator through data and technology, by Kevin D...
PDF
Smartling & AMA Webinar: Translation for the Global Marketplace
PPTX
Collaborative and Workflow Tools
PPTX
Easyling at atc London
PDF
FITT Toolbox: Tools for External Collaboration
PDF
Parlez-vous zwei-null, señor?
PDF
12. Gloria Corpas, Jorge Leiva, Miriam Seghiri (UMA) Human Translation & Tran...
2010 06-u maryland-crowd_sourcing-workshop-v2010-06-16-10h44
The Rosetta Foundation
Surviving crowdsourcing
Final project oral presentation
Jasig-sakai2012-communitytranslation-kajita
Rico celia
Gilbane sdl-innovate2010-final
Crowdsourcing In the Localization Process
The Language Economy or the Lang “gig” Economy? Crowdsourcing Comes of Age
New challenges in interactive media & video game localization projects
Lingotek Community Translation Platform June 2009 Public
Trends in Localisation
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Empowering the specialized translator through data and technology, by Kevin D...
Smartling & AMA Webinar: Translation for the Global Marketplace
Collaborative and Workflow Tools
Easyling at atc London
FITT Toolbox: Tools for External Collaboration
Parlez-vous zwei-null, señor?
12. Gloria Corpas, Jorge Leiva, Miriam Seghiri (UMA) Human Translation & Tran...
Ad

More from Grupo Inmigra i+d (20)

PPTX
Pecole aesla 2012 sml
PPT
Amnesty International : language in action LRC - Language Resource Centre
PPTX
Collaborative translation to help NGOs bridge the language last mile
PPT
Collaborative Translation in not-for-profit organizations
PDF
Inmigración y comunicación intercultural
PDF
Tribuna libre luis guerra
PPTX
Resultados del grupo UEM-Traducción
PPT
Traducción, inmigración e industrias de la lengua
PPT
Presentación del Grupo INMIGRA i+d
PPT
Certificación lingüística de nivel inicial para inmigrantes en contexto labor...
PPT
El uso del vídeo para evaluar la comprensión oral en niveles iniciales”. Encu...
PPT
Español con fines profesionales: necesidades de comprensión de los inmigrante...
PPT
La elaboración de textos y preguntas para evaluar la comprensión oral”. XXI C...
PPTX
El discurso periodístico sobre la inmigración latinoamericana en España: el c...
PPT
• Fases iniciales de la implantación de una certificación lingüística de E/L2
PPT
• Diseño de pruebas de evaluación comunicativas de E/L2
PPT
Análisis de necesidades lingüísticas comunicativas de la población inmigrante
PPT
Baralo programa formación_certificación
PPT
Criterios pragmáticos y socioculturales para la selección de textos y géneros...
PPT
Variables sociculturales y comunicativas para un diseño curricular de español...
Pecole aesla 2012 sml
Amnesty International : language in action LRC - Language Resource Centre
Collaborative translation to help NGOs bridge the language last mile
Collaborative Translation in not-for-profit organizations
Inmigración y comunicación intercultural
Tribuna libre luis guerra
Resultados del grupo UEM-Traducción
Traducción, inmigración e industrias de la lengua
Presentación del Grupo INMIGRA i+d
Certificación lingüística de nivel inicial para inmigrantes en contexto labor...
El uso del vídeo para evaluar la comprensión oral en niveles iniciales”. Encu...
Español con fines profesionales: necesidades de comprensión de los inmigrante...
La elaboración de textos y preguntas para evaluar la comprensión oral”. XXI C...
El discurso periodístico sobre la inmigración latinoamericana en España: el c...
• Fases iniciales de la implantación de una certificación lingüística de E/L2
• Diseño de pruebas de evaluación comunicativas de E/L2
Análisis de necesidades lingüísticas comunicativas de la población inmigrante
Baralo programa formación_certificación
Criterios pragmáticos y socioculturales para la selección de textos y géneros...
Variables sociculturales y comunicativas para un diseño curricular de español...

Recently uploaded (20)

PDF
Machine learning based COVID-19 study performance prediction
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Electronic commerce courselecture one. Pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Machine learning based COVID-19 study performance prediction
Reach Out and Touch Someone: Haptics and Empathic Computing
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Encapsulation_ Review paper, used for researhc scholars
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Unlocking AI with Model Context Protocol (MCP)
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Electronic commerce courselecture one. Pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Understanding_Digital_Forensics_Presentation.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
The Rise and Fall of 3GPP – Time for a Sabbatical?
Network Security Unit 5.pdf for BCA BBA.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
MYSQL Presentation for SQL database connectivity
Dropbox Q2 2025 Financial Results & Investor Presentation
“AI and Expert System Decision Support & Business Intelligence Systems”
The AUB Centre for AI in Media Proposal.docx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

Wanted: Best Practices for Collaborative Translation

  • 1. Wanted: Best Practices for Collaborative Translation Alain Désilets National Research Council of Canada alain.desilets@nrc-cnrc.gc.ca With support from:
  • 2. A (Very) Brief History of Collaborative Translation Circa 2005: “Wikis, what's that?” Circa 2006: “I know about Wikipedia, but I hear it’s garbage because anyone can write anything on it.” Circa 2007: “You know, I have been to Wikipedia a couple of times and was pleasantly surprised by the quality of what I found there.” Circa 2008: “Actually, this wiki stuff is really interesting. Now, I routinely use Wikipedia in my work, and although I am cautious with it, I find it useful. I have a sense that this wiki/collaborative stuff will have a wider impact for translation, but I’m not quite sure how.”
  • 3. A (Very) Brief History of Collaborative Translation (2) Circa 2009: “Whoa! Translation Crowdsourcing is going to put me out of a job!” Circa 2010: “Well, I guess that was a storm in a teacup. Crowdsourcing will be used in some specific and limited contexts, but it won't take over. Maybe this is more an opportunity for us than a threat…” Circa 2011: “Hum… getting this collaborative translation stuff to work right is hard and confusing.”
  • 4. Talk Outline The different “flavors” of Collaborative Translation Common issues and Challenges in Collaborative Translation Capturing Collaborative Translation best-practices in the form of Design Patterns
  • 5. The different flavors of Collaborative Translation
  • 7. Definition Collaborative Translation is the use of any open online collaboration technology or process, in order to help with translation tasks, or tasks related to translation (ex: terminology).
  • 8. Available in the following flavors… • Translation crowdsourcing • Collaborative terminology resources • Translation memory sharing • Online marketplaces for translators • Agile translation teamware • Post-editing by the crowd
  • 9. Translation Crowdsourcing Mechanical-Turk-like systems to support the translation of content by large crowds of mostly amateurs, through an open-call process. By far the most talked about collaborative translation approach • Software user interface (Facebook, Adobe, Symantec, Firefox) • Technical documentation (Adobe, Symantec) • Transcripts of videos of an “inspirational” nature (TED Talks, Adobe TV) • Humanitarian aid content (Translators without Borders, Kiva.org, Haiti Earthquake Mission) • Large scale collection of linguistic data for research purposes or machine translation training (NAACL Workshop on Crowdsourcing).
  • 10. Collaborative terminology resources Wikipedia-like platforms for the creation and maintenance of large terminology resources by a crowd of translators, terminologists, domain experts, and even general members of the public. • Wikipedia • Wiktionary • ProZ’s Kudoz forum • Urban Dictionary • TermWiki.com • TikiWiki • Reverso dictionary
  • 11. Translation memory sharing Platforms for large scale pooling and sharing of multilingual parallel corpora between organizations and individuals. • TAUS Data Association • MyMemory • Google Translator Toolkit • WeBiText Often, collaboration is “implicit”, for example, in the case of WeBiText.
  • 12. Online marketplaces for translators eBay-like disintermediated environments for connecting customers and translators directly, with minimal intervention by a middle man. • ProZ.com • TranslatorsCafe • Translated.net Collaborative aspects comes from things like “open call sourcing” and reputation management based on community assessment.
  • 13. Agile translation teamware Wiki-like systems and processes that allow multidisciplinary teams of professionals (translators, terminologists, domain experts, revisers, managers) to collaborate on large translation projects, using an agile, grassroots, parallelized process instead of the more top-down, assembly-line approach found in most translation workflow systems. No specific software or site, but many case studies describing how to implement this approach, using general purpose collaboration tools like wikis, BaseCamp. • Beninatto & De Palma, 2008, • Calvert, 2008 • Yahaya, 2008 Some translation workflow systems starting to market themselves as being “collaborative”
  • 14. Post-editing by the crowd Systems allowing a large crowd of mostly amateurs to correct the output of machine translations systems, often with the aim of improving the system’s accuracy. • Asia Online’s Wikipedia translation project • Google Translate allows anonymous users to correct the outputs produced by the systems • Likewise for Microsoft’s Bing Translator
  • 15. Is this REALLY New? Weren’t Terminology Databases, Translation Memories and Translation Workflow Systems already collaborative? • Yes, but… • … Collaborative Translation is about using these kinds of groupware technologies in the context of much larger groups or communities, where people have fewer reasons to trust each other a-priori. It’s one thing to open yourself to collaboration with colleagues and customers. It’s quite another thing to open yourself to the whole world.
  • 16. Common issues and Challenges in Collaborative Translation
  • 17. This is NOT easy Choosing a flavor and tailoring it to your needs is still somewhat of a black art, guided by trial and error. There are lots of important and poorly understood issues that arise, many of which are common to most flavors: • Alignment with business goals • Quality control • Crowd motivation • Proper role of professionals
  • 18. Alignment with business goals Why are you doing this in the first place? Which flavor can deliver what you want? The actual benefit you get from a given flavor is not necessarily what you think! Translation crowdsourcing • Reduce cost? – Yes, but not the biggest benefit. • Decrease lead time? – Definitely. • Translation more in-tune with target audience’s idiosyncrasies? -- Also • Most importantly: Increase brand loyalty by engaging end-users as co-creators of products, instead of passive consumers.
  • 19. Quality Control How to control quality when you open yourself to contributions from a potential large group of “outsiders”? Many ways: • Screen contributors before letting them in (ex: Translators Without Border, Kiva.org). • Have members of the community vote on the quality of each other’s work (ex: Facebook, Translated.net). • Have in-house professionals revise the work done by the community (ex: Facebook).
  • 20. Quality Control (2) Do not assume that quality of community-produced content will be lower. For instance, Wikipedia provably measures up to professionally produced encyclopedia like Britannica (English) and Brockhaus (German). Quality issues tend to iron themselves out provided that you attract a sufficient large number of the right people Wisdom of crowd effects works surprisingly well when the following conditions are met: • Diversity • Independence • Aggregation
  • 21. Crowd motivation If you are to attract and retain enough of the “right people” you need to understand why thy might contribute. • Mandated by management (ex: Agile Translation teamware) • Emotional bond with the content (ex: Facebook, and surprisingly, Adobe) • Prestige of the content (ex: TEDTalks) • Wanting to do good (ex: Translators Without Border, Kiva, Haiti Earthquake Mission, Data collection for scientific research) • Pride in one’s native language (ex: Data collection for R&D in MT for small density languages) • Trying to perfect second language skills • Trying to make a go at professional translation career (ex: Kiva.org) • And in some cases, $$$ – Will this be the dominant scenario? – How to set compensation high enough to attract good contributors, but not so high that it interferes with more intrinsic motivations, or attracts people out to game the system.
  • 22. Role of professionals Some flavors of CT are designed specifically for professionals (ex: Agile translation teamware, Online translator marketplaces). But some (e.g. Translation crowdsourcing), tend to de-emphasize their role. When should professionals be involved, and what should be their role? • Revise work done by amateurs? – Focus on more challenging aspects of translation like terminology, style, fluidity? • Manage and coach the crowd? • Focus on more mission-critical and hard to translate content? Translation Crowdsourcing may actually increase the size of the pie, by making it possible to tackle content and/or small languages that would otherwise not have been dealt with anyway.
  • 23. Capturing Collaborative Translation best-practices in the form of Design Patterns
  • 24. Wanted: Best- Practices Collaborative Translations presents practitioners with a varied and complex envelopes of different approaches and technologies. Selecting a flavor and tuning it to meet your needs is complex. We need some sort of concise, easy-to-consult repository of best- practices for that field. We propose a way to collaboratively create such a repository a community, in the form of a design patterns language.
  • 26. About Design Patterns A format for describing a common solution to a common problem in a given field Originally used in Architecture, but since then adopted in other fields such as Software Engineering, Education, etc.
  • 27. Design Patterns Example Publish Contributions Rapidly Context This pattern is useful for motivating contributors in any collaborative translation context, but it is particularly useful in translation crowdsourcing scenarios. Problem Contributors are often motivated by a desire to have a positive impact on the community they are participating in. However, they cannot achieve this sense of being useful, if their contributions do not become available to the rest of the community in a reasonable amount of time. Solution Therefore, minimize the delay between the moment when a member of the community contributes to the site, and the moment where it becomes publicly available to the rest of the community. Ideally, the contribution should become visible to the rest of the community as soon as the user clicks on the Save button.
  • 28. Design Patterns Example (2) Related patterns – Point System is another way for a contributor to get a sense of how useful he has been to the community. – Campaign Progress Gauge is another practice which allows members of the community to see the positive impact of their actions. The main difference is that it operates more at a community/project level rather than at a individual/contribution level. Real-life examples – At Facebook, translations become available in a matter of hours. – In the context of software localization by the crowd, Adobe makes a conscious effort to wrap the community's translations into every new releases of the product.
  • 29. TAUS Roundtable on Collaborative Translation Wiki “Barn Raising” workshop held on October 12th, 2011 at Localization World in Santa Clara. 12 practitioners • One third with hands on experience of CT (NRC, Adobe, Symantec, Kiva, World Wide Lexicon) • Two thirds with no experience, but a strong interest in trying it (In Every Language, MemSource, Firma 8, SPIL Games) Talks by the experienced users about what worked and didn’t. Followed by brainstorming of what the recurring best-practices seem to be.
  • 30. The Best-Practices End result: => 50+ best practices organized into 6 themes Planning and Scoping Translation as User Engagement, Align Stakeholder Expectations, Early and Continuous Clarification of Translator Expectations, Backup Plan, Project, Check Points, Appoint Initial Community Manager, Clear Objectives, Identify Compatible Content Community Motivation Campaign Progress Gauge, Contributor Recognition, Leader Board, Official Certificate, Point System, Offer Double Points, Hand-Out Unique Branded, Products, Contributor of the Month, Grant Special Access Rights, Playful Casual Translation, Campaign, Publish Contributions Rapidly, Playful Competition Among Contributors
  • 31. The Best-Practices (2) Quality Content-Specific Testing, Entry Exam, Peer Review, Automatic Reputation Management, Random Spot-Checking, Revision Crowdsourcing, Users as Translators, Voting, Transparent Quality Level, Publish then Revise Contributor Career Path Flexible Contributor Career Path, Lurker to Contributor Transition, Anonymous Translation, Find the Leaders, Support Variable Levels of Involvement, Community Manager, Content Prioritizer Right Sizing Appropriate Chunk Size, Community-Appropriate Project Size, Break Up Crowd Into Teams, Require Minimal Involvement Level, Keep the Crowd Small, Volunteer Team Leaders
  • 32. The Best-Practices (3) Tools and Processes Hint at Content Priority, First In, First Out, Task Self Selection, Layered Fallbacks, Official Linguistic Resources, Automatic Suggestions, Provide, Context, In-Place Translation, Community Forum, Analytics for Content Prioritization, Simplicity First, Good Examples of Contributions, Encourage Self, Set Deadlines
  • 33. Some Observations The bulk of practices relate to Translation Crowdsourcing. => We need to spend more time capturing practices for other flavors of Collaborative Translation The bulk of the practices so far are not specific to translation. • They would be useful in the context of crowdsourcing efforts in any domain. • Maybe all we need is to codify and/or learn about the best practices for crowdsourcing in general? The more similar two organizations are, the more similar their practices will be (ex: Kiva and TWB, versus Kiva and Adobe).
  • 35. Conclusion Collaborative Translation presents practitioners with a very large and varied set of tools and processes. Choosing a particular flavor of CT and tailoring to meet one’s needs can be a daunting task. We need a concise, easy to consult, modular compendium of current best practices in that area. We have started building such a compendium in the form of a wiki site (www.collaborative-translation-patterns.com) which captures best practices in the form of design patterns. We invite every one in this room to contribute to it if they can.