SlideShare a Scribd company logo
Quantifying the Translator Effect: Identifying authors and machine translation tools in translated text Aylin Caliskan Computer Science PhD Student  at Drexel University
Authorship Attribution Identifying the author of an anonymous text Software JStylo Training corpus Lexical, character, syntactic, semantic features Machine learning tools
Brennan-Greenstadt Adversarial Stylometry Corpus 21 st  century writing Diverse topics 13 authors Native English authors Minimum of 5000 words per author Exclude obfuscation and imitation texts
Translator Effect Original text:  “This paper set out to explore what communities who seek greater local control over their lives face as the world begins the herculean task of reorganizing its energy systems.”   Google's translator:  “In this paper, we explore the world to look for more local control of community life from what it is like to face a very difficult task to begin to rebuild their energy systems.”
Identifying translators
Authorship Attribution in Translated Text
Effective features in authorship attribution and translator attribution
Conclusion
Future Work Different types of translators One way translations Arabic, Russian, Spanish, and Turkish corpus
Thanks for listening. Any questions? Feel free to contact me: aylin.caliskan@drexel.edu

More Related Content

PDF
What you need to put Machine Translation into practice: Tools, People, and Pr...
PDF
Website Localization – Industry Best Practices by TripleInk
PPTX
MidwestJS Zero to Testing
PPTX
Zero to Testing in JavaScript
PPTX
Simple Proxying in Rails
PDF
Selecting a Web Framework
PDF
Displacing Worst Practices in CSS
PDF
Feminism & Open Source Contribution
What you need to put Machine Translation into practice: Tools, People, and Pr...
Website Localization – Industry Best Practices by TripleInk
MidwestJS Zero to Testing
Zero to Testing in JavaScript
Simple Proxying in Rails
Selecting a Web Framework
Displacing Worst Practices in CSS
Feminism & Open Source Contribution

More from pamselle (12)

PDF
WordPress 101 Sunday Session
PDF
WordPress 101 Saturday Session
PDF
Power Spriting With Compass
PDF
Sadia Afroz: Detecting Hoaxes, Frauds, and Deception in Writing Style Online
PDF
Kamelia Aryafar: Musical Genre Classification Using Sparsity-Eager Support Ve...
PDF
GDI WordPress 4 January 2012 (white)
PDF
GDI WordPress 4 January 2012
PDF
GDI WordPress 3 January 2012 (white background)
PDF
GDI WordPress 3 January 2012
PDF
GDI WordPress 2 January 2012
PDF
Gdi word press_2
PDF
GDI WordPress 1 January 2012
WordPress 101 Sunday Session
WordPress 101 Saturday Session
Power Spriting With Compass
Sadia Afroz: Detecting Hoaxes, Frauds, and Deception in Writing Style Online
Kamelia Aryafar: Musical Genre Classification Using Sparsity-Eager Support Ve...
GDI WordPress 4 January 2012 (white)
GDI WordPress 4 January 2012
GDI WordPress 3 January 2012 (white background)
GDI WordPress 3 January 2012
GDI WordPress 2 January 2012
Gdi word press_2
GDI WordPress 1 January 2012
Ad

Recently uploaded (20)

PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Network Security Unit 5.pdf for BCA BBA.
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Electronic commerce courselecture one. Pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
KodekX | Application Modernization Development
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Modernizing your data center with Dell and AMD
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Network Security Unit 5.pdf for BCA BBA.
The AUB Centre for AI in Media Proposal.docx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
NewMind AI Weekly Chronicles - August'25 Week I
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
A Presentation on Artificial Intelligence
Electronic commerce courselecture one. Pdf
Encapsulation_ Review paper, used for researhc scholars
Mobile App Security Testing_ A Comprehensive Guide.pdf
KodekX | Application Modernization Development
“AI and Expert System Decision Support & Business Intelligence Systems”
Modernizing your data center with Dell and AMD
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Ad

Aylin Caliskan: Quantifying the Translator Effect: Identifying authors and machine translation tools in translated text

  • 1. Quantifying the Translator Effect: Identifying authors and machine translation tools in translated text Aylin Caliskan Computer Science PhD Student at Drexel University
  • 2. Authorship Attribution Identifying the author of an anonymous text Software JStylo Training corpus Lexical, character, syntactic, semantic features Machine learning tools
  • 3. Brennan-Greenstadt Adversarial Stylometry Corpus 21 st century writing Diverse topics 13 authors Native English authors Minimum of 5000 words per author Exclude obfuscation and imitation texts
  • 4. Translator Effect Original text: “This paper set out to explore what communities who seek greater local control over their lives face as the world begins the herculean task of reorganizing its energy systems.” Google's translator: “In this paper, we explore the world to look for more local control of community life from what it is like to face a very difficult task to begin to rebuild their energy systems.”
  • 6. Authorship Attribution in Translated Text
  • 7. Effective features in authorship attribution and translator attribution
  • 9. Future Work Different types of translators One way translations Arabic, Russian, Spanish, and Turkish corpus
  • 10. Thanks for listening. Any questions? Feel free to contact me: aylin.caliskan@drexel.edu