SlideShare a Scribd company logo
EXTRA
Stuart Myles * Associated Press * 7th March 2016
© 2016 IPTC (www.iptc.org) All rights reserved
https://flic.kr/p/81HXTG
Google DNI
• Google’s €150 million Digital News Initiative fund
– Stimulate innovation among European news organizations
– https://www.digitalnewsinitiative.com/fund/
• Multiple rounds – first in October 2015, next in June 16
– First funding of €27 million to projects in 23 countries
– http://googlepolicyeurope.blogspot.gr/2016/02/digital-news-
initiative-first-funding_24.html
© 2016 IPTC (www.iptc.org) All rights reserved
EXTRA
EXTraction Rules Apparatus
• Open source, rules-based, multilingual news
classification
– Rules in two languages for IPTC Media Topics
• Rules based > Hand tagging
– Consistent, performant and scalable
• Rules based > Statistical tagging
– Statistical approaches require numerous annotated examples
– EXTRA will allow publishers to rapidly adapt to breaking news
and low-frequency topics
– Avoid problems with ambiguity (“Police Can’t Stop Gambling”)
– Precisely distinguish between similar topics, which are more
challenging for statistical approaches.
© 2016 IPTC (www.iptc.org) All rights reserved 3
EXTRA Deliverables
• Rules based engine
• Formal specification of the classification rules language
• Two sets of extraction rules
– Tagging with IPTC Media Topics
– Drive adoption and consistency
© 2010 IPTC (www.iptc.org) All rights reserved 4
EXTRA Budget
• Developer €35,000
• Linguist €10,000
• Project Manager €5,000
• Total - €50,000
© 2010 IPTC (www.iptc.org) All rights reserved 5
EXTRA Milestones
• Recruit EXTRA Team
• Evaluate existing open source projects and frameworks
• Create first EXTRA release
• Develop EXTRA software and rule sets
• First non-core open source contributor
• First production deployment
© 2010 IPTC (www.iptc.org) All rights reserved 6
Apply for next round?
• Apply in May 2016 for June round
• Candidate applications
– EXTRA – but no specific feedback
– Sport Identifiers
© 2016 IPTC (www.iptc.org) All rights reserved 7

More Related Content

PPT
BDE SC2 Workshop 3: DG AGRI R&I activities contributing to the DSM strategy
PPTX
BDE SC2 Workshop 3: Catalyzing the creation of a Data Ecosystem for Agricultu...
PPT
BDE SC2 Workshop 3: AgGate: the French Agricultural Data Platform
PPTX
BDE SC2 Workshop 3: Building a European Data Economy
PPT
It4 ccommunityi tresources
PPTX
BDE SC2 Workshop 3: CAPS: hyperconnectivity engaging citizens
PDF
Cybercrime and Cybersecurity Differences
PPTX
Big Data in Food & Agriculture: Community Perspectives
BDE SC2 Workshop 3: DG AGRI R&I activities contributing to the DSM strategy
BDE SC2 Workshop 3: Catalyzing the creation of a Data Ecosystem for Agricultu...
BDE SC2 Workshop 3: AgGate: the French Agricultural Data Platform
BDE SC2 Workshop 3: Building a European Data Economy
It4 ccommunityi tresources
BDE SC2 Workshop 3: CAPS: hyperconnectivity engaging citizens
Cybercrime and Cybersecurity Differences
Big Data in Food & Agriculture: Community Perspectives

Viewers also liked (14)

PPTX
Welcome To IPTC AGM 2016 Berlin
PPT
IPTC News Exchange Working Group 2013 Autumn Meeting
PPTX
IPTC Semantic Web June 2011
PPT
IPTC News Exchange Formats 2011 Autumn Working Party Report
PPTX
IPTC Rights Expression Working Group Spring 2016
PDF
Final report european union
ODP
Jayson lorenzen iptc_rnews_overview
KEY
Introduction To rNews 1.0
PPTX
Update on IPTC's EXTRA Open Source Classification Engine
PPTX
IPTC Chairman's Welcome June 2016
PPTX
IPTC EXTRA - Open Source Rules Classification
PPTX
Seven rNews Ideas
PPTX
IPTC Rights October 2016
PDF
Things I will tell my kids if they become entrepreneurs
Welcome To IPTC AGM 2016 Berlin
IPTC News Exchange Working Group 2013 Autumn Meeting
IPTC Semantic Web June 2011
IPTC News Exchange Formats 2011 Autumn Working Party Report
IPTC Rights Expression Working Group Spring 2016
Final report european union
Jayson lorenzen iptc_rnews_overview
Introduction To rNews 1.0
Update on IPTC's EXTRA Open Source Classification Engine
IPTC Chairman's Welcome June 2016
IPTC EXTRA - Open Source Rules Classification
Seven rNews Ideas
IPTC Rights October 2016
Things I will tell my kids if they become entrepreneurs
Ad

More from Stuart Myles (20)

PPTX
IPTC Rights Statements For News
PPTX
IPTC New Taxonomies Ideas
PPTX
IPTC Board Spring 2019
PPTX
IPTC Spring 2019 Conference
PPTX
Photomation or Fauxtomation?
PPTX
Image Tagging at the Associated Press
PPTX
IPTC Rights Working Group Toronto October 2018
PPTX
IPTC AGM 2018 Welcome
PPTX
How Can We Make Algorithmic News More Transparent?
PPTX
IPTC EXTRA Spring 2018
PPTX
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
PPTX
Ap Taxonomy Localization Requirements and Challenges
PPTX
IPTC Spring Meeting Welcome To Athens April 2018
PPTX
Sustaining Television News Technical Challenges
PPTX
How to Train Your Classifier: Create a Serverless Machine Learning System wit...
PPTX
The Search for IPTC's Next Managing Director
PPTX
IPTC Approach to News in JSON
PPTX
IPTC News in JSON November 2017
PPTX
IPTC EXTRA and EXTRA+ November 2017
PPTX
Welcome to Barcelona - IPTC November 2017
IPTC Rights Statements For News
IPTC New Taxonomies Ideas
IPTC Board Spring 2019
IPTC Spring 2019 Conference
Photomation or Fauxtomation?
Image Tagging at the Associated Press
IPTC Rights Working Group Toronto October 2018
IPTC AGM 2018 Welcome
How Can We Make Algorithmic News More Transparent?
IPTC EXTRA Spring 2018
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
Ap Taxonomy Localization Requirements and Challenges
IPTC Spring Meeting Welcome To Athens April 2018
Sustaining Television News Technical Challenges
How to Train Your Classifier: Create a Serverless Machine Learning System wit...
The Search for IPTC's Next Managing Director
IPTC Approach to News in JSON
IPTC News in JSON November 2017
IPTC EXTRA and EXTRA+ November 2017
Welcome to Barcelona - IPTC November 2017
Ad

Recently uploaded (20)

PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Electronic commerce courselecture one. Pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Cloud computing and distributed systems.
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Approach and Philosophy of On baking technology
PPTX
A Presentation on Artificial Intelligence
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
cuic standard and advanced reporting.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Spectroscopy.pptx food analysis technology
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPT
Teaching material agriculture food technology
MIND Revenue Release Quarter 2 2025 Press Release
Electronic commerce courselecture one. Pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Cloud computing and distributed systems.
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
Approach and Philosophy of On baking technology
A Presentation on Artificial Intelligence
Building Integrated photovoltaic BIPV_UPV.pdf
A comparative analysis of optical character recognition models for extracting...
Unlocking AI with Model Context Protocol (MCP)
cuic standard and advanced reporting.pdf
MYSQL Presentation for SQL database connectivity
sap open course for s4hana steps from ECC to s4
Spectroscopy.pptx food analysis technology
NewMind AI Weekly Chronicles - August'25-Week II
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
gpt5_lecture_notes_comprehensive_20250812015547.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Teaching material agriculture food technology

IPTC EXTRA Rules Based Classification for News

  • 1. EXTRA Stuart Myles * Associated Press * 7th March 2016 © 2016 IPTC (www.iptc.org) All rights reserved https://flic.kr/p/81HXTG
  • 2. Google DNI • Google’s €150 million Digital News Initiative fund – Stimulate innovation among European news organizations – https://www.digitalnewsinitiative.com/fund/ • Multiple rounds – first in October 2015, next in June 16 – First funding of €27 million to projects in 23 countries – http://googlepolicyeurope.blogspot.gr/2016/02/digital-news- initiative-first-funding_24.html © 2016 IPTC (www.iptc.org) All rights reserved
  • 3. EXTRA EXTraction Rules Apparatus • Open source, rules-based, multilingual news classification – Rules in two languages for IPTC Media Topics • Rules based > Hand tagging – Consistent, performant and scalable • Rules based > Statistical tagging – Statistical approaches require numerous annotated examples – EXTRA will allow publishers to rapidly adapt to breaking news and low-frequency topics – Avoid problems with ambiguity (“Police Can’t Stop Gambling”) – Precisely distinguish between similar topics, which are more challenging for statistical approaches. © 2016 IPTC (www.iptc.org) All rights reserved 3
  • 4. EXTRA Deliverables • Rules based engine • Formal specification of the classification rules language • Two sets of extraction rules – Tagging with IPTC Media Topics – Drive adoption and consistency © 2010 IPTC (www.iptc.org) All rights reserved 4
  • 5. EXTRA Budget • Developer €35,000 • Linguist €10,000 • Project Manager €5,000 • Total - €50,000 © 2010 IPTC (www.iptc.org) All rights reserved 5
  • 6. EXTRA Milestones • Recruit EXTRA Team • Evaluate existing open source projects and frameworks • Create first EXTRA release • Develop EXTRA software and rule sets • First non-core open source contributor • First production deployment © 2010 IPTC (www.iptc.org) All rights reserved 6
  • 7. Apply for next round? • Apply in May 2016 for June round • Candidate applications – EXTRA – but no specific feedback – Sport Identifiers © 2016 IPTC (www.iptc.org) All rights reserved 7