SlideShare a Scribd company logo
1ggg
Linked Data: A Commercial Outlook
Our LD-Portfolio in 2017 and a perspective on the future
Jan E. Voskuil
SEMANTiCS 2017
Amsterdam
2ggg
3ggg
2012 2013 2014 2015 2016 2017
Revenue
from Linked
Data work
SEMANTiCS 2017 3
4ggg
Our LD-Portfolio: A Snapshot
• Kennisnet
• KOOP
• Wolters Kluwer NL
• Alliander
• Beeld en Geluid
• Kadaster
• National Police
• CROW
• OntoPharma
SEMANTiCS 2017 4
5ggg
Our LD-Portfolio: A Snapshot
• Kennisnet
• KOOP
• Wolters Kluwer NL
• Alliander
• Beeld en Geluid
• Kadaster
• National Police
• CROW
• OntoPharma
SEMANTiCS 2017 5
Kennisnet supports
schools with IT.
Provides a basic IT-
infrastructure and
shares knowledge.
Maintains several
standards for
information exchange.
6ggg
Our LD-Portfolio: A Snapshot
• Kennisnet
• KOOP
• Wolters Kluwer NL
• Alliander
• Beeld en Geluid
• Kadaster
• National Police
• CROW
• OntoPharma
SEMANTiCS 2017 6
Publication office for
the Dutch
government
organisations.
Maintains value lists
that have a legal
status, e.g., the list of
municipalities.
7ggg
Our LD-Portfolio: A Snapshot
• Kennisnet
• KOOP
• Wolters Kluwer NL
• Alliander
• Beeld en Geluid
• Kadaster
• National Police
• CROW
• OntoPharma
SEMANTiCS 2017 7
Wolters Kluwer Legal &
Regulatory offers
information, software and
tools for legal
professionals.
Business vocabularies for
knowledge management
and search.
8ggg
Our LD-Portfolio: A Snapshot
• Kennisnet
• KOOP
• Wolters Kluwer NL
• Alliander
• Beeld en Geluid
• Kadaster
• National Police
• CROW
• OntoPharma
SEMANTiCS 2017 8
Alliander is an energy
network company. We are
bringing an open and
sustainable energy market
closer to the consumer.
Maintains several large
business vocabularies and
large amounts of
technical documents.
9ggg
Our LD-Portfolio: A Snapshot
• Kennisnet
• KOOP
• Wolters Kluwer NL
• Alliander
• Beeld en Geluid
• Kadaster
• National Police
• CROW
• OntoPharma
SEMANTiCS 2017 9
The Netherlands Institute
for Sound and Vision is a
cultural-historical
organization. It collects,
preserves and opens the
audiovisual heritage.
Maintains several large
thesauri.
10ggg
Our LD-Portfolio: A Snapshot
• Kennisnet
• KOOP
• Wolters Kluwer NL
• Alliander
• Beeld en Geluid
• Kadaster
• National Police
• CROW
• OntoPharma
SEMANTiCS 2017 10
The Netherlands’
Cadastre, Land Registry
and Mapping Agency – in
short Kadaster – collects
and registers
administrative and spatial
data on property and the
rights involved.
Publishes Linked Data on
a large scale.
11ggg
Our LD-Portfolio: A Snapshot
• Kennisnet
• KOOP
• Wolters Kluwer NL
• Alliander
• Beeld en Geluid
• Kadaster
• National Police
• CROW
• OntoPharma
SEMANTiCS 2017 11
The Dutch police uses
Linked Data-technologies
to collate data from
disparate sources.
12ggg
Our LD-Portfolio: A Snapshot
• Kennisnet
• KOOP
• WoltersKluwer NL
• Alliander
• Beeld en Geluid
• Kadaster
• National Police
• CROW
• OntoPharma
SEMANTiCS 2017 12
CROW is a non-profit
knowledge partner in
road construction.
CROW maintains thesauri
and a large knowledge
base.
13ggg
Our LD-Portfolio: A Snapshot
• Kennisnet
• KOOP
• WoltersKluwer NL
• Alliander
• Beeld en Geluid
• Kadaster
• National Police
• CROW
• OntoPharma
SEMANTiCS 2017 13
OntoPharma, a spin-off of
Taxonic, delivers solutions
for a Data First-approach in
the pharma sector.
Solutions include
automated data extraction,
management of reference
data and product data, and
structured authoring.
14ggg
Use Cases
SEMANTiCS 2017 14
15ggg
Use Cases
SEMANTiCS 2017 15
Vocabulary-
oriented
Data-oriented
Documents and
data
16ggg
Use Cases
SEMANTiCS 2017 16
Business vocabularies
Reference Data
Management
Collating business concepts across vocabularies
Vocabulary-
oriented
Data-oriented
Documents and
data
17ggg
Use Cases
SEMANTiCS 2017 17
Business vocabularies
Reference Data
Management
Collating business concepts across vocabularies
Vocabulary-
oriented
Data-oriented
Documents and
data
Publishing interoperable data
Publishing datamodels Managing datasets
18ggg
Use Cases
SEMANTiCS 2017 18
Business vocabularies
Reference Data
Management
Collating business concepts across vocabularies
Vocabulary-
oriented
Data-oriented
Documents and
data
Publishing interoperable data
Publishing datamodels Managing datasets
Semantic ECM
Tagging
Extraction
Indexing
Content
Classification
Structured Authoring
19ggg Kennisnet
KOOP
WoltersKluwerNL
Alliander
BeeldenGeluid
Kadaster
Police
CROW
OntoPharma
Managing metadata sets
Business vocabularies
Publishing datamodels
Publishing interoperable data
Collating business concepts
across vocabularies
Reference Data Management
Managing datasets
Semantic ECM
20ggg
Business vocabularies
Publishing interoperable data
Managing datasets
Managing metadata sets
Semantic ECM
Reference Data Management
Collating business concepts across vocabularies
Publishing datamodels
Relative contribution 2017
SEMANTiCS 2017 20
21ggg
Business vocabularies
Publishing interoperable data
Managing datasets
Managing metadata sets
Semantic ECM
Reference Data Management
Collating business concepts across vocabularies
Publishing datamodels
Expectations of Relative Growth
SEMANTiCS 2017 21
22ggg
Collating concepts across vocabularies
Kennisnet
• Develops standards for
information exchange in the field
of education
• Overlapping semantics
• Same concepts in different models
• Specialized tooling
• TopQuadrant Enterprise Vocabulary
Network
• Custom concept browsers for
visualizing semantic overlap
23ggg
24ggg
> The browser shows info about the term “Samengestelde groep”
> This term is defined in EDEXML, a data exchange standard
25ggg
> The browser shows info about the term “Samengestelde groep”
> This term is defined in EDEXML, a data exchange standard
> Also occurs in UWRL and ECKID
> The browser shows a detailed comparison
> Similarities and differences made visible
> High level of automation
26ggg
The potential for Semantic ECM
Data First
• From documents to data
• Regulatory requirements
• Data extraction as a service
• Future outlook: create data
and documents in concert
27ggg
Advanced algorithms for extracting data
Extracting
concepts not
literally
mentioned in
the text
Source Text
The tablets shall be taken with
liquid, and should not be crushed or
chewed. Ontopharmanax may be
taken with or without food
Extracted data
RoutesOfAdministration
{
“routes”: [“OralUse”]
}
Source Text
Ontopharmanax is for
subcutaneous injection only and
shall not be used for intramuscular
injection.
Extracted data
RoutesOfAdministration
{
“routes”: [
“SubcutaneousInjection”
]}
Dealing with
negations.
28ggg
The potential for Semantic ECM
Serve information in the field
• Operatives need specific
information
• Buried in documents
• Semantic Search
• Auto-tagging, extraction,
indexation
• Future outlook: template-
based authoring
29ggg
Semantic ECM
• Semantic ECM is transformational
• Affects many processes in many ways
• Authoring
• Redacting
• Retrieval
• Difficulty is structurally underestimated
• Complexity
• Need for solid know-how and the right tools
• Many projects start and then fail miserably
• Still much evangelizing needed!
SEMANTiCS 2017 29
30ggg
Business
vocabularies
Thesauri &
metadata
Data publishing
& management
Semantic
ECM
Concept
Computing
Evolving Use Cases
SEMANTiCS 2017 30
31ggg
32ggg
• Open Source
• Created by Taxonic, with Kadaster
• RML standard for mapping &
transformation
• From relational to RDF
• From any format to RDF
• First release now available from
Github!
• https://carml.gitlab.io/plain-html/
33ggg
Knowledge Works

More Related Content

PDF
Session 1.3 context information management across smart city knowledge domains
PPTX
Session 2.3 semantics for safeguarding & security – a police story
PDF
Session 2.6 semantic data governance for regulatory compliance
PDF
Session 1.3 semantic asset management in the dutch rail engineering and con...
PDF
Session 4.3 semantic annotation for enhancing collaborative ideation
PDF
FIWARE Global Summit - International Data Spaces - From Industry 4.0 to Data ...
PDF
ICARUS @EBDVF 2018 - TransformingTransport Session (November 2018, Vienna)
PDF
ICARUS @EASN 2019 - Industry 4.0 in Aeronautics Session (September 2019, Athens)
Session 1.3 context information management across smart city knowledge domains
Session 2.3 semantics for safeguarding & security – a police story
Session 2.6 semantic data governance for regulatory compliance
Session 1.3 semantic asset management in the dutch rail engineering and con...
Session 4.3 semantic annotation for enhancing collaborative ideation
FIWARE Global Summit - International Data Spaces - From Industry 4.0 to Data ...
ICARUS @EBDVF 2018 - TransformingTransport Session (November 2018, Vienna)
ICARUS @EASN 2019 - Industry 4.0 in Aeronautics Session (September 2019, Athens)

What's hot (20)

PDF
IoT Semantic Interoperability: Keynote at Haystack Connect 2017
PDF
FIWARE Global Summit - Exploring a New Opportunity in Data Economy: A Case of...
PDF
FIWARE Global Summit - The Digital Single Market - Benefits and Solutions for...
PPTX
Lynx Pilot 1 at ReMeP 2019
PPTX
GraphTalks Frankfurt - Graph Database Überblick
PDF
FIWARE Global Summit - FIWARE Today and Tomorrow
PDF
FIWARE Global Summit - The Future of FIWARE from a Corporate Perspective
PDF
DMA - Energy Demand Prediction in Smart Cities
PDF
FIWARE Tech Summit - Data Ahead - the New Data Logistic Approach
PPTX
BDE-BDVA Webinar: Arne Berre and Ana Garcia slides for BDVA/BDE Webinar
PDF
Europe rules – making the fair data economy flourish
PDF
FIWARE Global Summit - People First in the Digital Age - Engineering's Digita...
PPTX
Big Data Analytics @ Munich Re - VIII. International Istanbul Insurance Confe...
PPTX
Open data De Lijn
PDF
2019 bdva wg_dg
PPTX
Introducing the Jisc National HPC Agreement
PDF
FIWARE Tech Summit - Accelerating Materialization of the IDS Architecture
PPTX
Get symposium oct 1st 2015 Rotterdam - Ict for supply chain innovation
PPTX
Infochimps Cloudcon 2012
IoT Semantic Interoperability: Keynote at Haystack Connect 2017
FIWARE Global Summit - Exploring a New Opportunity in Data Economy: A Case of...
FIWARE Global Summit - The Digital Single Market - Benefits and Solutions for...
Lynx Pilot 1 at ReMeP 2019
GraphTalks Frankfurt - Graph Database Überblick
FIWARE Global Summit - FIWARE Today and Tomorrow
FIWARE Global Summit - The Future of FIWARE from a Corporate Perspective
DMA - Energy Demand Prediction in Smart Cities
FIWARE Tech Summit - Data Ahead - the New Data Logistic Approach
BDE-BDVA Webinar: Arne Berre and Ana Garcia slides for BDVA/BDE Webinar
Europe rules – making the fair data economy flourish
FIWARE Global Summit - People First in the Digital Age - Engineering's Digita...
Big Data Analytics @ Munich Re - VIII. International Istanbul Insurance Confe...
Open data De Lijn
2019 bdva wg_dg
Introducing the Jisc National HPC Agreement
FIWARE Tech Summit - Accelerating Materialization of the IDS Architecture
Get symposium oct 1st 2015 Rotterdam - Ict for supply chain innovation
Infochimps Cloudcon 2012
Ad

Similar to Session 1.1 linked data applied: a field report from the netherlands (20)

PPTX
Flink Meetup Septmeber 2017 2018
PDF
Accelerating Self-Service Analytics with Denodo and Tableau (Singapore)
PPTX
Easy SPARQLing for the Building Performance Professional
PDF
Compliance made easy: Lynx webinar #1
PPTX
Security, ETL, BI & Analytics, and Software Integration
PDF
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
PDF
Future of Data Strategy (ASEAN)
PDF
Business Transformation Agency
PPTX
Introduction To IPaaS: Drivers, Requirements And Use Cases
PPTX
Cloud and Data Analytics Architecture: Data Everywhere for Everyone
PPTX
Cognitive data
PDF
Data democratised
PDF
Interoperability: How legislation and running code should be connected, Erlen...
PDF
PrEstoCloud : PROACTIVE CLOUD RESOURCES MANAGEMENT AT THE EDGE FOR EFFICIENT ...
 
PDF
CWIN17 Frankfurt / Cloudera
PPTX
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
PPTX
Building a Canadian National Research Data Management Framework - Mark Leggott
PPTX
Integrating Applications and Data (with Oracle PaaS Cloud) - Oracle Cloud Day...
PDF
Service oriented architecture (SOA) deserves service oriented data
PPTX
Ermo Taks 2 of 2, digital government, public service delivery, SIGMA, 18 Marc...
Flink Meetup Septmeber 2017 2018
Accelerating Self-Service Analytics with Denodo and Tableau (Singapore)
Easy SPARQLing for the Building Performance Professional
Compliance made easy: Lynx webinar #1
Security, ETL, BI & Analytics, and Software Integration
Dynniq & GoDataDriven - Shaping the future of traffic with IoT and AI
Future of Data Strategy (ASEAN)
Business Transformation Agency
Introduction To IPaaS: Drivers, Requirements And Use Cases
Cloud and Data Analytics Architecture: Data Everywhere for Everyone
Cognitive data
Data democratised
Interoperability: How legislation and running code should be connected, Erlen...
PrEstoCloud : PROACTIVE CLOUD RESOURCES MANAGEMENT AT THE EDGE FOR EFFICIENT ...
 
CWIN17 Frankfurt / Cloudera
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
Building a Canadian National Research Data Management Framework - Mark Leggott
Integrating Applications and Data (with Oracle PaaS Cloud) - Oracle Cloud Day...
Service oriented architecture (SOA) deserves service oriented data
Ermo Taks 2 of 2, digital government, public service delivery, SIGMA, 18 Marc...
Ad

More from semanticsconference (20)

PPTX
Linear books to open world adventure
PDF
Session 1.2 high-precision, context-free entity linking exploiting unambigu...
PDF
Session 1.1 dalicc - data licenses clearance center
PDF
Session 0.0 aussenac semanticsnl-pwebsem2017-v4
PPTX
Session 0.0 keynote sandeep sacheti - final hi res
PDF
Session 1.2 enrich your knowledge graphs: linked data integration with pool...
PDF
Session 1.4 connecting information from legislation and datasets using a ca...
PDF
Session 1.4 a distributed network of heritage information
PDF
Session 0.0 media panel - matthias priem - gtuo - semantics 2017
PPTX
Session 1.3 energy, smart homes & smart grids: towards interoperability...
PDF
Session 1.2 improving access to digital content by semantic enrichment
PPTX
Session 2.5 semantic similarity based clustering of license excerpts for im...
PDF
Session 4.2 unleash the triple: leveraging a corporate discovery interface....
PDF
Session 1.6 slovak public metadata governance and management based on linke...
PPTX
Session 5.6 towards a semantic outlier detection framework in wireless sens...
PPTX
Session 2.2 ontology-guided job market demand analysis: a cross-sectional s...
PDF
Session 0.0 poster minutes madness
PPTX
Keynote new convergences between natural language processing and knowledge ...
PDF
Session 3.4 developing a medicines catalogue using linked data sources
PPTX
Session 2.5 matching natural language relations to knowledge graph properti...
Linear books to open world adventure
Session 1.2 high-precision, context-free entity linking exploiting unambigu...
Session 1.1 dalicc - data licenses clearance center
Session 0.0 aussenac semanticsnl-pwebsem2017-v4
Session 0.0 keynote sandeep sacheti - final hi res
Session 1.2 enrich your knowledge graphs: linked data integration with pool...
Session 1.4 connecting information from legislation and datasets using a ca...
Session 1.4 a distributed network of heritage information
Session 0.0 media panel - matthias priem - gtuo - semantics 2017
Session 1.3 energy, smart homes & smart grids: towards interoperability...
Session 1.2 improving access to digital content by semantic enrichment
Session 2.5 semantic similarity based clustering of license excerpts for im...
Session 4.2 unleash the triple: leveraging a corporate discovery interface....
Session 1.6 slovak public metadata governance and management based on linke...
Session 5.6 towards a semantic outlier detection framework in wireless sens...
Session 2.2 ontology-guided job market demand analysis: a cross-sectional s...
Session 0.0 poster minutes madness
Keynote new convergences between natural language processing and knowledge ...
Session 3.4 developing a medicines catalogue using linked data sources
Session 2.5 matching natural language relations to knowledge graph properti...

Recently uploaded (20)

PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Cloud computing and distributed systems.
PPTX
Big Data Technologies - Introduction.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Encapsulation theory and applications.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
MYSQL Presentation for SQL database connectivity
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Cloud computing and distributed systems.
Big Data Technologies - Introduction.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Encapsulation theory and applications.pdf
Electronic commerce courselecture one. Pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
MYSQL Presentation for SQL database connectivity
The Rise and Fall of 3GPP – Time for a Sabbatical?
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Chapter 3 Spatial Domain Image Processing.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
The AUB Centre for AI in Media Proposal.docx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Building Integrated photovoltaic BIPV_UPV.pdf

Session 1.1 linked data applied: a field report from the netherlands

  • 1. 1ggg Linked Data: A Commercial Outlook Our LD-Portfolio in 2017 and a perspective on the future Jan E. Voskuil SEMANTiCS 2017 Amsterdam
  • 3. 3ggg 2012 2013 2014 2015 2016 2017 Revenue from Linked Data work SEMANTiCS 2017 3
  • 4. 4ggg Our LD-Portfolio: A Snapshot • Kennisnet • KOOP • Wolters Kluwer NL • Alliander • Beeld en Geluid • Kadaster • National Police • CROW • OntoPharma SEMANTiCS 2017 4
  • 5. 5ggg Our LD-Portfolio: A Snapshot • Kennisnet • KOOP • Wolters Kluwer NL • Alliander • Beeld en Geluid • Kadaster • National Police • CROW • OntoPharma SEMANTiCS 2017 5 Kennisnet supports schools with IT. Provides a basic IT- infrastructure and shares knowledge. Maintains several standards for information exchange.
  • 6. 6ggg Our LD-Portfolio: A Snapshot • Kennisnet • KOOP • Wolters Kluwer NL • Alliander • Beeld en Geluid • Kadaster • National Police • CROW • OntoPharma SEMANTiCS 2017 6 Publication office for the Dutch government organisations. Maintains value lists that have a legal status, e.g., the list of municipalities.
  • 7. 7ggg Our LD-Portfolio: A Snapshot • Kennisnet • KOOP • Wolters Kluwer NL • Alliander • Beeld en Geluid • Kadaster • National Police • CROW • OntoPharma SEMANTiCS 2017 7 Wolters Kluwer Legal & Regulatory offers information, software and tools for legal professionals. Business vocabularies for knowledge management and search.
  • 8. 8ggg Our LD-Portfolio: A Snapshot • Kennisnet • KOOP • Wolters Kluwer NL • Alliander • Beeld en Geluid • Kadaster • National Police • CROW • OntoPharma SEMANTiCS 2017 8 Alliander is an energy network company. We are bringing an open and sustainable energy market closer to the consumer. Maintains several large business vocabularies and large amounts of technical documents.
  • 9. 9ggg Our LD-Portfolio: A Snapshot • Kennisnet • KOOP • Wolters Kluwer NL • Alliander • Beeld en Geluid • Kadaster • National Police • CROW • OntoPharma SEMANTiCS 2017 9 The Netherlands Institute for Sound and Vision is a cultural-historical organization. It collects, preserves and opens the audiovisual heritage. Maintains several large thesauri.
  • 10. 10ggg Our LD-Portfolio: A Snapshot • Kennisnet • KOOP • Wolters Kluwer NL • Alliander • Beeld en Geluid • Kadaster • National Police • CROW • OntoPharma SEMANTiCS 2017 10 The Netherlands’ Cadastre, Land Registry and Mapping Agency – in short Kadaster – collects and registers administrative and spatial data on property and the rights involved. Publishes Linked Data on a large scale.
  • 11. 11ggg Our LD-Portfolio: A Snapshot • Kennisnet • KOOP • Wolters Kluwer NL • Alliander • Beeld en Geluid • Kadaster • National Police • CROW • OntoPharma SEMANTiCS 2017 11 The Dutch police uses Linked Data-technologies to collate data from disparate sources.
  • 12. 12ggg Our LD-Portfolio: A Snapshot • Kennisnet • KOOP • WoltersKluwer NL • Alliander • Beeld en Geluid • Kadaster • National Police • CROW • OntoPharma SEMANTiCS 2017 12 CROW is a non-profit knowledge partner in road construction. CROW maintains thesauri and a large knowledge base.
  • 13. 13ggg Our LD-Portfolio: A Snapshot • Kennisnet • KOOP • WoltersKluwer NL • Alliander • Beeld en Geluid • Kadaster • National Police • CROW • OntoPharma SEMANTiCS 2017 13 OntoPharma, a spin-off of Taxonic, delivers solutions for a Data First-approach in the pharma sector. Solutions include automated data extraction, management of reference data and product data, and structured authoring.
  • 15. 15ggg Use Cases SEMANTiCS 2017 15 Vocabulary- oriented Data-oriented Documents and data
  • 16. 16ggg Use Cases SEMANTiCS 2017 16 Business vocabularies Reference Data Management Collating business concepts across vocabularies Vocabulary- oriented Data-oriented Documents and data
  • 17. 17ggg Use Cases SEMANTiCS 2017 17 Business vocabularies Reference Data Management Collating business concepts across vocabularies Vocabulary- oriented Data-oriented Documents and data Publishing interoperable data Publishing datamodels Managing datasets
  • 18. 18ggg Use Cases SEMANTiCS 2017 18 Business vocabularies Reference Data Management Collating business concepts across vocabularies Vocabulary- oriented Data-oriented Documents and data Publishing interoperable data Publishing datamodels Managing datasets Semantic ECM Tagging Extraction Indexing Content Classification Structured Authoring
  • 19. 19ggg Kennisnet KOOP WoltersKluwerNL Alliander BeeldenGeluid Kadaster Police CROW OntoPharma Managing metadata sets Business vocabularies Publishing datamodels Publishing interoperable data Collating business concepts across vocabularies Reference Data Management Managing datasets Semantic ECM
  • 20. 20ggg Business vocabularies Publishing interoperable data Managing datasets Managing metadata sets Semantic ECM Reference Data Management Collating business concepts across vocabularies Publishing datamodels Relative contribution 2017 SEMANTiCS 2017 20
  • 21. 21ggg Business vocabularies Publishing interoperable data Managing datasets Managing metadata sets Semantic ECM Reference Data Management Collating business concepts across vocabularies Publishing datamodels Expectations of Relative Growth SEMANTiCS 2017 21
  • 22. 22ggg Collating concepts across vocabularies Kennisnet • Develops standards for information exchange in the field of education • Overlapping semantics • Same concepts in different models • Specialized tooling • TopQuadrant Enterprise Vocabulary Network • Custom concept browsers for visualizing semantic overlap
  • 23. 23ggg
  • 24. 24ggg > The browser shows info about the term “Samengestelde groep” > This term is defined in EDEXML, a data exchange standard
  • 25. 25ggg > The browser shows info about the term “Samengestelde groep” > This term is defined in EDEXML, a data exchange standard > Also occurs in UWRL and ECKID > The browser shows a detailed comparison > Similarities and differences made visible > High level of automation
  • 26. 26ggg The potential for Semantic ECM Data First • From documents to data • Regulatory requirements • Data extraction as a service • Future outlook: create data and documents in concert
  • 27. 27ggg Advanced algorithms for extracting data Extracting concepts not literally mentioned in the text Source Text The tablets shall be taken with liquid, and should not be crushed or chewed. Ontopharmanax may be taken with or without food Extracted data RoutesOfAdministration { “routes”: [“OralUse”] } Source Text Ontopharmanax is for subcutaneous injection only and shall not be used for intramuscular injection. Extracted data RoutesOfAdministration { “routes”: [ “SubcutaneousInjection” ]} Dealing with negations.
  • 28. 28ggg The potential for Semantic ECM Serve information in the field • Operatives need specific information • Buried in documents • Semantic Search • Auto-tagging, extraction, indexation • Future outlook: template- based authoring
  • 29. 29ggg Semantic ECM • Semantic ECM is transformational • Affects many processes in many ways • Authoring • Redacting • Retrieval • Difficulty is structurally underestimated • Complexity • Need for solid know-how and the right tools • Many projects start and then fail miserably • Still much evangelizing needed! SEMANTiCS 2017 29
  • 30. 30ggg Business vocabularies Thesauri & metadata Data publishing & management Semantic ECM Concept Computing Evolving Use Cases SEMANTiCS 2017 30
  • 31. 31ggg
  • 32. 32ggg • Open Source • Created by Taxonic, with Kadaster • RML standard for mapping & transformation • From relational to RDF • From any format to RDF • First release now available from Github! • https://carml.gitlab.io/plain-html/