SlideShare a Scribd company logo
Semantic-assisted Analysis and 
Search in Customer Specifications 
Martin Voigt, Daniel Hladky 
September 2014 
1 
ONTOS LINKED DATA INFORMATION WORKBENCH 
Extraction & Analysis 
Indexing 
Information & 
Knowledge Management 
Search 
Engineer 
Storage 
Sales 
Portal 
Multilingual 
Specifications
I speakabout… 
The Problem, 
Our Solution, 
Insights & Further Work. 
2
The Problem 
AviComp Controls GmbH 
 leading engineering contractor 
for rotating machinery controls 
3 
Customers 
Engineers 
Sales 
> 100k Technical 
Specifications 
http://www.avicomp.com/capabilities/turbo-compressor-controls.html
The Problem 
Analysis: 1) task, 2) current solution, 3) ideas 
Problems 
Multiple, inefficient tools 
Heterogeneity 
Knowledge management & transfer 
4 
http://answerhub.com/article/ the-cost-of-knowledge-loss/
Our Solution 
5 
ONTOS LINKED DATA INFORMATION WORKBENCH 
Extraction & Analysis 
Indexing 
Information & 
Knowledge Management 
Search 
Engineer 
Storage 
Sales 
Portal 
Multilingual 
Specifications 
http://www.ontos.com/products/ontosldiw/
Our Solution 
Extraction& Analysis 
Homogenization: PDF conversion (Apache POI) & OCR (CuneiForm) 
Text extraction (Apache Tika) 
Language detection (language-detection API) 
Text preparation, e.g., remove headers & footers 
SKOS-based concept identification 
6 
Lorem ipsum dolor sit amet, consetetursadipscing 
elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam 
erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet 
clitakasdgubergren, no sea takimata 
sanctusestLorem ipsum dolor sit 
elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam 
erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet 
clitakasdgubergren, no sea takimata 
elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam 
erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet 
clitakasdgubergren, no sea takimata ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
Our Solution 
Storage via OntoQUAD 
 Triple and/or QuadStore, SPARQL 1.1, … 
Indexing 
 Full text search, result grouping, faceted browsing, 
SKOS-based label expansion, … 
 Apache Solr with lucene-skos plugin 
(https://github.com/behas/lucene-SKOS) 
7 
ONTOS LINKED DATA INFORMATION WORKBENCH 
Extraction & Analysis 
Indexing 
Information & 
Knowledge Management 
Search 
Engineer 
Storage 
Sales 
Portal 
Multilingual 
Specifications
Our Solution 
Knowledge Management 
via OntoDixbut SKOS-only 
8 
ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
Our Solution 
Search 
via AJAX Solr(https://github.com/evolvingweb/ajax-solr) 
9 
ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
Insights & Further Work 
Iterative development with early customer testing lowers usage barrier 
Lessons learned 
Development of a knowledge base 
Faceted search user interface 
Faceted search on RDF 
Multilingual disambiguationmechanisms 
10
Q&A 
Martin Voigt 
Ontos AG / GmbH 
Nidau(CH) / Leipzig (DE) 
T:+49 341 21559-10 
M:+49 178 40 222 58 
E: martin.voigt@ontos.com 
11
About Ontos 
12 
12 
DoW – CTI Project 
Ontos Group 
Key Facts 
- Established 2001 
- 15+ employees 
- Share in Eventos RU 
(30 people) 
- 5± Mio CHF turnover 
Industry 
- Media/News 
- Law Enforcement 
- Government 
- (Russia)

More Related Content

PDF
IDL Support for HDF4 and HDF5
PDF
Webinar: BI Mobile with SpagoBI: be aware everywhere!
PDF
Webinar - What's new in SpagoBI 5: advanced data analytics at your fingertips
PDF
Webinar: Free inquiry and Ad hoc reporting with SpagoBI
PDF
Einführung in die semantische Suche in Massendaten
PDF
How Could End-Users Identify Interesting Resources?
PDF
Ontos NLP Stack, Sep. 2016
IDL Support for HDF4 and HDF5
Webinar: BI Mobile with SpagoBI: be aware everywhere!
Webinar - What's new in SpagoBI 5: advanced data analytics at your fingertips
Webinar: Free inquiry and Ad hoc reporting with SpagoBI
Einführung in die semantische Suche in Massendaten
How Could End-Users Identify Interesting Resources?
Ontos NLP Stack, Sep. 2016

Similar to Semantic-assisted Analysis and Search in Customer Specifications (20)

PDF
Microsoft Power BI and Cortana Analytics user group meetings with Alteryx
PDF
apidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botify
PDF
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
PPT
PoolParty Overview
PDF
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
PPT
ActiveWarehouse/ETL - BI & DW for Ruby/Rails
PPTX
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
PPT
Semtech 2011 impressions
PDF
Webinar: Open Source Business Intelligence Intro
PDF
Berlin buzzwords 2020-feature-store-dowling
PDF
The power of faceted search in alfresco
PDF
Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...
PPTX
Sharepoint 2013-applied architecture from the field v3 (public)
PDF
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
PDF
TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...
PPTX
SharePoint 2013 Dev Features
PPT
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
PPT
OFF SHORE RECRUITER TRAINING
PPTX
Discussion for Anomaly & Prediction Engine
PPTX
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Microsoft Power BI and Cortana Analytics user group meetings with Alteryx
apidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botify
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
PoolParty Overview
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
ActiveWarehouse/ETL - BI & DW for Ruby/Rails
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Semtech 2011 impressions
Webinar: Open Source Business Intelligence Intro
Berlin buzzwords 2020-feature-store-dowling
The power of faceted search in alfresco
Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...
Sharepoint 2013-applied architecture from the field v3 (public)
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...
SharePoint 2013 Dev Features
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
OFF SHORE RECRUITER TRAINING
Discussion for Anomaly & Prediction Engine
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Ad

Recently uploaded (20)

PDF
17 Powerful Integrations Your Next-Gen MLM Software Needs
PPTX
Transform Your Business with a Software ERP System
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
medical staffing services at VALiNTRY
PPTX
assetexplorer- product-overview - presentation
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
Download FL Studio Crack Latest version 2025 ?
PPTX
Advanced SystemCare Ultimate Crack + Portable (2025)
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PDF
Salesforce Agentforce AI Implementation.pdf
PPTX
Monitoring Stack: Grafana, Loki & Promtail
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Cost to Outsource Software Development in 2025
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
Complete Guide to Website Development in Malaysia for SMEs
PPTX
Operating system designcfffgfgggggggvggggggggg
PPTX
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
PDF
Autodesk AutoCAD Crack Free Download 2025
17 Powerful Integrations Your Next-Gen MLM Software Needs
Transform Your Business with a Software ERP System
Wondershare Filmora 15 Crack With Activation Key [2025
medical staffing services at VALiNTRY
assetexplorer- product-overview - presentation
Design an Analysis of Algorithms II-SECS-1021-03
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Download FL Studio Crack Latest version 2025 ?
Advanced SystemCare Ultimate Crack + Portable (2025)
Design an Analysis of Algorithms I-SECS-1021-03
Salesforce Agentforce AI Implementation.pdf
Monitoring Stack: Grafana, Loki & Promtail
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Navsoft: AI-Powered Business Solutions & Custom Software Development
Cost to Outsource Software Development in 2025
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Complete Guide to Website Development in Malaysia for SMEs
Operating system designcfffgfgggggggvggggggggg
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
Autodesk AutoCAD Crack Free Download 2025
Ad

Semantic-assisted Analysis and Search in Customer Specifications

  • 1. Semantic-assisted Analysis and Search in Customer Specifications Martin Voigt, Daniel Hladky September 2014 1 ONTOS LINKED DATA INFORMATION WORKBENCH Extraction & Analysis Indexing Information & Knowledge Management Search Engineer Storage Sales Portal Multilingual Specifications
  • 2. I speakabout… The Problem, Our Solution, Insights & Further Work. 2
  • 3. The Problem AviComp Controls GmbH  leading engineering contractor for rotating machinery controls 3 Customers Engineers Sales > 100k Technical Specifications http://www.avicomp.com/capabilities/turbo-compressor-controls.html
  • 4. The Problem Analysis: 1) task, 2) current solution, 3) ideas Problems Multiple, inefficient tools Heterogeneity Knowledge management & transfer 4 http://answerhub.com/article/ the-cost-of-knowledge-loss/
  • 5. Our Solution 5 ONTOS LINKED DATA INFORMATION WORKBENCH Extraction & Analysis Indexing Information & Knowledge Management Search Engineer Storage Sales Portal Multilingual Specifications http://www.ontos.com/products/ontosldiw/
  • 6. Our Solution Extraction& Analysis Homogenization: PDF conversion (Apache POI) & OCR (CuneiForm) Text extraction (Apache Tika) Language detection (language-detection API) Text preparation, e.g., remove headers & footers SKOS-based concept identification 6 Lorem ipsum dolor sit amet, consetetursadipscing elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet clitakasdgubergren, no sea takimata sanctusestLorem ipsum dolor sit elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet clitakasdgubergren, no sea takimata elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet clitakasdgubergren, no sea takimata ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
  • 7. Our Solution Storage via OntoQUAD  Triple and/or QuadStore, SPARQL 1.1, … Indexing  Full text search, result grouping, faceted browsing, SKOS-based label expansion, …  Apache Solr with lucene-skos plugin (https://github.com/behas/lucene-SKOS) 7 ONTOS LINKED DATA INFORMATION WORKBENCH Extraction & Analysis Indexing Information & Knowledge Management Search Engineer Storage Sales Portal Multilingual Specifications
  • 8. Our Solution Knowledge Management via OntoDixbut SKOS-only 8 ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
  • 9. Our Solution Search via AJAX Solr(https://github.com/evolvingweb/ajax-solr) 9 ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
  • 10. Insights & Further Work Iterative development with early customer testing lowers usage barrier Lessons learned Development of a knowledge base Faceted search user interface Faceted search on RDF Multilingual disambiguationmechanisms 10
  • 11. Q&A Martin Voigt Ontos AG / GmbH Nidau(CH) / Leipzig (DE) T:+49 341 21559-10 M:+49 178 40 222 58 E: martin.voigt@ontos.com 11
  • 12. About Ontos 12 12 DoW – CTI Project Ontos Group Key Facts - Established 2001 - 15+ employees - Share in Eventos RU (30 people) - 5± Mio CHF turnover Industry - Media/News - Law Enforcement - Government - (Russia)