SlideShare a Scribd company logo
NLP2RDFIntegration of Data, Tools andApplicationswith RDF/OWL in the Areas of Textmining andLinguisticsPhD Thesis, Sebastian Hellmann
Extensive Topic – Whatisthecore?Features forMachine LearningWhichfeatures do I needfor a certain Textmining task?An introductoryexample :Resources: Face Recognition Tool thatdetectscoloroftheeyes(brown, green, blue) andtype ofhaircut(Vo-ku-hi-la, Mullet, GI Joe)
Database withAgeandOccupationGoal: predictincomeofpersonsYoung studentsearnlessthanoldCEO‘s. => Color ofeyesandhaircutprobably irrelevant!
Basic idea: a benchmarkingframeworkInput: Task specification
Text
Training/testdataOutput:Tools anddatarequiredtosolvethetaskDo I need POS tags toclassifyTourismdocuments?Prerequisites:Tools andapplicationsneed a standardizedinterface
Data needs a standardizedformatBasic idea: a benchmarkingframeworkNLP2RDF stack
Basic idea: a benchmarkingframeworkGoogle Code project was createdStanford parser was integrated
Ontologieswerefoundandintegrated
Pipeline implemented
Pluginsystemimplemented
SomeresultswereachievedBut…Architecture not flexible enough (Pipeline)
Integration boundto Java
Data sourceswere not sufficient
Wikipedia/DBpediatoocourse-grained
Speed ofintegrationtooslowPrerequisitesOnestep back:Creationofdatasets in RDFData integrationandlinkingofdatasetsLicencesStandardizedformatfortoolintegrationAcquisitionof additional knowledge
Why RDF and OWL ?RDF makesdataintegration easy: URIref, LinkedDataOWL isbased on Description Logics (Guarded Fragment)Availabilityof open datasets (accessandlicence)Diverse serializationsforannotations: XML, Turtle, RDFa+XHTMLScalabletoolsupport (Databases, Reasoning)6.     Iftheonlytoolyouhaveis a hammer, everythinglookslike a nail.
LOD Cloud - over 26 Billion FactsDBpediaiscentral:Cross-domain
Crystalizationpoint (earlybird)Linking Open Data clouddiagram, by Richard CyganiakandAnja Jentzsch. http://lod-cloud.net/
Simplified:Circlesare Database Tables
Links areHTTP-Foreign KeysLinkedDatahttp://www4.wiwiss.fu-berlin.de/rdf_browser/?browse_uri=http%3A%2F%2Fdata.nytimes.com%2FN12930380387917339601ResemblesdatabasetableKey-Value  pairsValues canbe:Datatypes (Strings, Integers)
URIs pointingtosubjects in the same table
URIs pointingtosubjects in anyothertableSPARQL – optimizationsfortablejoinsAll soccer players, who played as goalkeeper for a club that has a stadium with more than 40.000 seats and who are born in a country with more than 10 million inhabitantshttp://tinyurl.com/2uhuow9
SPARQL – optimizationsfortablejoins

More Related Content

PDF
Open hpi semweb-06-part2
PDF
Verifying Integrity Constraints of a RDF-based WordNet
PPT
Corrib.org - OpenSource and Research
PDF
Open hpi semweb-06-part4
PDF
SDA2013 Pundit: Creating, Exploring and Consuming Annotations
PDF
Lotus: Linked Open Text UnleaShed - ISWC COLD '15
PPTX
CSHALS 2010 W3C Semanic Web Tutorial
PDF
LOTUS: Adaptive Text Search for Big Linked Data
Open hpi semweb-06-part2
Verifying Integrity Constraints of a RDF-based WordNet
Corrib.org - OpenSource and Research
Open hpi semweb-06-part4
SDA2013 Pundit: Creating, Exploring and Consuming Annotations
Lotus: Linked Open Text UnleaShed - ISWC COLD '15
CSHALS 2010 W3C Semanic Web Tutorial
LOTUS: Adaptive Text Search for Big Linked Data

Viewers also liked (8)

PPT
Danielienė, Renata „Naujojo ECDL diegimo naujienos. ECDL Lietuva ir testavim...
PPTX
Shafarat diploma pgce presentation
DOC
Documento sin título
PPT
HR Breakfast Forum - 24 September 2013
PPTX
Reglamento evaluacion pnf
PPTX
Twitter and Facebook
PPTX
Local self storage_climate_controlled
PDF
Radvision High Quality Experience Over Unmanaged Networks By Face to Face Live
Danielienė, Renata „Naujojo ECDL diegimo naujienos. ECDL Lietuva ir testavim...
Shafarat diploma pgce presentation
Documento sin título
HR Breakfast Forum - 24 September 2013
Reglamento evaluacion pnf
Twitter and Facebook
Local self storage_climate_controlled
Radvision High Quality Experience Over Unmanaged Networks By Face to Face Live
Ad

Similar to NLP2RDF Wortschatz and Linguistic LOD draft (20)

PDF
Vital AI: Big Data Modeling
ODP
State of the Semantic Web
PDF
X api chinese cop monthly meeting feb.2016
PDF
The web of interlinked data and knowledge stripped
PDF
From Linked Data to Semantic Applications
PDF
Semantic Interoperability - grafi della conoscenza
PDF
Innovative methods for data integration: Linked Data and NLP
PPTX
Linked Data efforts for data standards in biopharma and healthcare
PPT
State and future of linked data in learning analytics
PDF
Transform unstructured e&p information
PDF
AHM 2014: OceanLink, Smart Data versus Smart Applications
DOCX
Data science nlp_resume-2018-abridged
PDF
RDF and other linked data standards — how to make use of big localization data
PDF
Semantic web
PPTX
Linked data for Enterprise Data Integration
PDF
Bio2RDF presentation at Combine 2012
PPT
Toward The Semantic Deep Web
PPT
Collaborative Data Analysis with Taverna Workflows
PDF
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
Vital AI: Big Data Modeling
State of the Semantic Web
X api chinese cop monthly meeting feb.2016
The web of interlinked data and knowledge stripped
From Linked Data to Semantic Applications
Semantic Interoperability - grafi della conoscenza
Innovative methods for data integration: Linked Data and NLP
Linked Data efforts for data standards in biopharma and healthcare
State and future of linked data in learning analytics
Transform unstructured e&p information
AHM 2014: OceanLink, Smart Data versus Smart Applications
Data science nlp_resume-2018-abridged
RDF and other linked data standards — how to make use of big localization data
Semantic web
Linked data for Enterprise Data Integration
Bio2RDF presentation at Combine 2012
Toward The Semantic Deep Web
Collaborative Data Analysis with Taverna Workflows
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
Ad

More from Sebastian Hellmann (19)

PDF
KEDL DBpedia 2019
PDF
Linguistic Linked Open Data, Challenges, Approaches, Future Work
PDF
DBpedia/association Introduction The Hague 12.2.2016
PDF
Lider Reference Model ld4lt session March, 3rd, 2015
PDF
LD4LT Roadmap session 19_02_2015
ODP
DBpedia: A Public Data Infrastructure for the Web of Data
ODP
Integrating NLP using Linked Data
ODP
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
ODP
Linked Data for Abbreviations and Segmentation
ODP
NIF 2.0 Phd thesis intermediate report
ODP
Navigation-induced Knowledge Engineering by Example
ODP
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...
PDF
NIF 2.0 draft for Pisa
PDF
Linked Data in Linguistics for NLP and Web Annotation
ODP
Introduction to LDL 2012
ODP
Thesis presentation
ODP
NIF - Version 1.0 - 2011/10/23
PDF
NIF - NLP Interchange Format
PPTX
Tool collection as linkeddata
KEDL DBpedia 2019
Linguistic Linked Open Data, Challenges, Approaches, Future Work
DBpedia/association Introduction The Hague 12.2.2016
Lider Reference Model ld4lt session March, 3rd, 2015
LD4LT Roadmap session 19_02_2015
DBpedia: A Public Data Infrastructure for the Web of Data
Integrating NLP using Linked Data
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
Linked Data for Abbreviations and Segmentation
NIF 2.0 Phd thesis intermediate report
Navigation-induced Knowledge Engineering by Example
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...
NIF 2.0 draft for Pisa
Linked Data in Linguistics for NLP and Web Annotation
Introduction to LDL 2012
Thesis presentation
NIF - Version 1.0 - 2011/10/23
NIF - NLP Interchange Format
Tool collection as linkeddata

Recently uploaded (20)

PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Electronic commerce courselecture one. Pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
cuic standard and advanced reporting.pdf
PPTX
Machine Learning_overview_presentation.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Spectroscopy.pptx food analysis technology
PPTX
Programs and apps: productivity, graphics, security and other tools
PPT
Teaching material agriculture food technology
Digital-Transformation-Roadmap-for-Companies.pptx
Electronic commerce courselecture one. Pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Advanced methodologies resolving dimensionality complications for autism neur...
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Encapsulation_ Review paper, used for researhc scholars
The Rise and Fall of 3GPP – Time for a Sabbatical?
“AI and Expert System Decision Support & Business Intelligence Systems”
MIND Revenue Release Quarter 2 2025 Press Release
NewMind AI Weekly Chronicles - August'25-Week II
cuic standard and advanced reporting.pdf
Machine Learning_overview_presentation.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
20250228 LYD VKU AI Blended-Learning.pptx
Spectral efficient network and resource selection model in 5G networks
MYSQL Presentation for SQL database connectivity
Spectroscopy.pptx food analysis technology
Programs and apps: productivity, graphics, security and other tools
Teaching material agriculture food technology

NLP2RDF Wortschatz and Linguistic LOD draft