SlideShare a Scribd company logo
R2R+BCO-DMO – Linked Oceanographic Datasets
Adila Krisnadhi1,5 Robert Arko2 Suzanne Carbotte2 Cynthia
Chandler3 Michelle Cheatham1 Pascal Hitzler1 Yingji Hu4
Krzysztof Janowciz4 Peng Ji2 Nazifa Karima1 Adam Shepherd3
Peter Wiebe3
1Data Semantics Lab, Wright State University
2Lamont-Doherty Observatory, Columbia University
3Woods Hole Oceanographic Institution
4Geography Department, University of California, Santa Barbara
5Faculty of Computer Science, Universitas Indonesia
Diversity++ 2015
Krisnadhi, et al Diversity++ 2015 1 / 13
Why Linked Data for Oceanography
Data proliferation
Increased number of repositories ⇒ increased heterogeneity.
Need to discover, access, and integrate data cross repositories
R2R & BCO-DMO are repositories.
Both hold datasets of field observations.
Linked data is for metadata of those datasets.
Linked data objective: starting point to enable dataset discovery.
Additional benefit: attribution of datasets to contributors in the form
of links.
Krisnadhi, et al Diversity++ 2015 2 / 13
Rolling Deck Repository (R2R)
Screen shot (10/10/2015) from: http://www.rvdata.us/catalog/Kilo_Moana
Krisnadhi, et al Diversity++ 2015 3 / 13
R2R
http://www.rvdata.us
Every NSF-funded cruise on a vessel in the
academic fleet creates an R2R record.
Environmental sensor data on-board vessels.
Catalog of vessels, instrumentation systems, expeditions, datasets,
investigators, organizations, funding awards, cruise reports, and
navigation tracks.
>530k triples, 25 in-service vessels, >4.3k cruises, >18 mil. archived
files
60,000 page views per month.
Krisnadhi, et al Diversity++ 2015 4 / 13
R2R: Architecture
Original picture from: http://www.rvdata.us/system/files/overview.png as displayed on (10/10/2015) at
http://www.rvdata.us/overview
Krisnadhi, et al Diversity++ 2015 5 / 13
R2R - http://data.rvdata.us
Krisnadhi, et al Diversity++ 2015 6 / 13
Biological and Chemical Oceanography Data Management
Office (BCO-DMO)
Screen shot (10/10/2015) from: http://mapservice.bco-dmo.org/mapserver/maps-ol/index.php
Krisnadhi, et al Diversity++ 2015 7 / 13
BCO-DMO: Architecture
BCO-DMO Data Management Architectural Overview
Metadata
Database
and Web
Content
Data ServerData ServerData Server
Geospatial Access
MapServer-cartography
OpenLayers-interface;
interrogate and draw
features
ExtJS and other JavaScript
libraries-environment
MySQL-metadata
BCO-DMO
Website
Public access
via Drupal
Web content and
metadata
JGOFS/GLOBEC
Backend Data
Storage and
Retrieval
Supporting Software
- Drupal
- PHP, Perl
- Load navigation and date
information into Location
table (Perl)
- Report modules
- NSF Tracker subsystem
Data Manager access
Metadata and web
content insert, update,
delete and display
Perl Library
Perl code calling REST
API via Drupal
November 21, 2013
Highlights
Text based interface; Geospatial
(MapServer) interface; Metadata
database stored in Drupal CMS;
Distributed backend data
management system; Fitness for
purpose tools in MapServer and
JGOFS/GLOBEC; Browser clients,
also distributed; Ability to support
other data management backends;
Semantic elements (contributed vs
standard names); Advanced
search using triple stores from
several sources; No login required;
Access to metadata; Access to
actual data; Data manager
interface via Drupal; Direct transfer
of data and metadata to
appropriate national archive, such
as NODC, when data are final.
Original picture from: http://www.bco-dmo.org/sites/default/files/BCO-DMO_System_Architecture.pdf as displayed on
10/10/2015
Krisnadhi, et al Diversity++ 2015 8 / 13
BCO-DMO
http://bco-dmo.org
PI of NSF-funded research expedition must
submit data from their expedition to
BCO-DMO.
PI may bring own instruments.
Catalog of datasets, instrumentation systems, measurement
parameters, investigators, organizations, funding awards, projects,
programs, and deployments.
Deployments involve than just vessels (i.e., not just cruises).
>2.1 mil triples, 7,500 datasets including information about >1.7k
researchers, >2.1k deployments, 500 projects.
6.5k page views per month.
Krisnadhi, et al Diversity++ 2015 9 / 13
BCO-DMO
Krisnadhi, et al Diversity++ 2015 10 / 13
Overlaps
Only a few dozens oceanographic research vessels being deployed.
R2R is vessel-centric. BCO-DMO is PI-centric and has more than just
cruise.
Overlapping set of people, cruise identifiers (linked between each
other).
341 person instances (exact match)
External links
R2R organization to dbpedia: 288/520
BCO-DMO instruments to dbpedia: 42/409
BCO-DMO organization to dbpedia: 81/488
Krisnadhi, et al Diversity++ 2015 11 / 13
Acknowledgements
GeoLink Project (NSF)
ISWC 2015 Travel Award
Krisnadhi, et al Diversity++ 2015 12 / 13
Thank you!
Krisnadhi, et al Diversity++ 2015 13 / 13

More Related Content

PDF
20160818 Semantics and Linkage of Archived Catalogs
PDF
Data Sharing via Globus in the NIH Intramural Program
PDF
The RDF Report Card: Beyond the Triple Count
PDF
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
PPTX
Dataset Descriptions in Open PHACTS and HCLS
PPTX
The CIARD RINGValeri
PDF
Mapping the Repository Landscape
PPTX
05 SPARQL queries over Open Land Use, Open Transport Net and Smart Points Of ...
20160818 Semantics and Linkage of Archived Catalogs
Data Sharing via Globus in the NIH Intramural Program
The RDF Report Card: Beyond the Triple Count
20161004 “Open Data Web” – A Linked Open Data Repository Built with CKAN
Dataset Descriptions in Open PHACTS and HCLS
The CIARD RINGValeri
Mapping the Repository Landscape
05 SPARQL queries over Open Land Use, Open Transport Net and Smart Points Of ...

What's hot (19)

PDF
Organising principles
PPTX
Organising principles
PPTX
RDA data, linked data, and benefits for users / Gordon Dunsire
PPTX
2015 09 rda-pre-meeting_jk
PPTX
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
PDF
How to clean data less through Linked (Open Data) approach?
PPT
Talis Platform: A Linked Data Engine
PPT
SomeSlides
PPTX
The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...
PPT
GFDC and GFIS
PPTX
March 2013 Bioinformatics Working Group
PDF
Semantic Markup
PDF
Data Mesh-up and Mapping using Semantic Wiki
PDF
Going for GOLD - Adventures in Open Linked Metadata
PDF
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
PDF
2013 open analytics-meetup-mortar
PDF
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
PDF
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
PPTX
Data exchange alternatives, GIGA TAG (2009)
Organising principles
Organising principles
RDA data, linked data, and benefits for users / Gordon Dunsire
2015 09 rda-pre-meeting_jk
Exposing Bibliographic Information as Linked Open Data using Standards-based ...
How to clean data less through Linked (Open Data) approach?
Talis Platform: A Linked Data Engine
SomeSlides
The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...
GFDC and GFIS
March 2013 Bioinformatics Working Group
Semantic Markup
Data Mesh-up and Mapping using Semantic Wiki
Going for GOLD - Adventures in Open Linked Metadata
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
2013 open analytics-meetup-mortar
Relations for Reusing (R4R) in A Shared Context: An Exploration on Research P...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Data exchange alternatives, GIGA TAG (2009)
Ad

Similar to Diversity++2015 talk: R2R+BCO-DMO - Linked Oceanographic Datasets (20)

PDF
Big Data, Beyond the Data Center
PPTX
FAIR Workflows and Research Objects get a Workout
PPTX
ElN - repository integration at the University of Goettingen
PPTX
Modeling Data Life Cycles with PROV
PPTX
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
PPT
An On-line Collaborative Data Management System
PDF
Linking Open Government Data at Scale
PPTX
NIH Data Summit - The NIH Data Commons
PPT
Going for GOLD - Adventures in Open Linked Geospatial Metadata
PPTX
IDRAC RADAR PRESENTATION MATERIAL IS UPLOADED
PPTX
IDRAC EXPLAINATION ON RADAR SYSTEMS SPECIALLY USING WEATHER INFORMATION
PDF
A Gen3 Perspective of Disparate Data
PDF
Making Data Dynamic: Views from UC3, CDL
PPTX
D4Science Data infrastructure: a facilitator for a FAIR data management
PPTX
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
PDF
Tese phd
PPTX
From data portal to knowledge portal: Leveraging semantic technologies to sup...
PPTX
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
PDF
How to Create the Google for Earth Data (XLDB 2015, Stanford)
PPT
2011linked science4mccuskermcguinnessfinal
Big Data, Beyond the Data Center
FAIR Workflows and Research Objects get a Workout
ElN - repository integration at the University of Goettingen
Modeling Data Life Cycles with PROV
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
An On-line Collaborative Data Management System
Linking Open Government Data at Scale
NIH Data Summit - The NIH Data Commons
Going for GOLD - Adventures in Open Linked Geospatial Metadata
IDRAC RADAR PRESENTATION MATERIAL IS UPLOADED
IDRAC EXPLAINATION ON RADAR SYSTEMS SPECIALLY USING WEATHER INFORMATION
A Gen3 Perspective of Disparate Data
Making Data Dynamic: Views from UC3, CDL
D4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
Tese phd
From data portal to knowledge portal: Leveraging semantic technologies to sup...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
How to Create the Google for Earth Data (XLDB 2015, Stanford)
2011linked science4mccuskermcguinnessfinal
Ad

Recently uploaded (20)

PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PDF
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PDF
Placing the Near-Earth Object Impact Probability in Context
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPT
6.1 High Risk New Born. Padetric health ppt
PPTX
Pharmacology of Autonomic nervous system
PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
famous lake in india and its disturibution and importance
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PDF
Sciences of Europe No 170 (2025)
PPTX
Introduction to Cardiovascular system_structure and functions-1
PPTX
2. Earth - The Living Planet earth and life
PPT
protein biochemistry.ppt for university classes
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
TOTAL hIP ARTHROPLASTY Presentation.pptx
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
Placing the Near-Earth Object Impact Probability in Context
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
Classification Systems_TAXONOMY_SCIENCE8.pptx
6.1 High Risk New Born. Padetric health ppt
Pharmacology of Autonomic nervous system
2. Earth - The Living Planet Module 2ELS
famous lake in india and its disturibution and importance
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
Sciences of Europe No 170 (2025)
Introduction to Cardiovascular system_structure and functions-1
2. Earth - The Living Planet earth and life
protein biochemistry.ppt for university classes
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud

Diversity++2015 talk: R2R+BCO-DMO - Linked Oceanographic Datasets

  • 1. R2R+BCO-DMO – Linked Oceanographic Datasets Adila Krisnadhi1,5 Robert Arko2 Suzanne Carbotte2 Cynthia Chandler3 Michelle Cheatham1 Pascal Hitzler1 Yingji Hu4 Krzysztof Janowciz4 Peng Ji2 Nazifa Karima1 Adam Shepherd3 Peter Wiebe3 1Data Semantics Lab, Wright State University 2Lamont-Doherty Observatory, Columbia University 3Woods Hole Oceanographic Institution 4Geography Department, University of California, Santa Barbara 5Faculty of Computer Science, Universitas Indonesia Diversity++ 2015 Krisnadhi, et al Diversity++ 2015 1 / 13
  • 2. Why Linked Data for Oceanography Data proliferation Increased number of repositories ⇒ increased heterogeneity. Need to discover, access, and integrate data cross repositories R2R & BCO-DMO are repositories. Both hold datasets of field observations. Linked data is for metadata of those datasets. Linked data objective: starting point to enable dataset discovery. Additional benefit: attribution of datasets to contributors in the form of links. Krisnadhi, et al Diversity++ 2015 2 / 13
  • 3. Rolling Deck Repository (R2R) Screen shot (10/10/2015) from: http://www.rvdata.us/catalog/Kilo_Moana Krisnadhi, et al Diversity++ 2015 3 / 13
  • 4. R2R http://www.rvdata.us Every NSF-funded cruise on a vessel in the academic fleet creates an R2R record. Environmental sensor data on-board vessels. Catalog of vessels, instrumentation systems, expeditions, datasets, investigators, organizations, funding awards, cruise reports, and navigation tracks. >530k triples, 25 in-service vessels, >4.3k cruises, >18 mil. archived files 60,000 page views per month. Krisnadhi, et al Diversity++ 2015 4 / 13
  • 5. R2R: Architecture Original picture from: http://www.rvdata.us/system/files/overview.png as displayed on (10/10/2015) at http://www.rvdata.us/overview Krisnadhi, et al Diversity++ 2015 5 / 13
  • 7. Biological and Chemical Oceanography Data Management Office (BCO-DMO) Screen shot (10/10/2015) from: http://mapservice.bco-dmo.org/mapserver/maps-ol/index.php Krisnadhi, et al Diversity++ 2015 7 / 13
  • 8. BCO-DMO: Architecture BCO-DMO Data Management Architectural Overview Metadata Database and Web Content Data ServerData ServerData Server Geospatial Access MapServer-cartography OpenLayers-interface; interrogate and draw features ExtJS and other JavaScript libraries-environment MySQL-metadata BCO-DMO Website Public access via Drupal Web content and metadata JGOFS/GLOBEC Backend Data Storage and Retrieval Supporting Software - Drupal - PHP, Perl - Load navigation and date information into Location table (Perl) - Report modules - NSF Tracker subsystem Data Manager access Metadata and web content insert, update, delete and display Perl Library Perl code calling REST API via Drupal November 21, 2013 Highlights Text based interface; Geospatial (MapServer) interface; Metadata database stored in Drupal CMS; Distributed backend data management system; Fitness for purpose tools in MapServer and JGOFS/GLOBEC; Browser clients, also distributed; Ability to support other data management backends; Semantic elements (contributed vs standard names); Advanced search using triple stores from several sources; No login required; Access to metadata; Access to actual data; Data manager interface via Drupal; Direct transfer of data and metadata to appropriate national archive, such as NODC, when data are final. Original picture from: http://www.bco-dmo.org/sites/default/files/BCO-DMO_System_Architecture.pdf as displayed on 10/10/2015 Krisnadhi, et al Diversity++ 2015 8 / 13
  • 9. BCO-DMO http://bco-dmo.org PI of NSF-funded research expedition must submit data from their expedition to BCO-DMO. PI may bring own instruments. Catalog of datasets, instrumentation systems, measurement parameters, investigators, organizations, funding awards, projects, programs, and deployments. Deployments involve than just vessels (i.e., not just cruises). >2.1 mil triples, 7,500 datasets including information about >1.7k researchers, >2.1k deployments, 500 projects. 6.5k page views per month. Krisnadhi, et al Diversity++ 2015 9 / 13
  • 10. BCO-DMO Krisnadhi, et al Diversity++ 2015 10 / 13
  • 11. Overlaps Only a few dozens oceanographic research vessels being deployed. R2R is vessel-centric. BCO-DMO is PI-centric and has more than just cruise. Overlapping set of people, cruise identifiers (linked between each other). 341 person instances (exact match) External links R2R organization to dbpedia: 288/520 BCO-DMO instruments to dbpedia: 42/409 BCO-DMO organization to dbpedia: 81/488 Krisnadhi, et al Diversity++ 2015 11 / 13
  • 12. Acknowledgements GeoLink Project (NSF) ISWC 2015 Travel Award Krisnadhi, et al Diversity++ 2015 12 / 13
  • 13. Thank you! Krisnadhi, et al Diversity++ 2015 13 / 13