SlideShare a Scribd company logo
Federated Architecture with Provenance and Access Control
to realize Open Digital Data for MGI
Amit Sheth and the team
Kno.e.sis – Ohio Center of Excellence in Knowledge-enabled Computing
Wright State University, Dayton, OH-45435
Kno.e.sis’ MGI related projects
• Federated Semantic Services Platform for Material
Sciences (funded via AFRL/Rome)
• Materials Database Knowledge Discovery and Data
Mining (funded from AFRL/RX)
Faculty: A. Sheth (PI), K. Thirunarayan (coPI), R.
Srinivasan (coPI), Clare Paul (expert)
Students: K. Gunaratna, M. Panahiazar, S.
Lalithsena, V. Nguyen, N. Bryant, A. Shiveley, N.
Jaykumar
2
3
Databases

Single
Access
Personal desktops

Lab notebooks

4
Public-Private Data Sharing
• Enhance publicly available datasets while
retaining intellectual property data for businesses

Private data and metadata
(eg. ongoing experimental processes, intellectual property data)

Selectively shared data and metadata
(eg. with ongoing collaborators, licensed data)

Public data and metadata
(released products, material specifications)
5
Federated Architecture
1. User
Authentication

2. Federated Semantic
Query Processor

3. Semantics
Mappings

Federal Endpoint

Private
Shared
Public

AC
Processor
Semantic
Query
Processor

Research Lab A

Private
Shared
Public

AC
Processor
Semantic
Query
Processor

Industry Lab B

Private
Shared
Public

AC
Processor
Semantic
Query
Processor

Organization C
6
Principles of a Federation
• Each component controls access to its local data
independently (local autonomy)
• A query is decomposed to multiple sub
queries, each sub query is executed at one
component
• Results from sub queries are combined by the
federated query processor (control global access)
Provenance Metadata
• Explains the origins of an artifact, such as
– How was it created?
– Who created it?
– When was it created?

• Example: for a given material X
– Which processes and properties involved?
– Input and output values of those processes?
– Which research/engineering team performed the
experiments?
Why Data Provenance?
•
•
•
•
•
•

Verification
Reproducibility
Trust
Testing
Quality
…
Product – Process – Product
Output
Input

Processes

Capturing provenance: Sufficient + Accurate
=> Reproduce the same output
A Unified Provenance Framework
• Capturing domain-specific provenance
– in addition to the W3C PROV ontology

• Representing in standard RDF
• Query engine for processing provenance queries
• Operators for comparing artifacts’ provenance
Can we choose any part of our
Semantic Web data
to share with public community,
or with selective collaborators ?
Semantic Web Data

Subject

Predicate

Object
A triple is in the format (Subject, Predicate, Object)
An RDF Dataset is a set of triples
Linked Data Story So Far?

Non-open data?
Not there yet!
Can we choose any part of our
Semantic Web data
to share with public community,
or with selective collaborators ?
Different levels of granularity
– Individual resources
• Example: a material product, a manufacturing process

– Individual triples
• Example: properties of a product, or process

– Entire datasets

Enable flexible selection of any data pieces to
be shared at anytime
Can we choose any part of our
Semantic Web data
to share with public community,
or with selective collaborators ?
Federal
Endpoint
User X of either
Public group or Collaborators

Manager Y
of component A

1. Query Rewriting
2. AC-embeded Query Execution
AC
Processes

Creating
Resources

Granting
Permissions

Local Component A

Inferring
Permissions
Various Policies
•
•
•
•

Role-based Access Control (RBAC)
Mandatory Access Control (MAC)
Attribute-based Access Control (ABAC)
Discretionary Access Control (DAC)
1. Which policy? Depend on the organization’s
needs!
2. Our AC mechanism can be extended to
support any of these policies
Summary
• Semantic Federated Architecture enables us to
–
–
–
–

Enhance the open data access
Protect the confidential information
Improve the communication between collaborating teams
Support the reproducibility of material products with
confidence and trust
– Utilize the power of Semantic Web standards and
technologies to do so more easily, effectively and flexibly
Kno.e.sis
Thank you, and please visit us at

http://knoesis.org/

Kno.e.sis – Ohio Center of Excellence in Knowledge-enabled Computing
Wright State University, Dayton, Ohio, USA

21

More Related Content

PPTX
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
PDF
FAIR Data Management and FAIR Data Sharing
PPTX
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
PPTX
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
PPT
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
PPT
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
PPT
Identifiers for Researchers and Data: Increasing Attribution and Discovery– J...
PPTX
Reading Group: From Database to Dataspaces
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
FAIR Data Management and FAIR Data Sharing
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
Identifiers for Researchers and Data: Increasing Attribution and Discovery– J...
Reading Group: From Database to Dataspaces

What's hot (20)

PDF
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
PPTX
VALA 2016 L-Plate session on Linked Open Data
PPTX
Report of the second FAIRDOM foundry
PPTX
FAIRy stories: tales from building the FAIR Research Commons
PPTX
Reproducible Research: how could Research Objects help
PPTX
Creating impact with accessible data in agriculture and nutrition: sharing da...
PDF
pro-iBiosphere 2013-05 Linked Open Data (Gregor Hagedorn)
PDF
Role of libraries in research and scholarly communication
PPTX
Making your data good enough for sharing.
PPTX
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
PPTX
RMG Open Data Policy Readiness
PPTX
Let’s go on a FAIR safari!
PPTX
Reflections on a (slightly unusual) multi-disciplinary academic career
PDF
NIH BD2K bioCADDIE DataMed: Data Discovery Index
PPTX
FAIR data and model management for systems biology.
PPTX
Scratchpads: the Virtual Research Environment for biodiversity data
PDF
ICIC 2013 New Product Introductions ChemAxon
PPTX
CETIS09 OER Technical Roundtable
PPTX
Reproducible and citable data and models: an introduction.
PDF
Data Repositories Impact
ICIC 2013 Conference Proceedings Uwe Rosemann TIB
VALA 2016 L-Plate session on Linked Open Data
Report of the second FAIRDOM foundry
FAIRy stories: tales from building the FAIR Research Commons
Reproducible Research: how could Research Objects help
Creating impact with accessible data in agriculture and nutrition: sharing da...
pro-iBiosphere 2013-05 Linked Open Data (Gregor Hagedorn)
Role of libraries in research and scholarly communication
Making your data good enough for sharing.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.
RMG Open Data Policy Readiness
Let’s go on a FAIR safari!
Reflections on a (slightly unusual) multi-disciplinary academic career
NIH BD2K bioCADDIE DataMed: Data Discovery Index
FAIR data and model management for systems biology.
Scratchpads: the Virtual Research Environment for biodiversity data
ICIC 2013 New Product Introductions ChemAxon
CETIS09 OER Technical Roundtable
Reproducible and citable data and models: an introduction.
Data Repositories Impact
Ad

Viewers also liked (17)

PDF
Meena Nagarajan Ph.D. Dissertation Defense
PPTX
Introduction to Kno.e.sis Center - March 2011
PPTX
Citizen Sensor Data Mining, Social Media Analytics and Development Centric ...
PPTX
Semantic Technologies for Big Sciences including Astrophysics
PPTX
Role of Semantic Web in Health Informatics
KEY
Detecting Signals from Real-time Social Web
PPTX
Computing for Human Experience [v3, Aug-Oct 2010]
PPTX
Active Perception over Machine and Citizen Sensing
PPTX
Computing for Human Experience [v4]: Keynote @ OnTheMove Federated Conferences
PPTX
Domain case study: successful application of Semantic Web technologies and to...
PPTX
Realizing Semantic Web - Light Weight semantics and beyond
PPT
User Experiences of Enterprise Semantic Content Management
PPT
Kino : Making Semantic Annotations Easier
PDF
How to Leverage Social Media Communities for Crisis Response Coordination
PPTX
PhD thesis defense of Ajith Ranabahu
PPT
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Meena Nagarajan Ph.D. Dissertation Defense
Introduction to Kno.e.sis Center - March 2011
Citizen Sensor Data Mining, Social Media Analytics and Development Centric ...
Semantic Technologies for Big Sciences including Astrophysics
Role of Semantic Web in Health Informatics
Detecting Signals from Real-time Social Web
Computing for Human Experience [v3, Aug-Oct 2010]
Active Perception over Machine and Citizen Sensing
Computing for Human Experience [v4]: Keynote @ OnTheMove Federated Conferences
Domain case study: successful application of Semantic Web technologies and to...
Realizing Semantic Web - Light Weight semantics and beyond
User Experiences of Enterprise Semantic Content Management
Kino : Making Semantic Annotations Easier
How to Leverage Social Media Communities for Crisis Response Coordination
PhD thesis defense of Ajith Ranabahu
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Ad

Similar to Federated Architecture with Provenance and Access Control to realize Open Digital Data for MGI (20)

PPT
Gt ea2009
PPT
The Social Data Web
PDF
Domain Semantics
PPT
Michael Lang Sr. Presentation
PPTX
Why I don't use Semantic Web technologies anymore, event if they still influe...
PPT
Pragmatic Approaches to the Semantic Web
PPT
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
PPT
Structured Dynamics' Semantic Technologies Product Stack
PPTX
Lotico oct 2010
PDF
Linked Open Data - State of the Art, Challenges and Applications
PDF
Some news about the SW
PPTX
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
PPT
F E A D R M A K M 2005 03 28
PPT
Fea Drm Akm 2005 03 28
ODT
Riding The Semantic Wave
PDF
Semantic web browser
PPT
Semantic Web in Action
PDF
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
PPTX
Publishing "5 star" data: the case for RDF
PPTX
Van de droom van het Semantic Web naar de realiteit van Linked Open
Gt ea2009
The Social Data Web
Domain Semantics
Michael Lang Sr. Presentation
Why I don't use Semantic Web technologies anymore, event if they still influe...
Pragmatic Approaches to the Semantic Web
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
Structured Dynamics' Semantic Technologies Product Stack
Lotico oct 2010
Linked Open Data - State of the Art, Challenges and Applications
Some news about the SW
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
F E A D R M A K M 2005 03 28
Fea Drm Akm 2005 03 28
Riding The Semantic Wave
Semantic web browser
Semantic Web in Action
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
Publishing "5 star" data: the case for RDF
Van de droom van het Semantic Web naar de realiteit van Linked Open

Recently uploaded (20)

PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
RMMM.pdf make it easy to upload and study
PDF
Classroom Observation Tools for Teachers
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Computing-Curriculum for Schools in Ghana
PDF
Trump Administration's workforce development strategy
PDF
01-Introduction-to-Information-Management.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
A systematic review of self-coping strategies used by university students to ...
PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Chinmaya Tiranga quiz Grand Finale.pdf
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
Microbial diseases, their pathogenesis and prophylaxis
RMMM.pdf make it easy to upload and study
Classroom Observation Tools for Teachers
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
Microbial disease of the cardiovascular and lymphatic systems
Computing-Curriculum for Schools in Ghana
Trump Administration's workforce development strategy
01-Introduction-to-Information-Management.pdf
Supply Chain Operations Speaking Notes -ICLT Program
A systematic review of self-coping strategies used by university students to ...
Weekly quiz Compilation Jan -July 25.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Final Presentation General Medicine 03-08-2024.pptx
human mycosis Human fungal infections are called human mycosis..pptx
2.FourierTransform-ShortQuestionswithAnswers.pdf

Federated Architecture with Provenance and Access Control to realize Open Digital Data for MGI

  • 1. Federated Architecture with Provenance and Access Control to realize Open Digital Data for MGI Amit Sheth and the team Kno.e.sis – Ohio Center of Excellence in Knowledge-enabled Computing Wright State University, Dayton, OH-45435
  • 2. Kno.e.sis’ MGI related projects • Federated Semantic Services Platform for Material Sciences (funded via AFRL/Rome) • Materials Database Knowledge Discovery and Data Mining (funded from AFRL/RX) Faculty: A. Sheth (PI), K. Thirunarayan (coPI), R. Srinivasan (coPI), Clare Paul (expert) Students: K. Gunaratna, M. Panahiazar, S. Lalithsena, V. Nguyen, N. Bryant, A. Shiveley, N. Jaykumar 2
  • 3. 3
  • 5. Public-Private Data Sharing • Enhance publicly available datasets while retaining intellectual property data for businesses Private data and metadata (eg. ongoing experimental processes, intellectual property data) Selectively shared data and metadata (eg. with ongoing collaborators, licensed data) Public data and metadata (released products, material specifications) 5
  • 6. Federated Architecture 1. User Authentication 2. Federated Semantic Query Processor 3. Semantics Mappings Federal Endpoint Private Shared Public AC Processor Semantic Query Processor Research Lab A Private Shared Public AC Processor Semantic Query Processor Industry Lab B Private Shared Public AC Processor Semantic Query Processor Organization C 6
  • 7. Principles of a Federation • Each component controls access to its local data independently (local autonomy) • A query is decomposed to multiple sub queries, each sub query is executed at one component • Results from sub queries are combined by the federated query processor (control global access)
  • 8. Provenance Metadata • Explains the origins of an artifact, such as – How was it created? – Who created it? – When was it created? • Example: for a given material X – Which processes and properties involved? – Input and output values of those processes? – Which research/engineering team performed the experiments?
  • 10. Product – Process – Product Output Input Processes Capturing provenance: Sufficient + Accurate => Reproduce the same output
  • 11. A Unified Provenance Framework • Capturing domain-specific provenance – in addition to the W3C PROV ontology • Representing in standard RDF • Query engine for processing provenance queries • Operators for comparing artifacts’ provenance
  • 12. Can we choose any part of our Semantic Web data to share with public community, or with selective collaborators ?
  • 13. Semantic Web Data Subject Predicate Object A triple is in the format (Subject, Predicate, Object) An RDF Dataset is a set of triples
  • 14. Linked Data Story So Far? Non-open data? Not there yet!
  • 15. Can we choose any part of our Semantic Web data to share with public community, or with selective collaborators ?
  • 16. Different levels of granularity – Individual resources • Example: a material product, a manufacturing process – Individual triples • Example: properties of a product, or process – Entire datasets Enable flexible selection of any data pieces to be shared at anytime
  • 17. Can we choose any part of our Semantic Web data to share with public community, or with selective collaborators ?
  • 18. Federal Endpoint User X of either Public group or Collaborators Manager Y of component A 1. Query Rewriting 2. AC-embeded Query Execution AC Processes Creating Resources Granting Permissions Local Component A Inferring Permissions
  • 19. Various Policies • • • • Role-based Access Control (RBAC) Mandatory Access Control (MAC) Attribute-based Access Control (ABAC) Discretionary Access Control (DAC) 1. Which policy? Depend on the organization’s needs! 2. Our AC mechanism can be extended to support any of these policies
  • 20. Summary • Semantic Federated Architecture enables us to – – – – Enhance the open data access Protect the confidential information Improve the communication between collaborating teams Support the reproducibility of material products with confidence and trust – Utilize the power of Semantic Web standards and technologies to do so more easily, effectively and flexibly
  • 21. Kno.e.sis Thank you, and please visit us at http://knoesis.org/ Kno.e.sis – Ohio Center of Excellence in Knowledge-enabled Computing Wright State University, Dayton, Ohio, USA 21

Editor's Notes

  • #4: A picture with 7 distinct stages to bring a material from the discovery research to product deployment.Discovery stage needs maximum coverage of data as much as possibleFor example, a designer may want to find all the property information about a material available
  • #5: Data is spread all over the heterogeneous sources but inaccessbile to researchers and engineers: private lab info, a desktop, notebook, firewall To make it easy for everyone, a single access point to search for all publicly available information about materials?
  • #6: For each organization like research lab, industry company, three kinds of data: private, selectively shared and public
  • #7: Semantics Mappings
  • #9: DefinitionExample
  • #10: Data provenance is useful for many purposes.Example in the next slide
  • #11: It’s crucial to capture various partial-ordered processes and their detail parameters (properties, compositions, predicted response, etc) as provenance of the output material product.Missing or inaccurate information of any important factor in the processes may result a different product, which may affect the verification, reproducibility, testing and trust.
  • #14: RecapRDF Datasets can be intuitively represented as a graph with a set of resources connected by edges.This graph maybe replaced by another graph which describes an example in the material science project
  • #17: To meet customized needs of different organizations
  • #19: By capturing the access control primitive operators in processes1) A manager Y of a local component can grant access to individual users or a group of users.The Public group is dedicated to the entired federated system. Any resources granted to this Public group is available for everyone.2) Meanwhile, we are also able to track any access rights in the system.One important scenario may be, one manager Y suspects can ask why a suspectious user has access to an important resource.