1
Unlocking the Value of Health Data
Sandeep Purao, Ph.D.
Research Director, Center for Enterprise Architecture
Associate Professor, College of Information Sciences and Technology
Penn State University, University Park, PA
Congressional Luncheon Series, 5 October 2011
© Sandeep Purao. spurao@ist.psu.edu
My Emphasis is on ‘Data’
2
© Sandeep Purao. spurao@ist.psu.edu
I have three Goals today
• I want to add structure to the large and intractable
problem of how to deal with data in healthcare
• I would like to point out ongoing research in other
domains based on this structure
• I would like to suggest how we may leverage this
research or identify opportunities for enhancement
3
© Sandeep Purao. spurao@ist.psu.edu
A Data (Life) Cycle helps
4
Generate /
Capture
Store /
Categorize
Adding structure to the problem of dealing with data in healthcare
Share / Make
Available
Use / Make
Sense
Destroy /
Shred
A data life cycle brings to the foreground
the phases through which data must flow
© Sandeep Purao. spurao@ist.psu.edu
A Layered view of Data helps
5
Data for the support of Clinical tasks and systems
Adding structure to the problem of dealing with data in healthcare
Data for the support of Administrative tasks and systems
Data that is Public or is available in Public sources
Primary Data from Institutions and Researchers
© Sandeep Purao. spurao@ist.psu.edu
Here is a Simple Structure
6
Generate /
Capture
Store /
Categorize
Share / Make
Available
Use / Make
Sense
Destroy /
Shred
Data for the support of Clinical tasks and systems
Data for the support of Administrative tasks and systems
Data that is Public or is available in Public sources
Primary Data from Institutions and Researchers
Adding structure to the problem of dealing with data in healthcare
© Sandeep Purao. spurao@ist.psu.edu
Adding Roles to the Structure
7
Adding structure to the problem of dealing with data in healthcare
Physicians
Patients
Policy Makers
Lawyers
Researchers
© Sandeep Purao. spurao@ist.psu.edu
There are some Key Problems
8
Scale
Inter-Operability
Using the structure to work with key problems related to data in healthcare
Security
Sense-
Making
© Sandeep Purao. spurao@ist.psu.edu
There is Research Elsewhere
• Scale
– Big Data – moving Giga to Tera to Peta
– Clouds, Hadoop and Map-Reduce
– Extracting data from information **
• Inter-operability
• Security
• Sense-making
9
Pointing to Ongoing Research in Other Domains
© Sandeep Purao. spurao@ist.psu.edu
There is Research Elsewhere
• Scale
• Inter-operability
– Voluntary and Consensus standards including HL7 **
– Heterogeneity, ontology and semantics (NLM) **
– Regional health IT partnerships **
• Security
• Sense-making
10
Pointing to Ongoing Research in Other Domains
© Sandeep Purao. spurao@ist.psu.edu
There is Research Elsewhere
11
Pointing to Ongoing Research in Other Domains
• Scale
• Inter-operability
• Security
– Dealing with legal co-existence of malicious users **
– Measures such as Role-based Access
– Laws to prevent access to EHR **
• Sense-making
© Sandeep Purao. spurao@ist.psu.edu
There is Research Elsewhere
12
Pointing to Ongoing Research in Other Domains
• Scale
• Inter-operability
• Security
• Sense-Making
– Measuring data quality in crowd-based forums **
– Search patterns and user behaviors **
– Data delivery / use for e-health
© Sandeep Purao. spurao@ist.psu.edu
Opportunities
• It is possible to leverage / enhance research from other
domains to add to what we know about the Data Puzzle
in the Healthcare context
13
Leveraging or Enhancing Existing Research
© Sandeep Purao. spurao@ist.psu.edu
Some Examples - 1
• Example 1: Regional health partnerships
– A study of regional health IT partnerships extending ideas and
theories about outsourcing
– Problem addressed: Data storage and Data sharing
• Example 2: Extracting action knowledge
– Studies of work in refineries and with health professionals to
extract and represent action knowledge
– Problem addressed: Data use and sense-making
14
Leveraging or Enhancing Existing Research
© Sandeep Purao. spurao@ist.psu.edu
Some Examples - 2
• Example 3: Changing models for data governance with clouds
– A study of healthcare organizations to understand how data
stewardship and governance models are changing with cloud
– Problem addressed: Data storage and exchange
• Example 4: Using context to overcome data heterogeneity
– Modeling context to understand how data may be exchanged
across different user communities
– Problem addressed: Data Sharing and Data use
15
Leveraging or Enhancing Existing Research
© Sandeep Purao. spurao@ist.psu.edu
Some Examples - 3
• Example 5: Data use in collaborative healthcare teams
– A study to understand how teams of healthcare professionals
access and use data
– Problem addressed: Data sharing and Data use
• Example 6: Search patterns for specialized information on the web
– Empirical analyses of how user communities look for health-
related and other information on web sources
– Problem addressed: Data Access and Data use
16
Leveraging or Enhancing Existing Research
© Sandeep Purao. spurao@ist.psu.edu
What I said I wanted to do
• I want to add structure to the large and intractable problem of how
to deal with data in healthcare
• I would like to point out ongoing research in other domains based
on this structure
• I would like to suggest how we may leverage this research or
identify opportunities for enhancement
17
© Sandeep Purao. spurao@ist.psu.edu
Summary
18
For further dialog: spurao@ist.psu.edu

More Related Content

PPT
Implementing the Maternity information system in South Canterbury
PPTX
Enhancing Our Capacity for Large Health Dataset Analysis
PPTX
Medical Writing
PPTX
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
PPTX
Making Informed Collection Decisions in a Research Environment
PDF
Research Data Management Services at UWA (July 2015)
PPT
Baljeet ppt(1)2
PDF
Data Management Lab: Data mapping exercise instructions
Implementing the Maternity information system in South Canterbury
Enhancing Our Capacity for Large Health Dataset Analysis
Medical Writing
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
Making Informed Collection Decisions in a Research Environment
Research Data Management Services at UWA (July 2015)
Baljeet ppt(1)2
Data Management Lab: Data mapping exercise instructions

What's hot (20)

PPT
Data management planning in the Australian funding landscape by Sarah Olesen
PPTX
Agile Curation: 2015 AGU Presentation
PPTX
PFL data collection – hands on session
PPTX
PEI Research Grants
PDF
Practical challenges for researchers in data sharing
PDF
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
PPTX
PEI Research Initiative
PDF
Tasmanian data linkage unit
DOCX
MHA 616 Exceptional Education - snaptutorial.com
PPT
Survey of research data management practices up2010digschol2011
DOCX
MHA 616 Effective Communication - snaptutorial.com
PPTX
ANDS health and medical data webinar 9 May. Review of the National Statement ...
PPTX
Open Access as a Means to Produce High Quality Data
DOC
مقاله للترجمه
PDF
Mha 616 Believe Possibilities / snaptutorial.com
PPTX
Lecture 9C
PPT
The use of ‘colloquial evidence’ in HTA: the experience of NICE
PPT
Challenges in commissioning research on what works in integrated care
PPTX
Investigator-initiated clinical trials: a community perspective
PDF
Abstract: Perceptions of the Nursing Faculty Towards the Development of eTest...
Data management planning in the Australian funding landscape by Sarah Olesen
Agile Curation: 2015 AGU Presentation
PFL data collection – hands on session
PEI Research Grants
Practical challenges for researchers in data sharing
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
PEI Research Initiative
Tasmanian data linkage unit
MHA 616 Exceptional Education - snaptutorial.com
Survey of research data management practices up2010digschol2011
MHA 616 Effective Communication - snaptutorial.com
ANDS health and medical data webinar 9 May. Review of the National Statement ...
Open Access as a Means to Produce High Quality Data
مقاله للترجمه
Mha 616 Believe Possibilities / snaptutorial.com
Lecture 9C
The use of ‘colloquial evidence’ in HTA: the experience of NICE
Challenges in commissioning research on what works in integrated care
Investigator-initiated clinical trials: a community perspective
Abstract: Perceptions of the Nursing Faculty Towards the Development of eTest...
Ad

Similar to Unlocking the value of health data - Presentation at the Congressional Luncheon Series - Oct 2011 (20)

PPTX
Access to health data and information – challenges
PPTX
The Role of Data Lakes in Healthcare
PDF
Trusted! Quest for data-driven and fair health solutions
PDF
IRJET- A Survey on Big Data Frameworks and Approaches in Health Care Sector
PPTX
AMDIS CHIME Fall Symposium
PDF
Salus.Coop Informe Final
PDF
Understanding the Need of Data Integration in E Healthcare
PDF
Sun==big data analytics for health care
PDF
Future of patient data global summary - 29 may 2018
PDF
Data Analytics Action Figures
DOCX
Big Data in He
PPTX
PSB2014 A Vision for Biomedical Research
PDF
ESR10 chess orientation poster casandra grundstrom
PDF
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdf
PPTX
Applications of Data Science in Healthcare
PDF
Big implications of Big Data in healthcare
PPTX
Health Data Sharing Scene Setting
PDF
Digital transformation to enable a FAIR approach for health data science
PPTX
Healthcare Data Analytics.pptx
PPTX
Clinical Data Models - The Hyve - Bio IT World April 2019
Access to health data and information – challenges
The Role of Data Lakes in Healthcare
Trusted! Quest for data-driven and fair health solutions
IRJET- A Survey on Big Data Frameworks and Approaches in Health Care Sector
AMDIS CHIME Fall Symposium
Salus.Coop Informe Final
Understanding the Need of Data Integration in E Healthcare
Sun==big data analytics for health care
Future of patient data global summary - 29 may 2018
Data Analytics Action Figures
Big Data in He
PSB2014 A Vision for Biomedical Research
ESR10 chess orientation poster casandra grundstrom
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdf
Applications of Data Science in Healthcare
Big implications of Big Data in healthcare
Health Data Sharing Scene Setting
Digital transformation to enable a FAIR approach for health data science
Healthcare Data Analytics.pptx
Clinical Data Models - The Hyve - Bio IT World April 2019
Ad

More from Sandeep Purao (13)

PPT
Keynote at Doctoral Consortium - CAiSE 2013 - Valencia Spain
PPT
A Personal View on Research and Writing
PPT
Re-using Integration Patterns as Design Knowledge
PPT
Problem Solving Process
PPT
Technology Choices for Enterprise Integration
PPT
Introduction to a Course in Advanced Enterprise Integration
PPT
Systems of Systems - Design and Management
PPT
Standards and Standardization - A Research Project
PPT
SOA Methodologies in Practice
PPT
Standardization: Overcoming Design by Committee
PPT
Using Problems to learn Service-oriented Computing
PPT
DESRIST 2008 Doctoral Consortium Report
PPT
The overlaps between Action Research and Design Research
Keynote at Doctoral Consortium - CAiSE 2013 - Valencia Spain
A Personal View on Research and Writing
Re-using Integration Patterns as Design Knowledge
Problem Solving Process
Technology Choices for Enterprise Integration
Introduction to a Course in Advanced Enterprise Integration
Systems of Systems - Design and Management
Standards and Standardization - A Research Project
SOA Methodologies in Practice
Standardization: Overcoming Design by Committee
Using Problems to learn Service-oriented Computing
DESRIST 2008 Doctoral Consortium Report
The overlaps between Action Research and Design Research

Recently uploaded (20)

PDF
Kishore Vora - Best CFO in India to watch in 2025.pdf
PDF
Cross-Cultural Leadership Practices in Education (www.kiu.ac.ug)
DOCX
Center Enamel Powering Innovation and Resilience in the Italian Chemical Indu...
PDF
Engaging Stakeholders in Policy Discussions: A Legal Framework (www.kiu.ac.ug)
PDF
Susan Semmelmann: Enriching the Lives of others through her Talents and Bless...
PDF
533158074-Saudi-Arabia-Companies-List-Contact.pdf
PPTX
Portfolio Example- Market & Consumer Insights – Strategic Entry for BYD UK.pptx
PDF
HQ #118 / 'Building Resilience While Climbing the Event Mountain
DOCX
Emerging Dubai Investment Opportunities in 2025.docx
PPTX
Transportation in Logistics management.pptx
PDF
Sustainable Digital Finance in Asia_FINAL_22.pdf
PDF
1911 Gold Corporate Presentation Aug 2025.pdf
PPTX
chapter 2 entrepreneurship full lecture ppt
PPTX
2 - Self & Personality 587689213yiuedhwejbmansbeakjrk
PPTX
TRAINNING, DEVELOPMENT AND APPRAISAL.pptx
PDF
Chapter 2 - AI chatbots and prompt engineering.pdf
PDF
#1 Safe and Secure Verified Cash App Accounts for Purchase.pdf
PDF
Vinod Bhatt - Most Inspiring Supply Chain Leader in India 2025.pdf
PDF
income tax laws notes important pakistan
PDF
Second Hand Fashion Call to Action March 2025
Kishore Vora - Best CFO in India to watch in 2025.pdf
Cross-Cultural Leadership Practices in Education (www.kiu.ac.ug)
Center Enamel Powering Innovation and Resilience in the Italian Chemical Indu...
Engaging Stakeholders in Policy Discussions: A Legal Framework (www.kiu.ac.ug)
Susan Semmelmann: Enriching the Lives of others through her Talents and Bless...
533158074-Saudi-Arabia-Companies-List-Contact.pdf
Portfolio Example- Market & Consumer Insights – Strategic Entry for BYD UK.pptx
HQ #118 / 'Building Resilience While Climbing the Event Mountain
Emerging Dubai Investment Opportunities in 2025.docx
Transportation in Logistics management.pptx
Sustainable Digital Finance in Asia_FINAL_22.pdf
1911 Gold Corporate Presentation Aug 2025.pdf
chapter 2 entrepreneurship full lecture ppt
2 - Self & Personality 587689213yiuedhwejbmansbeakjrk
TRAINNING, DEVELOPMENT AND APPRAISAL.pptx
Chapter 2 - AI chatbots and prompt engineering.pdf
#1 Safe and Secure Verified Cash App Accounts for Purchase.pdf
Vinod Bhatt - Most Inspiring Supply Chain Leader in India 2025.pdf
income tax laws notes important pakistan
Second Hand Fashion Call to Action March 2025

Unlocking the value of health data - Presentation at the Congressional Luncheon Series - Oct 2011

  • 1. 1 Unlocking the Value of Health Data Sandeep Purao, Ph.D. Research Director, Center for Enterprise Architecture Associate Professor, College of Information Sciences and Technology Penn State University, University Park, PA Congressional Luncheon Series, 5 October 2011
  • 2. © Sandeep Purao. spurao@ist.psu.edu My Emphasis is on ‘Data’ 2
  • 3. © Sandeep Purao. spurao@ist.psu.edu I have three Goals today • I want to add structure to the large and intractable problem of how to deal with data in healthcare • I would like to point out ongoing research in other domains based on this structure • I would like to suggest how we may leverage this research or identify opportunities for enhancement 3
  • 4. © Sandeep Purao. spurao@ist.psu.edu A Data (Life) Cycle helps 4 Generate / Capture Store / Categorize Adding structure to the problem of dealing with data in healthcare Share / Make Available Use / Make Sense Destroy / Shred A data life cycle brings to the foreground the phases through which data must flow
  • 5. © Sandeep Purao. spurao@ist.psu.edu A Layered view of Data helps 5 Data for the support of Clinical tasks and systems Adding structure to the problem of dealing with data in healthcare Data for the support of Administrative tasks and systems Data that is Public or is available in Public sources Primary Data from Institutions and Researchers
  • 6. © Sandeep Purao. spurao@ist.psu.edu Here is a Simple Structure 6 Generate / Capture Store / Categorize Share / Make Available Use / Make Sense Destroy / Shred Data for the support of Clinical tasks and systems Data for the support of Administrative tasks and systems Data that is Public or is available in Public sources Primary Data from Institutions and Researchers Adding structure to the problem of dealing with data in healthcare
  • 7. © Sandeep Purao. spurao@ist.psu.edu Adding Roles to the Structure 7 Adding structure to the problem of dealing with data in healthcare Physicians Patients Policy Makers Lawyers Researchers
  • 8. © Sandeep Purao. spurao@ist.psu.edu There are some Key Problems 8 Scale Inter-Operability Using the structure to work with key problems related to data in healthcare Security Sense- Making
  • 9. © Sandeep Purao. spurao@ist.psu.edu There is Research Elsewhere • Scale – Big Data – moving Giga to Tera to Peta – Clouds, Hadoop and Map-Reduce – Extracting data from information ** • Inter-operability • Security • Sense-making 9 Pointing to Ongoing Research in Other Domains
  • 10. © Sandeep Purao. spurao@ist.psu.edu There is Research Elsewhere • Scale • Inter-operability – Voluntary and Consensus standards including HL7 ** – Heterogeneity, ontology and semantics (NLM) ** – Regional health IT partnerships ** • Security • Sense-making 10 Pointing to Ongoing Research in Other Domains
  • 11. © Sandeep Purao. spurao@ist.psu.edu There is Research Elsewhere 11 Pointing to Ongoing Research in Other Domains • Scale • Inter-operability • Security – Dealing with legal co-existence of malicious users ** – Measures such as Role-based Access – Laws to prevent access to EHR ** • Sense-making
  • 12. © Sandeep Purao. spurao@ist.psu.edu There is Research Elsewhere 12 Pointing to Ongoing Research in Other Domains • Scale • Inter-operability • Security • Sense-Making – Measuring data quality in crowd-based forums ** – Search patterns and user behaviors ** – Data delivery / use for e-health
  • 13. © Sandeep Purao. spurao@ist.psu.edu Opportunities • It is possible to leverage / enhance research from other domains to add to what we know about the Data Puzzle in the Healthcare context 13 Leveraging or Enhancing Existing Research
  • 14. © Sandeep Purao. spurao@ist.psu.edu Some Examples - 1 • Example 1: Regional health partnerships – A study of regional health IT partnerships extending ideas and theories about outsourcing – Problem addressed: Data storage and Data sharing • Example 2: Extracting action knowledge – Studies of work in refineries and with health professionals to extract and represent action knowledge – Problem addressed: Data use and sense-making 14 Leveraging or Enhancing Existing Research
  • 15. © Sandeep Purao. spurao@ist.psu.edu Some Examples - 2 • Example 3: Changing models for data governance with clouds – A study of healthcare organizations to understand how data stewardship and governance models are changing with cloud – Problem addressed: Data storage and exchange • Example 4: Using context to overcome data heterogeneity – Modeling context to understand how data may be exchanged across different user communities – Problem addressed: Data Sharing and Data use 15 Leveraging or Enhancing Existing Research
  • 16. © Sandeep Purao. spurao@ist.psu.edu Some Examples - 3 • Example 5: Data use in collaborative healthcare teams – A study to understand how teams of healthcare professionals access and use data – Problem addressed: Data sharing and Data use • Example 6: Search patterns for specialized information on the web – Empirical analyses of how user communities look for health- related and other information on web sources – Problem addressed: Data Access and Data use 16 Leveraging or Enhancing Existing Research
  • 17. © Sandeep Purao. spurao@ist.psu.edu What I said I wanted to do • I want to add structure to the large and intractable problem of how to deal with data in healthcare • I would like to point out ongoing research in other domains based on this structure • I would like to suggest how we may leverage this research or identify opportunities for enhancement 17
  • 18. © Sandeep Purao. spurao@ist.psu.edu Summary 18 For further dialog: spurao@ist.psu.edu

Editor's Notes

  • #2: I am Sandeep – Sun–Deep Purao – like Perot but spelled differently as you can see. I am on the faculty of the College of Information at Penn State.   Would like to begin by thanking Neal for inviting me to a part of this panel of presenters. I am glad to be here and am glad to see a large turnout.
  • #3: My mandate today is to talk fairly broadly about the role of data in healthcare. My expertise here is on the word “data” instead of the word “healthcare.” So, with that in mind, I will try to accomplish three goals in the short talk today.
  • #4: First – I would like to add some structure to this large and intractable problem of how to deal with data in the context of healthcare. Structure is important because it allows us the ability to identify different components and focus on the one that we deem are critical.   Second – I would like to point out some ongoing research in select areas that are suggested by this structure. My intent here would be to share with you opportunities where the healthcare community may be able to leverage some of these research results.   Third – I will point out areas where existing research in allied disciplines may not be sufficient or specific for healthcare. This may be one way to distill the arguments in the talk so that efforts and resources may be brought to bear on these areas of research.
  • #5: So, the structure I propose is simple. - The first dimension of the structure is the Data Cycle – A data life cycle acknowledges the phase through which data must flow
  • #6: The second dimension of the structure are the different categories of data one sees in healthcare settings The first two layers are – clinical and administrative software and adds two others – publicly available data and primary data. These are the horizontal layers.   I know this audience knows it well but I will spend a precious minute from the time allocated to me in explicating each. The clinical layer refers to actual patient data such as ailments, medications, vitals etc. The administrative layer refers to data about managing the healthcare delivery system including payments, insurance, schedules etc.   The third layer is public data. Consider, for example, webMD, books – increasingly, digitized, physician ratings etc. We can also include publicly available research outcomes such as those in Pubmed or Medline.   The final layer is primary data. This is data such as clinical trials (see clinicaltrials.gov), data that researchers collect about obesity and exercise patterns, and data about what makes regional health networks work – collected with surveys and interviews.  
  • #7: In each cell, I see several opportunities. In each cell, there are also known problems. I would not be here if we did not know the opportunities. I am here because we realize that there are problems that we must overcome to realize the promise of “data in healthcare.”   So, let me address some key problems – before I do that I cannot resist the temptation to quickly show you how the simple structure can be seen from different perspectives such as - .
  • #8: So a physician may see things differently – as would a patient and a lawyer and a policy maker – These are often implicit – making them open allows us to be clear about where we are focusing
  • #9: So here are some key problems then – I have selected four from a long list