SlideShare a Scribd company logo
Nicole Vasilevsky1,2, Shannon McWeeney1, William Hersh1, David A. Dorr1,3, Ted Laderas1,
Jackie Wirz4, Bjorn Pederson1, Melissa Haendel1,3
1Department of Medical Informatics and Clinical Epidemiology, 2Ontology Development Group, Library, 3Department of Medicine, 4Graduate Student Affairs,
Oregon Health & Science University, Portland, OR
Students felt the course
Acknowledgements
This work is supported by NIH Grants 1R25EB020379-01
and 1R25GM114820-01.
Skills Course TopicsWhy?
Course Offerings
Defining The Problem Wrangling Data
Methods, Tools And
Analysis
Scientific Communication
Data Identification And Resources
…
?
v Problems amenable to
analytics
v Importance of question
v Team definitions
v Scope
v When we do this wrong:
methods don't match
v Finding the right data
v Search methods
v Use of metadata
v Data management
v Exploratory Data
Analysis
v Data Dictionary
v As you touch data, what
can go wrong?
v Visualization
v Matching algorithms to
problems
v Reporting Findings and Limitations
v Giving “Elevator Speech” on ideas
of how to approach problem
v Critique of related problem
Major challenge: how to manage, analyze and interpret
vast amounts of data being generated in biomedical
research
One goal of NIH Big Data to Knowledge (BD2K) initiative:
provide training for students and researchers to address
this
Research team in the Department of Medical Informatics
and Clinical Epidemiology (DMICE) is developing Open
Educational Resources and Skills Courses
Approach
OERs and courses connect the dots that help
researchers understand how to apply data science
techniques in the context of their whole research life
cycle
§ Skills course and OER topics are aimed to fill
specific gaps
§ Teaching students how to ask the question and
follow through
① Develop Open Educational Resources
(OERS)
② Teach Skills Courses
OER Modules
Challenges
Scope
Images
Style
Dissemination
How to scope generic
curricula for different levels of
users
How to translate diverse
teaching styles into general
materials
How to maximize
dissemination while protecting
intellectual property
How to incorporate images
and other copyrighted
materials into open resources
Available at:
http://dmice.ohsu.edu/bd2k
Improving Knowledge Discovery Through Development of
Big Data to Knowledge Skills Courses and Open Educational Resources
Intro Course
Data After
Dark
§ Week long course in Summer 2015
§ Offered to interns and undergraduates
§ Taught basics of data science in the context of
the research life cycle
§ Two-evening course in January 2016
§ Offered to OHSU students, staff and faculty
§ Taught basics of data science
Advanced
Course
§ Four-evening course in May 2016
§ Offered to OHSU students, staff and faculty
§ Taught advanced topics in of data science
Data and
Donuts
§ Two courses were offered in June and July 2016
§ Offered to OHSU summer interns
§ Taught basics of data science
01 | Biomedical Big Data Science
02 | Introduction to Big Data in Biology
and Medicine
03 | Ethical Issues in Use of Big Data
04 | Clinical Standards Related to Big Data
05 | Basic Research Data Standards
06 | Public Health and Big Data
07 | Team Science
08 | Secondary Use (Reuse) of Clinical Data
09 | Publication and Peer Review
10 | Information Retrieval
11 | Version Control and Identifiers
12 | Data annotation and curation
13 | Data Tools and Landscape
14 | Ontologies 101
15 | Data metadata and provenance
16 | Semantic data interoperability
17 | Choice of Algorithms and Algorithm
Dynamics
18 | Visualization and Interpretation
19 | Replication, Validation and the
spectrum of Reproducibility Semantic
data interoperability
20 | Regulatory Issues in Big Data for
Genomics and Health
Semantic Web data
21 | Hosting data dissemination and data
stewardship workshops
22 | Hosting data dissemination and data
stewardship workshops
23 | Terminology of Biomedical, Clinical,
and Translational Research
24 | Computing Concepts for Big Data
25 | Data modeling
26 | Semantic Web data
27 | Context-based selection of data
28 | Translating the Question
29 | Implications of Provenance and Pre-
processing
30 | Data tells a story
31 | Statistical Significance, P-hacking and
Multiple-testing
32 | Displaying Confidence and
Uncertainty
Grey = still under development

More Related Content

PPTX
Data science education resources for everyone
PDF
Couture Curricula - BD2K Data Science Tailored to Your Needs
PDF
Practical challenges for researchers in data sharing
PPTX
EUA questionnaire on Open Access: 2016/17 Survey Results
PPT
BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...
PDF
Principles, key responsibilities, and their intersection
PDF
Research Integrity Advisor and Data Management
PPTX
Research Data Mgt
Data science education resources for everyone
Couture Curricula - BD2K Data Science Tailored to Your Needs
Practical challenges for researchers in data sharing
EUA questionnaire on Open Access: 2016/17 Survey Results
BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...
Principles, key responsibilities, and their intersection
Research Integrity Advisor and Data Management
Research Data Mgt

What's hot (20)

PPTX
Scientific information retrieval: Challenges and opportunities
PPTX
Data peer review workshop
PPTX
Clinical Microbiology - searching for information
PPTX
Midwest Medical Library Association 2015 Big Data Panel
PPTX
Comparing scientific performance across disciplines: Methodological and conce...
PPTX
Research evaluation in iraq from 1996 to 2014
PDF
Gaining credit for sharing research data
PPTX
MSc transneuro & gastro 2013-14
PDF
Henning Müller et Michael Schumacher pour la journée e-health 2013
PPTX
Case studies for open science
PDF
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
PPTX
Embedding ORCID across researcher career paths
PPTX
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
PDF
Research Data Management Planning: problems and solutions
PPTX
Enhancing Our Capacity for Large Health Dataset Analysis
PDF
eHealth unit HES-SO in Sierre
PPTX
Comparing bibliographic data sources
PPTX
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
DOCX
The Alzheimer's Disease Research Network and the Uniform Data Set
Scientific information retrieval: Challenges and opportunities
Data peer review workshop
Clinical Microbiology - searching for information
Midwest Medical Library Association 2015 Big Data Panel
Comparing scientific performance across disciplines: Methodological and conce...
Research evaluation in iraq from 1996 to 2014
Gaining credit for sharing research data
MSc transneuro & gastro 2013-14
Henning Müller et Michael Schumacher pour la journée e-health 2013
Case studies for open science
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
Embedding ORCID across researcher career paths
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Research Data Management Planning: problems and solutions
Enhancing Our Capacity for Large Health Dataset Analysis
eHealth unit HES-SO in Sierre
Comparing bibliographic data sources
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
The Alzheimer's Disease Research Network and the Uniform Data Set
Ad

Viewers also liked (20)

PPT
Changes In Teaching
PPTX
Empowering patients by increasing accessibility to clinical terminology
PPT
Education in the past and nowadays
PPT
Schools in the past
PPTX
Enhancing the Human Phenotype Ontology for Use by the Layperson
PPTX
School 100 years ago expo
PPTX
Education - Past Vs Present
PPT
SCHOOLS 50 YEARS AGO
PPT
Education in the past powerpoint
KEY
Schools then and now
PPS
America 100 Years Ago
PPTX
Families 100 years ago 1
PPTX
On the Reproducibility of Science: Unique Identification of Research Resourc...
PPTX
Acrl march2015 final
PPTX
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
PDF
Research resources: curating the new eagle-i discovery system
PPTX
The Role of Libraries in Data Management and Curation
PPTX
Deep phenotyping for everyone
PPTX
Why the world needs phenopacketeers, and how to be one
PPT
Technology in the classroom ppt
Changes In Teaching
Empowering patients by increasing accessibility to clinical terminology
Education in the past and nowadays
Schools in the past
Enhancing the Human Phenotype Ontology for Use by the Layperson
School 100 years ago expo
Education - Past Vs Present
SCHOOLS 50 YEARS AGO
Education in the past powerpoint
Schools then and now
America 100 Years Ago
Families 100 years ago 1
On the Reproducibility of Science: Unique Identification of Research Resourc...
Acrl march2015 final
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Research resources: curating the new eagle-i discovery system
The Role of Libraries in Data Management and Curation
Deep phenotyping for everyone
Why the world needs phenopacketeers, and how to be one
Technology in the classroom ppt
Ad

Similar to Improving Knowledge Discovery Through Development of Big Data to Knowledge Skills Courses and Open Educational Resources (20)

PDF
Open Educational Resources for Big Data Science
PPTX
Teaching Data Science to Undergraduate Students
PPTX
Denise Esserman MedicReS World Congress 2015
PDF
Intro big data.pdf
PDF
Anthony J brookes
PDF
Big data for development
PPTX
Big Data and Analytics Across the Interdisciplinary Divide
PPT
The Thinking Behind Big Data at the NIH
PPTX
Will Biomedical Research Fundamentally Change in the Era of Big Data?
PPT
Human Genome and Big Data Challenges
PDF
G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...
PPTX
Data Science Meets Biomedicine, Does Anything Change
PPTX
Data Science for the Win
PPTX
Reproducible research: theory
PPT
AMIA 2014
PPTX
Big Data as a Catalyst for Collaboration & Innovation
PPTX
(Big) Data (Science) Skills
PDF
Medicine as data science
PPTX
NIH Big Data to Knowledge (BD2K)
PPT
data science ppt of emngineering studnets
Open Educational Resources for Big Data Science
Teaching Data Science to Undergraduate Students
Denise Esserman MedicReS World Congress 2015
Intro big data.pdf
Anthony J brookes
Big data for development
Big Data and Analytics Across the Interdisciplinary Divide
The Thinking Behind Big Data at the NIH
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Human Genome and Big Data Challenges
G. Poste. Managing the Data Deluge: Critical Issues in the Integration and An...
Data Science Meets Biomedicine, Does Anything Change
Data Science for the Win
Reproducible research: theory
AMIA 2014
Big Data as a Catalyst for Collaboration & Innovation
(Big) Data (Science) Skills
Medicine as data science
NIH Big Data to Knowledge (BD2K)
data science ppt of emngineering studnets

Recently uploaded (20)

PPTX
Pharmacology of Autonomic nervous system
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
Sciences of Europe No 170 (2025)
PDF
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
PPTX
Fluid dynamics vivavoce presentation of prakash
PPTX
Hypertension_Training_materials_English_2024[1] (1).pptx
PDF
The Land of Punt — A research by Dhani Irwanto
PDF
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PDF
The scientific heritage No 166 (166) (2025)
PDF
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
PPTX
Introcution to Microbes Burton's Biology for the Health
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PDF
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
PPTX
BIOMOLECULES PPT........................
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
Pharmacology of Autonomic nervous system
Biophysics 2.pdffffffffffffffffffffffffff
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
Sciences of Europe No 170 (2025)
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
Fluid dynamics vivavoce presentation of prakash
Hypertension_Training_materials_English_2024[1] (1).pptx
The Land of Punt — A research by Dhani Irwanto
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
The scientific heritage No 166 (166) (2025)
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
Introcution to Microbes Burton's Biology for the Health
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
BIOMOLECULES PPT........................
TOTAL hIP ARTHROPLASTY Presentation.pptx

Improving Knowledge Discovery Through Development of Big Data to Knowledge Skills Courses and Open Educational Resources

  • 1. Nicole Vasilevsky1,2, Shannon McWeeney1, William Hersh1, David A. Dorr1,3, Ted Laderas1, Jackie Wirz4, Bjorn Pederson1, Melissa Haendel1,3 1Department of Medical Informatics and Clinical Epidemiology, 2Ontology Development Group, Library, 3Department of Medicine, 4Graduate Student Affairs, Oregon Health & Science University, Portland, OR Students felt the course Acknowledgements This work is supported by NIH Grants 1R25EB020379-01 and 1R25GM114820-01. Skills Course TopicsWhy? Course Offerings Defining The Problem Wrangling Data Methods, Tools And Analysis Scientific Communication Data Identification And Resources … ? v Problems amenable to analytics v Importance of question v Team definitions v Scope v When we do this wrong: methods don't match v Finding the right data v Search methods v Use of metadata v Data management v Exploratory Data Analysis v Data Dictionary v As you touch data, what can go wrong? v Visualization v Matching algorithms to problems v Reporting Findings and Limitations v Giving “Elevator Speech” on ideas of how to approach problem v Critique of related problem Major challenge: how to manage, analyze and interpret vast amounts of data being generated in biomedical research One goal of NIH Big Data to Knowledge (BD2K) initiative: provide training for students and researchers to address this Research team in the Department of Medical Informatics and Clinical Epidemiology (DMICE) is developing Open Educational Resources and Skills Courses Approach OERs and courses connect the dots that help researchers understand how to apply data science techniques in the context of their whole research life cycle § Skills course and OER topics are aimed to fill specific gaps § Teaching students how to ask the question and follow through ① Develop Open Educational Resources (OERS) ② Teach Skills Courses OER Modules Challenges Scope Images Style Dissemination How to scope generic curricula for different levels of users How to translate diverse teaching styles into general materials How to maximize dissemination while protecting intellectual property How to incorporate images and other copyrighted materials into open resources Available at: http://dmice.ohsu.edu/bd2k Improving Knowledge Discovery Through Development of Big Data to Knowledge Skills Courses and Open Educational Resources Intro Course Data After Dark § Week long course in Summer 2015 § Offered to interns and undergraduates § Taught basics of data science in the context of the research life cycle § Two-evening course in January 2016 § Offered to OHSU students, staff and faculty § Taught basics of data science Advanced Course § Four-evening course in May 2016 § Offered to OHSU students, staff and faculty § Taught advanced topics in of data science Data and Donuts § Two courses were offered in June and July 2016 § Offered to OHSU summer interns § Taught basics of data science 01 | Biomedical Big Data Science 02 | Introduction to Big Data in Biology and Medicine 03 | Ethical Issues in Use of Big Data 04 | Clinical Standards Related to Big Data 05 | Basic Research Data Standards 06 | Public Health and Big Data 07 | Team Science 08 | Secondary Use (Reuse) of Clinical Data 09 | Publication and Peer Review 10 | Information Retrieval 11 | Version Control and Identifiers 12 | Data annotation and curation 13 | Data Tools and Landscape 14 | Ontologies 101 15 | Data metadata and provenance 16 | Semantic data interoperability 17 | Choice of Algorithms and Algorithm Dynamics 18 | Visualization and Interpretation 19 | Replication, Validation and the spectrum of Reproducibility Semantic data interoperability 20 | Regulatory Issues in Big Data for Genomics and Health Semantic Web data 21 | Hosting data dissemination and data stewardship workshops 22 | Hosting data dissemination and data stewardship workshops 23 | Terminology of Biomedical, Clinical, and Translational Research 24 | Computing Concepts for Big Data 25 | Data modeling 26 | Semantic Web data 27 | Context-based selection of data 28 | Translating the Question 29 | Implications of Provenance and Pre- processing 30 | Data tells a story 31 | Statistical Significance, P-hacking and Multiple-testing 32 | Displaying Confidence and Uncertainty Grey = still under development