SlideShare a Scribd company logo
Big data and statisticians
Statistician
Teaching
 Blogging
 Research
Teaching
 Blogging
 Research
jhudatascience.org
Teaching
 Blogging
 Research
simplystatistics.org
@simplystats
Teaching
 Blogging
 Research
jtleek.com
Big data and statisticians
N =
 SAMPLE SIZE
N =
($ YOU HAVE)
($ PER SAMPLE)
rna-seq
2008 N≈2 
2010 N≈70
2013 N≈900
PMIDS: 19056941,	
  20220758,	
  24092820	
  
Big data and statisticians
Moore Data Science Environments
0/3 directors, 1/25 speakers statisticians
NAS Big Data Workshop
2/13 speakers statisticians
NIH BD2K Proposal Workshop
0/18 participants
Big Data Rollout from White House
0/4 thought leaders in statistics
1/n reasons:
Infrastructure
Big data and statisticians
Big data and statisticians
Big data and statisticians
Big data and statisticians
Big data and statisticians
http://gking.harvard.edu/files/gking/files/0314policyforumff.pdf
Random Forests
Multiple Testing
Smoothing
Exploratory Data Analysis
h+p://simplysta6s6cs.org/2014/05/22/10-­‐things-­‐sta6s6cs-­‐taught-­‐us-­‐about-­‐big-­‐data-­‐analysis/	
  
9 classes
1 month long
Every month
Cumulative Enrollment
The Team
jtleek.com/talks

More Related Content

PDF
The Largest Data Science Program in the World: The Johns Hopkins Data Science...
PDF
Data Science Education at JHSPH
PDF
JHU Data Science MOOCs - Behind the Scenes
PDF
10 things statistics taught us about big data
PDF
Dm ml study_roadmap
PPTX
How the Web can change social science research (including yours)
PPT
And the survey says
PPTX
Research Data Sharing: A Basic Framework
The Largest Data Science Program in the World: The Johns Hopkins Data Science...
Data Science Education at JHSPH
JHU Data Science MOOCs - Behind the Scenes
10 things statistics taught us about big data
Dm ml study_roadmap
How the Web can change social science research (including yours)
And the survey says
Research Data Sharing: A Basic Framework

What's hot (10)

PPTX
Ariadne's Thread -- Exploring a world of networked information built from fre...
PDF
Changing The Way We Discover Research
PPTX
Data Provenance and its role in Data Science
PPTX
Crowdsourcing the Quality of Knowledge Graphs: A DBpedia Study
PPTX
Extracting and analyzing discussion data with google sheets and google analytics
PPT
Searching Deeply for Data, Results and Tools- What is Stopping Us?
PPTX
Welcome to PROMISE 2011
PDF
"Data mining и информационный поиск проблемы, алгоритмы, решения"_Краковецкий...
PPTX
How much does $1.7 billion buy?
PPT
Elsevier - Labs on Line
Ariadne's Thread -- Exploring a world of networked information built from fre...
Changing The Way We Discover Research
Data Provenance and its role in Data Science
Crowdsourcing the Quality of Knowledge Graphs: A DBpedia Study
Extracting and analyzing discussion data with google sheets and google analytics
Searching Deeply for Data, Results and Tools- What is Stopping Us?
Welcome to PROMISE 2011
"Data mining и информационный поиск проблемы, алгоритмы, решения"_Краковецкий...
How much does $1.7 billion buy?
Elsevier - Labs on Line
Ad

Similar to Big data and statisticians (20)

PPT
Data_Mining.ppt
PPTX
UNIT_1-BD.pptx
PDF
IICT-Big Data.pdf slideshow information to communication
PDF
IICT-Big Data.pdf slideshow Information to communication technology
PDF
Strata Conference NYC 2013
PDF
RuleML 2015: When Processes Rule Events
PPTX
UW Libraries Data Services Forum
PPTX
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
PPTX
Open Data Policy & Legislation
PPTX
Big Data analytics
PDF
Knowledge Discovery in Environmental Management
PDF
Speeding Up Data Science: From a Data Management Perspective
PPTX
Big Data in NATO and Your Role
PDF
姜俊宇/從資料到知識:從零開始的資料探勘
PDF
Big Data Benchmarking Tutorial
PDF
Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014
PDF
Data_Analytics_inroductionand data preprocessing
PDF
Data Management Lab: Session 1 Slides
PDF
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
PDF
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Data_Mining.ppt
UNIT_1-BD.pptx
IICT-Big Data.pdf slideshow information to communication
IICT-Big Data.pdf slideshow Information to communication technology
Strata Conference NYC 2013
RuleML 2015: When Processes Rule Events
UW Libraries Data Services Forum
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
Open Data Policy & Legislation
Big Data analytics
Knowledge Discovery in Environmental Management
Speeding Up Data Science: From a Data Management Perspective
Big Data in NATO and Your Role
姜俊宇/從資料到知識:從零開始的資料探勘
Big Data Benchmarking Tutorial
Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014
Data_Analytics_inroductionand data preprocessing
Data Management Lab: Session 1 Slides
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Ad

More from jtleek (7)

PPTX
Data science as a science
PDF
Fixing the leaks in the pipeline from public genomics data to the clinic
PDF
JHU Job Talk
PDF
Evidence based data analysis
PDF
Evidence based data analysis
PDF
Leek romesf-2015
PDF
Flash talk about Johns Hopkins Biostatistics Genomics Group
Data science as a science
Fixing the leaks in the pipeline from public genomics data to the clinic
JHU Job Talk
Evidence based data analysis
Evidence based data analysis
Leek romesf-2015
Flash talk about Johns Hopkins Biostatistics Genomics Group

Recently uploaded (20)

PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Introduction to machine learning and Linear Models
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Database Infoormation System (DBIS).pptx
PPTX
1_Introduction to advance data techniques.pptx
Clinical guidelines as a resource for EBP(1).pdf
IBA_Chapter_11_Slides_Final_Accessible.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
ISS -ESG Data flows What is ESG and HowHow
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Fluorescence-microscope_Botany_detailed content
IB Computer Science - Internal Assessment.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Business Acumen Training GuidePresentation.pptx
climate analysis of Dhaka ,Banglades.pptx
Introduction to machine learning and Linear Models
Qualitative Qantitative and Mixed Methods.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Acceptance and paychological effects of mandatory extra coach I classes.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Database Infoormation System (DBIS).pptx
1_Introduction to advance data techniques.pptx

Big data and statisticians