SlideShare a Scribd company logo
gannon@Indiana.edu 
Dennis.gannon@outlook.com
Cloud hpc-bigdata-challenges
Cloud hpc-bigdata-challenges
IT PAC
Cloud hpc-bigdata-challenges
Melbourne 
Sydney 
Brazil 
Beijing
Programming tools: Scala, IPython, Azure ML, … 
Frameworks: Spark, Hadoop, Yarn, HDInsight, Reef, Twister, Brisk 
Software Defined Storage 
Software Defined Networks 
Hardware Abstraction/Virtualization
http://tce.technion.ac.il/files/2012/06/Scott-shenker.pdfwww.opennetsummit.org/pdf/2013/presentations/albert_greenberg.pdfhttp://www.cs.princeton.edu/~jrex/papers/pyretic-login13.pdf
Cloud hpc-bigdata-challenges
The Science Perspective
Every research field is now a data science field
Last 
few decades 
Thousand 
years ago 
Last few Today and the Future 
hundred years 
2 
2 
2 
. 
3 
4 
a 
G c 
a 
a 
  
  
 
 
 
  
 
 
 
  
Simulation of 
complex phenomena 
Newton’s laws, 
Maxwell’s equations… 
Description of natural 
phenomena 
Unify theory, experiment and 
simulation with large 
multidisciplinary Data 
Using data exploration and 
data mining 
(from instruments, sensors, 
humans…) 
Distributed Communities
Video Link
Cloud hpc-bigdata-challenges
Cloud hpc-bigdata-challenges
Cloud hpc-bigdata-challenges
Cloud hpc-bigdata-challenges
Cloud hpc-bigdata-challenges
Inputs (training data) 
Labels 
Hidden layers 
Input data 
Detected features 
Mona Lisa
Cloud hpc-bigdata-challenges
•The Genetic Causes of Disease (David Heckerman) 
•WellcomeTrust for a GWAS for a large population 
•Looking for causes for seven common diseases (bipolar, r. arthritis, coronary, hypertension, ….) 
•Confounding is a problem. Needed a new algorithm. 
•Ran on Azure cloud using 35,000 cores in 3 weeks.
Cloud hpc-bigdata-challenges
Cloud hpc-bigdata-challenges
Chameleon Cloud 
SDN 
NIH data commons
Mesos 
Tachyon 
Docker 
Spark 
Data Analytics and ML programming tools 
Reef 
Twister
Cloud hpc-bigdata-challenges
Cloud hpc-bigdata-challenges
Cloud hpc-bigdata-challenges
•Many Examples 
•The Challenge: sustainability 
Data 
Acquisition & modelling 
Collaboration and visualisation 
Analysis & data mining 
Dissemination & sharing 
Archiving and preserving

More Related Content

PDF
Cytoscape Cyberinfrastructure
PPTX
Dynamic module deployment in a fog computing platform
PPTX
MoreLab - Mobility Research Lab
PPT
42 walter wp2 bgbm-vi brant-lighteningtalk
PPTX
PhD Projects in Fog Computing Research Ideas
PDF
Machine Learning @ NECST
PDF
Azure Brain: 4th paradigm, scientific discovery & (really) big data
Cytoscape Cyberinfrastructure
Dynamic module deployment in a fog computing platform
MoreLab - Mobility Research Lab
42 walter wp2 bgbm-vi brant-lighteningtalk
PhD Projects in Fog Computing Research Ideas
Machine Learning @ NECST
Azure Brain: 4th paradigm, scientific discovery & (really) big data

What's hot (18)

PPT
Io t technologies_ppt-2
PPTX
Research, the Cloud, and the IRB
PPT
Introduction
PDF
Optimized Algorithm for Hiding Digital Text in a Colour Image Using FPGA
PDF
Callforpapersinternationaljournalofcomputerijc volume11issue1-april2013
PDF
Deep Slicing and Loops in a Loop: Multi-Tenancy and Smart Closed-Loop Control...
DOCX
Fire col a collaborative protection network
PDF
Lakesh_resume_02-07
PDF
PERICLES workshop (London 15 October 2015) - Art & Media Domain Ontologies
DOCX
Grid computing assiment
PPTX
Provenance based presentation on cloud computing security
PDF
Assisting IoT Projects and Developers in Designing Interoperable Semantic Web...
PDF
ENVIROFI, FI-PPP and big data
PDF
Cassowary: Middleware Platform for Context-Aware Smart Buildings with Softwar...
PPTX
Что такое Data Science
PDF
K luo bera_poster
PDF
Everything about Internet of Things: An Overview of Related Ontologies
PPTX
Cytoscape ci chapter 1
Io t technologies_ppt-2
Research, the Cloud, and the IRB
Introduction
Optimized Algorithm for Hiding Digital Text in a Colour Image Using FPGA
Callforpapersinternationaljournalofcomputerijc volume11issue1-april2013
Deep Slicing and Loops in a Loop: Multi-Tenancy and Smart Closed-Loop Control...
Fire col a collaborative protection network
Lakesh_resume_02-07
PERICLES workshop (London 15 October 2015) - Art & Media Domain Ontologies
Grid computing assiment
Provenance based presentation on cloud computing security
Assisting IoT Projects and Developers in Designing Interoperable Semantic Web...
ENVIROFI, FI-PPP and big data
Cassowary: Middleware Platform for Context-Aware Smart Buildings with Softwar...
Что такое Data Science
K luo bera_poster
Everything about Internet of Things: An Overview of Related Ontologies
Cytoscape ci chapter 1
Ad

Viewers also liked (10)

PPTX
The big data ready bank
PPTX
Big Data Maturity Model
PDF
ABBYY Capture solutions
PDF
Big data trends challenges opportunities
PPTX
BigData in Banking
PPS
Big data hadoop rdbms
PPTX
Hadoop vs. RDBMS for Advanced Analytics
PDF
Hadoop World 2011: Hadoop vs. RDBMS for Big Data Analytics...Why Choose?
PDF
Hadoop World 2011: Replacing RDB/DW with Hadoop and Hive for Telco Big Data -...
PPTX
Relational databases vs Non-relational databases
The big data ready bank
Big Data Maturity Model
ABBYY Capture solutions
Big data trends challenges opportunities
BigData in Banking
Big data hadoop rdbms
Hadoop vs. RDBMS for Advanced Analytics
Hadoop World 2011: Hadoop vs. RDBMS for Big Data Analytics...Why Choose?
Hadoop World 2011: Replacing RDB/DW with Hadoop and Hive for Telco Big Data -...
Relational databases vs Non-relational databases
Ad

Similar to Cloud hpc-bigdata-challenges (20)

PDF
ieee cloud 2015 keynote talk
PPTX
Philosophy of Deep Learning
PPT
Physical-Cyber-Social Data Analytics & Smart City Applications
PPT
AI Science
PPT
Cyberinfrastructure and Applications Overview: Howard University June22
PPTX
Deep Learning for Data Scientists - Data Science ATL Meetup Presentation, 201...
PPTX
Ingredients for Semantic Sensor Networks
PDF
How Can AI and IoT Power the Chemical Industry?
PPTX
Data science
PPT
Sinnott Paper
PPT
grid computing
PPT
Cyberinfrastructure for Einstein's Equations and Beyond
PDF
new_kitching_cv
PPTX
BrightTALK - Semantic AI
PPTX
MDM-2013, Milan, Italy, 6 June, 2013
PDF
Introduction to the Artificial Intelligence and Computer Vision revolution
PPTX
Semantic Sensor Networks and Linked Stream Data
PPTX
Ed Fox on Learning Technologies
PDF
Fog Computing - DEV.BG 2018
PPT
The OptIPuter Project: From the Grid to the LambdaGrid
ieee cloud 2015 keynote talk
Philosophy of Deep Learning
Physical-Cyber-Social Data Analytics & Smart City Applications
AI Science
Cyberinfrastructure and Applications Overview: Howard University June22
Deep Learning for Data Scientists - Data Science ATL Meetup Presentation, 201...
Ingredients for Semantic Sensor Networks
How Can AI and IoT Power the Chemical Industry?
Data science
Sinnott Paper
grid computing
Cyberinfrastructure for Einstein's Equations and Beyond
new_kitching_cv
BrightTALK - Semantic AI
MDM-2013, Milan, Italy, 6 June, 2013
Introduction to the Artificial Intelligence and Computer Vision revolution
Semantic Sensor Networks and Linked Stream Data
Ed Fox on Learning Technologies
Fog Computing - DEV.BG 2018
The OptIPuter Project: From the Grid to the LambdaGrid

More from Microsoft Azure for Research (15)

PDF
Accelerating your Research with Microsoft Azure (June 2015)
PDF
Parallel asynchronous inference of word senses with Microsoft Azure
PDF
Accelerating your research with Microsoft Azure
PPTX
A4 r overview deck_1.7
PDF
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
PDF
Environmental Science, Big Data and the Cloud
PDF
Keynote IEEE International Workshop on Cloud Analytics. Dennis Gannon
PDF
Doing Research in the Cloud - NIH Workshop Dennis Gannon
PDF
Big data - from consumers and patients, to the sea and stars
PDF
Reproducible Research and the Cloud
PPTX
Living Outside the Comfort Zone - Daron green florianopolis 5-7-2014
PPTX
Keynote Presentation at Moscow State University.
Accelerating your Research with Microsoft Azure (June 2015)
Parallel asynchronous inference of word senses with Microsoft Azure
Accelerating your research with Microsoft Azure
A4 r overview deck_1.7
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
Environmental Science, Big Data and the Cloud
Keynote IEEE International Workshop on Cloud Analytics. Dennis Gannon
Doing Research in the Cloud - NIH Workshop Dennis Gannon
Big data - from consumers and patients, to the sea and stars
Reproducible Research and the Cloud
Living Outside the Comfort Zone - Daron green florianopolis 5-7-2014
Keynote Presentation at Moscow State University.

Recently uploaded (20)

PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Computer network topology notes for revision
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
annual-report-2024-2025 original latest.
PDF
Mega Projects Data Mega Projects Data
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Introduction to machine learning and Linear Models
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
[EN] Industrial Machine Downtime Prediction
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Computer network topology notes for revision
Introduction-to-Cloud-ComputingFinal.pptx
annual-report-2024-2025 original latest.
Mega Projects Data Mega Projects Data
IB Computer Science - Internal Assessment.pptx
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
SAP 2 completion done . PRESENTATION.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Business Analytics and business intelligence.pdf
Data_Analytics_and_PowerBI_Presentation.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
STERILIZATION AND DISINFECTION-1.ppthhhbx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Fluorescence-microscope_Botany_detailed content
Introduction to machine learning and Linear Models
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
[EN] Industrial Machine Downtime Prediction
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf

Cloud hpc-bigdata-challenges