SlideShare a Scribd company logo
gannon@Indiana.edu
Dennis.gannon@outlook.com
gannon@Indiana.edu
Dennis.gannon@outlook.com
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk
Every research field is now a data
science field
Last
few decades
Thousand
years ago
Today and the FutureLast few
hundred years
2
2
2.
3
4
a
cG
a
a










Simulation of
complex phenomena
Newton’s laws,
Maxwell’s equations…
Description of natural
phenomena
Unify theory, experiment and
simulation with large
multidisciplinary Data
Using data exploration and
data mining
(from instruments, sensors,
humans…)
Distributed Communities
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk
The Long Tail of Science
Programming tools: Scala, IPython, Azure ML, …
Frameworks: Spark, Hadoop, Yarn, HDInsight, Reef, Twister, Brisk
Software Defined Storage
Software Defined Networks
Hardware Abstraction/Virtualization
Container and Distributed Cluster OS (Docker, Mesosphere, K8S)
http://tce.technion.ac.il/files/2012/06/Scott-shenker.pdf
www.opennetsummit.org/pdf/2013/presentations/albert_greenberg.pdf
http://www.cs.princeton.edu/~jrex/papers/pyretic-login13.pdf
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk
The IPython notebook deployed on a VM.
1. Create a coreos VM and open the https port 443
2. Login and issue this one command
$docker run -d -p 443:8888 -e "PASSWORD=****" ipython/scipyserver
3. Go to https://yourVMaddress and log in.
compute
compute
compute
compute
compute
compute
compute
compute
compute
compute
compute
compute
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk
Vowpal Wabbit? Datumbox?
ieee cloud 2015 keynote talk
Marathon
Master
node
Master
Backup
Worker
node
Worker
node
Worker
node
Mesos
Zookeeper
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk
Cloud
SDN
NIH data
commons
• Many Examples
• Challenges:
• I only talked about the Analysis … but it there is more
• Sustainability
• Sharing
• Reproducible Science
Data
Acquisition &
modelling
Collaboration
and
visualisation
Analysis &
data mining
Dissemination
& sharing
Archiving and
preserving
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk
ieee cloud 2015 keynote talk

More Related Content

PDF
Accelerating your research with Microsoft Azure
PDF
Keynote IEEE International Workshop on Cloud Analytics. Dennis Gannon
PDF
Accelerating your Research with Microsoft Azure (June 2015)
PDF
Reproducible Research and the Cloud
PDF
Doing Research in the Cloud - NIH Workshop Dennis Gannon
PPTX
A4 r overview deck_1.7
PPTX
Open Science Data Cloud (IEEE Cloud 2011)
Accelerating your research with Microsoft Azure
Keynote IEEE International Workshop on Cloud Analytics. Dennis Gannon
Accelerating your Research with Microsoft Azure (June 2015)
Reproducible Research and the Cloud
Doing Research in the Cloud - NIH Workshop Dennis Gannon
A4 r overview deck_1.7
Open Science Data Cloud (IEEE Cloud 2011)

What's hot (18)

PPT
Foss4G 2009 Scenz Grid
PPTX
Scaling collaborative data science with Globus and Jupyter
PPTX
Data Tribology: Overcoming Data Friction with Cloud Automation
PPTX
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
PPTX
PhD Projects in Geoscience Remote Sensing Research Guidance
PDF
Research Objects in Wf4Ever
PPTX
Research Automation for Data-Driven Discovery
PPTX
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
PPTX
PhD Projects in Green Cloud Computing Research Guidance
PPTX
Big Data, Big Computing, AI, and Environmental Science
PDF
Cloud Dataverse
PDF
MeDiCI - How to Withstand a Research Data Tsunami
PDF
The pulse of cloud computing with bioinformatics as an example
PDF
San diego-supercomputing-sc17-user-group
PPTX
Accelerating data-intensive science by outsourcing the mundane
PPTX
Planet lab : cloud vs grid computing
PDF
Big data caching for networking : Moving from cloud to edge
PPTX
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Foss4G 2009 Scenz Grid
Scaling collaborative data science with Globus and Jupyter
Data Tribology: Overcoming Data Friction with Cloud Automation
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
PhD Projects in Geoscience Remote Sensing Research Guidance
Research Objects in Wf4Ever
Research Automation for Data-Driven Discovery
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
PhD Projects in Green Cloud Computing Research Guidance
Big Data, Big Computing, AI, and Environmental Science
Cloud Dataverse
MeDiCI - How to Withstand a Research Data Tsunami
The pulse of cloud computing with bioinformatics as an example
San diego-supercomputing-sc17-user-group
Accelerating data-intensive science by outsourcing the mundane
Planet lab : cloud vs grid computing
Big data caching for networking : Moving from cloud to edge
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Ad

Similar to ieee cloud 2015 keynote talk (20)

PDF
Cloud hpc-bigdata-challenges
PPT
The Concurrent Constraint Programming Research Programmes -- Redux
PDF
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
PDF
Prediction of Wireless Sensor Network and Attack using Machine Learning Techn...
PPTX
Big Data HPC Convergence and a bunch of other things
PPTX
Software engineering practices for the data science and machine learning life...
PPT
current trends in digital era - Recent Technologies and Techniques
PDF
How HPC and large-scale data analytics are transforming experimental science
PPTX
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPT
Aaas Data Intensive Science And Grid
PPTX
Adarsh_Masekar(2GP19CS003).pptx
PPT
Trends in digital era-Programming Knowledge
PPTX
Architecting an Open Source AI Platform 2018 edition
PDF
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
PPTX
CHASE-CI: A Distributed Big Data Machine Learning Platform
PDF
04 open source_tools
PPTX
Virtual Science in the Cloud
PPTX
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
PDF
Deep Learning Pipelines for High Energy Physics using Apache Spark with Distr...
Cloud hpc-bigdata-challenges
The Concurrent Constraint Programming Research Programmes -- Redux
The Hitchhiker's Guide to Machine Learning with Python & Apache Spark
Prediction of Wireless Sensor Network and Attack using Machine Learning Techn...
Big Data HPC Convergence and a bunch of other things
Software engineering practices for the data science and machine learning life...
current trends in digital era - Recent Technologies and Techniques
How HPC and large-scale data analytics are transforming experimental science
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Aaas Data Intensive Science And Grid
Adarsh_Masekar(2GP19CS003).pptx
Trends in digital era-Programming Knowledge
Architecting an Open Source AI Platform 2018 edition
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
CHASE-CI: A Distributed Big Data Machine Learning Platform
04 open source_tools
Virtual Science in the Cloud
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
Deep Learning Pipelines for High Energy Physics using Apache Spark with Distr...
Ad

More from Microsoft Azure for Research (8)

PDF
Parallel asynchronous inference of word senses with Microsoft Azure
PDF
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
PDF
Environmental Science, Big Data and the Cloud
PDF
Big data - from consumers and patients, to the sea and stars
PPTX
Living Outside the Comfort Zone - Daron green florianopolis 5-7-2014
PPTX
Keynote Presentation at Moscow State University.
Parallel asynchronous inference of word senses with Microsoft Azure
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
Environmental Science, Big Data and the Cloud
Big data - from consumers and patients, to the sea and stars
Living Outside the Comfort Zone - Daron green florianopolis 5-7-2014
Keynote Presentation at Moscow State University.

Recently uploaded (20)

PDF
Mushroom cultivation and it's methods.pdf
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PPTX
Tartificialntelligence_presentation.pptx
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
August Patch Tuesday
PDF
Approach and Philosophy of On baking technology
Mushroom cultivation and it's methods.pdf
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Group 1 Presentation -Planning and Decision Making .pptx
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Chapter 5: Probability Theory and Statistics
Web App vs Mobile App What Should You Build First.pdf
Enhancing emotion recognition model for a student engagement use case through...
1 - Historical Antecedents, Social Consideration.pdf
Encapsulation theory and applications.pdf
Univ-Connecticut-ChatGPT-Presentaion.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
A Presentation on Artificial Intelligence
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Heart disease approach using modified random forest and particle swarm optimi...
Tartificialntelligence_presentation.pptx
Zenith AI: Advanced Artificial Intelligence
August Patch Tuesday
Approach and Philosophy of On baking technology

ieee cloud 2015 keynote talk