SlideShare a Scribd company logo
5
Most read
14
Most read
20
Most read
The Evolution of Data Science
Kenny Daniel
CTO, Algorithmia
July 24, 2015
Kenny Daniel - CTO, Algorithmia
• Graduate research in Artificial Intelligence and Mechanism Design
• Multiple published algorithms and papers in Machine Learning
• Received $1 million from DOT “Engineering Tomorrow’s Transportation Market”
• B.S. Carnegie Mellon University, M.S., Ph.D. (on leave) USC
• Data Scientist and Computer Vision specialist for Delectable, Inc
• Initial and current overall architect of Algorithmia Platform
Make state-of-the-art algorithms
accessible and discoverable by
everyone.
Evolution of Data Science
● History of data science
● Modern data science
● Future speculation
Pre-cloud
● Mainframes
● Universities
● Research Facilities
● Finance
● PhD researchers, highly specialized
More pre-planning, less exploratory
Source and Inspiration: http://www.slideshare.net/AlbertWenger/the-no-stackstartup
1990s Connectivity
$10,000 per month
Servers
$20,000 per box
Storage
$1,000/GB
2000s Connectivity
$1,000 per month
Servers
$1,000 per box
Storage
$10/GB
2010s Connectivity
10 cents/GB
Servers
20 cents/hour
Storage
12 cents/GB
NOW Backend using Parse
Search using Algolia
Synchronization using Firebase
Video calls and SMS using Twilio
Payments using Stripe
Video recording using Ziggeo
Send and track emails using Mailgun
Customer service using Intercom
Ship product using Shyp
“no one got fired for using AWS”
cost, security, convenience
“We used to leak memory.
Now we leak instances.
Soon we will leak entire data centers.”
- Dan Kaminsky
Previously, data analysis was done by domain experts
Now, shift toward data science as its own field
A new field is born
The Evolution of Data Science
“Hi, I’m a Data Scientist”
Lots of Data
Little Intelligence
“Data is inherently dumb. It doesn’t actually do anything unless
you know how to use it...
The next digital gold rush will be focused on how you do
something with data.”
- Peter Sondergaard (Gartner Research)
1990s Technology
HPC, Mainframes
2000s
2010s
NOW Generalist Big Data such as Amazon EMR
Large Data Processing such as Databricks
Real Time Processing such as Amazon Kinesis
Data Repositories such as Socrata
Data Collectors such as Kimono
DSaaS for Customer Analytics such as Captricity
DSaaS for Marketing such as Acxiom
DSaaS for Security such as Fortscale
Hosted Machine Learning such as BigML, Dato
Algorithms-as-a-Service such as Algorithmia
Technology
In-house clusters
Technologies
Cloud, Hadoop, Spark
Users
Corporations, tech startups
Users
Individual data scientists
Users
Researchers, hw engineers, committees
Behold...
Data Science
in a Spreadsheet
Future of Data Science
● How will these trends continue?
● What will future tools look like?
● What is the role of data scientists going forward?
Data is less structured, and less amenable to traditional
data analysis without pre-processing
● Unstructured text
● Images
● Video
Future… new data sources
Future… building blocks
Topic Analysis
Twitter Youtube Satellite Imagery
Computer Vision
Artificial Neural Networks
Future… more autonomous
AutoML
Ensemble learning
Hyperparameter optimization
JOIN:
algorithmia.com/signup?invite=SeattleDS
(will post to meetup group)
REACH OUT:
kenny@algorithmia.com

More Related Content

PPTX
Data science & data scientist
PPTX
Data Science: Past, Present, and Future
PDF
An Introduction to Generative AI
PDF
Generative AI
PPT
Descriptive statistics ppt
PPTX
Data science
PPT
Data science & data scientist
Data Science: Past, Present, and Future
An Introduction to Generative AI
Generative AI
Descriptive statistics ppt
Data science

What's hot (20)

PDF
Introduction to Data Science
PDF
What is Data Science
PPTX
Data science applications and usecases
PDF
Introduction to data science
PPTX
Generative AI
PPTX
Introduction to data science
PPTX
introduction to data science
PPTX
Artificial Intelligence, Machine Learning and Deep Learning
PDF
Introduction to Data Science and Analytics
PDF
Data science
PPTX
Big data analytics
PPTX
Chapter 1 big data
PDF
And then there were ... Large Language Models
PPTX
Deep learning health care
PPTX
Data Science
PPTX
The Future of AI is Generative not Discriminative 5/26/2021
PDF
Landscape of AI/ML in 2023
PDF
Introduction to Data Science
PDF
Introduction To Data Science
PDF
Data science presentation
Introduction to Data Science
What is Data Science
Data science applications and usecases
Introduction to data science
Generative AI
Introduction to data science
introduction to data science
Artificial Intelligence, Machine Learning and Deep Learning
Introduction to Data Science and Analytics
Data science
Big data analytics
Chapter 1 big data
And then there were ... Large Language Models
Deep learning health care
Data Science
The Future of AI is Generative not Discriminative 5/26/2021
Landscape of AI/ML in 2023
Introduction to Data Science
Introduction To Data Science
Data science presentation
Ad

Similar to The Evolution of Data Science (20)

PDF
Big data: understanding the present
PPTX
PPSX
PDF
Internet of Things
PPTX
Big Data and Data Science: The Technologies Shaping Our Lives
PDF
Data science and Artificial Intelligence
PPTX
Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria?
PDF
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
PDF
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
PDF
Big Data, Big Deal: For Future Big Data Scientists
PDF
The Evolving Landscape of Data Engineering
PPTX
Big data
PPTX
Big data4businessusers
PDF
Data Analytics Career Paths
PDF
Data analytics career path
PPTX
SKILLWISE-BIGDATA ANALYSIS
PPTX
Big data ppt
PPTX
Foundations of Big Data: Concepts, Techniques, and Applications
PPTX
The Future of Data Science
DOCX
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
Big data: understanding the present
Internet of Things
Big Data and Data Science: The Technologies Shaping Our Lives
Data science and Artificial Intelligence
Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria?
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
Big Data, Big Deal: For Future Big Data Scientists
The Evolving Landscape of Data Engineering
Big data
Big data4businessusers
Data Analytics Career Paths
Data analytics career path
SKILLWISE-BIGDATA ANALYSIS
Big data ppt
Foundations of Big Data: Concepts, Techniques, and Applications
The Future of Data Science
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
Ad

Recently uploaded (20)

PPTX
climate analysis of Dhaka ,Banglades.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
annual-report-2024-2025 original latest.
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Introduction to machine learning and Linear Models
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
Mega Projects Data Mega Projects Data
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Database Infoormation System (DBIS).pptx
climate analysis of Dhaka ,Banglades.pptx
Miokarditis (Inflamasi pada Otot Jantung)
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
annual-report-2024-2025 original latest.
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
.pdf is not working space design for the following data for the following dat...
Introduction to machine learning and Linear Models
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to Knowledge Engineering Part 1
IB Computer Science - Internal Assessment.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Mega Projects Data Mega Projects Data
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
oil_refinery_comprehensive_20250804084928 (1).pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Database Infoormation System (DBIS).pptx

The Evolution of Data Science

  • 1. The Evolution of Data Science Kenny Daniel CTO, Algorithmia July 24, 2015
  • 2. Kenny Daniel - CTO, Algorithmia • Graduate research in Artificial Intelligence and Mechanism Design • Multiple published algorithms and papers in Machine Learning • Received $1 million from DOT “Engineering Tomorrow’s Transportation Market” • B.S. Carnegie Mellon University, M.S., Ph.D. (on leave) USC • Data Scientist and Computer Vision specialist for Delectable, Inc • Initial and current overall architect of Algorithmia Platform
  • 3. Make state-of-the-art algorithms accessible and discoverable by everyone.
  • 4. Evolution of Data Science ● History of data science ● Modern data science ● Future speculation
  • 5. Pre-cloud ● Mainframes ● Universities ● Research Facilities ● Finance ● PhD researchers, highly specialized More pre-planning, less exploratory
  • 6. Source and Inspiration: http://www.slideshare.net/AlbertWenger/the-no-stackstartup 1990s Connectivity $10,000 per month Servers $20,000 per box Storage $1,000/GB 2000s Connectivity $1,000 per month Servers $1,000 per box Storage $10/GB 2010s Connectivity 10 cents/GB Servers 20 cents/hour Storage 12 cents/GB NOW Backend using Parse Search using Algolia Synchronization using Firebase Video calls and SMS using Twilio Payments using Stripe Video recording using Ziggeo Send and track emails using Mailgun Customer service using Intercom Ship product using Shyp
  • 7. “no one got fired for using AWS” cost, security, convenience
  • 8. “We used to leak memory. Now we leak instances. Soon we will leak entire data centers.” - Dan Kaminsky
  • 9. Previously, data analysis was done by domain experts Now, shift toward data science as its own field A new field is born
  • 11. “Hi, I’m a Data Scientist”
  • 12. Lots of Data Little Intelligence
  • 13. “Data is inherently dumb. It doesn’t actually do anything unless you know how to use it... The next digital gold rush will be focused on how you do something with data.” - Peter Sondergaard (Gartner Research)
  • 14. 1990s Technology HPC, Mainframes 2000s 2010s NOW Generalist Big Data such as Amazon EMR Large Data Processing such as Databricks Real Time Processing such as Amazon Kinesis Data Repositories such as Socrata Data Collectors such as Kimono DSaaS for Customer Analytics such as Captricity DSaaS for Marketing such as Acxiom DSaaS for Security such as Fortscale Hosted Machine Learning such as BigML, Dato Algorithms-as-a-Service such as Algorithmia Technology In-house clusters Technologies Cloud, Hadoop, Spark Users Corporations, tech startups Users Individual data scientists Users Researchers, hw engineers, committees
  • 16. Future of Data Science ● How will these trends continue? ● What will future tools look like? ● What is the role of data scientists going forward?
  • 17. Data is less structured, and less amenable to traditional data analysis without pre-processing ● Unstructured text ● Images ● Video Future… new data sources
  • 18. Future… building blocks Topic Analysis Twitter Youtube Satellite Imagery Computer Vision Artificial Neural Networks
  • 19. Future… more autonomous AutoML Ensemble learning Hyperparameter optimization
  • 20. JOIN: algorithmia.com/signup?invite=SeattleDS (will post to meetup group) REACH OUT: kenny@algorithmia.com