SlideShare a Scribd company logo
Applications of
Data Science in
Microsoft Cloud
Products
Lisa Cohen
MSFT Cloud Data Sciences
linkedin.com/in/cohenlisa
About me
Applied Math
Bachelor &
Masters
VS Languages & IDE PM
Shipped 3 major releases
VS Telemetry Mgr
Product & business
analytics
Principal Data
Science Leader
Cross-functional org leader
Experimentation, Machine
Learning, Data Science,
Data Vis, Prod Mgmt
E2E Azure business,
Help customers succeed
on the cloud
Sr Community PM
Strategy, Communications,
Keynote speaker
UW Data Science
Board member
Author & Speaker
Data science and
career topics
linkedin.com/in/cohenlisa
medium.com/@lisa_cohen
Community leader: Data Science,
Women, Mentors, Managers, Book club
3
Agenda
• Ensuring success from the project kickoff
• Data Science in action:
• Analytics
• Experiments
• ML models
• Enterprise considerations
Data Science at Microsoft
Data Scientist
Analytics & Inference
Statistical analysis &
experiments
Machine Learning
Scientist/Engineer
Production Models
Develop predictive &
prescriptive models, MLOps
Data Engineer
Data Platform & Pipelines
Build the data platform
Product Manager
Planning & Stakeholder
Engagement
Manage the data science
process
Data Science
Lifecycle
Problem &
Hypothesis
Design
Approach
Data
Acquisition
&
Exploration
Analysis &
Predictive
Modeling
Evaluation &
Reviews
Deployment
&
Socialization
Data Science project kickoff:
Analysis, Experiment, Model
What new capability will this enable?
What decision/action will you take?
What’s the expected impact?
Deliver what users want (vs what ask for)
Leverage the power of data science
Analytics problems
 State of the business
 Exploration, distributions, data insights
 Define funnel, calculate LTV
 Establish goals (OKRs)
 Trends and root cause analysis
 Retention curves
 Cohort segmentation
48.3%
29.5%
22.0% 19.5%
15.0%
6.7%
<1 1 2 3 4 5-6
Tenure (in years)
Cohorting (Churn sample data)
Confidence intervals (CSAT sample data)
Funnel stages (Free account signup)
May 2019
Mar 2019
Jan 2019
50%
60%
70%
80%
90%
100%
0 1 2 3 4 5 6 7 8 9 10 11 12
Retention
Month
Retention curves by cohort start month
Experimentation & Causal Inference
 Accelerate learning and innovation
 Online, application and program experiments
 Designing experiments
 Outcome metrics, control group, power analysis, duration
 Analyzing experiments
 Tracking primary and secondary metrics
 Considering bias, ethics and fairness
 Causal inference scenarios
Usage
Time
User Engagement
Control
Treatment
EXP Platform
MS Research
Hypothesis
Design
Experiment
Run
Experiment
Analyze
Experiment
Act on
Results
Understanding the
customer
 Customer
Segmentation
 Customer Journey &
Service Combinations
Awareness
 Doc Recommender,
Attribution, Search
 Marketplace Search
& Recommender
Buy
 Lifetime value
 Azure Cost
Management
Engagement
 Service/Solution
Recommender
 Anomaly Detection
Nurture
 Program
Recommender
 Support ticket
classification &
prioritization
Retention
 Predict churn risk &
support needs
Acquisition
 First workload
 Propensity models
Customer-centric approach, aligned with lifecycle stages:
Machine Learning Models
 Predictive and prescriptive models, with actionable insights & recommendations
 Internal and external facing
 Production vs learning
The data science process for machine learning
Business
understanding
Modeling
Data
acquisition
&
understanding
Deployment
Scoring,
performance
monitoring, etc.
Data source
On premises vs. cloud
Database vs. files
Pipeline
Streaming vs. batch
Low vs. high frequency
Environment
On premises vs. cloud
Database vs. data lake vs. …
Small vs. medium vs. big data
Wrangling,
exploration,
& cleaning
Structured vs. unstructured
Data validation & cleanup
Visualization
Feature
engineering
Transform, binning
Temporal, text, image
Feature selection
Model
training
Algorithms, ensemble
Parameter tuning
Retraining
Model management
Model
evaluation
Cross-validation
Model reporting
A/B testing
Intelligent
applications
Web
services
Model
store
Customer
acceptance
https://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/lifecycle
Data Science in the real world
 Value of data quality, data cleaning and data set improvements
 Skewed populations
 Model reliability, SLAs, data contracts, maintenance
 Model explain-ability and simplicity
 Done is better than perfect (consider time to market)
 Tuning models for business use cases
 Socialize and integrate into existing tools
 Causation vs correlation
11
Summary
Project Kickoff
• Clarify business goals
• Determine the
decision or action
Data Science
Products
• Analytics
• Experiments
• ML Models
Data Science in the
Enterprise
• Data quality
• Model explain-ability
Keep in Touch
https://www.linkedin.com/in/cohenlisa
https://medium.com/@lisa_cohen
lisa.cohen.h@gmail.com
© Copyright Microsoft Corporation. All rights reserved.

More Related Content

PPTX
Hyf azure ml_1
PPTX
Tips for Effective Data Science in the Enterprise
PPTX
Webinar: Question Answering and Virtual Assistants with Deep Learning
PDF
H2O World - Machine Learning for non-data scientists
PPTX
Data science 101 Masterclass
PDF
Data Driven Engineering 2014
PDF
Barga ACM DEBS 2013 Keynote
PDF
Data science vs. Data scientist by Jothi Periasamy
Hyf azure ml_1
Tips for Effective Data Science in the Enterprise
Webinar: Question Answering and Virtual Assistants with Deep Learning
H2O World - Machine Learning for non-data scientists
Data science 101 Masterclass
Data Driven Engineering 2014
Barga ACM DEBS 2013 Keynote
Data science vs. Data scientist by Jothi Periasamy

What's hot (20)

PDF
CRISP-DM: a data science project methodology
PPTX
Leveraging Data Science in the Automotive Industry
PDF
H2O World - Advanced Analytics at Macys.com - Daqing Zhao
PPTX
PDF
From Raw Data to Deployed Product. Fast & Agile with CRISP-DM
PDF
CRISP-DM - Agile Approach To Data Mining Projects
PDF
H2O World - What you need before doing predictive analysis - Keen.io
PDF
Putting data science in your business a first utility feedback
PPT
IBM Watson - Introduction to IBM Watson
PDF
Barga Galvanize Sept 2015
PPT
PPTX
Max diff scaling for research access(4)
PDF
H2O World - Intro to Data Science with Erin Ledell
PPTX
The Other 99% of a Data Science Project
PPTX
Data analytics
PDF
20151016 Data Science For Project Managers
PPTX
Domino and AWS: collaborative analytics and model governance at financial ser...
PPT
Micropanel Webinar 8 24-11
PPTX
Choosing the Right Document Processing Solution for Healthcare Organizations
PDF
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
CRISP-DM: a data science project methodology
Leveraging Data Science in the Automotive Industry
H2O World - Advanced Analytics at Macys.com - Daqing Zhao
From Raw Data to Deployed Product. Fast & Agile with CRISP-DM
CRISP-DM - Agile Approach To Data Mining Projects
H2O World - What you need before doing predictive analysis - Keen.io
Putting data science in your business a first utility feedback
IBM Watson - Introduction to IBM Watson
Barga Galvanize Sept 2015
Max diff scaling for research access(4)
H2O World - Intro to Data Science with Erin Ledell
The Other 99% of a Data Science Project
Data analytics
20151016 Data Science For Project Managers
Domino and AWS: collaborative analytics and model governance at financial ser...
Micropanel Webinar 8 24-11
Choosing the Right Document Processing Solution for Healthcare Organizations
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Ad

Similar to Applications of Data Science in Microsoft Cloud Products (20)

PPTX
Tips and Tricks to be an Effective Data Scientist
PPT
Institutional Data Management Blueprint
PPTX
Impact of Data Science
PDF
Azure Machine Learning
PPT
UK Digital Curation Centre: enabling research data management at the coalface
PDF
Unlock the power of information: Data Science Course In Kerala
PDF
Understanding-the-Data-Science-Lifecycle
PDF
Open Source Framework for Deploying Data Science Models and Cloud Based Appli...
PDF
UXPA 2023: Data science and UX: Smarter together
PPTX
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
PPTX
Introduction-FODS-fundamantals of data science
PDF
Data science mastery course in pitampura
PDF
Mastering Predictive Analytics with R 2nd edition Edition Forte
PDF
Data-Driven Organisation
PPTX
Data Science Mastery Course in Pitampura
PPTX
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
PPTX
A Practical-ish Introduction to Data Science
PPT
Delivering Value Through Business Analytics
PDF
Mastering Predictive Analytics with R 2nd edition Edition Forte
PPTX
Text Analytics for Legal work
Tips and Tricks to be an Effective Data Scientist
Institutional Data Management Blueprint
Impact of Data Science
Azure Machine Learning
UK Digital Curation Centre: enabling research data management at the coalface
Unlock the power of information: Data Science Course In Kerala
Understanding-the-Data-Science-Lifecycle
Open Source Framework for Deploying Data Science Models and Cloud Based Appli...
UXPA 2023: Data science and UX: Smarter together
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Introduction-FODS-fundamantals of data science
Data science mastery course in pitampura
Mastering Predictive Analytics with R 2nd edition Edition Forte
Data-Driven Organisation
Data Science Mastery Course in Pitampura
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data Science
Delivering Value Through Business Analytics
Mastering Predictive Analytics with R 2nd edition Edition Forte
Text Analytics for Legal work
Ad

Recently uploaded (20)

PDF
Fluorescence-microscope_Botany_detailed content
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
Business Analytics and business intelligence.pdf
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
Lecture1 pattern recognition............
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPT
Quality review (1)_presentation of this 21
Fluorescence-microscope_Botany_detailed content
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Qualitative Qantitative and Mixed Methods.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
IB Computer Science - Internal Assessment.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Business Analytics and business intelligence.pdf
Business Ppt On Nestle.pptx huunnnhhgfvu
Lecture1 pattern recognition............
climate analysis of Dhaka ,Banglades.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Data_Analytics_and_PowerBI_Presentation.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
STUDY DESIGN details- Lt Col Maksud (21).pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Quality review (1)_presentation of this 21

Applications of Data Science in Microsoft Cloud Products

  • 1. Applications of Data Science in Microsoft Cloud Products Lisa Cohen MSFT Cloud Data Sciences linkedin.com/in/cohenlisa
  • 2. About me Applied Math Bachelor & Masters VS Languages & IDE PM Shipped 3 major releases VS Telemetry Mgr Product & business analytics Principal Data Science Leader Cross-functional org leader Experimentation, Machine Learning, Data Science, Data Vis, Prod Mgmt E2E Azure business, Help customers succeed on the cloud Sr Community PM Strategy, Communications, Keynote speaker UW Data Science Board member Author & Speaker Data science and career topics linkedin.com/in/cohenlisa medium.com/@lisa_cohen Community leader: Data Science, Women, Mentors, Managers, Book club
  • 3. 3 Agenda • Ensuring success from the project kickoff • Data Science in action: • Analytics • Experiments • ML models • Enterprise considerations
  • 4. Data Science at Microsoft Data Scientist Analytics & Inference Statistical analysis & experiments Machine Learning Scientist/Engineer Production Models Develop predictive & prescriptive models, MLOps Data Engineer Data Platform & Pipelines Build the data platform Product Manager Planning & Stakeholder Engagement Manage the data science process
  • 5. Data Science Lifecycle Problem & Hypothesis Design Approach Data Acquisition & Exploration Analysis & Predictive Modeling Evaluation & Reviews Deployment & Socialization Data Science project kickoff: Analysis, Experiment, Model What new capability will this enable? What decision/action will you take? What’s the expected impact? Deliver what users want (vs what ask for) Leverage the power of data science
  • 6. Analytics problems  State of the business  Exploration, distributions, data insights  Define funnel, calculate LTV  Establish goals (OKRs)  Trends and root cause analysis  Retention curves  Cohort segmentation 48.3% 29.5% 22.0% 19.5% 15.0% 6.7% <1 1 2 3 4 5-6 Tenure (in years) Cohorting (Churn sample data) Confidence intervals (CSAT sample data) Funnel stages (Free account signup) May 2019 Mar 2019 Jan 2019 50% 60% 70% 80% 90% 100% 0 1 2 3 4 5 6 7 8 9 10 11 12 Retention Month Retention curves by cohort start month
  • 7. Experimentation & Causal Inference  Accelerate learning and innovation  Online, application and program experiments  Designing experiments  Outcome metrics, control group, power analysis, duration  Analyzing experiments  Tracking primary and secondary metrics  Considering bias, ethics and fairness  Causal inference scenarios Usage Time User Engagement Control Treatment EXP Platform MS Research Hypothesis Design Experiment Run Experiment Analyze Experiment Act on Results
  • 8. Understanding the customer  Customer Segmentation  Customer Journey & Service Combinations Awareness  Doc Recommender, Attribution, Search  Marketplace Search & Recommender Buy  Lifetime value  Azure Cost Management Engagement  Service/Solution Recommender  Anomaly Detection Nurture  Program Recommender  Support ticket classification & prioritization Retention  Predict churn risk & support needs Acquisition  First workload  Propensity models Customer-centric approach, aligned with lifecycle stages: Machine Learning Models  Predictive and prescriptive models, with actionable insights & recommendations  Internal and external facing  Production vs learning
  • 9. The data science process for machine learning Business understanding Modeling Data acquisition & understanding Deployment Scoring, performance monitoring, etc. Data source On premises vs. cloud Database vs. files Pipeline Streaming vs. batch Low vs. high frequency Environment On premises vs. cloud Database vs. data lake vs. … Small vs. medium vs. big data Wrangling, exploration, & cleaning Structured vs. unstructured Data validation & cleanup Visualization Feature engineering Transform, binning Temporal, text, image Feature selection Model training Algorithms, ensemble Parameter tuning Retraining Model management Model evaluation Cross-validation Model reporting A/B testing Intelligent applications Web services Model store Customer acceptance https://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/lifecycle
  • 10. Data Science in the real world  Value of data quality, data cleaning and data set improvements  Skewed populations  Model reliability, SLAs, data contracts, maintenance  Model explain-ability and simplicity  Done is better than perfect (consider time to market)  Tuning models for business use cases  Socialize and integrate into existing tools  Causation vs correlation
  • 11. 11 Summary Project Kickoff • Clarify business goals • Determine the decision or action Data Science Products • Analytics • Experiments • ML Models Data Science in the Enterprise • Data quality • Model explain-ability
  • 13. © Copyright Microsoft Corporation. All rights reserved.

Editor's Notes

  • #7: Telling the state of the business in numbers Illustrating the business scenario and modeling in the funnel
  • #12: https://pixabay.com/photos/raise-challenge-landscape-mountain-3338589/
  • #13: https://www.flaticon.com/free-icon/linkedin_174857 https://medium.com/@Medium