SlideShare a Scribd company logo
A Day in the Life of a Data Scientist
in an AI Company
Francesca Lazzeri & Jaya Mathew
@frlazzeri @mathew_jaya
Agenda
@frlazzeri @mathew_jaya
What is AI and Why is it so important?
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Computer Vision Audio Processing
Natural Language
Processing
Knowledge
Representation
Machine Learning Expert Systems
AI
Technologies
Illustrative
Solutions
…
Virtual
Agents
Identity
Analytics
Cognitive
Robotics
Speech
Analytics
Recommendation
Systems
Data
Visualization
Emerging AI technologies
Computer vision and audio processing, for
example are able to actively perceive the world
around them by acquiring and processing images,
sounds and speech. The use of facial recognition
at border control kiosks is one practical example of
how it can improve productivity.
Sense
Natural language processing and inference
engines can enable AI systems to analyse and
understand the information collected. This
technology is used to power the language
translation feature of search engine results
Comprehend
An AI system can take action through technologies
such as expert systems and inference engines or
undertake actions in the physical world. Auto-pilot
features and assisted-braking capabilities in cars
are examples of this
Act
What is AI? – To sense, comprehend and act
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Accenture: Why artificial intelligence is the future of growth, April 2016
IntelligenceCloudData
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Diagnostic
[Interactive Dashboards]
Prescriptive
[Recommendations & Automation]
Predictive
[Machine Learning]
Descriptive
[Reports]
What should
I do?
What will
happen?
Why did it
happen?
What
happened? Insight
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Systems of
Intelligence
Engage your
customers
Empower
your
employees
Optimize
your
operations
Transform
your
products
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Data, Questions and Metrics
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Data is
connected
Data is
accurate
A lot of
data
Example: Predict
whether
component X will
fail in the next Y
days
Example: Identifiers
at the level you are
predicting, relevant
data collected &
feature engineering
using domain
knowledge
Example: Will be
difficult to predict
failure accurately
with few examples
Example: Failures are
really failures, human
labels on root causes
Example: Machine
information
linkable to usage
information
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Business
scenario
Key
decision
Data Science
question
Energy forecasting Should I buy or sell
energy contracts?
What will be the long/short-term
demand for energy in a region?
Customer churn Which customers should I
prioritize to reduce churn?
What is probability of churn within
X days for each customer?
Personalized marketing What product should I offer first? What is the probability
that customer will purchase
each product?
Product feedback Which service/product
needs attention?
What is social media sentiment
for each service/product?
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Defining Performance Metrics
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Correlation
with the Data
Science
Metric
Establish a
Baseline
Quantify the
Metric Value
Improvement
Translate into a
Quantifiable
Business
Metric
Establish a
Qualitative
Objective
Example: Reduce
user churn
Example: Reduce the
fraction of users with
4-week inactivity
Example: Statistically
significant A/B test is
a clean way. If this is
difficult, compare the
values of the metric
before and after the
solution
Example: Reduce
the fraction of
users with 4-week
inactivity by 20%
Example: Current
fraction of users
with 4-week
inactivity = 60%
Understanding the ML workflow &
the Team Data Science Process
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Understand
Business
Goals
Under-
stand
Data
Discover
/Gather
Data
Ingest
Data
Collaboration & Version Control
Transform
Data
Monitor/
Maintain
Model
Create
Model
Deploy
Model
Share
Results
with
Business
Owners
Documentation
Respond to changes/lessons
Debug, Fix, Enhance, etc.
Apps
Services
Data Engines
Build a model
Publish a
model
Consume a
model
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
aka.ms/TeamDataScience
Domain expert
Solution Architect
Data Scientist
Visualization Expert
Project Manager
Executive
Sponsorship –
IT & Business
Data Engineer
Integration Engineer
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Azure AI
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Azure Bot Service
Azure Cognitive
Services
Azure Cognitive
Search
Azure Databricks
Azure Machine
Learning
Knowledge miningAI apps & agents Machine learning
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Sophisticated pretrained models
To simplify solution development
Azure
Databricks
Machine Learning
VMs
Popular frameworks
To build advanced deep learning solutions TensorFlow KerasPytorch Onnx
Azure
Machine Learning
LanguageSpeech
…
Azure
Search
Vision
On-premises Cloud Edge
Productive services
To empower data science and development teams
Powerful infrastructure
To accelerate deep learning
Flexible deployment
To deploy and manage models on intelligent cloud and edge
Cognitive Services
@frlazzeri @mathew_jaya
Azure Bot Service
Azure Cognitive
Services
Azure Cognitive
Search
Azure Databricks
Azure Machine
Learning
Knowledge miningAI apps & agents Machine learning
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
The Team Workspace
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Model Deployment
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
MNIST Dataset
http://ufldl.stanford.edu/wiki/index.php/Using_the_MNIST_Dataset
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Scoring file.py
Python Environment
Azure
Kubernetes
Service (AKS)
or
Azure Container
Instance (ACI)
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Scoring file.py
Python Environment
Azure
Kubernetes
Service (AKS)
or
Azure Container
Instance (ACI)
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
Scoring file.py
Python Environment
Azure
Kubernetes
Service (AKS)
or
Azure Container
Instance (ACI)
@frlazzeri @mathew_jaya@frlazzeri @mathew_jaya
 Azure Machine Learning Services: https://aka.ms/AMLServices
 Visual Studio Code Tools for AI: https://aka.ms/VSCodeToolsAI
 Data Science Virtual Machine: https://aka.ms/AzureDSVM
https://aka.ms/TeamDataScience
http://ufldl.stanford.edu/wiki/index.php/Using_the_MNIST_Dataset
@frlazzeri @mathew_jaya
Francesca Lazzeri & Jaya Mathew
@frlazzeri @mathew_jaya
Learn more
Thank you!

More Related Content

PDF
Operationalize deep learning models for fraud detection with Azure Machine Le...
PPTX
Credit Card Fraud Detection Client Presentation
PPTX
Analysis of-credit-card-fault-detection
PDF
IRJET- Credit Card Fraud Detection using Random Forest
PDF
Credit Card Fraud Detection
PDF
Credit Card Fraud Detection Using Unsupervised Machine Learning Algorithms
DOCX
credit card fraud analysis using predictive modeling python project abstract
PPTX
Online Payment Fraud Detection with Azure Machine Learning
Operationalize deep learning models for fraud detection with Azure Machine Le...
Credit Card Fraud Detection Client Presentation
Analysis of-credit-card-fault-detection
IRJET- Credit Card Fraud Detection using Random Forest
Credit Card Fraud Detection
Credit Card Fraud Detection Using Unsupervised Machine Learning Algorithms
credit card fraud analysis using predictive modeling python project abstract
Online Payment Fraud Detection with Azure Machine Learning

What's hot (20)

PDF
Uses of analytics in the field of Banking
PDF
Credit card fraud detection using python machine learning
PDF
A Study on Credit Card Fraud Detection using Machine Learning
PDF
Fraud deep learning_v2
DOCX
CVtesting
PDF
Machine learning for retail banking
PDF
Fraud Detection presentation
PDF
IRJET- Finalize Attributes and using Specific Way to Find Fraudulent Transaction
PDF
Introduction to ml
PPTX
Artificial Intelligence: a driver of innovation in the Banking Sector
PDF
Machine Learning in Banking Sector
PDF
Artificial intelligence: PwC Top Issues
DOCX
Credit card fraud detection using random forest & cart algorithm
PDF
Ml master class northeastern university
PPTX
Fraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
PDF
Ai - Artificial Intelligence predictions-2018-report - PWC
PDF
Future of artificial intelligence in the banking sector
PDF
Fraud detection ML
PDF
International Journal of Computational Engineering Research(IJCER)
PDF
Fraud Analytics with Machine Learning and Big Data Engineering for Telecom
Uses of analytics in the field of Banking
Credit card fraud detection using python machine learning
A Study on Credit Card Fraud Detection using Machine Learning
Fraud deep learning_v2
CVtesting
Machine learning for retail banking
Fraud Detection presentation
IRJET- Finalize Attributes and using Specific Way to Find Fraudulent Transaction
Introduction to ml
Artificial Intelligence: a driver of innovation in the Banking Sector
Machine Learning in Banking Sector
Artificial intelligence: PwC Top Issues
Credit card fraud detection using random forest & cart algorithm
Ml master class northeastern university
Fraud Detection in Insurance with Machine Learning for WARTA - Artur Suchwalko
Ai - Artificial Intelligence predictions-2018-report - PWC
Future of artificial intelligence in the banking sector
Fraud detection ML
International Journal of Computational Engineering Research(IJCER)
Fraud Analytics with Machine Learning and Big Data Engineering for Telecom
Ad

Similar to A day in the life of a data scientist in an AI company (20)

PPTX
Bringing AI to your company (Innovation Pioneers 2018)
PDF
AI Foundations Course Module 1 - An AI Transformation Journey
PDF
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
PDF
AI Restart 2023: Guillermo Alda - How AI is transforming companies, inside out
DOCX
My Journey from Data Confusion to Data Mastery.docx
PDF
w.docx.pdf
PPTX
Artificial Intelligence For Business A Comprehensive Guide to AI Integration
PPTX
AI Machine Learning - Practical Applications and Insights
PDF
1 introduction to data science
PPTX
In-Depth Data Analytics
PDF
AI in the Enterprise
PDF
ADV Slides: Data Curation for Artificial Intelligence Strategies
PDF
IBM i & Data Science in the AI era.
PPTX
Rahat Yasir: Enterprise Data & AI Strategy & Platform Designing
PPTX
Rahat Yasir: Enterprise Data & AI Strategy & Platform Designing
PPTX
Career_Jobs_in_Data_Science.pptx
DOCX
Accelerate Your Career with AI Data Science Certification – Start Now
PPTX
Data Leaders Summit Barcelona 2018
PPTX
An AI Maturity Roadmap for Becoming a Data-Driven Organization
Bringing AI to your company (Innovation Pioneers 2018)
AI Foundations Course Module 1 - An AI Transformation Journey
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
AI Restart 2023: Guillermo Alda - How AI is transforming companies, inside out
My Journey from Data Confusion to Data Mastery.docx
w.docx.pdf
Artificial Intelligence For Business A Comprehensive Guide to AI Integration
AI Machine Learning - Practical Applications and Insights
1 introduction to data science
In-Depth Data Analytics
AI in the Enterprise
ADV Slides: Data Curation for Artificial Intelligence Strategies
IBM i & Data Science in the AI era.
Rahat Yasir: Enterprise Data & AI Strategy & Platform Designing
Rahat Yasir: Enterprise Data & AI Strategy & Platform Designing
Career_Jobs_in_Data_Science.pptx
Accelerate Your Career with AI Data Science Certification – Start Now
Data Leaders Summit Barcelona 2018
An AI Maturity Roadmap for Becoming a Data-Driven Organization
Ad

Recently uploaded (20)

PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPTX
modul_python (1).pptx for professional and student
PPTX
Introduction to Inferential Statistics.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
Managing Community Partner Relationships
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
PDF
Microsoft 365 products and services descrption
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PDF
annual-report-2024-2025 original latest.
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Leprosy and NLEP programme community medicine
DOCX
Factor Analysis Word Document Presentation
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
CYBER SECURITY the Next Warefare Tactics
PDF
Introduction to Data Science and Data Analysis
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PDF
Introduction to the R Programming Language
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
modul_python (1).pptx for professional and student
Introduction to Inferential Statistics.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Qualitative Qantitative and Mixed Methods.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
Managing Community Partner Relationships
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
Microsoft 365 products and services descrption
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
annual-report-2024-2025 original latest.
SAP 2 completion done . PRESENTATION.pptx
Leprosy and NLEP programme community medicine
Factor Analysis Word Document Presentation
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
CYBER SECURITY the Next Warefare Tactics
Introduction to Data Science and Data Analysis
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Introduction to the R Programming Language

A day in the life of a data scientist in an AI company