SlideShare a Scribd company logo
Explore data
Code one or
multiple
models
Experiment
and train the
model
1.Evaluation
& error
analysis
Balance
between
underfitting
& overfitting
Improve the
model
performance
1.Clean
incorrectly
labeled data
1.Get more
data or
synthesize
artificial data
•Define a
Neural
Network
strategy
•NN, CNN,
RNN, …
Business
problem
•Clean
•Format
•Label
•Pre-process
Get Data
•Training
•Cross-
validation
•Test
Split
Datasets
with as many cycles
(epochs) as needed
Underfitting (high bias)
Overfitting (high variance
Multiple if the
infrastructure
allows it(if needed)
(if needed)
Expose
model to
real world
data
Real World
data
Insert
model into
production
system
Production
Tune
model’s
hyper-
parameters
Keep
tuning
Start
Customer
Acceptance
Many sources
Streaming vs Static
Structured vs Unstructured
Model accuracy is high?
Start iterate here
as many time as needed
Regularization,
Dropouts,
Hyper-parameters,
Network architecture,
Optimizers &
Loss function, ...
Transfer learning from
existing model?
Training loss
Validation loss
Accuracy
Vincent Pommier, Sept 2018

More Related Content

PDF
Approach AI assurance
PDF
H2O World - Ensembles with Erin LeDell
PPTX
Top 10 Data Science Practioner Pitfalls - Mark Landry
PDF
Top 10 Data Science Practitioner Pitfalls
PPTX
Supervised Machine Learning
PPTX
H2O World - Top 10 Data Science Pitfalls - Mark Landry
PPTX
LinkedIn talk at Netflix ML Platform meetup Sep 2019
PDF
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
Approach AI assurance
H2O World - Ensembles with Erin LeDell
Top 10 Data Science Practioner Pitfalls - Mark Landry
Top 10 Data Science Practitioner Pitfalls
Supervised Machine Learning
H2O World - Top 10 Data Science Pitfalls - Mark Landry
LinkedIn talk at Netflix ML Platform meetup Sep 2019
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...

Similar to Data science neural network project life cycle (20)

PDF
Measuring Data Quality with DataOps
PDF
Machine learning systems for engineers
PPTX
AWS Forcecast: DeepAR Predictor Time-series
PDF
Mind Map Test Data Management Overview
PDF
Making Netflix Machine Learning Algorithms Reliable
PPTX
Techniques for effective test data management in test automation.pptx
PPTX
Supervised learning
PPTX
An Agile Approach to Machine Learning
PPTX
MACHINE LEARNING for BCA students .pptx
PPTX
Fine_Tuning_Datasets_and_Techniques.pptx
PDF
Guiding through a typical Machine Learning Pipeline
PPTX
Ml2 production
PDF
Big Data Analytics for connected home
PPT
Automated Testing with Databases
PDF
Engineering Machine Learning Data Pipelines Series: Tracking Data Lineage fro...
PDF
BI Chapter 04.pdf business business business business
PPTX
Deliveinrg explainable AI
PDF
Machine_Learning_Overview_Presentation_1.pdf
PPTX
Machine_Learning_Overview_Presentation_1.pptx
Measuring Data Quality with DataOps
Machine learning systems for engineers
AWS Forcecast: DeepAR Predictor Time-series
Mind Map Test Data Management Overview
Making Netflix Machine Learning Algorithms Reliable
Techniques for effective test data management in test automation.pptx
Supervised learning
An Agile Approach to Machine Learning
MACHINE LEARNING for BCA students .pptx
Fine_Tuning_Datasets_and_Techniques.pptx
Guiding through a typical Machine Learning Pipeline
Ml2 production
Big Data Analytics for connected home
Automated Testing with Databases
Engineering Machine Learning Data Pipelines Series: Tracking Data Lineage fro...
BI Chapter 04.pdf business business business business
Deliveinrg explainable AI
Machine_Learning_Overview_Presentation_1.pdf
Machine_Learning_Overview_Presentation_1.pptx
Ad

Recently uploaded (20)

PDF
Getting Started with Data Integration: FME Form 101
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Mushroom cultivation and it's methods.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Encapsulation theory and applications.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Enhancing emotion recognition model for a student engagement use case through...
PPTX
A Presentation on Touch Screen Technology
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PPTX
1. Introduction to Computer Programming.pptx
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
August Patch Tuesday
Getting Started with Data Integration: FME Form 101
Unlocking AI with Model Context Protocol (MCP)
Mushroom cultivation and it's methods.pdf
Approach and Philosophy of On baking technology
Univ-Connecticut-ChatGPT-Presentaion.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Encapsulation theory and applications.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Enhancing emotion recognition model for a student engagement use case through...
A Presentation on Touch Screen Technology
Group 1 Presentation -Planning and Decision Making .pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Zenith AI: Advanced Artificial Intelligence
gpt5_lecture_notes_comprehensive_20250812015547.pdf
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
1. Introduction to Computer Programming.pptx
WOOl fibre morphology and structure.pdf for textiles
MIND Revenue Release Quarter 2 2025 Press Release
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
August Patch Tuesday
Ad

Data science neural network project life cycle

  • 1. Explore data Code one or multiple models Experiment and train the model 1.Evaluation & error analysis Balance between underfitting & overfitting Improve the model performance 1.Clean incorrectly labeled data 1.Get more data or synthesize artificial data •Define a Neural Network strategy •NN, CNN, RNN, … Business problem •Clean •Format •Label •Pre-process Get Data •Training •Cross- validation •Test Split Datasets with as many cycles (epochs) as needed Underfitting (high bias) Overfitting (high variance Multiple if the infrastructure allows it(if needed) (if needed) Expose model to real world data Real World data Insert model into production system Production Tune model’s hyper- parameters Keep tuning Start Customer Acceptance Many sources Streaming vs Static Structured vs Unstructured Model accuracy is high? Start iterate here as many time as needed Regularization, Dropouts, Hyper-parameters, Network architecture, Optimizers & Loss function, ... Transfer learning from existing model? Training loss Validation loss Accuracy Vincent Pommier, Sept 2018