SlideShare a Scribd company logo
Machine Learning
Algorithm
Novan Parmonangan Simanjuntak
Head of Machine Learning and AI Strategy
novanps
novan.p.simanjuntak@glair.ai
glair glair.ai hi@glair.ai
Contact Us
01
02
03
04
05
glair.ai
Intro to Machine Learning
Machine Learning Workflow
01
02
OUTLINE
2
glair.ai
Intro to
Machine
Learning 01
3
3
AI
ML
A program that can sense, reason,
act, and adapt
Algorithms learns from data
Artificial neural networks (inspired by
brain) learn from data
ARTIFICIAL INTELLIGENCE
MACHINE LEARNING
DL
AI, ML & DL
DEEP LEARNING
4
towardsdatascience.com
Neural Network
towardsdatascience.com
5
Why ML: Example
Output/Label:
Do I want to go to outside on the weekend given data? (Answer: Yes or No)
Input:
● Weather: Rainy, Sunny
● Distance to destination (km)
● Invited by friends: yes, no
Rules:
1. Initially, score = 0
2. If the weather is rainy then (score + 10), else
if the weather is sunny then (score - 10)
3. score + distance
4. If invited by friends then (score -10), else if
not invited by friends then (score + 10)
5. If score < 10 then Yes, else No
No Weather Distance Invited by Friends Score Go on Weekend? (Answers)
1 Rainy 8 Yes 10+8-10 = 8 Yes
2 Sunny 20 No -10+20+10 = 20 No
3 Rainy 5 No 10+5+10 = 25 No
Data:
6
Why ML: Example
Let’s model it using AI (Model: Artificial Neural Network)
7
Why ML: Example
Let’s model it using Artificial Neural Network
8
>=10
Example:
● Input: weather = Rainy, Distance = 8, Invited by Friends = Yes
● Score = 1 * 10 + 0 * -10 + 8 * 1 + 1 * -10 + 0 * 10 = 8
● Score < 10, so the Output is Yes
Why ML?: A New Paradigm
9
Traditional Programming
Input
Rules
Output
Machine Learning
Input
Output
Rules
Why ML?
No Weather Distance (km) Invited by Friends Go on Weekend? (Answers)
1 Rainy 8 Yes Yes
2 Sunny 20 No No
3 Rainy 5 No No
ML: Find the Rules
10
Optimization : Finding Rules
11
● Determine Loss Function
● Update Rules/Weight to minimize loss
Why ML?
If you know the Rules Do Not Use ML
If not Use ML
12
Why DL?
13
14
AI Approach
Learning Type
● Supervised Learning
● Unsupervised Learning
● Reinforcement Learning
15
● Input and Output is Given
● Finding a rules/function f that maps a set of points X (input/predictor) to a set of labels Y
(output), based on given data (xi
, yi
).
● Categorized to:
○ Classification (categorical labels)
■ Credit Scoring
■ Fraud Detection
■ Recommendation System
■ Object Recognition
■ Spam Filtering
○ Regression (numerical labels)
■ Home Price Prediction
■ Stock Market Prediction
■ Demand Forecasting
Supervised Learning
16
Unsupervised Learning
● Only Input is given
● Find hidden patterns or underlying
structure in the given data
Document Clustering 17
search.carrotsearch.com/pertamina
Unsupervised Learning
18
Reinforcement Learning
19
20
youtube.com
21
youtube.com
glair.ai
Reinforcement Learning in Action
22
glair.ai
Machine
Learning
Workflow 02
23
23
glair.ai
Machine Learning Workflow
Business Problem
ML Problem Framing
Data Ingestion
24
24
• Transportation of data from varied sources
to a storage where it can be accessed, used, and analyzed
by an organization.
• The destination is typically a data warehouse, data mart, or database.
GDP LABS CONFIDENTIAL
Data Ingestion
01
25
glair.ai
Machine Learning Workflow
Business Problem
ML Problem Framing
Data Preparation
Data Ingestion
26
● Make it easier for ML algorithm to interpret the data.
● Categories:
○ Data cleaning:
■ Garbage In Garbage Out
■ identifying incomplete, incorrect, inaccurate or irrelevant parts of the
data and then replacing, modifying, or deleting the dirty data
○ Data encoding, normalization, resampling
○ Data Splitting (Training, Validation, Test)
GDP LABS CONFIDENTIAL
Data Preparation
02
27
glair.ai
Machine Learning Workflow
Business Problem
ML Problem Framing
Data Preparation
EDA
Data Ingestion
28
● Exploratory Data Analysis
Analyze data to summarize main characteristics, often using statistical
techniques or data visualization.
● Objectives:
○ Suggest hypotheses
○ Assess assumptions
○ Support the selection of appropriate tools, techniques, and features
GDP LABS CONFIDENTIAL
Exploratory Data Analysis (EDA)
03
29
glair.ai
Machine Learning Workflow
Business Problem
ML Problem Framing
Data Preparation
EDA
Feature Engineering
Data Ingestion
30
● Feature engineering can use domain knowledge to extract new features from
raw data
● Objectives:
○ Improving the performance of machine learning models.
● Examples:
○ Grouping operations over a window (e.g, average)
○ Binning
○ Log transformation
GDP LABS CONFIDENTIAL
Feature Engineering
04
31
glair.ai
Machine Learning Workflow
Business Problem
ML Problem Framing
Data Preparation
EDA
Feature Engineering
Model Training and
Tuning
Data Ingestion
32
● Choose which model to try
● Simpler model is better
● Iteratively using a more complex model and features if needed
● Use a baseline model
● Maximize model performance using hyperparameter tuning
● Consider constraints:
○ cost, explainability, and speed
GDP LABS CONFIDENTIAL
Model Training and Tuning
05
33
glair.ai
Machine Learning Workflow
Business Problem
ML Problem Framing
Data Preparation
EDA
Feature Engineering
Model Training and
Tuning
Model Evaluation
Data Ingestion
34
● Evaluate the model with the test data
● Pick a suitable metrics for the problem.
○ There must be a Business and ML metrics for the problem
● If the business metrics achieved then continue with model deployment, if not
reiterate the model creation with data augmentation (adding more data) or feature
augmentation (adding other features)
GDP LABS CONFIDENTIAL
Model Evaluation
06
35
glair.ai
Machine Learning Workflow
Business Problem
ML Problem Framing
Data Preparation
EDA
Feature Engineering
Model Training and
Tuning
Model Evaluation Model Deployment
Data Ingestion
Are
Business
Goals
Met?
Data
Augmentation
Feature Augmentation
No Yes
36
● Model Serving/Inference
● Considerations:
○ How to wrap the prediction code as a production-ready service?
○ Which API / Protocol to use?
○ Scalability, Throughput, Latency.
○ Deployments
■ Model Versioning
■ Choose appropriate deployment strategy (e.g, A/B Testing)
GDP LABS CONFIDENTIAL
Model Deployment
07
37
glair.ai
Machine Learning Workflow
Business Problem
ML Problem Framing
Data Preparation
EDA
Feature Engineering
Model Training and
Tuning
Model Evaluation Model Deployment
Model Monitoring
Data Ingestion
Are
Business
Goals
Met?
Predictions
Data
Augmentation
No Yes
38
● Maintain model performance
● Things to be monitored:
○ Service Health
○ Data Quality & Integrity
○ Data & Target Drift
○ Bias/fairness
GDP LABS CONFIDENTIAL
Model Monitoring
08
39
Novan Parmonangan Simanjuntak
Head of Machine Learning and AI Strategy
novanps
novan.p.simanjuntak@glair.ai
glair glair.ai hi@glair.ai
Contact Us
Thank You!

More Related Content

PPTX
2024-02-24_Session 1 - PMLE_UPDATED.pptx
PDF
Choosing a Machine Learning technique to solve your need
PDF
Machine Learning: Past, Present and Future - by Tom Dietterich
PPTX
recent.pptx
PDF
Getting started with Machine Learning
PDF
Machine learning for IoT - unpacking the blackbox
PDF
Intro to machine learning
PDF
The Machine Learning Workflow with Azure
2024-02-24_Session 1 - PMLE_UPDATED.pptx
Choosing a Machine Learning technique to solve your need
Machine Learning: Past, Present and Future - by Tom Dietterich
recent.pptx
Getting started with Machine Learning
Machine learning for IoT - unpacking the blackbox
Intro to machine learning
The Machine Learning Workflow with Azure

Similar to D7 MarkPlus - Machine Learning Algorithm.pdf (20)

PDF
AI/ML Fundamentals to advanced Slides by GDG Amrita Mysuru.pdf
PPTX
A leap around AI
PDF
Main principles of Data Science and Machine Learning
PPTX
Artificial intelligence: Simulation of Intelligence
PPTX
Introduction.pptx about the mechine Learning
PDF
Debugging AI
PDF
Machine Learning: The First Salvo of the AI Business Revolution
PPTX
Artificial intelligence ( AI ) | Guide
PDF
Artificial Intelligence Masterclass for managers and business heads
PDF
10 more lessons learned from building Machine Learning systems - MLConf
PDF
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
PDF
10 more lessons learned from building Machine Learning systems
PPTX
Model Development And Evaluation in ML.pptx
PPTX
Machine Learning AND Deep Learning for OpenPOWER
PDF
AI Orange Belt - Session 2
PDF
DN18 | Technical Debt in Machine Learning | Jaroslaw Szymczak | OLX
PDF
Technical debt in ML | Jaroslaw Szymczak | DN18
PPTX
Borys Rybak “Azure Machine Learning Studio & Azure Workbench & R + Python”
PDF
Machine Learning and its Applications
PDF
Efficient Learning Machines Theories Concepts And Applications For Engineers ...
AI/ML Fundamentals to advanced Slides by GDG Amrita Mysuru.pdf
A leap around AI
Main principles of Data Science and Machine Learning
Artificial intelligence: Simulation of Intelligence
Introduction.pptx about the mechine Learning
Debugging AI
Machine Learning: The First Salvo of the AI Business Revolution
Artificial intelligence ( AI ) | Guide
Artificial Intelligence Masterclass for managers and business heads
10 more lessons learned from building Machine Learning systems - MLConf
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
10 more lessons learned from building Machine Learning systems
Model Development And Evaluation in ML.pptx
Machine Learning AND Deep Learning for OpenPOWER
AI Orange Belt - Session 2
DN18 | Technical Debt in Machine Learning | Jaroslaw Szymczak | OLX
Technical debt in ML | Jaroslaw Szymczak | DN18
Borys Rybak “Azure Machine Learning Studio & Azure Workbench & R + Python”
Machine Learning and its Applications
Efficient Learning Machines Theories Concepts And Applications For Engineers ...
Ad

Recently uploaded (20)

PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPT
Teaching material agriculture food technology
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
cuic standard and advanced reporting.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Spectroscopy.pptx food analysis technology
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
KodekX | Application Modernization Development
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
MYSQL Presentation for SQL database connectivity
Building Integrated photovoltaic BIPV_UPV.pdf
Encapsulation_ Review paper, used for researhc scholars
Teaching material agriculture food technology
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
cuic standard and advanced reporting.pdf
Spectral efficient network and resource selection model in 5G networks
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Review of recent advances in non-invasive hemoglobin estimation
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Spectroscopy.pptx food analysis technology
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Per capita expenditure prediction using model stacking based on satellite ima...
KodekX | Application Modernization Development
“AI and Expert System Decision Support & Business Intelligence Systems”
The Rise and Fall of 3GPP – Time for a Sabbatical?
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
MYSQL Presentation for SQL database connectivity
Ad

D7 MarkPlus - Machine Learning Algorithm.pdf

  • 1. Machine Learning Algorithm Novan Parmonangan Simanjuntak Head of Machine Learning and AI Strategy novanps novan.p.simanjuntak@glair.ai glair glair.ai hi@glair.ai Contact Us
  • 2. 01 02 03 04 05 glair.ai Intro to Machine Learning Machine Learning Workflow 01 02 OUTLINE 2
  • 4. AI ML A program that can sense, reason, act, and adapt Algorithms learns from data Artificial neural networks (inspired by brain) learn from data ARTIFICIAL INTELLIGENCE MACHINE LEARNING DL AI, ML & DL DEEP LEARNING 4 towardsdatascience.com
  • 6. Why ML: Example Output/Label: Do I want to go to outside on the weekend given data? (Answer: Yes or No) Input: ● Weather: Rainy, Sunny ● Distance to destination (km) ● Invited by friends: yes, no Rules: 1. Initially, score = 0 2. If the weather is rainy then (score + 10), else if the weather is sunny then (score - 10) 3. score + distance 4. If invited by friends then (score -10), else if not invited by friends then (score + 10) 5. If score < 10 then Yes, else No No Weather Distance Invited by Friends Score Go on Weekend? (Answers) 1 Rainy 8 Yes 10+8-10 = 8 Yes 2 Sunny 20 No -10+20+10 = 20 No 3 Rainy 5 No 10+5+10 = 25 No Data: 6
  • 7. Why ML: Example Let’s model it using AI (Model: Artificial Neural Network) 7
  • 8. Why ML: Example Let’s model it using Artificial Neural Network 8 >=10 Example: ● Input: weather = Rainy, Distance = 8, Invited by Friends = Yes ● Score = 1 * 10 + 0 * -10 + 8 * 1 + 1 * -10 + 0 * 10 = 8 ● Score < 10, so the Output is Yes
  • 9. Why ML?: A New Paradigm 9 Traditional Programming Input Rules Output Machine Learning Input Output Rules
  • 10. Why ML? No Weather Distance (km) Invited by Friends Go on Weekend? (Answers) 1 Rainy 8 Yes Yes 2 Sunny 20 No No 3 Rainy 5 No No ML: Find the Rules 10
  • 11. Optimization : Finding Rules 11 ● Determine Loss Function ● Update Rules/Weight to minimize loss
  • 12. Why ML? If you know the Rules Do Not Use ML If not Use ML 12
  • 14. 14
  • 15. AI Approach Learning Type ● Supervised Learning ● Unsupervised Learning ● Reinforcement Learning 15
  • 16. ● Input and Output is Given ● Finding a rules/function f that maps a set of points X (input/predictor) to a set of labels Y (output), based on given data (xi , yi ). ● Categorized to: ○ Classification (categorical labels) ■ Credit Scoring ■ Fraud Detection ■ Recommendation System ■ Object Recognition ■ Spam Filtering ○ Regression (numerical labels) ■ Home Price Prediction ■ Stock Market Prediction ■ Demand Forecasting Supervised Learning 16
  • 17. Unsupervised Learning ● Only Input is given ● Find hidden patterns or underlying structure in the given data Document Clustering 17 search.carrotsearch.com/pertamina
  • 24. glair.ai Machine Learning Workflow Business Problem ML Problem Framing Data Ingestion 24 24
  • 25. • Transportation of data from varied sources to a storage where it can be accessed, used, and analyzed by an organization. • The destination is typically a data warehouse, data mart, or database. GDP LABS CONFIDENTIAL Data Ingestion 01 25
  • 26. glair.ai Machine Learning Workflow Business Problem ML Problem Framing Data Preparation Data Ingestion 26
  • 27. ● Make it easier for ML algorithm to interpret the data. ● Categories: ○ Data cleaning: ■ Garbage In Garbage Out ■ identifying incomplete, incorrect, inaccurate or irrelevant parts of the data and then replacing, modifying, or deleting the dirty data ○ Data encoding, normalization, resampling ○ Data Splitting (Training, Validation, Test) GDP LABS CONFIDENTIAL Data Preparation 02 27
  • 28. glair.ai Machine Learning Workflow Business Problem ML Problem Framing Data Preparation EDA Data Ingestion 28
  • 29. ● Exploratory Data Analysis Analyze data to summarize main characteristics, often using statistical techniques or data visualization. ● Objectives: ○ Suggest hypotheses ○ Assess assumptions ○ Support the selection of appropriate tools, techniques, and features GDP LABS CONFIDENTIAL Exploratory Data Analysis (EDA) 03 29
  • 30. glair.ai Machine Learning Workflow Business Problem ML Problem Framing Data Preparation EDA Feature Engineering Data Ingestion 30
  • 31. ● Feature engineering can use domain knowledge to extract new features from raw data ● Objectives: ○ Improving the performance of machine learning models. ● Examples: ○ Grouping operations over a window (e.g, average) ○ Binning ○ Log transformation GDP LABS CONFIDENTIAL Feature Engineering 04 31
  • 32. glair.ai Machine Learning Workflow Business Problem ML Problem Framing Data Preparation EDA Feature Engineering Model Training and Tuning Data Ingestion 32
  • 33. ● Choose which model to try ● Simpler model is better ● Iteratively using a more complex model and features if needed ● Use a baseline model ● Maximize model performance using hyperparameter tuning ● Consider constraints: ○ cost, explainability, and speed GDP LABS CONFIDENTIAL Model Training and Tuning 05 33
  • 34. glair.ai Machine Learning Workflow Business Problem ML Problem Framing Data Preparation EDA Feature Engineering Model Training and Tuning Model Evaluation Data Ingestion 34
  • 35. ● Evaluate the model with the test data ● Pick a suitable metrics for the problem. ○ There must be a Business and ML metrics for the problem ● If the business metrics achieved then continue with model deployment, if not reiterate the model creation with data augmentation (adding more data) or feature augmentation (adding other features) GDP LABS CONFIDENTIAL Model Evaluation 06 35
  • 36. glair.ai Machine Learning Workflow Business Problem ML Problem Framing Data Preparation EDA Feature Engineering Model Training and Tuning Model Evaluation Model Deployment Data Ingestion Are Business Goals Met? Data Augmentation Feature Augmentation No Yes 36
  • 37. ● Model Serving/Inference ● Considerations: ○ How to wrap the prediction code as a production-ready service? ○ Which API / Protocol to use? ○ Scalability, Throughput, Latency. ○ Deployments ■ Model Versioning ■ Choose appropriate deployment strategy (e.g, A/B Testing) GDP LABS CONFIDENTIAL Model Deployment 07 37
  • 38. glair.ai Machine Learning Workflow Business Problem ML Problem Framing Data Preparation EDA Feature Engineering Model Training and Tuning Model Evaluation Model Deployment Model Monitoring Data Ingestion Are Business Goals Met? Predictions Data Augmentation No Yes 38
  • 39. ● Maintain model performance ● Things to be monitored: ○ Service Health ○ Data Quality & Integrity ○ Data & Target Drift ○ Bias/fairness GDP LABS CONFIDENTIAL Model Monitoring 08 39
  • 40. Novan Parmonangan Simanjuntak Head of Machine Learning and AI Strategy novanps novan.p.simanjuntak@glair.ai glair glair.ai hi@glair.ai Contact Us Thank You!