SlideShare a Scribd company logo
Introduction to
Machine Learning &
Data Analytics
Shishir Choudhary
Why ML ?
What happened
with user at
meeting today ?
What user has
bought in past
What happened
at user house
today morning ?
What people
with similar
interest
bought
What has user
recently shown
interest in by
search
ML Prediction System
You want to take a good decision
• You can’t ask a human to take the decision due to time and dollar cost involved at
the scale of problem 

• You are presently anyway not taking good decisions because decision is presently
random , hard coded or complex rules based

• You want to reduce cost or verify your own decision
E.g. Taking good decision is neither easy, nor perfect, but ML makes it less error prone than
simpler old alternatives
Why Data Analytics ?
Understanding your business/industry/users better
You want to improve your business, industry or users, e.g. What
segments of users you have and how well each segment likes your
product. How well are different players in industry doing in different
segments of products they well ? Which products are growing fast ?
Helps to remove bias or opinions and gives more factual insights. Also
called data driven decision making. Typically involves visualisation,
clustering and at times regression(ML). 

Sometimes this involves analysing “big” amount of data typically from
world or “small” amount of data from your company itself
Why deep learning ?
• Better results with large amount of data

• Very high dimensions in data or time series / sequence
data - E.g. Image, Voice, Text, Time series 

• End to end Vs Part of workflow
What is ML = Optimisation Problem

Given Past Data
Parameter that
can’t be
obtained
What user has
bought in past
Parameter that
can’t be
obtained
What people
with similar
interest
bought
What has user
recently shown
interest in by
search
What product should be recommended to user ? Imagine 20 inputs
Prediction System
Weight for decision
= 0.2 ?
Weight for decision
= 0.3 ?
Weight for
decision
= 0.5 ?
ML = 

Optimize the Weights
Past Purchase Data
What is ML = 

Inductive Reasoning Problem
• Guess the function 

• f(1) = 1 

• f(2) = 4

• f(3) = 9 

• f(4) = 16 

• ?? what is f(x) ?
Involves leap of faith. Does training generalise well is the crucial question ?
Alternative = 

Deductive Reasoning 

= Rules Based System
Simple attempt at predicting gender by rules 

• Men => Short Hair 

• Women => Medium/Long Hair 

• Arun is a Man => Arun has short hair
Quite quickly, rules and their exceptions can become too complex to manage
Most Popular ML
Problem Statements
• Classification 

• Given lot of examples, learn how to predict 

function(Input) -> Output one of Fixed Set Of Categories 

• E.g. Spam or Not 

• Good vs bad investment

• Image classification / labelling 

• Regression 

• Given lot of examples, learn how to predict 

function(Input) -> Continuous infinite values function 

• House Price estimation

• Pricing a stock 

• Time for food to arrive for food delivery app
Classification or Regression
?
$100-120k $120-140k $140-160k $160-180k
100
120
140
160
80 sqm 100 sqm 120 sqm 140 sqm
House Pricing - Classification
Class 1 Class 2 Class 3 Class 4
House Pricing - Regression
Size ->
Size ->
Crucial challenges
• Bias Error & Variance Error & Irreducible Error
ML Models try to balance between bias and variance. Some models by default have
more bias and some have by default more variance. Some some parameters their
default leaning can be adjusted.
DL can reduce both bias and variance error with huge amount of data and high number
of features/parameters
Crucial Challenges
• Curse of dimensionality 

• Amount of training data needed increases exponentially with more features
used for prediction

• Hence dimensionality reduction techniques

• Bad and missing data 

• Lot of real world data often is bad quality, with either wrong or missing
values. Thus good data becomes quite useful asset today. 

• Feature Engineering 

• In ML you need to select right features to use for prediction. This is a tough
problem and partly solved by domain knowledge and partly by data analysis
ML vs DL
0
25
50
75
100
Untitled 1
ML DL Small DL Large
Larger Training Data
Accuracy
At lower quantum of training data, ML , Small DL models and Large DL models all perform
quite similarly. At larger quantum DL is better able to utilise the additional training data. Latest
Image Recognition DL Models perform better than human error rates.
Models !
Models are fancy words for 

mathematical functions which are used to generalise from training data
Prediction Optimisation
Most learning algorithm work in some sort of derivative of
following steps
1. Based on inputs and initial model parameters, make a
prediction
2. Check what was actual answer and calculate error
3. Try to minimise the error by adjusting the models’s
parameters in direction which reduces error
4. Repeat till convergence (error % stops reducing and is
low)
Regression
Takes simplifying assumption that prediction
is linear (or binomial or trinomial) relation to input(s)
E.g. Height of plant by age , 

time required to deliver food in food app
Decision Trees
Perceptron
Neural Network
Good resources
• Udacity (completely) Free Machine Learning Course 

• Josh Gordon’s Youtube Videos on Introduction to ML 

• Libraries 

• Scikit Learn in Python - Machine Learning 

• Tensorflow - Deep Learning 

• Gensim - Natural Language Processing

• http://learney.sg/#machine learning

• http://learney.sg/#data science

• http://learney.sg/#deep learning

• http://learney.sg/#natural language processing
Summary &
Q &A

More Related Content

PPTX
Machine Learning
ODP
Introduction to Machine learning
PDF
Managing machine learning
PDF
Module 1 introduction to machine learning
PDF
Machine Learning for Dummies
PPTX
Introduction to machine learning
PPTX
Primer to Machine Learning
PDF
Introduction to machine learning
Machine Learning
Introduction to Machine learning
Managing machine learning
Module 1 introduction to machine learning
Machine Learning for Dummies
Introduction to machine learning
Primer to Machine Learning
Introduction to machine learning

What's hot (20)

PDF
Machine Learning Algorithms (Part 1)
PPTX
What is Machine Learning
PPTX
Machine learning
PDF
Lecture 1: What is Machine Learning?
PPTX
Machine Learning and Real-World Applications
PPTX
Introduction to Machine Learning
PDF
Module 9: Natural Language Processing Part 2
PPT
Machine learning with Big Data power point presentation
PPTX
End-to-End Machine Learning Project
PDF
Barga Data Science lecture 9
PDF
Intro to machine learning
PDF
Fairly Measuring Fairness In Machine Learning
PDF
Machine Learning: Understanding the Invisible Force Changing Our World
PDF
Machine Learning: Applications, Process and Techniques
PPTX
Machine learning_ Replicating Human Brain
PDF
Applications in Machine Learning
PDF
Module 4: Model Selection and Evaluation
PDF
Barga Data Science lecture 1
PDF
ML Basics
PPTX
Data Science Training | Data Science For Beginners | Data Science With Python...
Machine Learning Algorithms (Part 1)
What is Machine Learning
Machine learning
Lecture 1: What is Machine Learning?
Machine Learning and Real-World Applications
Introduction to Machine Learning
Module 9: Natural Language Processing Part 2
Machine learning with Big Data power point presentation
End-to-End Machine Learning Project
Barga Data Science lecture 9
Intro to machine learning
Fairly Measuring Fairness In Machine Learning
Machine Learning: Understanding the Invisible Force Changing Our World
Machine Learning: Applications, Process and Techniques
Machine learning_ Replicating Human Brain
Applications in Machine Learning
Module 4: Model Selection and Evaluation
Barga Data Science lecture 1
ML Basics
Data Science Training | Data Science For Beginners | Data Science With Python...
Ad

Similar to Introduction to machine learning and deep learning (20)

PPTX
Machine Learning Essentials Demystified part1 | Big Data Demystified
PDF
S3_ML Introduction.pdf
PPTX
Ml - A shallow dive
PPTX
Intro to ML for product school meetup
PPTX
Essential of ML 1st Lecture IIT Kharagpur
PPTX
Machine Learning Contents.pptx
PPTX
Introduction.pptx about the mechine Learning
PPTX
artificial intelligence.pptx
PPTX
L15.pptx
PDF
Efficient Learning Machines Theories Concepts And Applications For Engineers ...
PDF
Prepare your data for machine learning
PDF
The Machine Learning Workflow with Azure
PPTX
Selected Topics in CS-CHapter-twooo.pptx
PPTX
machine learning introduction notes foRr
PDF
General introduction to AI ML DL DS
DOC
Lecture #1: Introduction to machine learning (ML)
PDF
AI.pdf
PDF
Machine learning for IoT - unpacking the blackbox
PDF
Fundementals of Machine Learning and Deep Learning
PDF
林守德/Practical Issues in Machine Learning
Machine Learning Essentials Demystified part1 | Big Data Demystified
S3_ML Introduction.pdf
Ml - A shallow dive
Intro to ML for product school meetup
Essential of ML 1st Lecture IIT Kharagpur
Machine Learning Contents.pptx
Introduction.pptx about the mechine Learning
artificial intelligence.pptx
L15.pptx
Efficient Learning Machines Theories Concepts And Applications For Engineers ...
Prepare your data for machine learning
The Machine Learning Workflow with Azure
Selected Topics in CS-CHapter-twooo.pptx
machine learning introduction notes foRr
General introduction to AI ML DL DS
Lecture #1: Introduction to machine learning (ML)
AI.pdf
Machine learning for IoT - unpacking the blackbox
Fundementals of Machine Learning and Deep Learning
林守德/Practical Issues in Machine Learning
Ad

More from Shishir Choudhary (9)

PDF
Learncool - Angel Round Pitch
PDF
Adopting innovation
PDF
Adopting innovation
PDF
Adopting innovation
PDF
Building innovative products
PDF
Building innovative products
PDF
Building innovative products
PDF
Building innovative products
PDF
Deep learning for NLP
Learncool - Angel Round Pitch
Adopting innovation
Adopting innovation
Adopting innovation
Building innovative products
Building innovative products
Building innovative products
Building innovative products
Deep learning for NLP

Recently uploaded (20)

PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PPTX
1_Introduction to advance data techniques.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
Lecture1 pattern recognition............
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Foundation of Data Science unit number two notes
PPTX
Database Infoormation System (DBIS).pptx
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Global journeys: estimating international migration
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Computer network topology notes for revision
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Major-Components-ofNKJNNKNKNKNKronment.pptx
1_Introduction to advance data techniques.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Business Ppt On Nestle.pptx huunnnhhgfvu
Lecture1 pattern recognition............
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
IBA_Chapter_11_Slides_Final_Accessible.pptx
Foundation of Data Science unit number two notes
Database Infoormation System (DBIS).pptx
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Supervised vs unsupervised machine learning algorithms
climate analysis of Dhaka ,Banglades.pptx
Global journeys: estimating international migration
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Computer network topology notes for revision

Introduction to machine learning and deep learning

  • 1. Introduction to Machine Learning & Data Analytics Shishir Choudhary
  • 2. Why ML ? What happened with user at meeting today ? What user has bought in past What happened at user house today morning ? What people with similar interest bought What has user recently shown interest in by search ML Prediction System You want to take a good decision • You can’t ask a human to take the decision due to time and dollar cost involved at the scale of problem • You are presently anyway not taking good decisions because decision is presently random , hard coded or complex rules based • You want to reduce cost or verify your own decision E.g. Taking good decision is neither easy, nor perfect, but ML makes it less error prone than simpler old alternatives
  • 3. Why Data Analytics ? Understanding your business/industry/users better You want to improve your business, industry or users, e.g. What segments of users you have and how well each segment likes your product. How well are different players in industry doing in different segments of products they well ? Which products are growing fast ? Helps to remove bias or opinions and gives more factual insights. Also called data driven decision making. Typically involves visualisation, clustering and at times regression(ML). Sometimes this involves analysing “big” amount of data typically from world or “small” amount of data from your company itself
  • 4. Why deep learning ? • Better results with large amount of data • Very high dimensions in data or time series / sequence data - E.g. Image, Voice, Text, Time series • End to end Vs Part of workflow
  • 5. What is ML = Optimisation Problem
 Given Past Data Parameter that can’t be obtained What user has bought in past Parameter that can’t be obtained What people with similar interest bought What has user recently shown interest in by search What product should be recommended to user ? Imagine 20 inputs Prediction System Weight for decision = 0.2 ? Weight for decision = 0.3 ? Weight for decision = 0.5 ? ML = 
 Optimize the Weights Past Purchase Data
  • 6. What is ML = 
 Inductive Reasoning Problem • Guess the function • f(1) = 1 • f(2) = 4 • f(3) = 9 • f(4) = 16 • ?? what is f(x) ? Involves leap of faith. Does training generalise well is the crucial question ?
  • 7. Alternative = 
 Deductive Reasoning 
 = Rules Based System Simple attempt at predicting gender by rules • Men => Short Hair • Women => Medium/Long Hair • Arun is a Man => Arun has short hair Quite quickly, rules and their exceptions can become too complex to manage
  • 8. Most Popular ML Problem Statements • Classification • Given lot of examples, learn how to predict 
 function(Input) -> Output one of Fixed Set Of Categories • E.g. Spam or Not • Good vs bad investment • Image classification / labelling • Regression • Given lot of examples, learn how to predict 
 function(Input) -> Continuous infinite values function • House Price estimation • Pricing a stock • Time for food to arrive for food delivery app
  • 9. Classification or Regression ? $100-120k $120-140k $140-160k $160-180k 100 120 140 160 80 sqm 100 sqm 120 sqm 140 sqm House Pricing - Classification Class 1 Class 2 Class 3 Class 4 House Pricing - Regression Size -> Size ->
  • 10. Crucial challenges • Bias Error & Variance Error & Irreducible Error ML Models try to balance between bias and variance. Some models by default have more bias and some have by default more variance. Some some parameters their default leaning can be adjusted. DL can reduce both bias and variance error with huge amount of data and high number of features/parameters
  • 11. Crucial Challenges • Curse of dimensionality • Amount of training data needed increases exponentially with more features used for prediction • Hence dimensionality reduction techniques • Bad and missing data • Lot of real world data often is bad quality, with either wrong or missing values. Thus good data becomes quite useful asset today. • Feature Engineering • In ML you need to select right features to use for prediction. This is a tough problem and partly solved by domain knowledge and partly by data analysis
  • 12. ML vs DL 0 25 50 75 100 Untitled 1 ML DL Small DL Large Larger Training Data Accuracy At lower quantum of training data, ML , Small DL models and Large DL models all perform quite similarly. At larger quantum DL is better able to utilise the additional training data. Latest Image Recognition DL Models perform better than human error rates.
  • 13. Models ! Models are fancy words for 
 mathematical functions which are used to generalise from training data
  • 14. Prediction Optimisation Most learning algorithm work in some sort of derivative of following steps 1. Based on inputs and initial model parameters, make a prediction 2. Check what was actual answer and calculate error 3. Try to minimise the error by adjusting the models’s parameters in direction which reduces error 4. Repeat till convergence (error % stops reducing and is low)
  • 15. Regression Takes simplifying assumption that prediction is linear (or binomial or trinomial) relation to input(s) E.g. Height of plant by age , 
 time required to deliver food in food app
  • 19. Good resources • Udacity (completely) Free Machine Learning Course • Josh Gordon’s Youtube Videos on Introduction to ML • Libraries • Scikit Learn in Python - Machine Learning • Tensorflow - Deep Learning • Gensim - Natural Language Processing • http://learney.sg/#machine learning • http://learney.sg/#data science • http://learney.sg/#deep learning • http://learney.sg/#natural language processing