SlideShare a Scribd company logo
Introduction to Machine learning
Explore ML
Welcome to
Explore ML!
Day 1
Evolution of
Machines
What is a machine?
What is learning?
Learning is any process by
which a system improves
performance from
experience.
Herbert Alexander Simon
Explore ML day 1
1. How does the game work?
2. How is it recognising your drawings?
3. How could we program this?
Quickdraw Game - Discussion
Explore ML day 1
Explore ML day 1
What is Machine
Learning?
Machine Learning is concerned with computer
programs that automatically improve their
performance through experience.
AI
ML
DL
Explore ML day 1
Explore ML day 1
Explore ML day 1
Supervised Learning:
Regression
Regression analysis is a
statistical method that
helps us to analyze and
understand the relationship
between two or more
variables of interest.
Classification
Program learns from
the given dataset or
observations and then
classifies new
observation into a
number of classes
or groups.
Unsupervised Learning
Association
It is a machine learning
and data mining
technique that finds
important relations
between variables or
features in a data set.
Clustering
A way of grouping the
data points into
different clusters,
consisting of similar
data points.
Anomaly Detection
It is the process of
identifying unexpected
items or events in data
sets, which differ from
the norm.
Reinforcement Learning:
Learn from mistakes
Reinforcement learning is a machine learning training method based on
rewarding desired behaviors and/or punishing undesired ones
Reinforcement at work
Explore ML day 1
Explore ML day 1
How do I start solving a
problem with ML?
First, familiarise yourself with what data is
available.
Feature Handling
Preparing the proper input dataset, compatible with Machine learning
algorithm requirements.
Goal of Feature Handling
According to survey, data scientists spend 60% of their
time on data preparation
In Feature Handling, you will learn...
Handling categorical data
● Nominal variables
● Ordinal variables
● One hot encoding
● Label/ordinal/integer encoding
Missing invalid values
● Mean method
● Median method
● Mode method
A variable whose values are one or more categories.
Categorical Variables
Before we move further,
Variable comprises a finite set of discrete values with no relationship between
those values.
These are variables which are not related to each other in any order
Nominal Variables
Ordinal variables
Variable comprises a finite set of discrete values with a ranked
ordering between values.
These are variables where we can find a certain order or relation or
rank between those variables.
Explore ML day 1
One Hot Encoding
Forcing an ordinal relationship via
an ordinal encoding and allowing
the model to assume a natural
ordering between categories may
result in poor performance or
unexpected results
In ordinal encoding, each
unique category value is
assigned an integer value.
Ordinal Encoding
Unfortunately, data in real life usually has
issues
Consider a dataset that gives you information
about multiple people aboard the Titanic like
their ages, sexes, sibling counts, embarkment
points and whether or not they survived the
disaster.
Based on this, you have to predict if an
arbitrary passenger on Titanic would survive
the sinking.
Looking at a real-life dataset
What will happen if we directly jump into
solving the problem?
Real life datasets almost always have
missing values
For example, not all passengers’ age will be recorded.
There are multiple reasons why this could happen.
Reasons
● Simply put, it’s difficult to collect data.
● Sometimes data is lost.
● Data can also be corrupted.
● People may not be comfortable with sharing data.
Handling missing values
Mean, Median, Mode
Statistical approach to handle the missing values
Mean
Handling missing values
Mean
In this method, any missing values in a column are replaced with the mean
of that column.
Assume that we have a dataset of a some patients and in that the age
attribute has some missing values, we have to overcome this or else it will
be a good recipe for a disaster.
Explore ML day 1
Cons of using this method
● This method is heavily dependent and extremely sensitive for the outliers
present in a data set.
● Value influenced by outlier is a major threat to any machine learning model
and it may make model catastrophic.
Statistical approach to handle the missing values
Handling missing values
Median
Another technique is median imputation in which the missing values are
replaced with the median value of the entire feature column.
● Doesn’t factor the correlations between features. It only works on the
column level.
● Will give poor results on encoded categorical features (do NOT use it
on categorical features).
Cons of using this method
Statistical approach to handle the missing values
Mode
Handling missing values
Another technique is mode imputation in which the missing values are
replaced with the mode value or most frequent value of the entire
feature column.
● It also doesn’t factor the correlations between features.
● It can introduce bias in the data.
Cons of using this method
Quick Recap!
Suppose you have a Basket
Its filled with some fresh fruits
Arrange different fruits in different places
-- TASK --
How did we learn?
Things you can expect
tomorrow
Introduction to advanced ML topics used to
solve real-life problems
Intuition behind each concept, not just the
high-level understanding
Applying these concepts on a custom dataset
and experimenting with the results in a
hands-on session
Lots of fun, learning and exclusive Google
goodies!

More Related Content

PPT
Artificial inteligence
PPTX
Artificial Intelligence (A.I.) || Introduction of A.I. || HELPFUL FOR STUDENT...
PPTX
Quantum machine learning basics
PDF
Blue Brain Seminar Report
PPTX
What is quantum computing
PDF
Introduction to AI
PPTX
Machine Learning vs Deep Learning vs Artificial Intelligence | ML vs DL vs AI...
PPTX
Artificial intelligence
Artificial inteligence
Artificial Intelligence (A.I.) || Introduction of A.I. || HELPFUL FOR STUDENT...
Quantum machine learning basics
Blue Brain Seminar Report
What is quantum computing
Introduction to AI
Machine Learning vs Deep Learning vs Artificial Intelligence | ML vs DL vs AI...
Artificial intelligence

What's hot (20)

PPTX
Brain Chips_main.pptx
PDF
Introduction to ChatGPT and Overview of its capabilities and functionality.pdf
PDF
Machine Learning for dummies!
PPTX
Riddles(57)
PPTX
Internet of things
PDF
BLUE BRAIN SEMINAR REPORT
PPTX
Digital Twin
PPTX
Digital twin ppt
PDF
IBM Watson Internet of Things: Introducing Digital Twin
PPTX
Presentation on Artificial Intelligence
PPTX
AI IN SPACE EXPLORATION
PDF
The Future is in Responsible Generative AI
PPTX
Digital twin
PPTX
Quantum Computer
PDF
Using the power of Generative AI at scale
PDF
Machine Learning for Dummies
PPTX
Artifical Intelligence
PDF
Introduction To Artificial Intelligence PowerPoint Presentation Slides
PPTX
Quantum computing ppt.pptx
PPTX
Digital twin
Brain Chips_main.pptx
Introduction to ChatGPT and Overview of its capabilities and functionality.pdf
Machine Learning for dummies!
Riddles(57)
Internet of things
BLUE BRAIN SEMINAR REPORT
Digital Twin
Digital twin ppt
IBM Watson Internet of Things: Introducing Digital Twin
Presentation on Artificial Intelligence
AI IN SPACE EXPLORATION
The Future is in Responsible Generative AI
Digital twin
Quantum Computer
Using the power of Generative AI at scale
Machine Learning for Dummies
Artifical Intelligence
Introduction To Artificial Intelligence PowerPoint Presentation Slides
Quantum computing ppt.pptx
Digital twin
Ad

Similar to Explore ML day 1 (20)

PDF
Machine Learning.pdf
PDF
Machine learning Mind Map
PPTX
Supervised learning
PPTX
Exploratory Data Analysis Unit 1 ppt presentation.pptx
PDF
Foundations of Machine Learning - StampedeCon AI Summit 2017
PDF
Data science using python, Data Preprocessing
PDF
Data Analytics ,Data Preprocessing What is Data Preprocessing?
PPTX
Intro to Machine Learning for non-Data Scientists
PDF
Lecture 8 - Feature Engineering and Optimization, a lecture in subject module...
PPTX
data_preprocessingknnnaiveandothera.pptx
PDF
Dirty data science machine learning on non-curated data
PDF
13_Data Preprocessing in Python.pptx (1).pdf
PPTX
Data_Preparation.pptx
PDF
Exploratory Data Analysis - Satyajit.pdf
PPTX
Unit 3 Data Quality and Preprocessing .pptx
PPTX
Machine learning introduction
PPTX
Internshipppt.pptx
PPTX
Machine Learning: A Fast Review
PPTX
Machine learning module 2
PDF
CM NCCU Class1
Machine Learning.pdf
Machine learning Mind Map
Supervised learning
Exploratory Data Analysis Unit 1 ppt presentation.pptx
Foundations of Machine Learning - StampedeCon AI Summit 2017
Data science using python, Data Preprocessing
Data Analytics ,Data Preprocessing What is Data Preprocessing?
Intro to Machine Learning for non-Data Scientists
Lecture 8 - Feature Engineering and Optimization, a lecture in subject module...
data_preprocessingknnnaiveandothera.pptx
Dirty data science machine learning on non-curated data
13_Data Preprocessing in Python.pptx (1).pdf
Data_Preparation.pptx
Exploratory Data Analysis - Satyajit.pdf
Unit 3 Data Quality and Preprocessing .pptx
Machine learning introduction
Internshipppt.pptx
Machine Learning: A Fast Review
Machine learning module 2
CM NCCU Class1
Ad

Recently uploaded (20)

PPTX
Lesson notes of climatology university.
PPTX
Cell Types and Its function , kingdom of life
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Digestion and Absorption of Carbohydrates, Proteina and Fats
PPTX
UNIT III MENTAL HEALTH NURSING ASSESSMENT
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
RMMM.pdf make it easy to upload and study
PPTX
Unit 4 Skeletal System.ppt.pptxopresentatiom
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Computing-Curriculum for Schools in Ghana
PDF
Complications of Minimal Access Surgery at WLH
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
IGGE1 Understanding the Self1234567891011
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
Lesson notes of climatology university.
Cell Types and Its function , kingdom of life
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Orientation - ARALprogram of Deped to the Parents.pptx
Supply Chain Operations Speaking Notes -ICLT Program
Digestion and Absorption of Carbohydrates, Proteina and Fats
UNIT III MENTAL HEALTH NURSING ASSESSMENT
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
Chinmaya Tiranga quiz Grand Finale.pdf
RMMM.pdf make it easy to upload and study
Unit 4 Skeletal System.ppt.pptxopresentatiom
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Computing-Curriculum for Schools in Ghana
Complications of Minimal Access Surgery at WLH
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
IGGE1 Understanding the Self1234567891011
Paper A Mock Exam 9_ Attempt review.pdf.

Explore ML day 1