Classification Algorithm in Machine Learning

Supervised vs. Unsupervised Learning
 Supervised learning (classification)
◦ Supervision: The training data (observations,
measurements, etc.) are accompanied by labels indicating
the class of the observations
◦ New data is classified based on the training set
 Unsupervised learning (clustering)
◦ The class labels of training data is unknown
◦ Given a set of measurements, observations, etc. with the
aim of establishing the existence of classes or clusters in
the data

 Classification
 predicts categorical class labels (discrete or nominal)
 classifies data (constructs a model) based on the training
set and the values (class labels) in a classifying attribute
and uses it in classifying new data
 Prediction
 models continuous-valued functions, i.e., predicts
unknown or missing values
 Typical applications
 Credit approval
 Target marketing
 Medical diagnosis
 Fraud detection
Classification vs. Prediction

Classification: Definition
 Given a collection of records (training set )
 Each record contains a set of attributes, one of the attributes is
the class.
 Find a model for class attribute as a function of
the values of other attributes.
 Goal: previously unseen records should be
assigned a class as accurately as possible.
 A test set is used to determine the accuracy of the model.
Usually, the given data set is divided into training and test sets,
with training set used to build the model and test set used to
validate it.

Classification—A Two-Step Process
 Model construction: describing a set of
predetermined classes
Each tuple/sample is assumed to belong to a
predefined class, as determined by the class
label attribute
The set of tuples used for model
construction is training set
The model is represented as classification
rules, decision trees, or mathematical
formulae

Classification—A Two-Step Process
 Model usage: for classifying future or unknown
objects
Estimate accuracy of the model
 The known label of test sample is compared
with the classified result from the model
 Accuracy rate is the percentage of test set
samples that are correctly classified by the
model
 Test set is independent of training set,
otherwise over-fitting will occur
If the accuracy is acceptable, use the model to
classify data tuples whose class labels are not
known

Classification Process (1): Model Construction
Training
Data
NAME RANK YEARS TENURED
Mike Assistant Prof 3 no
Mary Assistant Prof 7 yes
Bill Professor 2 yes
Jim Associate Prof 7 yes
Dave Assistant Prof 6 no
Anne Associate Prof 3 no
Classification
Algorithms
IF rank = ‘professor’
OR years > 6
THEN tenured = ‘yes’
Classifier
(Model)

Classification Process (2): Use the Model in Prediction
Classifier
Testing
Data
NAME RANK YEARS TENURED
Tom Assistant Prof 2 no
Merlisa Associate Prof 7 no
George Professor 5 yes
Joseph Assistant Prof 7 yes
Unseen Data
(Jeff, Professor, 4)
Tenured?

The Learning Process in spam mail Example
Email Server
● Number of recipients
● Size of message
● Number of attachments
● Number of "re's" in the
subject line
…
Model Learning Model
Testin
g

An Example
 A fish-packing plant wants to automate the
process of sorting incoming fish according to
species
 As a pilot project, it is decided to try to
separate sea bass from salmon using optical
sensing
Classification

An Example (continued)
 Features/attributes:
Length
Lightness
Width
Position of mouth

Classification

 Preprocessing: Images of different
fishes are isolated from one another
and from background;
 Feature extraction: The information
of a single fish is then sent to a feature
extractor, that measure certain
“features” or “properties”;
 Classification: The values of these
features are passed to a classifier that
evaluates the evidence presented, and
build a model to discriminate between
the two species
Classification

Classification
 Domain knowledge:
◦ A sea bass is generally longer than a salmon
 Related feature: (or attribute)
◦ Length
 Training the classifier:
◦ Some examples are provided to the classifier in this
form: <fish_length, fish_name>
◦ These examples are called training examples
◦ The classifier learns itself from the training examples,
how to distinguish Salmon from Bass based on the
fish_length

Classification
 Classification model (hypothesis):
◦ The classifier generates a model from the training data to classify
future examples (test examples)
◦ An example of the model is a rule like this:
◦ If Length >= l* then sea bass otherwise salmon
◦ Here the value of l* determined by the classifier
 Testing the model
◦ Once we get a model out of the classifier, we may use the
classifier to test future examples
◦ The test data is provided in the form <fish_length>
◦ The classifier outputs <fish_type> by checking fish_length against
the model

 So the overall
classification process
goes like this 
Classification
Preprocessing,
and feature
extraction
Training
Training Data
Model
Test/Unlabeled
Data
Testing against
model/
Classification
Feature vector
Preprocessing, and
feature extraction
Feature vector
Prediction/
Evaluation

Classification
Pre-
processing,
Feature
extraction
12, salmon
15, sea bass
8, salmon
5, sea bass
Training data
Feature vector
Training If len > 12,
then sea bass
else salmon
Model
Test data
15, salmon
10, salmon
18, ?
8, ?
Feature vector
Test/
Classify
sea bass (error!)
salmon (correct)
sea bass
salmon
Evaluation/Prediction
Pre-
processing,
Feature
extraction
Labeled data
Unlabeled data

Classification
 Why error?
 Insufficient training data
 Too few features
 Too many/irrelevant features
 Overfitting / specialization

Classification
Pre-
processing,
Feature
extraction
12, 4, salmon
15, 8, sea bass
8, 2, salmon
5, 10, sea bass
Training data
Feature vector
Training
If ltns > 6 or
len*5+ltns*2>100
then sea bass else
salmon
Model
Test data
15, 2, salmon
10, 7, salmon
18, 7, ?
8, 5, ?
Feature vector
Test/
Classify
salmon (correct)
salmon (correct)
sea bass
salmon
Evaluation/Prediction
Pre-
processing,
Feature
extraction

Linear, Non-linear, Multi-class
and
Multi-label classification

Linear Classification
 A linear classifier achieves this by making
a classification decision based on the value of
a linear combination of the characteristics.
 A classification algorithm (Classifier) that makes its
classification based on a linear predictor function
combining a set of weights with the feature vector
 Decision boundaries is flat
◦ Line, plane, ….
 May involve non-linear operations

Linear Classifiers
How would you
classify this data?
New Recipients
Email
Length

Linear Classifiers
New Recipients
Email
Length

Linear Classifiers
Any of these would
be fine..
..but which is best?
New Recipients
Email
Length

Classifier Margin
New Recipients
Define the margin of
a linear classifier
as the width that
the boundary
could be increased
by before hitting a
datapoint.
Email
Length

No Linear Classifier can cover all instances
How would you
classify this data?
New Recipients
Email
Length

• Ideally, the best decision boundary should
be the one which provides an optimal
performance such as in the following
figure

No Linear Classifier can cover all instances
Email
Length
New Recipients

What is multiclass
 Output
◦ In some cases, output space can be very large
(i.e., K is very large)
 Each input belongs to exactly one class
(c.f. in multilabel, input belongs to many classes)

Multi-Classes Classification
 Multi-class classification is simply
classifying objects into any one
of multiple categories. Such as
classifying just into either a dog
or cat from the dataset.
 1.When there are more than two
categories in which the images can
be classified, and
 2.An image does not belong to
more than one class

 If both of the above conditions are
satisfied, it is referred to as a multi-
class image classification problem

Multi-label classification
 When we can classify an image into
more than one class (as in the image
beside), it is known as a multi-label
image classification problem.
 Multi-label classification is a type
of classification in which an object
can be categorized into more than
one class.
 For example, In the image dataset,
we will classify a picture as
the image of a dog or cat and
also classify the same image based
on the breed of the dog or cat
 .
These are all labels of the given images. Each
image here belongs to more than one
class and hence it is a multi-label image
classification problem.

Multi classVs multi label classification

Classification Algorithm in Machine Learning

More Related Content

Similar to Classification Algorithm in Machine Learning (20)

More from Senthil Vit (20)

Recently uploaded (20)

Classification Algorithm in Machine Learning