SlideShare a Scribd company logo
CH 17
GOING A STEP BEYOND USING
SUPPORT VECTOR
MACHINES
10766012陳遠任 Jason
Revisiting the Separation
Problem
• Nonseparability of classes
• There is no straight line that traces a precise border
between different examples.
• Other options
• K-Nearest Neighbors : Ch14
• Logistic regression : Ch15
• Transforming the features : Solves the problem by
employing both feature creation
• Decision trees、Neural networks
Characteristics of
Support Vector Machines
• Binary and multiclass classification, regression, and
detection of anomalous or novelty data
• Robust handling of overfitting, noisy data, and outliers
• A capability to handle situations with many variables
• Easy and timely handling of up to about 10,000
training examples
• Automatic detection of nonlinearity in data
Explaining the Algorithm
SVM - Linear
Negative hyperplane
Positive hyperplane
Support Vector
Applying Nonlinearity
• Nonlinearly
separable points
requiring feature
transformation
(left) to be fit by a
line (right).
• Make the existing
features onto a
feature space of
higher
dimensionality
Applying Nonlinearity
• Problems and limits :
• The number of features increases exponentially, making
computations cumbersome 計算繁複
• The expansion creates many redundant features,
causing overfitting. 創造冗餘特徵
• Difficult to determine becoming linearly or not, requiring
many iterations of expansion and test
Kernel functions
• kernel functions project the original features into
a higher dimensional space by combining them
in a nonlinear way
• rely on algebra calculations
Discovering the different
kernels
• Linear: Suitable for linear
• No extra parameters
• Radial Basis Function: Suitable for non-linear
• parameters: gamma
• Polynomial: suitable for non-linear
• parameters: gamma, degree, and coef0
• Sigmoid: Binary classification like Logistic Regression
• parameters: gamma and coef0
• Custom-made kernels: Depends upon the kernel
Radial Basis Function
• An RBF kernel
that uses
diverse hyper-
parameters to
create unique
SVM solutions.
• The RBF kernel can adapt itself to different
learning strategies
• the error cost is high -> bended hyperplane
• the error cost is low -> smoother curve line
Kernels
• The polynomial and sigmoid kernels aren’t as
adaptable as RBF, thus showing more bias
• Most data problems are easily solved using the
RBF
sigmoid polynomial
Classifying and Estimating
with SVM
• handwritten recognition task
• the digits dataset (from Scikit-learn)
• nonlinear kernel, using the RBF
• a series of 8-x-8 grayscale pixel images of
handwritten numbers ranging from 0 to 9.

More Related Content

PPTX
Real-Time Voice Actuation
PDF
Callgraph analysis
PPTX
Reactive Programming with RxJava
PPTX
Lecture Slides - SVM.pptx
PPTX
SVM_and_Kernels_presentation_with_code.pptx
PPTX
Statistical Machine Learning unit4 lecture notes
PPTX
Quantum Machine Learning for IBM AI
Real-Time Voice Actuation
Callgraph analysis
Reactive Programming with RxJava
Lecture Slides - SVM.pptx
SVM_and_Kernels_presentation_with_code.pptx
Statistical Machine Learning unit4 lecture notes
Quantum Machine Learning for IBM AI

Similar to Ml ch17 (20)

PDF
General Tips for participating Kaggle Competitions
PDF
Why quants love NAG & NAG from Excel
PPTX
SVMs.pptx support vector machines machine learning
PPTX
The Case for Learned Index Structures
PDF
Large-Scale Lasso and Elastic-Net Regularized Generalized Linear Models (DB T...
PPTX
Week5-Faster R-CNN.pptx
PPTX
How Machine Learning Helps Organizations to Work More Efficiently?
PDF
201907 AutoML and Neural Architecture Search
PDF
Faster R-CNN - PR012
PPTX
Day17.pptx department of computer science and eng
PPTX
[Q-tangled 22] Deconstructing Quantum Machine Learning Algorithms - Sasha Laz...
PDF
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
PDF
Towards Detecting Performance Anti-patterns Using Classification Techniques
PPTX
Welch Verolog 2013
PDF
Introduction to deep learning
PPTX
SVM_and_Kernels_with_Visuals_and_Code.pptx
PPTX
Machine Learning Workshop
PPTX
Improving region based CNN object detector using bayesian optimization
PDF
Performance Benchmarking of the R Programming Environment on the Stampede 1.5...
PDF
AMBER presentation
General Tips for participating Kaggle Competitions
Why quants love NAG & NAG from Excel
SVMs.pptx support vector machines machine learning
The Case for Learned Index Structures
Large-Scale Lasso and Elastic-Net Regularized Generalized Linear Models (DB T...
Week5-Faster R-CNN.pptx
How Machine Learning Helps Organizations to Work More Efficiently?
201907 AutoML and Neural Architecture Search
Faster R-CNN - PR012
Day17.pptx department of computer science and eng
[Q-tangled 22] Deconstructing Quantum Machine Learning Algorithms - Sasha Laz...
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Towards Detecting Performance Anti-patterns Using Classification Techniques
Welch Verolog 2013
Introduction to deep learning
SVM_and_Kernels_with_Visuals_and_Code.pptx
Machine Learning Workshop
Improving region based CNN object detector using bayesian optimization
Performance Benchmarking of the R Programming Environment on the Stampede 1.5...
AMBER presentation
Ad

More from Jason Chen (7)

PDF
Optimize0530v 3
PPTX
Ai robot
PDF
Outlier detection in audit logs for application systems
PDF
PHILIPPINE JASON 0530
PDF
JASON - Data mining
PDF
10766012 ranalitics
PDF
Post Big Data 0530
Optimize0530v 3
Ai robot
Outlier detection in audit logs for application systems
PHILIPPINE JASON 0530
JASON - Data mining
10766012 ranalitics
Post Big Data 0530
Ad

Recently uploaded (20)

PDF
Mega Projects Data Mega Projects Data
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Managing Community Partner Relationships
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPT
Predictive modeling basics in data cleaning process
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Introduction to Knowledge Engineering Part 1
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
Business Analytics and business intelligence.pdf
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
Transcultural that can help you someday.
PDF
[EN] Industrial Machine Downtime Prediction
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Mega Projects Data Mega Projects Data
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Qualitative Qantitative and Mixed Methods.pptx
Managing Community Partner Relationships
oil_refinery_comprehensive_20250804084928 (1).pptx
Predictive modeling basics in data cleaning process
Galatica Smart Energy Infrastructure Startup Pitch Deck
Optimise Shopper Experiences with a Strong Data Estate.pdf
.pdf is not working space design for the following data for the following dat...
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Database Infoormation System (DBIS).pptx
Introduction to Knowledge Engineering Part 1
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Supervised vs unsupervised machine learning algorithms
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Business Analytics and business intelligence.pdf
IBA_Chapter_11_Slides_Final_Accessible.pptx
Transcultural that can help you someday.
[EN] Industrial Machine Downtime Prediction
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...

Ml ch17

  • 1. CH 17 GOING A STEP BEYOND USING SUPPORT VECTOR MACHINES 10766012陳遠任 Jason
  • 2. Revisiting the Separation Problem • Nonseparability of classes • There is no straight line that traces a precise border between different examples. • Other options • K-Nearest Neighbors : Ch14 • Logistic regression : Ch15 • Transforming the features : Solves the problem by employing both feature creation • Decision trees、Neural networks
  • 3. Characteristics of Support Vector Machines • Binary and multiclass classification, regression, and detection of anomalous or novelty data • Robust handling of overfitting, noisy data, and outliers • A capability to handle situations with many variables • Easy and timely handling of up to about 10,000 training examples • Automatic detection of nonlinearity in data
  • 5. SVM - Linear Negative hyperplane Positive hyperplane Support Vector
  • 6. Applying Nonlinearity • Nonlinearly separable points requiring feature transformation (left) to be fit by a line (right). • Make the existing features onto a feature space of higher dimensionality
  • 7. Applying Nonlinearity • Problems and limits : • The number of features increases exponentially, making computations cumbersome 計算繁複 • The expansion creates many redundant features, causing overfitting. 創造冗餘特徵 • Difficult to determine becoming linearly or not, requiring many iterations of expansion and test
  • 8. Kernel functions • kernel functions project the original features into a higher dimensional space by combining them in a nonlinear way • rely on algebra calculations
  • 9. Discovering the different kernels • Linear: Suitable for linear • No extra parameters • Radial Basis Function: Suitable for non-linear • parameters: gamma • Polynomial: suitable for non-linear • parameters: gamma, degree, and coef0 • Sigmoid: Binary classification like Logistic Regression • parameters: gamma and coef0 • Custom-made kernels: Depends upon the kernel
  • 10. Radial Basis Function • An RBF kernel that uses diverse hyper- parameters to create unique SVM solutions. • The RBF kernel can adapt itself to different learning strategies • the error cost is high -> bended hyperplane • the error cost is low -> smoother curve line
  • 11. Kernels • The polynomial and sigmoid kernels aren’t as adaptable as RBF, thus showing more bias • Most data problems are easily solved using the RBF sigmoid polynomial
  • 12. Classifying and Estimating with SVM • handwritten recognition task • the digits dataset (from Scikit-learn) • nonlinear kernel, using the RBF • a series of 8-x-8 grayscale pixel images of handwritten numbers ranging from 0 to 9.