SlideShare a Scribd company logo
INTRODUCTION
TO
MACHINE
LEARNING
3RD EDITION
ETHEM ALPAYDIN
© The MIT Press, 2014
alpaydin@boun.edu.tr
http://www.cmpe.boun.edu.tr/~ethem/i2ml3e
Lecture Slides for
CHAPTER 1:
INTRODUCTION
Big Data
3
 The definition of big data is data that contains greater variety, arriving in increasing volumes
and with more velocity. This is also known as the three Vs.
 Put simply, big data is larger, more complex data sets, especially from new data sources.
These data sets are so voluminous that traditional data processing software just can’t
manage them. But these massive volumes of data can be used to address business
problems you wouldn’t have been able to tackle before.
Big Data
4
 Widespread use of personal computers and wireless communicationwe all
became producers of data that leads to “big data”
 Every time we buy a product, every time we rent a movie, visit a web page, write
a blog, or post on the social media, even when we just walk or drive around, we
are generating data.
 We are both producers and consumers of data (Products and Services )
 Think, for example, of a supermarket that is selling thousands of goods to millions
of customers all over a country or through a virtual store over the web.
 What the supermarket wants is to be able to predict which customer is likely to
buy which product, to maximize sales and profit.
 Similarly each customer wants to find the set of products best matching his/her
needs.
 Customer behavior changes in time and by geographic location, Data is not
random, it has structure(Pattern), e.g., customer behavior
 We need “big theory” to extract that structure from data for
(a) Understanding the process
Why “Learn” ?
5
 Machine learning is a growing technology which enables computers to
learn automatically from past data or past experience.
 Machine learning uses various algorithms for building mathematical
models and making predictions using historical data or information.
 Currently, it is being used for various tasks such as image recognition,
speech recognition, email filtering, Facebook auto-tagging, recommender
system, and many more.
 There is no need to “learn” to calculate payroll
 Learning is used when:
 Human expertise does not exist (navigating on Mars),
 Humans are unable to explain their expertise (speech recognition)
 Solution changes in time (routing on a computer network)
 Solution needs to be adapted to particular cases (user biometrics)
What We Talk About When We
Talk About “Learning”
6
 Learning general models from a data of
particular examples
 Data is cheap and abundant (data
warehouses, data marts); knowledge is
expensive and scarce.
 Example in retail: Customer transactions to
consumer behavior:
People who bought “Blink” also bought “Outliers”
(www.amazon.com)
 Build a model that is a good and useful
approximation to the data.
Data Mining
7
 Retail: Market basket analysis, Customer
relationship management (CRM)
 Finance: Credit scoring, fraud detection
 Manufacturing: Control, robotics,
troubleshooting
 Medicine: Medical diagnosis
 Telecommunications: Spam filters, intrusion
detection
 Bioinformatics: Motifs, alignment
 Web mining: Search engines
 ...
What is Machine Learning?
8
 Optimize a performance criterion using
example data or past experience.
 Role of Statistics: Inference from a sample
 Role of Computer science: Efficient algorithms
to
 Solve the optimization problem
 Representing and evaluating the model for inference
Applications
9
 Association
 Supervised Learning
 Classification
 Regression
 Unsupervised Learning
 Reinforcement Learning
Learning Associations
10
 Basket analysis:
P (Y | X ) probability that somebody who buys
X also buys Y where X and Y are
products/services.
Example: P ( chips | beer ) = 0.7
Classification
11
 Example: Credit
scoring
 Differentiating
between low-risk
and high-risk
customers from
their income and
savings
Discriminant: IF income > θ1 AND savings > θ2
THEN low-risk ELSE high-risk
Classification: Applications
12
 Aka Pattern recognition
 Face recognition: Pose, lighting, occlusion
(glasses, beard), make-up, hair style
 Character recognition: Different handwriting
styles.
 Speech recognition: Temporal dependency.
 Medical diagnosis: From symptoms to
illnesses
 Biometrics: Recognition/authentication using
physical and/or behavioral characteristics:
Face, iris, signature, etc
 Outlier/novelty detection:
Face Recognition
13
Training examples of a person
Test images
ORL dataset,
AT&T Laboratories, Cambridge UK
Regression
 Example: Price of a
used car
 x : car attributes
y : price
y = g (x | q )
g ( ) model,
q parameters
14
y = wx+w0
Regression Applications
15
 Navigating a car: Angle of the steering
 Kinematics of a robot arm
α1= g1(x,y)
α2= g2(x,y)
α1
α2
(x,y)
 Response surface design
Supervised Learning: Uses
16
 Prediction of future cases: Use the rule to
predict the output for future inputs
 Knowledge extraction: The rule is easy to
understand
 Compression: The rule is simpler than the data
it explains
 Outlier detection: Exceptions that are not
covered by the rule, e.g., fraud
Unsupervised Learning
17
 Learning “what normally happens”
 No output
 Clustering: Grouping similar instances
 Example applications
 Customer segmentation in CRM
 Image compression: Color quantization
 Bioinformatics: Learning motifs
Reinforcement Learning
18
 Learning a policy: A sequence of outputs
 No supervised output but delayed reward
 Credit assignment problem
 Game playing
 Robot in a maze
 Multiple agents, partial observability, ...
Resources: Datasets
19
 UCI Repository:
http://www.ics.uci.edu/~mlearn/MLRepository.html
 Statlib: http://lib.stat.cmu.edu/
Resources: Journals
20
 Journal of Machine Learning Research
www.jmlr.org
 Machine Learning
 Neural Computation
 Neural Networks
 IEEE Trans on Neural Networks and Learning
Systems
 IEEE Trans on Pattern Analysis and Machine
Intelligence
 Journals on Statistics/Data Mining/Signal
Processing/Natural Language
Resources: Conferences
21
 International Conference on Machine Learning (ICML)
 European Conference on Machine Learning (ECML)
 Neural Information Processing Systems (NIPS)
 Uncertainty in Artificial Intelligence (UAI)
 Computational Learning Theory (COLT)
 International Conference on Artificial Neural Networks
(ICANN)
 International Conference on AI & Statistics (AISTATS)
 International Conference on Pattern Recognition (ICPR)
 ...

More Related Content

PPTX
introduction to machin learning
PPTX
i2ml3e-chap1.pptx
PPT
Machine Learning
PPT
Machine Learning basics with simple .ppt
PPT
ML-Topic1A.ppteeweqeqeqeqeqeqwewqqwwqeeqeqw
PPT
Ml topic1 a
PPT
i2ml-chap1-v1-1.ppt
introduction to machin learning
i2ml3e-chap1.pptx
Machine Learning
Machine Learning basics with simple .ppt
ML-Topic1A.ppteeweqeqeqeqeqeqwewqqwwqeeqeqw
Ml topic1 a
i2ml-chap1-v1-1.ppt

Similar to Introduction_to_MAchine_Learning_Advance.pptx (20)

PPT
intro to ML by the way m toh phasee movie Punjabi
PDF
DSCI 552 machine learning for data science
PPTX
Introduction to Machine Learning
PDF
Machine Learning an Research Overview
PPTX
Basics of machine learning
PPTX
Machine learning
PPTX
Unit - 1 - Introduction of the machine learning
PPT
Machine Learning Techniques all units .ppt
PPTX
Machine learning
PPTX
BIG DATA AND MACHINE LEARNING
PDF
MLT unit 1- Introduction To Machine Learning And types Of ML , Cross Validation
PPT
Introduction to Machine Learning and different types of Learning
PPTX
introduction to machine learning .pptx
PPTX
Machine Learning DR PRKRao-PPT UNIT-I.pptx
PPT
Introduction to Machine Learning
PDF
Machine Learning Deep Learning AI and Data Science
PPTX
machine learning
PPT
Machine learning introduction to unit 1.ppt
PDF
Lect 7 intro to M.L..pdf
PPTX
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
intro to ML by the way m toh phasee movie Punjabi
DSCI 552 machine learning for data science
Introduction to Machine Learning
Machine Learning an Research Overview
Basics of machine learning
Machine learning
Unit - 1 - Introduction of the machine learning
Machine Learning Techniques all units .ppt
Machine learning
BIG DATA AND MACHINE LEARNING
MLT unit 1- Introduction To Machine Learning And types Of ML , Cross Validation
Introduction to Machine Learning and different types of Learning
introduction to machine learning .pptx
Machine Learning DR PRKRao-PPT UNIT-I.pptx
Introduction to Machine Learning
Machine Learning Deep Learning AI and Data Science
machine learning
Machine learning introduction to unit 1.ppt
Lect 7 intro to M.L..pdf
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Ad

Recently uploaded (20)

PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPT
Project quality management in manufacturing
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
web development for engineering and engineering
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PPT
Mechanical Engineering MATERIALS Selection
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
Welding lecture in detail for understanding
PDF
Digital Logic Computer Design lecture notes
PPTX
Lecture Notes Electrical Wiring System Components
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Project quality management in manufacturing
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
bas. eng. economics group 4 presentation 1.pptx
web development for engineering and engineering
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Mechanical Engineering MATERIALS Selection
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
CYBER-CRIMES AND SECURITY A guide to understanding
Model Code of Practice - Construction Work - 21102022 .pdf
Welding lecture in detail for understanding
Digital Logic Computer Design lecture notes
Lecture Notes Electrical Wiring System Components
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Ad

Introduction_to_MAchine_Learning_Advance.pptx

  • 1. INTRODUCTION TO MACHINE LEARNING 3RD EDITION ETHEM ALPAYDIN © The MIT Press, 2014 alpaydin@boun.edu.tr http://www.cmpe.boun.edu.tr/~ethem/i2ml3e Lecture Slides for
  • 3. Big Data 3  The definition of big data is data that contains greater variety, arriving in increasing volumes and with more velocity. This is also known as the three Vs.  Put simply, big data is larger, more complex data sets, especially from new data sources. These data sets are so voluminous that traditional data processing software just can’t manage them. But these massive volumes of data can be used to address business problems you wouldn’t have been able to tackle before.
  • 4. Big Data 4  Widespread use of personal computers and wireless communicationwe all became producers of data that leads to “big data”  Every time we buy a product, every time we rent a movie, visit a web page, write a blog, or post on the social media, even when we just walk or drive around, we are generating data.  We are both producers and consumers of data (Products and Services )  Think, for example, of a supermarket that is selling thousands of goods to millions of customers all over a country or through a virtual store over the web.  What the supermarket wants is to be able to predict which customer is likely to buy which product, to maximize sales and profit.  Similarly each customer wants to find the set of products best matching his/her needs.  Customer behavior changes in time and by geographic location, Data is not random, it has structure(Pattern), e.g., customer behavior  We need “big theory” to extract that structure from data for (a) Understanding the process
  • 5. Why “Learn” ? 5  Machine learning is a growing technology which enables computers to learn automatically from past data or past experience.  Machine learning uses various algorithms for building mathematical models and making predictions using historical data or information.  Currently, it is being used for various tasks such as image recognition, speech recognition, email filtering, Facebook auto-tagging, recommender system, and many more.  There is no need to “learn” to calculate payroll  Learning is used when:  Human expertise does not exist (navigating on Mars),  Humans are unable to explain their expertise (speech recognition)  Solution changes in time (routing on a computer network)  Solution needs to be adapted to particular cases (user biometrics)
  • 6. What We Talk About When We Talk About “Learning” 6  Learning general models from a data of particular examples  Data is cheap and abundant (data warehouses, data marts); knowledge is expensive and scarce.  Example in retail: Customer transactions to consumer behavior: People who bought “Blink” also bought “Outliers” (www.amazon.com)  Build a model that is a good and useful approximation to the data.
  • 7. Data Mining 7  Retail: Market basket analysis, Customer relationship management (CRM)  Finance: Credit scoring, fraud detection  Manufacturing: Control, robotics, troubleshooting  Medicine: Medical diagnosis  Telecommunications: Spam filters, intrusion detection  Bioinformatics: Motifs, alignment  Web mining: Search engines  ...
  • 8. What is Machine Learning? 8  Optimize a performance criterion using example data or past experience.  Role of Statistics: Inference from a sample  Role of Computer science: Efficient algorithms to  Solve the optimization problem  Representing and evaluating the model for inference
  • 9. Applications 9  Association  Supervised Learning  Classification  Regression  Unsupervised Learning  Reinforcement Learning
  • 10. Learning Associations 10  Basket analysis: P (Y | X ) probability that somebody who buys X also buys Y where X and Y are products/services. Example: P ( chips | beer ) = 0.7
  • 11. Classification 11  Example: Credit scoring  Differentiating between low-risk and high-risk customers from their income and savings Discriminant: IF income > θ1 AND savings > θ2 THEN low-risk ELSE high-risk
  • 12. Classification: Applications 12  Aka Pattern recognition  Face recognition: Pose, lighting, occlusion (glasses, beard), make-up, hair style  Character recognition: Different handwriting styles.  Speech recognition: Temporal dependency.  Medical diagnosis: From symptoms to illnesses  Biometrics: Recognition/authentication using physical and/or behavioral characteristics: Face, iris, signature, etc  Outlier/novelty detection:
  • 13. Face Recognition 13 Training examples of a person Test images ORL dataset, AT&T Laboratories, Cambridge UK
  • 14. Regression  Example: Price of a used car  x : car attributes y : price y = g (x | q ) g ( ) model, q parameters 14 y = wx+w0
  • 15. Regression Applications 15  Navigating a car: Angle of the steering  Kinematics of a robot arm α1= g1(x,y) α2= g2(x,y) α1 α2 (x,y)  Response surface design
  • 16. Supervised Learning: Uses 16  Prediction of future cases: Use the rule to predict the output for future inputs  Knowledge extraction: The rule is easy to understand  Compression: The rule is simpler than the data it explains  Outlier detection: Exceptions that are not covered by the rule, e.g., fraud
  • 17. Unsupervised Learning 17  Learning “what normally happens”  No output  Clustering: Grouping similar instances  Example applications  Customer segmentation in CRM  Image compression: Color quantization  Bioinformatics: Learning motifs
  • 18. Reinforcement Learning 18  Learning a policy: A sequence of outputs  No supervised output but delayed reward  Credit assignment problem  Game playing  Robot in a maze  Multiple agents, partial observability, ...
  • 19. Resources: Datasets 19  UCI Repository: http://www.ics.uci.edu/~mlearn/MLRepository.html  Statlib: http://lib.stat.cmu.edu/
  • 20. Resources: Journals 20  Journal of Machine Learning Research www.jmlr.org  Machine Learning  Neural Computation  Neural Networks  IEEE Trans on Neural Networks and Learning Systems  IEEE Trans on Pattern Analysis and Machine Intelligence  Journals on Statistics/Data Mining/Signal Processing/Natural Language
  • 21. Resources: Conferences 21  International Conference on Machine Learning (ICML)  European Conference on Machine Learning (ECML)  Neural Information Processing Systems (NIPS)  Uncertainty in Artificial Intelligence (UAI)  Computational Learning Theory (COLT)  International Conference on Artificial Neural Networks (ICANN)  International Conference on AI & Statistics (AISTATS)  International Conference on Pattern Recognition (ICPR)  ...