SlideShare a Scribd company logo
COMPUTER VISION, MACHINE, AND DEEP
LEARNING WITH PYTHON
Dr.Eng. Igi Ardiyanto
PROFILE
Igi Ardiyanto
Field of Interest :
Robotics
Computer Vision
Intelligent Transportation System
Embedded System
Parallel Computing
Deep Learning
More Information ??
http://te.ugm.ac.id/~igi
What is Computer
Vision?
Computer Vision, Machine, and Deep Learning with Python
COMPUTER VISION
Make computers understand images and video
What kind of scene?
Where are the people?
How far is the
building?
Where is Waldo?
Like when human “sees” something …..
VISION IS REALLY HARD
 Vision is an amazing feat of natural
intelligence
 Visual cortex occupies about 50%
of Macaque brain
 More human brain devoted to
vision than anything else
Sik…sik…. Iki
dolanan opo
panganan, cuk?
OPTICAL CHARACTER RECOGNITION (OCR)
Digit recognition, AT&T labs
http://www.research.att.com/~yann/
Technology to convert scanned docs to text
• If you have a scanner, it probably came with OCR software
License plate readers
http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
FACE DETECTION
 Many new digital cameras now detect faces
 Canon, Sony, Fuji, …
SMILE DETECTION
Sony Cyber-shot® T70 Digital Still Camera
What is Machine
Learning?
Computer Vision, Machine, and Deep Learning with Python
MACHINE LEARNING
 Machine learning is programming computers to optimize a
performance criterion using example data or past experience.
 There is no need to “learn” to calculate payroll
 Learning is used when:
 Human expertise does not exist (navigating on Mars),
 Humans are unable to explain their expertise (speech
recognition)
 Solution changes in time (routing on a computer network)
 Solution needs to be adapted to particular cases (user biometrics)
COMPUTER VISION MEETS MACHINE LEARNING
Dog
Cat
Raccoon
Dog
Train:
Deploy:
Training
Labels
Training
Image
Features
Prediction
Image
Features
Learned
model
IMAGE FEATURES ??
 Color
 Histograms
 Shape
 …
Slide credit: L. Lazebnik
VERY BRIEF TOUR OF SOME CLASSIFIERS
 K-nearest neighbor
 SVM
 Boosted Decision Trees
 Neural networks
 Naïve Bayes
 Bayesian network
 Gaussian Logistic regression
 Random Forests
 RBMs
 Etc.
FACIAL ATTRACTIVENESS PREDICTION
Yoona: Score 3.6 Yuri: Score 3.4 Tiffany: Score 3.8
FACIAL ATTRACTIVENESS PREDICTION
https://github.com/avisingh599/face-rating
Yoona:
Score 3.6
Yuri:
Score 3.4
Tiffany:
Score 3.8
What is Deep
Learning?
Computer Vision, Machine, and Deep Learning with Python
1) A host of statistical machine
learning techniques
2) Enables the automatic learning
of feature hierarchies
3) Generally based on artificial
neural networks
DEEP LEARNING
 English and Mandarin speech recognition
 Transition from English to Mandarin made simpler by end-to-end
DL
 No feature engineering or Mandarin-specificsrequired
 More accurate than humans
 Error rate 3.7% vs. 4% for human tests
http://arxiv.org/abs/1512.02595
END-TO-END DEEP LEARNING FOR ENGLISH AND MANDARIN SPEECH
RECOGNITION
BAIDU DEEP SPEECH 2
FIRST COMPUTER PROGRAM TO BEAT A HUMAN GO PROFESSIONAL
Training DNNs : 3 weeks, 340 million training steps on 50 GPUs
Play : Asynchronousmulti-threadedsearch
Simulations on CPUs, policy and value DNNs in parallel on
GPUs Single machine: 40 search threads, 48 CPUs, and 8
GPUs
Distributed version: 40 search threads, 1202 CPUs and
176 GPUs
Outcome: Beat both European and World Go champions in
best of 5 matches
ALPHA-GO
DEEP LEARNING EVERYWHERE
INTERNET & CLOUD
Image Classification
Speech Recognition
Language Translation
Language Processing
Sentiment Analysis
Recommendation
MEDIA &
ENTERTAINMENT
Video Captioning
Video Search
Real Time
Translation
AUTONOMOUS MACHINES
Pedestrian Detection
Lane Tracking
Recognize Traffic Sign
SECURITY &
DEFENSE
Face Detection
Video Surveillance
Satellite Imagery
MEDICINE & BIOLOGY
Cancer Cell
Detection Diabetic
Grading Drug
Discovery
So what’s the f*** there
for Python?
Computer Vision, Machine, and Deep Learning with Python
WHAT IS PYTHON?
 General purpose interpreted programming language
 Widely used by scientists and programmers of all stripes
 Supported by many 3rd-party libraries (currently 21,054 on the
main python package website)
 Free!
WHY IS IT WELL-SUITED TO SCIENCE?
 NumPy
 Numerical library for python
 Written in C, wrapped by python
 Fast
 Scipy
 Built on top of NumPy (i.e. Also fast!)
 Common maths, science, engineering routines
 Matplotlib
 Hugely flexible plotting library
 Similar syntax to Matlab
 Produces publication-quality output
WHY IS PYTHON BETTER THAN WHAT I USE NOW?
 It can do everything
 Fast mathematical operations
 Easy file manipulation
 Format conversion
 Plotting
 Scripting
 Command line
 OK, not everything
 Write thesis for you
Python has a wide range of deep learning-related libraries available
Low level
High level
(efficient gpu-powered math)
(theano-wrapper, models in python code,
abstracts theano away)
(wrapper for theano, yaml, experiment-oriented)
(computer-vision oriented DL framework,
model-zoo, prototxt model definitions)
pythonification ongoing!
(theano-extension, models in python code,
theano not hidden)
and of course:
HOW EASY TO PROGRAM??
HOW EASY TO PROGRAM??
DEMO

More Related Content

PPTX
Deep learning
PPTX
Image Classification using deep learning
PDF
Deepfake detection
PPTX
Computer vision introduction
PDF
Convolutional Neural Network Models - Deep Learning
PDF
Deep learning - A Visual Introduction
PPTX
Computer vision
PPTX
AI Computer vision
Deep learning
Image Classification using deep learning
Deepfake detection
Computer vision introduction
Convolutional Neural Network Models - Deep Learning
Deep learning - A Visual Introduction
Computer vision
AI Computer vision

What's hot (20)

PPTX
Computer vision
PPTX
Introduction to CNN
PPTX
Image classification with Deep Neural Networks
PDF
Vector Search for Data Scientists.pdf
PPTX
Introduction to Deep learning
PPTX
Computer Vision
PPTX
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
PPTX
What is Deep Learning?
PPTX
Object detection with Tensorflow Api
PDF
SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)
PDF
Image analysis using python
PDF
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
PPTX
What is computer vision?
PPTX
Transfer Learning and Fine-tuning Deep Neural Networks
PDF
HRNET : Deep High-Resolution Representation Learning for Human Pose Estimation
PDF
Deepfakes - How they work and what it means for the future
PDF
CIFAR-10
PPTX
Resnet.pptx
PDF
Python for Image Understanding: Deep Learning with Convolutional Neural Nets
PDF
OpenCV Introduction
Computer vision
Introduction to CNN
Image classification with Deep Neural Networks
Vector Search for Data Scientists.pdf
Introduction to Deep learning
Computer Vision
Transfer Learning and Fine Tuning for Cross Domain Image Classification with ...
What is Deep Learning?
Object detection with Tensorflow Api
SSII2021 [SS2] Deepfake Generation and Detection – An Overview (ディープフェイクの生成と検出)
Image analysis using python
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
What is computer vision?
Transfer Learning and Fine-tuning Deep Neural Networks
HRNET : Deep High-Resolution Representation Learning for Human Pose Estimation
Deepfakes - How they work and what it means for the future
CIFAR-10
Resnet.pptx
Python for Image Understanding: Deep Learning with Convolutional Neural Nets
OpenCV Introduction
Ad

Viewers also liked (20)

PPTX
Structure Learning of Bayesian Networks with p Nodes from n Samples when n&lt...
PDF
Face Recognition Based on Deep Learning (Yurii Pashchenko Technology Stream)
PPTX
Pattern Recognition and Machine Learning : Graphical Models
PPT
Muzammil Abdulrahman PPT On Gabor Wavelet Transform (GWT) Based Facial Expres...
PDF
Using Gradient Descent for Optimization and Learning
PPTX
DIY Deep Learning with Caffe Workshop
PDF
Face recognition and deep learning โดย ดร. สรรพฤทธิ์ มฤคทัต NECTEC
PDF
Facebook Deep face
PPTX
Caffe framework tutorial
PDF
Caffe - A deep learning framework (Ramin Fahimi)
PPTX
Semi fragile watermarking
PPTX
Caffe framework tutorial2
PDF
Processor, Compiler and Python Programming Language
PDF
[AI07] Revolutionizing Image Processing with Cognitive Toolkit
PPTX
Optimization in deep learning
PPT
портфоліо Бабич О.А.
PDF
Center loss for Face Recognition
PDF
Rattani - Ph.D. Defense Slides
PDF
怖くない誤差逆伝播法 Chainerを添えて
PDF
Pattern Recognition and Machine Learning: Section 3.3
Structure Learning of Bayesian Networks with p Nodes from n Samples when n&lt...
Face Recognition Based on Deep Learning (Yurii Pashchenko Technology Stream)
Pattern Recognition and Machine Learning : Graphical Models
Muzammil Abdulrahman PPT On Gabor Wavelet Transform (GWT) Based Facial Expres...
Using Gradient Descent for Optimization and Learning
DIY Deep Learning with Caffe Workshop
Face recognition and deep learning โดย ดร. สรรพฤทธิ์ มฤคทัต NECTEC
Facebook Deep face
Caffe framework tutorial
Caffe - A deep learning framework (Ramin Fahimi)
Semi fragile watermarking
Caffe framework tutorial2
Processor, Compiler and Python Programming Language
[AI07] Revolutionizing Image Processing with Cognitive Toolkit
Optimization in deep learning
портфоліо Бабич О.А.
Center loss for Face Recognition
Rattani - Ph.D. Defense Slides
怖くない誤差逆伝播法 Chainerを添えて
Pattern Recognition and Machine Learning: Section 3.3
Ad

Similar to Computer vision, machine, and deep learning (20)

PPTX
Machine learningfor computervision_ashutoshupadhyay
PDF
IRJET- Python Libraries and Packages for Deep Learning-A Survey
PPTX
Welcome-to-AI-Focused-CourseLast.pptx
PDF
Utilization of Machine Learning in Computer Vision
PDF
PDF
Face Detection Using Artificial Intelligence and Machine Learning with Python
PPTX
Strata London - Deep Learning 05-2015
PPTX
Automatic Attendace using convolutional neural network Face Recognition
PDF
Machine Learning Overview: How did we get here ?
PPTX
Image classification using convolutional neural network
PDF
Introduction to Deep Learning: Concepts, Architectures, and Applications
PDF
Fontys - Demystify AI. Wat is er mogelijk met AI en wat niet?
PPTX
Unit 4 Object Recognition and Classification.pptx
PDF
Everything You Need to Know About Computer Vision
PDF
Presentation v3
PDF
Image Classification and Annotation Using Deep Learning
PDF
20 interesting-applications-of-deep-learning-with-python
PPTX
B4UConference_machine learning_deeplearning
PDF
Ai=ml academic-institutions-Webinar
Machine learningfor computervision_ashutoshupadhyay
IRJET- Python Libraries and Packages for Deep Learning-A Survey
Welcome-to-AI-Focused-CourseLast.pptx
Utilization of Machine Learning in Computer Vision
Face Detection Using Artificial Intelligence and Machine Learning with Python
Strata London - Deep Learning 05-2015
Automatic Attendace using convolutional neural network Face Recognition
Machine Learning Overview: How did we get here ?
Image classification using convolutional neural network
Introduction to Deep Learning: Concepts, Architectures, and Applications
Fontys - Demystify AI. Wat is er mogelijk met AI en wat niet?
Unit 4 Object Recognition and Classification.pptx
Everything You Need to Know About Computer Vision
Presentation v3
Image Classification and Annotation Using Deep Learning
20 interesting-applications-of-deep-learning-with-python
B4UConference_machine learning_deeplearning
Ai=ml academic-institutions-Webinar

Recently uploaded (20)

PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PPTX
OOP with Java - Java Introduction (Basics)
DOCX
573137875-Attendance-Management-System-original
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
Welding lecture in detail for understanding
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Well-logging-methods_new................
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PPTX
Geodesy 1.pptx...............................................
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
UNIT 4 Total Quality Management .pptx
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
OOP with Java - Java Introduction (Basics)
573137875-Attendance-Management-System-original
R24 SURVEYING LAB MANUAL for civil enggi
Welding lecture in detail for understanding
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Well-logging-methods_new................
CYBER-CRIMES AND SECURITY A guide to understanding
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Geodesy 1.pptx...............................................
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
Internet of Things (IOT) - A guide to understanding
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
UNIT 4 Total Quality Management .pptx
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Model Code of Practice - Construction Work - 21102022 .pdf

Computer vision, machine, and deep learning

  • 1. COMPUTER VISION, MACHINE, AND DEEP LEARNING WITH PYTHON Dr.Eng. Igi Ardiyanto
  • 2. PROFILE Igi Ardiyanto Field of Interest : Robotics Computer Vision Intelligent Transportation System Embedded System Parallel Computing Deep Learning More Information ?? http://te.ugm.ac.id/~igi
  • 3. What is Computer Vision? Computer Vision, Machine, and Deep Learning with Python
  • 4. COMPUTER VISION Make computers understand images and video What kind of scene? Where are the people? How far is the building? Where is Waldo? Like when human “sees” something …..
  • 5. VISION IS REALLY HARD  Vision is an amazing feat of natural intelligence  Visual cortex occupies about 50% of Macaque brain  More human brain devoted to vision than anything else Sik…sik…. Iki dolanan opo panganan, cuk?
  • 6. OPTICAL CHARACTER RECOGNITION (OCR) Digit recognition, AT&T labs http://www.research.att.com/~yann/ Technology to convert scanned docs to text • If you have a scanner, it probably came with OCR software License plate readers http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
  • 7. FACE DETECTION  Many new digital cameras now detect faces  Canon, Sony, Fuji, …
  • 8. SMILE DETECTION Sony Cyber-shot® T70 Digital Still Camera
  • 9. What is Machine Learning? Computer Vision, Machine, and Deep Learning with Python
  • 10. MACHINE LEARNING  Machine learning is programming computers to optimize a performance criterion using example data or past experience.  There is no need to “learn” to calculate payroll  Learning is used when:  Human expertise does not exist (navigating on Mars),  Humans are unable to explain their expertise (speech recognition)  Solution changes in time (routing on a computer network)  Solution needs to be adapted to particular cases (user biometrics)
  • 11. COMPUTER VISION MEETS MACHINE LEARNING Dog Cat Raccoon Dog Train: Deploy: Training Labels Training Image Features Prediction Image Features Learned model
  • 12. IMAGE FEATURES ??  Color  Histograms  Shape  … Slide credit: L. Lazebnik
  • 13. VERY BRIEF TOUR OF SOME CLASSIFIERS  K-nearest neighbor  SVM  Boosted Decision Trees  Neural networks  Naïve Bayes  Bayesian network  Gaussian Logistic regression  Random Forests  RBMs  Etc.
  • 14. FACIAL ATTRACTIVENESS PREDICTION Yoona: Score 3.6 Yuri: Score 3.4 Tiffany: Score 3.8
  • 16. What is Deep Learning? Computer Vision, Machine, and Deep Learning with Python
  • 17. 1) A host of statistical machine learning techniques 2) Enables the automatic learning of feature hierarchies 3) Generally based on artificial neural networks DEEP LEARNING
  • 18.  English and Mandarin speech recognition  Transition from English to Mandarin made simpler by end-to-end DL  No feature engineering or Mandarin-specificsrequired  More accurate than humans  Error rate 3.7% vs. 4% for human tests http://arxiv.org/abs/1512.02595 END-TO-END DEEP LEARNING FOR ENGLISH AND MANDARIN SPEECH RECOGNITION BAIDU DEEP SPEECH 2
  • 19. FIRST COMPUTER PROGRAM TO BEAT A HUMAN GO PROFESSIONAL Training DNNs : 3 weeks, 340 million training steps on 50 GPUs Play : Asynchronousmulti-threadedsearch Simulations on CPUs, policy and value DNNs in parallel on GPUs Single machine: 40 search threads, 48 CPUs, and 8 GPUs Distributed version: 40 search threads, 1202 CPUs and 176 GPUs Outcome: Beat both European and World Go champions in best of 5 matches ALPHA-GO
  • 20. DEEP LEARNING EVERYWHERE INTERNET & CLOUD Image Classification Speech Recognition Language Translation Language Processing Sentiment Analysis Recommendation MEDIA & ENTERTAINMENT Video Captioning Video Search Real Time Translation AUTONOMOUS MACHINES Pedestrian Detection Lane Tracking Recognize Traffic Sign SECURITY & DEFENSE Face Detection Video Surveillance Satellite Imagery MEDICINE & BIOLOGY Cancer Cell Detection Diabetic Grading Drug Discovery
  • 21. So what’s the f*** there for Python? Computer Vision, Machine, and Deep Learning with Python
  • 22. WHAT IS PYTHON?  General purpose interpreted programming language  Widely used by scientists and programmers of all stripes  Supported by many 3rd-party libraries (currently 21,054 on the main python package website)  Free!
  • 23. WHY IS IT WELL-SUITED TO SCIENCE?  NumPy  Numerical library for python  Written in C, wrapped by python  Fast  Scipy  Built on top of NumPy (i.e. Also fast!)  Common maths, science, engineering routines  Matplotlib  Hugely flexible plotting library  Similar syntax to Matlab  Produces publication-quality output
  • 24. WHY IS PYTHON BETTER THAN WHAT I USE NOW?  It can do everything  Fast mathematical operations  Easy file manipulation  Format conversion  Plotting  Scripting  Command line  OK, not everything  Write thesis for you
  • 25. Python has a wide range of deep learning-related libraries available Low level High level (efficient gpu-powered math) (theano-wrapper, models in python code, abstracts theano away) (wrapper for theano, yaml, experiment-oriented) (computer-vision oriented DL framework, model-zoo, prototxt model definitions) pythonification ongoing! (theano-extension, models in python code, theano not hidden) and of course:
  • 26. HOW EASY TO PROGRAM??
  • 27. HOW EASY TO PROGRAM??
  • 28. DEMO