SlideShare a Scribd company logo
OPEN CV – AN INTRODUCTION TO THE VISION LIBRARY AND ITS APPLICATIONS Hemanth Haridas Co founder  VidPulp Technologies
WHAT IS THIS TALK ABOUT? An basic introduction to Open CV, a cross platform vision library. To get students interested in Vision and related fields. Introduction to the capabilities of Open CV. Applications of the library. Limitations of the library
WHAT IS THIS TALK NOT ABOUT? Not a programmatic tutorial on Open CV. Not a detailed explaination about Open CV. No math behind Open CV covered.
 
COMPUTER VISION Rapidly growing field because of cheaper and more capable cameras and affordable processing power. Vision algorithms are starting to mature. Open CV has helped computer vision grow as a field Open CV helps jump-start research by providing them with a computer vision and machine learning infrastructure
CONTENTS Introduction Who Uses OpenCV? What is Computer Vision? How complex is the problem? Details about the Library. Summary
INTRODUCTION OpenCV  ( Open  Source  C omputer  V ision)  library of programming functions for real time computer vision.  (computer efficiency) Written in optimized C and C++ Runs in windows, Linux and MAC OS. Can develop in C, C++, python, ruby, matlab
INTRODUCTION Simple to use infra  Helps to build fairly complicated applications. 500 functions spanning Factory product inspection  medical imaging Security user interface  camera calibration stereo vision robotics contains a full, general-purpose Machine Learning Library (MLL)
WHO USES OPEN CV? Surveillance images and video on the Web. (Flickr, picasa  face recognition and tagging) aerial and street-map images (such as in Google’s Street View) make heavy use of camera calibration and image stitching techniques safety monitoring, unmanned flying vehicles, or biomedical analysis. manufacturing: virtually everything that is mass-produced has been automatically  inspected at some point using computer vision.
WHO USES OPEN CV? license for OpenCV has been structured such that you can build a commercial product using all or part of OpenCV. You are under no obligation to opensource large user community that includes people from major companies (IBM, Microsoft , Intel, SONY, Siemens, and Google, to name only a few) and research centers (such as Stanford, MIT, CMU, Cambridge, and INRIA).
WHO USES OPEN CV? http://groups.yahoo.com/group/OpenCV   -  20,000 members OpenCV was a key part of the vision system in the robot from Stanford, “Stanley”, which won the $2M DARPA Grand Challenge desert robot race web maps, image scan alignment, medical image noise reduction, object analysis, security and intrusion detection systems, automatic monitoring and safety systems, manufacturing inspection systems, camera calibration, military applications, and unmanned aerial, ground, and underwater vehicles
WHAT IS COMPUTER VISION? transformation of data from a still or video camera into either a decision or a new representation. Data - > “the camera is mounted in a car” or “laser range fi nder indicates an object is 1 meter away”. Decision -> “there is a person in this scene” or “there are 14 tumor cells on this slide” new representation -> turning a color image into a grayscale image
HOW HARD CAN THAT BE? Human  brain divides the vision signals to many channels. Identifies important parts Complex feedback mechanism that is little understood. draw on cross-associations made from years of living in the world. Controls lighting through the iris.
HOW A MACHINE SEES IT? A 2d Image of a 3d object. No definite way to reconstruct the 3d image.
HOW A MACHINE SEES IT? Images are corrupted by noise and distortions. (weather, lighting, reflections, movements) Additional contextual knowledge is used  Which is helpful in matching
NOISE Edge Detection -> impossible to detect edges by comparing a point to its neighbours If the comparison is made over a localized area its easier. Compensating noise by using statistics over time. Explicit models learnt from available data. (lens distortions)
CONTEXTUAL INFORMATION The decision taken by vision algorithms depend on the application it is used for. Security system that alerts if a person tries to cross a fence. monitoring system that counts how many people cross through an area in an amusement park. Strategy for vision algos in security cameras different from that of in robots. The more constrained our context , the better the solution will be.
ABOUT THE LIBRARY aimed at providing the basic tools needed to solve computer vision problems high-level functionalities in the library will be sufficient to solve the more complex problems in computer vision the basic components in the library are complete enough to enable creation of a complete solution After you develop a first draft solution, check for weakness and fix it.
OPEN CV TIMELINE
COMPONENTS OF OPENCV
RESOURCES Download Open CV -  http://sourceforge.net/projects/opencvlibrary/ Install guide and tutorials -  http://opencv.willowgarage.com/wiki/ IDE – eclipse, .net , VC++
SUMMARY Open CV is a open source library to implement Computer vision algorithms Computer vision is a complex problem. Made easy with enough context information. Computer vision is a interesting and a fast growing field and skills in this field is niche , in demand
OCR ALGORITHMS Optical character recognition, to recognize text in scanned documents. Useful in detecting text in videos For extracting contextual information in videos. Tesseract and GOCR – open source OCRs available
SPEECH TO TEXT ALGOS Algorithms to convert from Speech to text. Text to speech conversion algos are available. Language translation research is underway.
QUESTIONS?

More Related Content

PPTX
MongoDB on AWSクラウドという選択
PDF
AWS Lambda 내부 동작 방식 및 활용 방법 자세히 살펴 보기 - 김일호 솔루션즈 아키텍트 매니저, AWS :: AWS Summit ...
PDF
현대백화점 리테일테크랩과 AWS Prototyping 팀 개발자가 들려주는 인공 지능 무인 스토어 개발 여정 - 최권열 AWS 프로토타이핑...
PPTX
Memória virtual
PDF
[Retail & CPG Day 2019] 유통 고객의 AWS 도입 동향 - 박동국, AWS 어카운트 매니저, 김준성, AWS어카운트 매니저
PDF
AWS Summit Seoul 2023 | HL Mando가 AWS IoT Fleetwise로 그리는 미래 커넥티드 모빌리티 기술
PPTX
Hvd and blu ray
PDF
Past ielts essays
MongoDB on AWSクラウドという選択
AWS Lambda 내부 동작 방식 및 활용 방법 자세히 살펴 보기 - 김일호 솔루션즈 아키텍트 매니저, AWS :: AWS Summit ...
현대백화점 리테일테크랩과 AWS Prototyping 팀 개발자가 들려주는 인공 지능 무인 스토어 개발 여정 - 최권열 AWS 프로토타이핑...
Memória virtual
[Retail & CPG Day 2019] 유통 고객의 AWS 도입 동향 - 박동국, AWS 어카운트 매니저, 김준성, AWS어카운트 매니저
AWS Summit Seoul 2023 | HL Mando가 AWS IoT Fleetwise로 그리는 미래 커넥티드 모빌리티 기술
Hvd and blu ray
Past ielts essays

Similar to Open Cv – An Introduction To The Vision (20)

DOCX
Open cv
PDF
PDF
بینایی ماشین
PPTX
OpenCV+Android.pptx
PPTX
PDF
PL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor Miller
DOCX
Open cv python tutorial for beginners 1
ODP
Computer vision for your projects
PDF
Draw in Air | Open CV Project
DOCX
16 OpenCV Functions to Start your Computer Vision journey.docx
PPTX
Presentation on the topic of computer vision.pptx
PDF
"The OpenCV Open Source Computer Vision Library: Latest Developments," a Pres...
PDF
Computer Vision Workshop
PPTX
20110220 computer vision_eruhimov_lecture02
PDF
Introduction to OpenCV 2.3.1
PPTX
PPTX
AI UNIT 4 - SRCAS JOC.pptx enjoy this ppt
PPTX
Computer vision
PDF
Image Processing In Open CV. Image Processing In Open CV. Image Processing In...
PDF
Image Detection and Count Using Open Computer Vision (Opencv)
Open cv
بینایی ماشین
OpenCV+Android.pptx
PL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor Miller
Open cv python tutorial for beginners 1
Computer vision for your projects
Draw in Air | Open CV Project
16 OpenCV Functions to Start your Computer Vision journey.docx
Presentation on the topic of computer vision.pptx
"The OpenCV Open Source Computer Vision Library: Latest Developments," a Pres...
Computer Vision Workshop
20110220 computer vision_eruhimov_lecture02
Introduction to OpenCV 2.3.1
AI UNIT 4 - SRCAS JOC.pptx enjoy this ppt
Computer vision
Image Processing In Open CV. Image Processing In Open CV. Image Processing In...
Image Detection and Count Using Open Computer Vision (Opencv)
Ad

Recently uploaded (20)

PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Pre independence Education in Inndia.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
master seminar digital applications in india
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
RMMM.pdf make it easy to upload and study
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Computing-Curriculum for Schools in Ghana
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
O5-L3 Freight Transport Ops (International) V1.pdf
Microbial diseases, their pathogenesis and prophylaxis
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Pre independence Education in Inndia.pdf
Supply Chain Operations Speaking Notes -ICLT Program
master seminar digital applications in india
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
TR - Agricultural Crops Production NC III.pdf
RMMM.pdf make it easy to upload and study
VCE English Exam - Section C Student Revision Booklet
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
102 student loan defaulters named and shamed – Is someone you know on the list?
Computing-Curriculum for Schools in Ghana
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Final Presentation General Medicine 03-08-2024.pptx
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Ad

Open Cv – An Introduction To The Vision

  • 1. OPEN CV – AN INTRODUCTION TO THE VISION LIBRARY AND ITS APPLICATIONS Hemanth Haridas Co founder VidPulp Technologies
  • 2. WHAT IS THIS TALK ABOUT? An basic introduction to Open CV, a cross platform vision library. To get students interested in Vision and related fields. Introduction to the capabilities of Open CV. Applications of the library. Limitations of the library
  • 3. WHAT IS THIS TALK NOT ABOUT? Not a programmatic tutorial on Open CV. Not a detailed explaination about Open CV. No math behind Open CV covered.
  • 4.  
  • 5. COMPUTER VISION Rapidly growing field because of cheaper and more capable cameras and affordable processing power. Vision algorithms are starting to mature. Open CV has helped computer vision grow as a field Open CV helps jump-start research by providing them with a computer vision and machine learning infrastructure
  • 6. CONTENTS Introduction Who Uses OpenCV? What is Computer Vision? How complex is the problem? Details about the Library. Summary
  • 7. INTRODUCTION OpenCV ( Open Source C omputer V ision) library of programming functions for real time computer vision. (computer efficiency) Written in optimized C and C++ Runs in windows, Linux and MAC OS. Can develop in C, C++, python, ruby, matlab
  • 8. INTRODUCTION Simple to use infra Helps to build fairly complicated applications. 500 functions spanning Factory product inspection medical imaging Security user interface camera calibration stereo vision robotics contains a full, general-purpose Machine Learning Library (MLL)
  • 9. WHO USES OPEN CV? Surveillance images and video on the Web. (Flickr, picasa face recognition and tagging) aerial and street-map images (such as in Google’s Street View) make heavy use of camera calibration and image stitching techniques safety monitoring, unmanned flying vehicles, or biomedical analysis. manufacturing: virtually everything that is mass-produced has been automatically inspected at some point using computer vision.
  • 10. WHO USES OPEN CV? license for OpenCV has been structured such that you can build a commercial product using all or part of OpenCV. You are under no obligation to opensource large user community that includes people from major companies (IBM, Microsoft , Intel, SONY, Siemens, and Google, to name only a few) and research centers (such as Stanford, MIT, CMU, Cambridge, and INRIA).
  • 11. WHO USES OPEN CV? http://groups.yahoo.com/group/OpenCV - 20,000 members OpenCV was a key part of the vision system in the robot from Stanford, “Stanley”, which won the $2M DARPA Grand Challenge desert robot race web maps, image scan alignment, medical image noise reduction, object analysis, security and intrusion detection systems, automatic monitoring and safety systems, manufacturing inspection systems, camera calibration, military applications, and unmanned aerial, ground, and underwater vehicles
  • 12. WHAT IS COMPUTER VISION? transformation of data from a still or video camera into either a decision or a new representation. Data - > “the camera is mounted in a car” or “laser range fi nder indicates an object is 1 meter away”. Decision -> “there is a person in this scene” or “there are 14 tumor cells on this slide” new representation -> turning a color image into a grayscale image
  • 13. HOW HARD CAN THAT BE? Human brain divides the vision signals to many channels. Identifies important parts Complex feedback mechanism that is little understood. draw on cross-associations made from years of living in the world. Controls lighting through the iris.
  • 14. HOW A MACHINE SEES IT? A 2d Image of a 3d object. No definite way to reconstruct the 3d image.
  • 15. HOW A MACHINE SEES IT? Images are corrupted by noise and distortions. (weather, lighting, reflections, movements) Additional contextual knowledge is used Which is helpful in matching
  • 16. NOISE Edge Detection -> impossible to detect edges by comparing a point to its neighbours If the comparison is made over a localized area its easier. Compensating noise by using statistics over time. Explicit models learnt from available data. (lens distortions)
  • 17. CONTEXTUAL INFORMATION The decision taken by vision algorithms depend on the application it is used for. Security system that alerts if a person tries to cross a fence. monitoring system that counts how many people cross through an area in an amusement park. Strategy for vision algos in security cameras different from that of in robots. The more constrained our context , the better the solution will be.
  • 18. ABOUT THE LIBRARY aimed at providing the basic tools needed to solve computer vision problems high-level functionalities in the library will be sufficient to solve the more complex problems in computer vision the basic components in the library are complete enough to enable creation of a complete solution After you develop a first draft solution, check for weakness and fix it.
  • 21. RESOURCES Download Open CV - http://sourceforge.net/projects/opencvlibrary/ Install guide and tutorials - http://opencv.willowgarage.com/wiki/ IDE – eclipse, .net , VC++
  • 22. SUMMARY Open CV is a open source library to implement Computer vision algorithms Computer vision is a complex problem. Made easy with enough context information. Computer vision is a interesting and a fast growing field and skills in this field is niche , in demand
  • 23. OCR ALGORITHMS Optical character recognition, to recognize text in scanned documents. Useful in detecting text in videos For extracting contextual information in videos. Tesseract and GOCR – open source OCRs available
  • 24. SPEECH TO TEXT ALGOS Algorithms to convert from Speech to text. Text to speech conversion algos are available. Language translation research is underway.