Disp2Flow: Optical flow synthesis using
pre-trained disparity estimation networks
전기정보공학부 김준호
Disp2Flow: Optical flow synthesis using
pre-trained disparity estimation networks
전기정보공학부 김준호
Deep learning trends + My Research
전기정보공학부 김준호
Contents
• What is deep learning?
• Growing trends in deep learning
• Big players in deep learning
• Deep learning @Korea
• My research
What is deep learning?
What is deep learning?
What is machine learning?
• Data -> Learn -> Knowledge (Make predictions on unseen data)
• Examples
Support Vector Machines Gaussian Process Regression
What is machine learning?(cont.)
FAQ: How does unsupervised learning work?
Unsupervised learning is cool but…
• In many cases some form of supervision is necessary
• “Fully unsupervised” learning is impossible -> Some assumptions
about input data should be given!
• No free lunch theorem
• Let’s only focus on supervised learning for now!
Supervised learning
• Goal: Find a function “f” that maps input data to output domain
• Input & output domain could be anything
• Examples
• Input: Images -> Output: Label
• Input: Korean -> Output: English
What is deep learning?
• Consider a supervised learning problem where we want to predict
whether an image is a cat or not
• Again, we want to find “f”
• The relationship between an image and its label will be very complex
• Probably not “y = ax + b”
• We can build “complex” functions using composition!
• Stack “a lot” of functions -> deep learning
What is deep learning?
Y = f(a*f(c*f(e*f….(x)+g)+d) + b)
Growing(Bad) trends in deep learning
Hmmm…
Major AI research areas
Major AI research areas
Top-tier machine learning conferences
• Computer vision
• CVPR
• ICCV
• ECCV
• Natural language processing
• ACL
• EMNLP
• NAACL-HLT
• Machine learning
• NIPS
• ICML
• ICLR
Number of submissions
• CVPR – top tier conference in computer vision
• Acceptance rate: approx. 23%
Number of submissions
• NIPS – top tier machine learning conference
• Acceptance rate: approx. 20%
A typical review process
• Most machine learning conferences use a peer-review system
• Papers are submitted to different “areas”
• Ex) If one submits a paper on classifying cats, it will go under the category of
“image classification”
• Each “area” has an “area chair” – a person who is in charge of all
submission to that “area”
• “Area chairs” distribute the submitted papers to reviewers, who will
eventually decide accept/reject
More papers are good but…
• It means we need more reviewers
• However, the total number of reviewers is limited
• A lot of bad quality reviewers are present
Deep learning trends
Big players in deep learning
= Diamond sponsors of top-tier conferences
CVPR
ACL
Ilya Sutskever
• Research scientist @OpenAI
• Salary - $1.9 million
Ian Goodfellow
• Research scientist @ Google brain
• Salary - $800,000
Deep learning@Korea
Number of top-tier conference papers are
increasing!
• CVPR 2017, 2018 -> More than 20 papers accepted
• NIPS 2018 -> 20 papers accepted (~5 in NIPS 2017)
• SNU ECE: 3
• SNU CS: 1
• SNU 융기원: 2
• KAIST EE: 5
• KAIST CS: 3
• Yonsei EE: 1
• Yonsei CS: 1
• Kakao brain: 1
• NAVER: 1
• SK T-brain: 1
• Lunit: 1
“Four” companies
My research
Disparity
• Finding disparity leads to depth estimation in stereo vision
• In order to find disparity, we need rectified stereo pairs
• Matching points should be on the ‘same’ horizontal line
Rectified images
Optical flow
• Optical flow is a more general problem, where we want to find
individual movements of pixels
Optical flow -> Disparity
• A 90s paper
Disparity -> Optical flow
• Optical flow training data is hard to find
• In many cases synthetic data is used
• Could we use pre-trained models for disparity estimation to find
optical flow?
Disparity -> Optical flow
• Input: Image pair
• Output: Optical flow
• Novelty: Obtain optical flow estimation using networks trained on
stereo disparity estimation
• Method overview
• Pre-train a neural network(DispNet-Horizontal) on stereo disparity data
• Using basic rotation operations, obtain another neural network(DispNet-
Vertical) on rotated stereo disparity data
• Train a domain transfer network to map optical flow target images to stereo
images(automatic rectification)
• Fine-tune domain transfer network using optical flow ground truth
Disparity -> Optical flow (initial version)
Disparity -> Optical flow (new version)
Disparity -> Optical flow(TODO)
• Pre-train dispnet
• Download optical flow dataset
• Test adequate volume for discriminator
• Implement filter rotation
Questions?

More Related Content

PPTX
Utilizing additional information in factorization methods (research overview,...
PDF
Context-aware preference modeling with factorization
PPTX
Human pose estimation with deep learning
PDF
李俊良/Feature Engineering in Machine Learning
PDF
Jeff Dean at AI Frontiers: Trends and Developments in Deep Learning Research
PDF
Infrastructure for the work of Data Scientists
PDF
Quoc le, slides MLconf 11/15/13
PDF
PR-278: RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
Utilizing additional information in factorization methods (research overview,...
Context-aware preference modeling with factorization
Human pose estimation with deep learning
李俊良/Feature Engineering in Machine Learning
Jeff Dean at AI Frontiers: Trends and Developments in Deep Learning Research
Infrastructure for the work of Data Scientists
Quoc le, slides MLconf 11/15/13
PR-278: RAFT: Recurrent All-Pairs Field Transforms for Optical Flow

Similar to Deep learning trends (20)

PDF
Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
PDF
Intro to Neural Networks
PDF
The (very) basics of AI for the Radiology resident
PDF
MLIP - Chapter 3 - Introduction to deep learning
PDF
Deep Learning, an interactive introduction for NLP-ers
PPTX
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
PDF
Introduction to Recurrent Neural Network
PDF
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
PPTX
Deep Learning with Python (PyData Seattle 2015)
PPTX
Deep Learning in Recommender Systems - RecSys Summer School 2017
PPTX
Object extraction from satellite imagery using deep learning
PPTX
Computer vision introduction
PDF
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
PPTX
Introduction to deep learning
PPTX
Deep learning introduction
PDF
Sean Kandel - Data profiling: Assessing the overall content and quality of a ...
PDF
Introduction talk to Computer Vision
PPTX
Promises of Deep Learning
PDF
Introduction to Neural Network
PDF
Approximate nearest neighbor methods and vector models – NYC ML meetup
Wastian, Brunmeir - Data Analyses in Industrial Applications: From Predictive...
Intro to Neural Networks
The (very) basics of AI for the Radiology resident
MLIP - Chapter 3 - Introduction to deep learning
Deep Learning, an interactive introduction for NLP-ers
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Recurrent Neural Network
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
Deep Learning with Python (PyData Seattle 2015)
Deep Learning in Recommender Systems - RecSys Summer School 2017
Object extraction from satellite imagery using deep learning
Computer vision introduction
MDEC Data Matters Series: machine learning and Deep Learning, A Primer
Introduction to deep learning
Deep learning introduction
Sean Kandel - Data profiling: Assessing the overall content and quality of a ...
Introduction talk to Computer Vision
Promises of Deep Learning
Introduction to Neural Network
Approximate nearest neighbor methods and vector models – NYC ML meetup
Ad

Recently uploaded (20)

PPTX
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
PDF
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf
PPTX
Module 8- Technological and Communication Skills.pptx
PPTX
wireless networks, mobile computing.pptx
PDF
20250617 - IR - Global Guide for HR - 51 pages.pdf
PDF
Computer organization and architecuture Digital Notes....pdf
PDF
August -2025_Top10 Read_Articles_ijait.pdf
PDF
Applications of Equal_Area_Criterion.pdf
PPTX
ASME PCC-02 TRAINING -DESKTOP-NLE5HNP.pptx
PPTX
Measurement Uncertainty and Measurement System analysis
PPTX
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
PPTX
Chapter 2 -Technology and Enginerring Materials + Composites.pptx
PDF
Computer System Architecture 3rd Edition-M Morris Mano.pdf
PPTX
Feature types and data preprocessing steps
PPTX
Principal presentation for NAAC (1).pptx
PDF
UEFA_Embodied_Carbon_Emissions_Football_Infrastructure.pdf
PPTX
PRASUNET_20240614003_231416_0000[1].pptx
PDF
Java Basics-Introduction and program control
PPTX
Chemical Technological Processes, Feasibility Study and Chemical Process Indu...
PDF
UEFA_Carbon_Footprint_Calculator_Methology_2.0.pdf
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf
Module 8- Technological and Communication Skills.pptx
wireless networks, mobile computing.pptx
20250617 - IR - Global Guide for HR - 51 pages.pdf
Computer organization and architecuture Digital Notes....pdf
August -2025_Top10 Read_Articles_ijait.pdf
Applications of Equal_Area_Criterion.pdf
ASME PCC-02 TRAINING -DESKTOP-NLE5HNP.pptx
Measurement Uncertainty and Measurement System analysis
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
Chapter 2 -Technology and Enginerring Materials + Composites.pptx
Computer System Architecture 3rd Edition-M Morris Mano.pdf
Feature types and data preprocessing steps
Principal presentation for NAAC (1).pptx
UEFA_Embodied_Carbon_Emissions_Football_Infrastructure.pdf
PRASUNET_20240614003_231416_0000[1].pptx
Java Basics-Introduction and program control
Chemical Technological Processes, Feasibility Study and Chemical Process Indu...
UEFA_Carbon_Footprint_Calculator_Methology_2.0.pdf
Ad

Deep learning trends

  • 1. Disp2Flow: Optical flow synthesis using pre-trained disparity estimation networks 전기정보공학부 김준호
  • 2. Disp2Flow: Optical flow synthesis using pre-trained disparity estimation networks 전기정보공학부 김준호
  • 3. Deep learning trends + My Research 전기정보공학부 김준호
  • 4. Contents • What is deep learning? • Growing trends in deep learning • Big players in deep learning • Deep learning @Korea • My research
  • 5. What is deep learning?
  • 6. What is deep learning?
  • 7. What is machine learning? • Data -> Learn -> Knowledge (Make predictions on unseen data) • Examples Support Vector Machines Gaussian Process Regression
  • 8. What is machine learning?(cont.)
  • 9. FAQ: How does unsupervised learning work?
  • 10. Unsupervised learning is cool but… • In many cases some form of supervision is necessary • “Fully unsupervised” learning is impossible -> Some assumptions about input data should be given! • No free lunch theorem • Let’s only focus on supervised learning for now!
  • 11. Supervised learning • Goal: Find a function “f” that maps input data to output domain • Input & output domain could be anything • Examples • Input: Images -> Output: Label • Input: Korean -> Output: English
  • 12. What is deep learning? • Consider a supervised learning problem where we want to predict whether an image is a cat or not • Again, we want to find “f” • The relationship between an image and its label will be very complex • Probably not “y = ax + b” • We can build “complex” functions using composition! • Stack “a lot” of functions -> deep learning
  • 13. What is deep learning? Y = f(a*f(c*f(e*f….(x)+g)+d) + b)
  • 14. Growing(Bad) trends in deep learning
  • 18. Top-tier machine learning conferences • Computer vision • CVPR • ICCV • ECCV • Natural language processing • ACL • EMNLP • NAACL-HLT • Machine learning • NIPS • ICML • ICLR
  • 19. Number of submissions • CVPR – top tier conference in computer vision • Acceptance rate: approx. 23%
  • 20. Number of submissions • NIPS – top tier machine learning conference • Acceptance rate: approx. 20%
  • 21. A typical review process • Most machine learning conferences use a peer-review system • Papers are submitted to different “areas” • Ex) If one submits a paper on classifying cats, it will go under the category of “image classification” • Each “area” has an “area chair” – a person who is in charge of all submission to that “area” • “Area chairs” distribute the submitted papers to reviewers, who will eventually decide accept/reject
  • 22. More papers are good but… • It means we need more reviewers • However, the total number of reviewers is limited • A lot of bad quality reviewers are present
  • 24. Big players in deep learning
  • 25. = Diamond sponsors of top-tier conferences
  • 26. CVPR
  • 27. ACL
  • 28. Ilya Sutskever • Research scientist @OpenAI • Salary - $1.9 million
  • 29. Ian Goodfellow • Research scientist @ Google brain • Salary - $800,000
  • 31. Number of top-tier conference papers are increasing! • CVPR 2017, 2018 -> More than 20 papers accepted • NIPS 2018 -> 20 papers accepted (~5 in NIPS 2017) • SNU ECE: 3 • SNU CS: 1 • SNU 융기원: 2 • KAIST EE: 5 • KAIST CS: 3 • Yonsei EE: 1 • Yonsei CS: 1 • Kakao brain: 1 • NAVER: 1 • SK T-brain: 1 • Lunit: 1
  • 34. Disparity • Finding disparity leads to depth estimation in stereo vision • In order to find disparity, we need rectified stereo pairs • Matching points should be on the ‘same’ horizontal line
  • 36. Optical flow • Optical flow is a more general problem, where we want to find individual movements of pixels
  • 37. Optical flow -> Disparity • A 90s paper
  • 38. Disparity -> Optical flow • Optical flow training data is hard to find • In many cases synthetic data is used • Could we use pre-trained models for disparity estimation to find optical flow?
  • 39. Disparity -> Optical flow • Input: Image pair • Output: Optical flow • Novelty: Obtain optical flow estimation using networks trained on stereo disparity estimation • Method overview • Pre-train a neural network(DispNet-Horizontal) on stereo disparity data • Using basic rotation operations, obtain another neural network(DispNet- Vertical) on rotated stereo disparity data • Train a domain transfer network to map optical flow target images to stereo images(automatic rectification) • Fine-tune domain transfer network using optical flow ground truth
  • 40. Disparity -> Optical flow (initial version)
  • 41. Disparity -> Optical flow (new version)
  • 42. Disparity -> Optical flow(TODO) • Pre-train dispnet • Download optical flow dataset • Test adequate volume for discriminator • Implement filter rotation