Deep learning trends

Disp2Flow: Optical flow synthesis using
pre-trained disparity estimation networks
전기정보공학부 김준호

Deep learning trends + My Research
전기정보공학부 김준호

Contents
• What is deep learning?
• Growing trends in deep learning
• Big players in deep learning
• Deep learning @Korea
• My research

What is machine learning?
• Data -> Learn -> Knowledge (Make predictions on unseen data)
• Examples
Support Vector Machines Gaussian Process Regression

What is machine learning?(cont.)

FAQ: How does unsupervised learning work?

Unsupervised learning is cool but…
• In many cases some form of supervision is necessary
• “Fully unsupervised” learning is impossible -> Some assumptions
about input data should be given!
• No free lunch theorem
• Let’s only focus on supervised learning for now!

Supervised learning
• Goal: Find a function “f” that maps input data to output domain
• Input & output domain could be anything
• Examples
• Input: Images -> Output: Label
• Input: Korean -> Output: English

What is deep learning?
• Consider a supervised learning problem where we want to predict
whether an image is a cat or not
• Again, we want to find “f”
• The relationship between an image and its label will be very complex
• Probably not “y = ax + b”
• We can build “complex” functions using composition!
• Stack “a lot” of functions -> deep learning

What is deep learning?
Y = f(a*f(c*f(e*f….(x)+g)+d) + b)

Growing(Bad) trends in deep learning

Top-tier machine learning conferences
• Computer vision
• CVPR
• ICCV
• ECCV
• Natural language processing
• ACL
• EMNLP
• NAACL-HLT
• Machine learning
• NIPS
• ICML
• ICLR

Number of submissions
• CVPR – top tier conference in computer vision
• Acceptance rate: approx. 23%

Number of submissions
• NIPS – top tier machine learning conference
• Acceptance rate: approx. 20%

A typical review process
• Most machine learning conferences use a peer-review system
• Papers are submitted to different “areas”
• Ex) If one submits a paper on classifying cats, it will go under the category of
“image classification”
• Each “area” has an “area chair” – a person who is in charge of all
submission to that “area”
• “Area chairs” distribute the submitted papers to reviewers, who will
eventually decide accept/reject

More papers are good but…
• It means we need more reviewers
• However, the total number of reviewers is limited
• A lot of bad quality reviewers are present

= Diamond sponsors of top-tier conferences

Ilya Sutskever
• Research scientist @OpenAI
• Salary - $1.9 million

Ian Goodfellow
• Research scientist @ Google brain
• Salary - $800,000

Number of top-tier conference papers are
increasing!
• CVPR 2017, 2018 -> More than 20 papers accepted
• NIPS 2018 -> 20 papers accepted (~5 in NIPS 2017)
• SNU ECE: 3
• SNU CS: 1
• SNU 융기원: 2
• KAIST EE: 5
• KAIST CS: 3
• Yonsei EE: 1
• Yonsei CS: 1
• Kakao brain: 1
• NAVER: 1
• SK T-brain: 1
• Lunit: 1

Disparity
• Finding disparity leads to depth estimation in stereo vision
• In order to find disparity, we need rectified stereo pairs
• Matching points should be on the ‘same’ horizontal line

Optical flow
• Optical flow is a more general problem, where we want to find
individual movements of pixels

Optical flow -> Disparity
• A 90s paper

Disparity -> Optical flow
• Optical flow training data is hard to find
• In many cases synthetic data is used
• Could we use pre-trained models for disparity estimation to find
optical flow?

Disparity -> Optical flow
• Input: Image pair
• Output: Optical flow
• Novelty: Obtain optical flow estimation using networks trained on
stereo disparity estimation
• Method overview
• Pre-train a neural network(DispNet-Horizontal) on stereo disparity data
• Using basic rotation operations, obtain another neural network(DispNet-
Vertical) on rotated stereo disparity data
• Train a domain transfer network to map optical flow target images to stereo
images(automatic rectification)
• Fine-tune domain transfer network using optical flow ground truth

Disparity -> Optical flow (initial version)

Disparity -> Optical flow (new version)

Disparity -> Optical flow(TODO)
• Pre-train dispnet
• Download optical flow dataset
• Test adequate volume for discriminator
• Implement filter rotation

Deep learning trends

More Related Content

Similar to Deep learning trends (20)

Recently uploaded (20)

Deep learning trends