Region-of-Interest
Advanced Video Coding
      IMEC
     Inventor: Jiangbo Lu
     Presenter: Gauthier Lafruit
RoI-AVC = Region-of-Interest Advanced Video Coding
For stationary camera video applications:
(e.g., video conference, video surveillance, news broadcast)
• Foreground moving objects of crucial interest      RoI for smart video processing
• RoiAVC straddling computer vision & video coding            a joint optimized design
• Battery powered cameras for low bandwidth scenarios             encoding efficiency
                                                                  & complexity crucial
              Frame-based                         Object -based




RoiAVC                           RoI-based


  A practical semantic video codec with the coding efficiency
  and complexity advantages over state-of-art H.264/AVC
  Striking a sweet spot between frame-based video coding paradigm
  and object-based video coding paradigm
  Powered by our key competence in fast reliable RoI detection and
  coding schemes
                                                                                     2
Outline: RoI-AVC framework and strength
        A joint optimized design bridging the two worlds




             Vision world                                                    Video world

             Multi-scale motion             RoI bounding -           H.264 video            H.264 video
              RoI detection                 box generation            encoder                decoder

                                                     Metadata of
                                                    Bounding-boxes




   Previous frame           Current frame                                                  Reconstructed frame


• Avoiding the initial background                            • Up to 34% bit-rate saving @
training and online updating                                 similar quality over H.264/AVC
• Reliable motion RoI detection                              • 2.x to 3.x faster (including RoI
• 20 fps @ 352x288 w/o manual                                overhead) than H.264 reference
optimization on Intel Pentium 4                              encoder, similar for the decoder
                                                                                                                 3
                                       To appear in IEEE ICASSP 2007
Vision world: multi-scale motion RoI detection


                         Multi-scale motion RoI detection

     Previous frame     Pixel-
                        Pixel - level        Region-
                                             Region-level
                        processing           processing


                                                             Detected motion RoI




     Current frame




Multi-scale motion RoI detection
 Multi-scale structural change aggregation as the key contribution
 An integrated fast and reliable motion RoI detection approach
 Directly applied to two successive video frames w/o a BG model
 Robust to flicking lighting and camera noise, and less sensitive to the
 thresholds


                                                                                   4
Multi-scale motion RoI detection: flowchart
                                                                                  Region-
                                                                                  Region-level processing


Pixel-
Pixel-level processing




      Median filter          Multi-scale decomp .       Multi-scale structural
                                                        change aggregation

                 Laplacian operator                                                  Bounding-box
                                       Morphological closing                     generation & extending
                                                                                                          Fast motion pixel
                                                                                   Size-based noisy          clustering
                                                                                   changes culling
                                                                                                  Optimized connected
                                                                                                  component analysis



                                                                                                                              5
Video world: flexible MB-based H.264/AVC coding
                                                          Flexible MB-based H.264/AVC codec

                                                       Flexible MB-based          Flexible MB-based
                                                         H.264 encoder              H.264 decoder

          Detected motion RoI                                                                          Reconstructed frame




                                                               Metadata of
                                                              Bounding-boxes    MB-based RoI coding

Flexible organization of MBs

                 16 17 18
                                                                               Flexible MB-based codec:
  1   2   3      19 20 21                     1   2   3   4                     Largely reduced coding bit-rate and
  4   5   6      22 23 24                     5   6   7   8
  7   8   9                                   9 10 11 12                        complexity
 10 11 12
 13 14 15
                                             13 14 15 16 21 22
                                             17 18 19 20 23 24
                                                                                Data locality-preserving ordering
                                                      25 26 27 28
                                                                                w/o changing MB-based pipeline
m MB of 1st Motion RoI      n MB of 2 nd Motion RoI        MB of background
                                                                                Could be fully compliant to AVC
                                                                                                                             6
Compared to the prior methods from different worlds



Video
world



         Current input frame     Results of [CSVT’01]      Results of [MM’01]           Our results




Vision
world




         Current input frame   Gaussian hypothesis test The single-scale variant   Our multi-scale scheme
                                                                                                            7
Video demo 1: multi-scale motion RoI detection
Ballet @ 1024 x 768 from MSR camera-4




  Detected motion blobs           Bounding-boxes superimposed
                                    upon the original frames


                                                                8
Video demo 2: perceptual quality of RoI-AVC



 Indoor
monitoring




  News
broadcast




             Original video sequences   Reconstructed video sequences   9

More Related Content

PPTX
Scalable Video Coding in Content-Aware Networks
PDF
Was ist neu in Exchange 2013?
PDF
Storage Performance Takes Off
PPTX
Extraction of region of interest in an image
PPT
Region Of Interest Extraction
PPTX
Dmitry Stepanov - Detector of interest point from region of interest on NBI ...
PDF
MetaAnalysis of Multimedia Transmission quality improvements in Wireless Netw...
PPTX
Robust region of interest determination based on user attention model through...
Scalable Video Coding in Content-Aware Networks
Was ist neu in Exchange 2013?
Storage Performance Takes Off
Extraction of region of interest in an image
Region Of Interest Extraction
Dmitry Stepanov - Detector of interest point from region of interest on NBI ...
MetaAnalysis of Multimedia Transmission quality improvements in Wireless Netw...
Robust region of interest determination based on user attention model through...

Viewers also liked (20)

PPT
How Does Multimedia Enhance The Use Of Information System In Organisations
PDF
DYNAMIC REGION OF INTEREST TRANSCODING FOR MULTIPOINT VIDEO ...
KEY
Privacy protection of visual information
PPTX
Mw2012 eyetracking
PDF
Scrambling For Video Surveillance
PPTX
Pros and Cons of Eyetracking
PDF
ICME 2016 - Tutorial on Interactive Search in Video & Lifelog Repositories
PDF
Video Forgery Detection: Literature review
PDF
Eye Tracking & Consumer Behavior
PDF
Eye tracking and its economic feasibility
PPTX
An eye tracker analysis of the influence of applicant attractiveness on emplo...
PPTX
Interactive Video Search - Tutorial at ACM Multimedia 2015
PPTX
Image Processing Based Signature Recognition and Verification Technique Using...
PDF
Immersive Telepresence
 
PPT
Multimedia networking
PPT
Eye-tracking presentation
PPT
Theeye tribe, it s a eye tracking device which makes the usage of PC, laptops...
PPTX
Eye Tracking & Design
PDF
Svcc12 designparternship
PDF
Break out: Collaboration tools - Kris Naessens
How Does Multimedia Enhance The Use Of Information System In Organisations
DYNAMIC REGION OF INTEREST TRANSCODING FOR MULTIPOINT VIDEO ...
Privacy protection of visual information
Mw2012 eyetracking
Scrambling For Video Surveillance
Pros and Cons of Eyetracking
ICME 2016 - Tutorial on Interactive Search in Video & Lifelog Repositories
Video Forgery Detection: Literature review
Eye Tracking & Consumer Behavior
Eye tracking and its economic feasibility
An eye tracker analysis of the influence of applicant attractiveness on emplo...
Interactive Video Search - Tutorial at ACM Multimedia 2015
Image Processing Based Signature Recognition and Verification Technique Using...
Immersive Telepresence
 
Multimedia networking
Eye-tracking presentation
Theeye tribe, it s a eye tracking device which makes the usage of PC, laptops...
Eye Tracking & Design
Svcc12 designparternship
Break out: Collaboration tools - Kris Naessens
Ad

Similar to Workshopvin4 Region Of Interest Advanced Video Coding (20)

PDF
Emerging H.264 Standard: Overview and TMS320DM642- Based ...
PPT
H 264 in cuda presentation
PDF
Emerging H.264 Standard:
PDF
The H.264 Video Compression Standard
PDF
Ijctt v7 p110
PDF
ORUSSI: Optimal Road sUrveillance based on Scalable vIdeo
PDF
H.264 Library
PDF
Video Compression Algorithm Based on Frame Difference Approaches
PPT
Introduction to Video Compression Techniques - Anurag Jain
PDF
2008 brokerage 04 smart vision system [compatibility mode]
PDF
2008 brokerage 04 smart vision system [compatibility mode]
PDF
PDF
Jpeg2000
PDF
Gv2512441247
PDF
Gv2512441247
PPTX
Multimedia basic video compression techniques
PDF
Gd3111841188
PPTX
Generic Video Adaptation Framework Towards Content – and Context Awareness in...
PDF
Scanned document compression using block based hybrid video codec
PDF
Scanned document compression using block based hybrid video codec
Emerging H.264 Standard: Overview and TMS320DM642- Based ...
H 264 in cuda presentation
Emerging H.264 Standard:
The H.264 Video Compression Standard
Ijctt v7 p110
ORUSSI: Optimal Road sUrveillance based on Scalable vIdeo
H.264 Library
Video Compression Algorithm Based on Frame Difference Approaches
Introduction to Video Compression Techniques - Anurag Jain
2008 brokerage 04 smart vision system [compatibility mode]
2008 brokerage 04 smart vision system [compatibility mode]
Jpeg2000
Gv2512441247
Gv2512441247
Multimedia basic video compression techniques
Gd3111841188
Generic Video Adaptation Framework Towards Content – and Context Awareness in...
Scanned document compression using block based hybrid video codec
Scanned document compression using block based hybrid video codec
Ad

More from imec.archive (20)

PDF
iMinds-iLab.o, Open Innovation in ICT
PDF
Accio presentation closing event
PPTX
PRoF+ Patient Room of the Future
PPTX
Results of the Apollon pilot in homecare and independent living
PPTX
Delivery of feedback on Health, Home Security and Home Energy in Aware Homes ...
PDF
NMMU-Emmanuel Haven Living Lab
PDF
The Humanicité workshops
PPTX
A Real-World Experimentation Platform
PDF
ENoLL @ AAL Forum 2012
PDF
ENoLL 6th Wave Results Ceremony (Jesse Marsh)
PDF
The Connected Smart Cities Network and Living Labs - Towards Horizon 2020 - K...
PDF
Apollon-23/05/2012-9u30- Parallell session: Living Labs added value
PPT
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
PPT
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
PPT
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
PPT
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
PPTX
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
PPTX
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
PPTX
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
PPT
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
iMinds-iLab.o, Open Innovation in ICT
Accio presentation closing event
PRoF+ Patient Room of the Future
Results of the Apollon pilot in homecare and independent living
Delivery of feedback on Health, Home Security and Home Energy in Aware Homes ...
NMMU-Emmanuel Haven Living Lab
The Humanicité workshops
A Real-World Experimentation Platform
ENoLL @ AAL Forum 2012
ENoLL 6th Wave Results Ceremony (Jesse Marsh)
The Connected Smart Cities Network and Living Labs - Towards Horizon 2020 - K...
Apollon-23/05/2012-9u30- Parallell session: Living Labs added value
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 16:00 - Smart Open Cities and the Future Internet
Apollon - 22/5/12 - 11:30 - Local SME's - Innovating Across borders
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems

Workshopvin4 Region Of Interest Advanced Video Coding

  • 1. Region-of-Interest Advanced Video Coding IMEC Inventor: Jiangbo Lu Presenter: Gauthier Lafruit
  • 2. RoI-AVC = Region-of-Interest Advanced Video Coding For stationary camera video applications: (e.g., video conference, video surveillance, news broadcast) • Foreground moving objects of crucial interest RoI for smart video processing • RoiAVC straddling computer vision & video coding a joint optimized design • Battery powered cameras for low bandwidth scenarios encoding efficiency & complexity crucial Frame-based Object -based RoiAVC RoI-based A practical semantic video codec with the coding efficiency and complexity advantages over state-of-art H.264/AVC Striking a sweet spot between frame-based video coding paradigm and object-based video coding paradigm Powered by our key competence in fast reliable RoI detection and coding schemes 2
  • 3. Outline: RoI-AVC framework and strength A joint optimized design bridging the two worlds Vision world Video world Multi-scale motion RoI bounding - H.264 video H.264 video RoI detection box generation encoder decoder Metadata of Bounding-boxes Previous frame Current frame Reconstructed frame • Avoiding the initial background • Up to 34% bit-rate saving @ training and online updating similar quality over H.264/AVC • Reliable motion RoI detection • 2.x to 3.x faster (including RoI • 20 fps @ 352x288 w/o manual overhead) than H.264 reference optimization on Intel Pentium 4 encoder, similar for the decoder 3 To appear in IEEE ICASSP 2007
  • 4. Vision world: multi-scale motion RoI detection Multi-scale motion RoI detection Previous frame Pixel- Pixel - level Region- Region-level processing processing Detected motion RoI Current frame Multi-scale motion RoI detection Multi-scale structural change aggregation as the key contribution An integrated fast and reliable motion RoI detection approach Directly applied to two successive video frames w/o a BG model Robust to flicking lighting and camera noise, and less sensitive to the thresholds 4
  • 5. Multi-scale motion RoI detection: flowchart Region- Region-level processing Pixel- Pixel-level processing Median filter Multi-scale decomp . Multi-scale structural change aggregation Laplacian operator Bounding-box Morphological closing generation & extending Fast motion pixel Size-based noisy clustering changes culling Optimized connected component analysis 5
  • 6. Video world: flexible MB-based H.264/AVC coding Flexible MB-based H.264/AVC codec Flexible MB-based Flexible MB-based H.264 encoder H.264 decoder Detected motion RoI Reconstructed frame Metadata of Bounding-boxes MB-based RoI coding Flexible organization of MBs 16 17 18 Flexible MB-based codec: 1 2 3 19 20 21 1 2 3 4 Largely reduced coding bit-rate and 4 5 6 22 23 24 5 6 7 8 7 8 9 9 10 11 12 complexity 10 11 12 13 14 15 13 14 15 16 21 22 17 18 19 20 23 24 Data locality-preserving ordering 25 26 27 28 w/o changing MB-based pipeline m MB of 1st Motion RoI n MB of 2 nd Motion RoI MB of background Could be fully compliant to AVC 6
  • 7. Compared to the prior methods from different worlds Video world Current input frame Results of [CSVT’01] Results of [MM’01] Our results Vision world Current input frame Gaussian hypothesis test The single-scale variant Our multi-scale scheme 7
  • 8. Video demo 1: multi-scale motion RoI detection Ballet @ 1024 x 768 from MSR camera-4 Detected motion blobs Bounding-boxes superimposed upon the original frames 8
  • 9. Video demo 2: perceptual quality of RoI-AVC Indoor monitoring News broadcast Original video sequences Reconstructed video sequences 9