SlideShare a Scribd company logo
Visual Communications and
                        Image Processing 2006
                          SPIE paper 6077-8




        Resolution enhancement of
        low-quality videos using a
          high-resolution frame
                            Tuan Pham 1
                           Lucas van Vliet 1
                           Klamer Schutte 2

1. Quantitative Imaging Group, Delft University of Technology, The Netherlands
        2. TNO Physics and Electronics Laboratory, The Netherlands
Contents


   1. Influence of compression on Super-Resolution


   2. Example-based SR in the DCT domain


   3. Application in video compression




© Tuan Pham (pham@tnw.tudelft.nl)                    2
Super-Resolution reconstruction




 •Problems with compressed inputs:
   – Registration: less accurate motion vectors due to compression error

   – Fusion: outlier intensities due to ringing and blocking artifacts

   – Deblur: distorted and truncated frequency spectrum due to
   quantization

© Tuan Pham (pham@tnw.tudelft.nl)                                          3
Discrete Cosine Transform

 •Each 8x8 image block is projected onto 64 orthogonal bases
   → 64 DCT coefficients per block




     Input image                64 DCT basis functions   DCT coefficients


 •The coefficients are then quantized (lossy) & compressed (lossless)

© Tuan Pham (pham@tnw.tudelft.nl)                                           4
Quantization noise and blur




              JPEG quality 80                                                 Quantization noise

                                                                  8




                                          MSE of all 8x8 blocks
                                                                  7

                                                                  6

                                                                  5

                                                                  4
                                                                  8
                                                                      6                                 8
                                                                               4                    6
                                                                          y                 4
                                                                                   2   2        x

       Average DCT attenuation factor   Average error variance of 8x8 blocks
© Tuan Pham (pham@tnw.tudelft.nl)                                                                           5
SR by texture synthesis

   •Example-based SR1 aims to fill-in the missing frequencies:



                                                            Hi-Res texture
                                                                source

                       Interpolation
                           or
                       Multi-frame                             Texture
                         fusion                                transfer
     Lo-Res compressed                   Hi-Res blurred                      Super-Resolved
         input video                   intermediate video                     output video


  •We propose a DCT-domain SR synthesis algorithm which
can be applied directly on the compressed video stream.

© Tuan Pham al., “Example-based Super-Resolution”, IEEE Comp.Graph. & App., 2001
1. Freeman et (pham@tnw.tudelft.nl)                                                           6
Resize versus SR in DCT domain

   •Image resize1 in DCT domain:
    – zero filling of high-frequency content followed by transcoding


                                     8x8                                8x8        8x8
                                     DCT         0                      DCT        DCT
                       enlarge                           transcode
            8x8
            DCT
                       0 fill-in
                                                                        8x8        8x8
                                       0         0                      DCT        DCT
         LR DCT

                                       16x16 DCT                          2x HR DCT


   •Super-resolution in DCT domain:
    – fill-in plausible high-frequency coefficients instead of zero-ing


© Tuan Phamand Mitra, “Image resizing in the compressed domain using subband DCT”, CSVT, 2002 7
1. Mukherjee (pham@tnw.tudelft.nl)
Example-based SR in DCT domain

•Block-wise SR synthesis in a raster-scan order:
 – Correspondence constraint: low-freq. coefficients of LR - HR blocks should match
 – Spatial continuity constraint: boundary pixels of adjacent HR blocks should match

     8x8 quantized DCT                                                           SR decoded image

                             AC


                      DC      MeanAbs + ε
                                                                   ×α


                                            15 LR DCTs   33 overlapping pixels

                                                                                       +
      Training data
                                                                                       ×
                                                         ÷
                                                   Best Match



© Tuan Pham (pham@tnw.tudelft.nl)                                                                   8
DCT-domain SR vs spatial domain SR



                                    +
     Texture source: CIF frame 0        Input QCIF frame 20 (JPEG quality 80)




     Example-based SR (Freeman)
© Tuan Pham (pham@tnw.tudelft.nl)             DCT-domain SR synthesis      9
DCT-domain SR vs spatial domain SR



                                       +
     Texture source: CIF frame 0           Input QCIF frame 50 (JPEG quality 80)




© Tuan Linear Embedding SR (CVPR’04)
Local Pham (pham@tnw.tudelft.nl)                      DCT-domain SR          10
Importance of a good texture source



                           +
 input (JPEG quality 50)      source from 30-frame away     wrong source




       QCIF input
© Tuan Pham (pham@tnw.tudelft.nl)   good but not striking   poor SR result   11
SR using updated HR frames

 •LR compressed video + a regularly updated HR source




     QCIF input at JPEG quality 80   CIF output using 1 HR frame / sec

 •Application: video coding for devices with limited resources
© Tuan Pham (pham@tnw.tudelft.nl)                                        12
Conclusions

 •SR reconstruction of compressed image is difficult because of:
   – truncated frequency spectrum due to heavy quantization

   – space-variant quantization noise


 •DCT-domain SR is better than spatial domain SR because it:
   – produces better results

   – is more efficient


 •Application of SR synthesis using a HR frame:
   – video compression for systems with limited resources (phones, cameras)


© Tuan Pham (pham@tnw.tudelft.nl)                                     13
Thank you !


                               Tuan Pham


                      pham@tnw.tudelft.nl


© Tuan Pham (pham@tnw.tudelft.nl)           14

More Related Content

PPTX
Broadcast in Hypercube
PDF
Case Study: PTP over WAN
PPT
Motion estimation overview
PDF
Available network bandwidth schema to improve performance in tcp protocols
PDF
Solution(1)
PPTX
Communication costs in parallel machines
PPT
Chap4 slides
PDF
Basic communication operations - One to all Broadcast
Broadcast in Hypercube
Case Study: PTP over WAN
Motion estimation overview
Available network bandwidth schema to improve performance in tcp protocols
Solution(1)
Communication costs in parallel machines
Chap4 slides
Basic communication operations - One to all Broadcast

What's hot (12)

PDF
Asilomar 2011
PPT
Week5 chap6
PPTX
CloudMC: A cloud computing map-reduce implementation for radiotherapy. RUBEN ...
PPTX
Dcp project
PDF
SpeedIT : GPU-based acceleration of sparse linear algebra
PDF
Wilmott Nyc Jul2012 Nag Talk John Holden
PDF
Ofdm sim-matlab-code-tutorial web for EE students
PPT
Simulating communication systems with MATLAB: An introduction
PPTX
Modern processors
PDF
ALGORITHMS FOR PACKET ROUTING IN SWITCHING NETWORKS WITH RECONFIGURATION OVER...
PDF
Cloudy with a Touch of Cheminformatics
Asilomar 2011
Week5 chap6
CloudMC: A cloud computing map-reduce implementation for radiotherapy. RUBEN ...
Dcp project
SpeedIT : GPU-based acceleration of sparse linear algebra
Wilmott Nyc Jul2012 Nag Talk John Holden
Ofdm sim-matlab-code-tutorial web for EE students
Simulating communication systems with MATLAB: An introduction
Modern processors
ALGORITHMS FOR PACKET ROUTING IN SWITCHING NETWORKS WITH RECONFIGURATION OVER...
Cloudy with a Touch of Cheminformatics
Ad

Viewers also liked (10)

PDF
Improved license plate recognition for low resolution cctv forensics by integ...
PPTX
Face Recognition
ODP
Human Recognition in Video
PPTX
Face recognition tech1
PDF
Image transforms
PPT
Image trnsformations
PDF
03 image transform
PPTX
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
PPTX
Face Recognition
PPT
Face recognition ppt
Improved license plate recognition for low resolution cctv forensics by integ...
Face Recognition
Human Recognition in Video
Face recognition tech1
Image transforms
Image trnsformations
03 image transform
Comparison between JPEG(DCT) and JPEG 2000(DWT) compression standards
Face Recognition
Face recognition ppt
Ad

Similar to Resolution enhancement of low-quality videos using a high-resolution frame (20)

PDF
CyberSec_JPEGcompressionForensics.pdf
PDF
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
PPTX
Cross-Layer Design of Raptor Codes for Video Multicast over 802.11n MIMO Chan...
PPT
robust image watermarking
PDF
“Meeting the Critical Needs of Accuracy, Performance and Adaptability in Embe...
PDF
Using Many-Core Processors to Improve the Performance of Space Computing Plat...
PPTX
Design and implementation of DADCT
PDF
Online learning for low-latency streaming
PPT
Bsspar1 s14 chapter 02_radio_resource_administration_v1.1
PPTX
Video Compression Basics by sahil jain
PPT
Fast Resilient Jumbo Frames in Wireless LANs
PDF
Slides of talk
PDF
PDF
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
PDF
Discrete cosine transform
PDF
Discrete cosine transform
PDF
Discrete cosine transform
PDF
In datacenter performance analysis of a tensor processing unit
PPT
Introduction to Video Compression Techniques - Anurag Jain
PPTX
Jeff Johnson, Research Engineer, Facebook at MLconf NYC
CyberSec_JPEGcompressionForensics.pdf
Yufeng Guo - Tensor Processing Units: how TPUs enable the next generation of ...
Cross-Layer Design of Raptor Codes for Video Multicast over 802.11n MIMO Chan...
robust image watermarking
“Meeting the Critical Needs of Accuracy, Performance and Adaptability in Embe...
Using Many-Core Processors to Improve the Performance of Space Computing Plat...
Design and implementation of DADCT
Online learning for low-latency streaming
Bsspar1 s14 chapter 02_radio_resource_administration_v1.1
Video Compression Basics by sahil jain
Fast Resilient Jumbo Frames in Wireless LANs
Slides of talk
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
Discrete cosine transform
Discrete cosine transform
Discrete cosine transform
In datacenter performance analysis of a tensor processing unit
Introduction to Video Compression Techniques - Anurag Jain
Jeff Johnson, Research Engineer, Facebook at MLconf NYC

More from Tuan Q. Pham (15)

PDF
Oral presentation on Asymmetric recursive Gaussian filtering for space-varia...
PDF
Asymmetric recursive Gaussian filtering for space-variant artificial bokeh
PPTX
Parallel implementation of geodesic distance transform with application in su...
PDF
Parallel implementation of geodesic distance transform with application in su...
PDF
Multi-hypothesis projection-based shift estimation for sweeping panorama reco...
PDF
Multi-hypothesis projection-based shift estimation for sweeping panorama reco...
PPT
Non-maximum suppression using fewer than two comparison per pixels
PDF
Paper fingerprinting using alpha-masked image matching
PDF
Paper fingerprinting using alpha-masked image matching
PDF
Bidirectional bias correction for gradient-based shift estimation
PDF
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
PPT
Separable bilateral filtering for fast video preprocessing
PDF
Performance of Optimal Registration Estimator
PDF
Influence of Signal-to-Noise Ratio and Point Spread Function on Limits of Sup...
PPT
Normalized averaging using adaptive applicability functions with applications...
Oral presentation on Asymmetric recursive Gaussian filtering for space-varia...
Asymmetric recursive Gaussian filtering for space-variant artificial bokeh
Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...
Multi-hypothesis projection-based shift estimation for sweeping panorama reco...
Multi-hypothesis projection-based shift estimation for sweeping panorama reco...
Non-maximum suppression using fewer than two comparison per pixels
Paper fingerprinting using alpha-masked image matching
Paper fingerprinting using alpha-masked image matching
Bidirectional bias correction for gradient-based shift estimation
Robust Super-Resolution by minimizing a Gaussian-weighted L2 error norm
Separable bilateral filtering for fast video preprocessing
Performance of Optimal Registration Estimator
Influence of Signal-to-Noise Ratio and Point Spread Function on Limits of Sup...
Normalized averaging using adaptive applicability functions with applications...

Recently uploaded (20)

PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
KodekX | Application Modernization Development
PDF
Electronic commerce courselecture one. Pdf
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
MIND Revenue Release Quarter 2 2025 Press Release
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
The AUB Centre for AI in Media Proposal.docx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
KodekX | Application Modernization Development
Electronic commerce courselecture one. Pdf
sap open course for s4hana steps from ECC to s4
Per capita expenditure prediction using model stacking based on satellite ima...
Reach Out and Touch Someone: Haptics and Empathic Computing
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Programs and apps: productivity, graphics, security and other tools
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Review of recent advances in non-invasive hemoglobin estimation
Network Security Unit 5.pdf for BCA BBA.
Understanding_Digital_Forensics_Presentation.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Encapsulation_ Review paper, used for researhc scholars

Resolution enhancement of low-quality videos using a high-resolution frame

  • 1. Visual Communications and Image Processing 2006 SPIE paper 6077-8 Resolution enhancement of low-quality videos using a high-resolution frame Tuan Pham 1 Lucas van Vliet 1 Klamer Schutte 2 1. Quantitative Imaging Group, Delft University of Technology, The Netherlands 2. TNO Physics and Electronics Laboratory, The Netherlands
  • 2. Contents 1. Influence of compression on Super-Resolution 2. Example-based SR in the DCT domain 3. Application in video compression © Tuan Pham (pham@tnw.tudelft.nl) 2
  • 3. Super-Resolution reconstruction •Problems with compressed inputs: – Registration: less accurate motion vectors due to compression error – Fusion: outlier intensities due to ringing and blocking artifacts – Deblur: distorted and truncated frequency spectrum due to quantization © Tuan Pham (pham@tnw.tudelft.nl) 3
  • 4. Discrete Cosine Transform •Each 8x8 image block is projected onto 64 orthogonal bases → 64 DCT coefficients per block Input image 64 DCT basis functions DCT coefficients •The coefficients are then quantized (lossy) & compressed (lossless) © Tuan Pham (pham@tnw.tudelft.nl) 4
  • 5. Quantization noise and blur JPEG quality 80 Quantization noise 8 MSE of all 8x8 blocks 7 6 5 4 8 6 8 4 6 y 4 2 2 x Average DCT attenuation factor Average error variance of 8x8 blocks © Tuan Pham (pham@tnw.tudelft.nl) 5
  • 6. SR by texture synthesis •Example-based SR1 aims to fill-in the missing frequencies: Hi-Res texture source Interpolation or Multi-frame Texture fusion transfer Lo-Res compressed Hi-Res blurred Super-Resolved input video intermediate video output video •We propose a DCT-domain SR synthesis algorithm which can be applied directly on the compressed video stream. © Tuan Pham al., “Example-based Super-Resolution”, IEEE Comp.Graph. & App., 2001 1. Freeman et (pham@tnw.tudelft.nl) 6
  • 7. Resize versus SR in DCT domain •Image resize1 in DCT domain: – zero filling of high-frequency content followed by transcoding 8x8 8x8 8x8 DCT 0 DCT DCT enlarge transcode 8x8 DCT 0 fill-in 8x8 8x8 0 0 DCT DCT LR DCT 16x16 DCT 2x HR DCT •Super-resolution in DCT domain: – fill-in plausible high-frequency coefficients instead of zero-ing © Tuan Phamand Mitra, “Image resizing in the compressed domain using subband DCT”, CSVT, 2002 7 1. Mukherjee (pham@tnw.tudelft.nl)
  • 8. Example-based SR in DCT domain •Block-wise SR synthesis in a raster-scan order: – Correspondence constraint: low-freq. coefficients of LR - HR blocks should match – Spatial continuity constraint: boundary pixels of adjacent HR blocks should match 8x8 quantized DCT SR decoded image AC DC MeanAbs + ε ×α 15 LR DCTs 33 overlapping pixels + Training data × ÷ Best Match © Tuan Pham (pham@tnw.tudelft.nl) 8
  • 9. DCT-domain SR vs spatial domain SR + Texture source: CIF frame 0 Input QCIF frame 20 (JPEG quality 80) Example-based SR (Freeman) © Tuan Pham (pham@tnw.tudelft.nl) DCT-domain SR synthesis 9
  • 10. DCT-domain SR vs spatial domain SR + Texture source: CIF frame 0 Input QCIF frame 50 (JPEG quality 80) © Tuan Linear Embedding SR (CVPR’04) Local Pham (pham@tnw.tudelft.nl) DCT-domain SR 10
  • 11. Importance of a good texture source + input (JPEG quality 50) source from 30-frame away wrong source QCIF input © Tuan Pham (pham@tnw.tudelft.nl) good but not striking poor SR result 11
  • 12. SR using updated HR frames •LR compressed video + a regularly updated HR source QCIF input at JPEG quality 80 CIF output using 1 HR frame / sec •Application: video coding for devices with limited resources © Tuan Pham (pham@tnw.tudelft.nl) 12
  • 13. Conclusions •SR reconstruction of compressed image is difficult because of: – truncated frequency spectrum due to heavy quantization – space-variant quantization noise •DCT-domain SR is better than spatial domain SR because it: – produces better results – is more efficient •Application of SR synthesis using a HR frame: – video compression for systems with limited resources (phones, cameras) © Tuan Pham (pham@tnw.tudelft.nl) 13
  • 14. Thank you ! Tuan Pham pham@tnw.tudelft.nl © Tuan Pham (pham@tnw.tudelft.nl) 14