SlideShare a Scribd company logo
Video Compression Demystified Intro to MPEG, WMV, H.264, AVI
Why compression? Bandwidth constraint Space constraint I don’t know what else?
Basic idea Correlation (Entropy) Space, Time, Bits Human eye models Low-pass filter Brain ‘fills’ in for sharpness Frugality
Pixels and frames Each picture is a set of pixels in two dimensions, third being time. Finite sharpness, resolution issue. Each group of 8 pixels = block 2x2 blocks = Macroblock
Correlation in space Space is ‘gravitated’ towards being continuous.  How do you exploit it? DCT inside a block AC-DC prediction across blocks
Correlation in Time The WHOLE image does not change. Do NOT repeat information. Motion estimation Locate where my marker is moving towards. Send only the coded difference. Bi-predict pictures.
Correlation in bits Run-level-last coding for zeroes. Subsequent bits still have correlation in them. Huffman coding (VLC) Arithmetic coding (CABAC)
Human eye Eye is not responsive to every detail Quantization Smoothening filters Eye is more responsive to brightness  Less effort of chrominance Brain fills in crucial details.
DCT and quantization Random in Time domain is high freq. Spatial redundancy reduced to a few significant values. Zigzag coding + quantization  Focus on lower freq and suppress high freq.
Motion Estimation Find the best match around – ME Diamond, Hierarchical, Logarithmic Generally over a block. Half-pel and quarter pel motion estimation using filters. Better match in sub-pel level.
RLL and Huffman Long sequence of zeroes helps reducing to smaller chunks of data. This chunk also has a probabilistic models. Exploit probability of symbols and be frugal in bits. Context adaptive arithmetic coding.
MPEG MPEG-1: DCT Pixel level ME AC-DC prediction P pictures Huffman coding
MPEG-4 DCT Quarter-pel ME B pictures AC-DC prediction Data partitioning Objects – layers, groups, streams
H.264 Integer transform on 4x4 block Quarter pel ME on 4x4  In-loop de-blocking Multiple reference pictures CABAC – 2 times frugal Bit-plane coding
AVI Audio Video Interleave format One frame of video with equivalent audio frame(s) Necessary information Display info Shit

More Related Content

PPTX
Illustrative Introductory CNN
PPTX
Deeplab
PPTX
WaveNet
PDF
cbs_sips2005
PPTX
Introduction to Segmentation in Computer vision
PDF
Multidimensional RNN
PPTX
Human parsing
PPT
Motion estimation overview
Illustrative Introductory CNN
Deeplab
WaveNet
cbs_sips2005
Introduction to Segmentation in Computer vision
Multidimensional RNN
Human parsing
Motion estimation overview

What's hot (15)

PDF
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
PDF
(slides 3) Visual Computing: Geometry, Graphics, and Vision
PPT
4 4 bopardikar_concealment
PDF
Object Detection Using R-CNN Deep Learning Framework
PPTX
Review on cs231 part-2
PDF
Pr045 deep lab_semantic_segmentation
PPTX
Unit 2 character concept
PDF
HRNET : Deep High-Resolution Representation Learning for Human Pose Estimation
PDF
Alberto Massidda - Images and words: mechanics of automated captioning with n...
PDF
P03 neural networks cvpr2012 deep learning methods for vision
PDF
Building and road detection from large aerial imagery
PDF
Image captioning with Keras and Tensorflow - Debarko De @ Practo
DOCX
video comparison
PPTX
Image captioning
PDF
ViT (Vision Transformer) Review [CDM]
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
(slides 3) Visual Computing: Geometry, Graphics, and Vision
4 4 bopardikar_concealment
Object Detection Using R-CNN Deep Learning Framework
Review on cs231 part-2
Pr045 deep lab_semantic_segmentation
Unit 2 character concept
HRNET : Deep High-Resolution Representation Learning for Human Pose Estimation
Alberto Massidda - Images and words: mechanics of automated captioning with n...
P03 neural networks cvpr2012 deep learning methods for vision
Building and road detection from large aerial imagery
Image captioning with Keras and Tensorflow - Debarko De @ Practo
video comparison
Image captioning
ViT (Vision Transformer) Review [CDM]
Ad

Viewers also liked (18)

PDF
Hw3 0972552
ODP
PPTX
Image compression introductory presentation
PPT
Image compression jpeg coding standards
PPTX
Jpeg compression
PPTX
MPEG video compression standard
PPT
Video Compression Basics - MPEG2
PPTX
video compression techique
PDF
Compression: Images (JPEG)
PPTX
Image Compression
PPT
Image compression
PPT
Image processing
ODP
image compression ppt
PPT
Compression
PPTX
JPEG Image Compression
PPTX
Image processing ppt
PDF
Video Compression Basics
PPT
Digital Image Processing
Hw3 0972552
Image compression introductory presentation
Image compression jpeg coding standards
Jpeg compression
MPEG video compression standard
Video Compression Basics - MPEG2
video compression techique
Compression: Images (JPEG)
Image Compression
Image compression
Image processing
image compression ppt
Compression
JPEG Image Compression
Image processing ppt
Video Compression Basics
Digital Image Processing
Ad

Similar to Video Compression Demystified - Intro to MPEG, AVI, WMV (20)

PPT
Introduction to Video Compression Techniques - Anurag Jain
PPT
Video00.ppt
PPT
Multimedia Presentation
PPT
Digital Video 101.ppt
ODP
MPEG-1 Part 2 Video Encoding
PDF
Encoding Video for the Web - Webinar from ReelSEO.com
PPT
H263.ppt
PPT
mpeg4copy-120428133000-phpapp01.ppt
PDF
Compression: Video Compression (MPEG and others)
PPT
Mpeg4copy 120428133000-phpapp01
PPT
Multimedia Object - Video
PPT
Iain Richardson: An Introduction to Video Compression
PPT
28 h 264-avc_by_dhchang
PPT
ohh dear (1).ppt 1
PPT
Slide
PPTX
A short history of video coding
PPT
MPEG4 vs H.264
PDF
Video Compression Technology
PPT
H 264 in cuda presentation
PDF
A REAL-TIME H.264/AVC ENCODER&DECODER WITH VERTICAL MODE FOR INTRA FRAME AND ...
Introduction to Video Compression Techniques - Anurag Jain
Video00.ppt
Multimedia Presentation
Digital Video 101.ppt
MPEG-1 Part 2 Video Encoding
Encoding Video for the Web - Webinar from ReelSEO.com
H263.ppt
mpeg4copy-120428133000-phpapp01.ppt
Compression: Video Compression (MPEG and others)
Mpeg4copy 120428133000-phpapp01
Multimedia Object - Video
Iain Richardson: An Introduction to Video Compression
28 h 264-avc_by_dhchang
ohh dear (1).ppt 1
Slide
A short history of video coding
MPEG4 vs H.264
Video Compression Technology
H 264 in cuda presentation
A REAL-TIME H.264/AVC ENCODER&DECODER WITH VERTICAL MODE FOR INTRA FRAME AND ...

More from Arpit Agarwal (20)

PDF
Humara Vikas Humare Haath: Personality Development Lessons from the Gita
PDF
46101491 role-of-editor-in-new-media
PDF
What is a BarCamp? What you can expect to get out of it?
PPT
Social Media Marketing
PDF
300: Lessons in strategy from the movie
PPT
Inflation - Good or Bad?
PPT
Rbi Intervention
PPT
Rbi Intervention
PDF
Convertibility of Black money to white
PPT
Taxing Agriculture
PPS
Inflation - Good or Bad?
PPT
Government hand in private sector
PPT
FII is good too!
PPT
Govt Intervention For Markets
PDF
Convertibility of Black money to white
PPT
Is The Present Sez Policy Appropriate
PDF
FDI is good
PPT
Fiscal Expansion of India
PPT
Convertibility of Black money to white
PPS
Taxing Agriculture
Humara Vikas Humare Haath: Personality Development Lessons from the Gita
46101491 role-of-editor-in-new-media
What is a BarCamp? What you can expect to get out of it?
Social Media Marketing
300: Lessons in strategy from the movie
Inflation - Good or Bad?
Rbi Intervention
Rbi Intervention
Convertibility of Black money to white
Taxing Agriculture
Inflation - Good or Bad?
Government hand in private sector
FII is good too!
Govt Intervention For Markets
Convertibility of Black money to white
Is The Present Sez Policy Appropriate
FDI is good
Fiscal Expansion of India
Convertibility of Black money to white
Taxing Agriculture

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Approach and Philosophy of On baking technology
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Mushroom cultivation and it's methods.pdf
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
1. Introduction to Computer Programming.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Tartificialntelligence_presentation.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Spectral efficient network and resource selection model in 5G networks
Building Integrated photovoltaic BIPV_UPV.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Approach and Philosophy of On baking technology
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Group 1 Presentation -Planning and Decision Making .pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Assigned Numbers - 2025 - Bluetooth® Document
Programs and apps: productivity, graphics, security and other tools
Mushroom cultivation and it's methods.pdf
SOPHOS-XG Firewall Administrator PPT.pptx
TLE Review Electricity (Electricity).pptx
Per capita expenditure prediction using model stacking based on satellite ima...
1. Introduction to Computer Programming.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
Tartificialntelligence_presentation.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Univ-Connecticut-ChatGPT-Presentaion.pdf
Heart disease approach using modified random forest and particle swarm optimi...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025

Video Compression Demystified - Intro to MPEG, AVI, WMV

  • 1. Video Compression Demystified Intro to MPEG, WMV, H.264, AVI
  • 2. Why compression? Bandwidth constraint Space constraint I don’t know what else?
  • 3. Basic idea Correlation (Entropy) Space, Time, Bits Human eye models Low-pass filter Brain ‘fills’ in for sharpness Frugality
  • 4. Pixels and frames Each picture is a set of pixels in two dimensions, third being time. Finite sharpness, resolution issue. Each group of 8 pixels = block 2x2 blocks = Macroblock
  • 5. Correlation in space Space is ‘gravitated’ towards being continuous. How do you exploit it? DCT inside a block AC-DC prediction across blocks
  • 6. Correlation in Time The WHOLE image does not change. Do NOT repeat information. Motion estimation Locate where my marker is moving towards. Send only the coded difference. Bi-predict pictures.
  • 7. Correlation in bits Run-level-last coding for zeroes. Subsequent bits still have correlation in them. Huffman coding (VLC) Arithmetic coding (CABAC)
  • 8. Human eye Eye is not responsive to every detail Quantization Smoothening filters Eye is more responsive to brightness Less effort of chrominance Brain fills in crucial details.
  • 9. DCT and quantization Random in Time domain is high freq. Spatial redundancy reduced to a few significant values. Zigzag coding + quantization Focus on lower freq and suppress high freq.
  • 10. Motion Estimation Find the best match around – ME Diamond, Hierarchical, Logarithmic Generally over a block. Half-pel and quarter pel motion estimation using filters. Better match in sub-pel level.
  • 11. RLL and Huffman Long sequence of zeroes helps reducing to smaller chunks of data. This chunk also has a probabilistic models. Exploit probability of symbols and be frugal in bits. Context adaptive arithmetic coding.
  • 12. MPEG MPEG-1: DCT Pixel level ME AC-DC prediction P pictures Huffman coding
  • 13. MPEG-4 DCT Quarter-pel ME B pictures AC-DC prediction Data partitioning Objects – layers, groups, streams
  • 14. H.264 Integer transform on 4x4 block Quarter pel ME on 4x4 In-loop de-blocking Multiple reference pictures CABAC – 2 times frugal Bit-plane coding
  • 15. AVI Audio Video Interleave format One frame of video with equivalent audio frame(s) Necessary information Display info Shit