SlideShare a Scribd company logo
Prepared By:Avni Guna.
Khushali Panasala.
Yogesh Pandey.
Priyanka Pandey.
Hiteshri Patel.
Guided By:Mr.Chandresh
Parekh.
 Introduction
 What Is Audio?
 Why Compression is needed?
 Audio Compression.
 Types Of Audio Compression.
 Standard codecs for audio compression.
 Categories of Audio Files
 MPEG Audio Encoding Steps
 MPEG Audio decoding
 Successor of MP3
 High quality audio coding based on perceptual models has
found its way to widespread application in broadcasting and
Internet audio (e.g. mp3).
 Algorithms defined by the MPEG group (MPEG-1 Audio, e.g.
MPEG Layer-3 (mp3), MPEG-2 Advanced Audio Coding,
MPEG-4 Audio including its different functionalities) still
define the state of the art.
 Audio is an electrical or other
representation of sound.
 An audio file format is a file
format for storing digital audio
data on a computer system.
• Compression is the reduction in size of data in order to
save space or transmission time.
• Compression can be used to:
 Reduce File Size
 Save disk space
 Reduce transmission time
• Compression is performed by a program that uses an
algorithm to determine how to compress or decompress
data.
• Audio compression is a form of data compression designed
to reduce the size of audio files.
• There is a conditions on this definition :
 the audio file must still be playable after
compression, without decompressing it to original size
when you want to play it (for example with WinRAR).
 If the file is compressed 'too much' there will be loss
of quality.
 The compression is done with a thing called a codec.
This is an aggregation of the words: compressor and
decompressor.
 This codec is a special algorithm to reduce the size.
Audio compression
 There are mainly two types of audio compression show
below:
1)Lossy Compression(MP3)
2)Lossless Compression(Winzip)
• A compression technique that does not decompress data
back to 100% of the original.
• Lossy methods provide high degrees of compression
and result in smaller compressed files, but there is a
certain amount of visual loss when restored.
• Example:MP3
• A compression technique that decompresses
data back to its original form without any loss.
• The decompressed file and the original are
identical. For example, the ZIP archiving
technology (WinZip...) is the most widely used
lossless method.
• Lossless audio files typically require more storage
space than Lossy encoded ones.
• However this type of format is often favored by users
wanting to backup original audio CDs.
• A perfect copy can be restored in the event of loss or
damage to the CD. FLAC, Apple Lossless (ALAC) and
WMA Lossless are examples of lossless compression
formats.
 For lossy compression:
• Nero AAC Codec (Nero “advanced audio coding”
codec): It was developed and distributed by Nero AG.
• FAAC(Freeware Advanced Audio Coder):is an audio
compression computer program that creates AAC sound
files from other formats , it is the recommended format
for the company's iPod music player.
 For lossless compression
• LPAC (Lossless predictive audio compression):is an
improved lossless audio compression algorithm developed
by Tilman Liebchen, Marcus Purat and Peter Noll.
• ALAC (Apple Lossless Audio Codec):is an audio coding
format, and its reference audio codec implementation,
developed by Apple.
• FLAC(Free Lossless Audio Codec): can typically reduce
the original size of audio file to 50–60%, and decompressed
it to an identical copy of the original audio data, developed
by Josh Coalson.
• WMA Lossless (Windows Media Audio Lossless):
developed by Microsoft
 Moving Picture Experts Group
 Aim to create standards relating to synchronized
 audio and video compression
 MPEG-1
 MPEG-2
Quite good
high bit-rate is
available
Digital audio tap
Complex
Digital Audio
Broadcasting
most complex
audio transmission
over ISDN lines
There are 3 categories in which certain Audio files
belong to:-
1) Uncompressed:
Ex) .Wav
2) Lossless:
Ex) .WMA
3) Lossy:
Ex) .Mp3
Audio compression
Audio compression
Audio compression
Audio compression
Audio compression
Audio compression
Audio compression
◦ Hearing threshold level – a function of frequency
◦ Any frequency components below the threshold will
not be perceived by human ear
 A frequency component can be partly or fully masked by
another component that is close to it in frequency
 A lower tone can effectively mask higher tone
 This shifts the hearing threshold
◦ A quieter sound can be masked by a louder
sound if they are temporally close Sounds that
occur both (shortly) before and after volume
increase can be masked
Audio compression
 Power-law: larger values have less accuracy
 quantization: uniform or non-uniform quantization.
 coding: quantized spectral components are transmitted
either directly, or as entropy coded words (Huffman
coding)
 For better data compression, variable- length Huffman
codes are used to encode the quantized samples.
 Resultant bitstream is now reduced, because of coarser
quantisation, but can be further reduced by the use of
Huffman coding.
 formats encoded quantized samples into an
encoded bit stream – final form in which the
compressed signal is transmitted.
 Header (First 4 bytes of a frame)
◦ Contains: Frame Sync, MPEG Layer, Sampling Frequency,
Number of Channels, CRC, etc.
◦ Variable bit rate mp3’s switch bitrate between frames
 Decoder side relatively easier. The gain, scale
factor, quantization steps recovered and used for
reconstruct the filter bank responses
 Filter bank responses are combined to reconstruct
the decoded audio signal
 Advanced Audio Coding (AAC) – now part of MPEG-
4 Audio
 Inclusion of 48 full-bandwidth audio channels
 Default audio format for iPhone, iPad, Nintendo,
PlayStation, Nokia, Android, BlackBerry
 Introduced 1997 as MPEG-2 Part 7
 In 1999 – updated and included in MPEG-4
 standard for lossy digital audio compression. Designed
to be the successor of the MP3 format, AAC generally
achieves better sound quality than MP3 at similar bit
rates.
 Opus is a lossy audio coding format developed by the
Internet Engineering Task Force (IETF) that is
particularly suitable for interactive real-time
applications over the Internet.
 Opus incorporates technology from two other audio
coding formats: the speech-oriented SILK and the low
latency CELT.
 The MDCT was proposed by Princen, Johnson, and
Bradley[1] in 1987, following earlier (1986) work by
Princen and Bradley[2] to develop the MDCT’s
underlying principle of time-domain aliasing
cancellation (TDAC), described below.
 In MP3, the MDCT is not applied to the audio signal
directly, but rather to the output of a 32-band polyphase
quadrature filter (PQF) bank. The output of this MDCT
is post processed by an alias reduction formula to
reduce the typical aliasing of the PQF filter bank.
 Such a combination of a filter bank with an MDCT is
called a hybrid filter bank or a sub band MDCT. AAC,
on the other hand, normally uses a pure MDCT; only
the (rarely used) MPEG-4 AAC-SSR variant (by Sony)
uses a four band PQF bank followed by an MDCT.
Similar to MP3, ATRAC uses stacked quadrature
mirror filters (QMF) followed by an MDCT.
 ISO/IEC
• MPEG-1 Layer III (MP3)
• MPEG-1 Layer II
• MPEG-1 Layer I
• AAC
• MPEG-4 ALS
• MPEG-4 SLS
• MPEG-D USAC
 ITU-T
• G.711
• G.718
• G.719
• G.722
• G.723
• G.726
• G.728
• G.729
• audio compression is a key technology
• many algorithms  many applications
• Better algorithms  better quality, more compression
Thank You

More Related Content

PPT
Audio compression 1
PPTX
Audio compression
ODP
Audio compression
PPTX
Audio compression
PPTX
Audio compression
PPTX
Fundamentals of Data compression
ODP
image compression ppt
PPTX
Audio and Video Compression
Audio compression 1
Audio compression
Audio compression
Audio compression
Audio compression
Fundamentals of Data compression
image compression ppt
Audio and Video Compression

What's hot (20)

PPTX
Multimedia communication networks
PDF
Digital Image Processing - Image Compression
ODP
MPEG-1 Part 2 Video Encoding
PPTX
MPEG video compression standard
PPTX
Digital Image Processing
PPTX
Wavelet based image compression technique
PPTX
Digital video
PPTX
Adaptive differential pcm
PPTX
Sub band project
PPTX
Image Sampling and Quantization.pptx
PPTX
Image compression standards
PPTX
digital image processing
PDF
Video compression
PPS
MPEG/Audio Compression
PPTX
Data compression
PPTX
Multimedia:Multimedia compression
PDF
Video Compression
PPTX
Bit plane coding
PPT
Chapter 4 : SOUND
Multimedia communication networks
Digital Image Processing - Image Compression
MPEG-1 Part 2 Video Encoding
MPEG video compression standard
Digital Image Processing
Wavelet based image compression technique
Digital video
Adaptive differential pcm
Sub band project
Image Sampling and Quantization.pptx
Image compression standards
digital image processing
Video compression
MPEG/Audio Compression
Data compression
Multimedia:Multimedia compression
Video Compression
Bit plane coding
Chapter 4 : SOUND
Ad

Similar to Audio compression (20)

PPTX
Audio format
PPTX
Audio file format in computer graphic
PPTX
Ppt on audio file formats
PPT
Digital audio
PPTX
Uncompressed lossless lossy audio
PPTX
introduction to audio formats - Multimedia Students
PPT
Audio Compression
PPTX
audiocompression-130624061221-phpapp02.pptx
PPTX
Chap62
PPTX
Audio format ict
PPTX
1.1.3 DATA STORAGE
PPT
Audio Codec
DOCX
Ig2 task 1 work sheet
PDF
History of digital week4
PPTX
Chapter 2 - MM Data and Compression.pptx
PDF
CHAPTER – 5 Audio
PPTX
Audio Compression_2023.pptx
PPTX
Digital Audio in Multimedia
PPTX
Sound digitalisation
PPT
Speech Compression
Audio format
Audio file format in computer graphic
Ppt on audio file formats
Digital audio
Uncompressed lossless lossy audio
introduction to audio formats - Multimedia Students
Audio Compression
audiocompression-130624061221-phpapp02.pptx
Chap62
Audio format ict
1.1.3 DATA STORAGE
Audio Codec
Ig2 task 1 work sheet
History of digital week4
Chapter 2 - MM Data and Compression.pptx
CHAPTER – 5 Audio
Audio Compression_2023.pptx
Digital Audio in Multimedia
Sound digitalisation
Speech Compression
Ad

Recently uploaded (20)

PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
Strings in CPP - Strings in C++ are sequences of characters used to store and...
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Geodesy 1.pptx...............................................
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
composite construction of structures.pdf
PPT
Mechanical Engineering MATERIALS Selection
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
Structs to JSON How Go Powers REST APIs.pdf
PPTX
Construction Project Organization Group 2.pptx
PPTX
additive manufacturing of ss316l using mig welding
Foundation to blockchain - A guide to Blockchain Tech
UNIT 4 Total Quality Management .pptx
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Strings in CPP - Strings in C++ are sequences of characters used to store and...
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Internet of Things (IOT) - A guide to understanding
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Geodesy 1.pptx...............................................
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
composite construction of structures.pdf
Mechanical Engineering MATERIALS Selection
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Operating System & Kernel Study Guide-1 - converted.pdf
Structs to JSON How Go Powers REST APIs.pdf
Construction Project Organization Group 2.pptx
additive manufacturing of ss316l using mig welding

Audio compression

  • 1. Prepared By:Avni Guna. Khushali Panasala. Yogesh Pandey. Priyanka Pandey. Hiteshri Patel. Guided By:Mr.Chandresh Parekh.
  • 2.  Introduction  What Is Audio?  Why Compression is needed?  Audio Compression.  Types Of Audio Compression.  Standard codecs for audio compression.  Categories of Audio Files  MPEG Audio Encoding Steps  MPEG Audio decoding  Successor of MP3
  • 3.  High quality audio coding based on perceptual models has found its way to widespread application in broadcasting and Internet audio (e.g. mp3).  Algorithms defined by the MPEG group (MPEG-1 Audio, e.g. MPEG Layer-3 (mp3), MPEG-2 Advanced Audio Coding, MPEG-4 Audio including its different functionalities) still define the state of the art.
  • 4.  Audio is an electrical or other representation of sound.  An audio file format is a file format for storing digital audio data on a computer system.
  • 5. • Compression is the reduction in size of data in order to save space or transmission time. • Compression can be used to:  Reduce File Size  Save disk space  Reduce transmission time • Compression is performed by a program that uses an algorithm to determine how to compress or decompress data.
  • 6. • Audio compression is a form of data compression designed to reduce the size of audio files. • There is a conditions on this definition :  the audio file must still be playable after compression, without decompressing it to original size when you want to play it (for example with WinRAR).  If the file is compressed 'too much' there will be loss of quality.  The compression is done with a thing called a codec. This is an aggregation of the words: compressor and decompressor.  This codec is a special algorithm to reduce the size.
  • 8.  There are mainly two types of audio compression show below: 1)Lossy Compression(MP3) 2)Lossless Compression(Winzip)
  • 9. • A compression technique that does not decompress data back to 100% of the original. • Lossy methods provide high degrees of compression and result in smaller compressed files, but there is a certain amount of visual loss when restored. • Example:MP3
  • 10. • A compression technique that decompresses data back to its original form without any loss. • The decompressed file and the original are identical. For example, the ZIP archiving technology (WinZip...) is the most widely used lossless method.
  • 11. • Lossless audio files typically require more storage space than Lossy encoded ones. • However this type of format is often favored by users wanting to backup original audio CDs. • A perfect copy can be restored in the event of loss or damage to the CD. FLAC, Apple Lossless (ALAC) and WMA Lossless are examples of lossless compression formats.
  • 12.  For lossy compression: • Nero AAC Codec (Nero “advanced audio coding” codec): It was developed and distributed by Nero AG. • FAAC(Freeware Advanced Audio Coder):is an audio compression computer program that creates AAC sound files from other formats , it is the recommended format for the company's iPod music player.
  • 13.  For lossless compression • LPAC (Lossless predictive audio compression):is an improved lossless audio compression algorithm developed by Tilman Liebchen, Marcus Purat and Peter Noll. • ALAC (Apple Lossless Audio Codec):is an audio coding format, and its reference audio codec implementation, developed by Apple. • FLAC(Free Lossless Audio Codec): can typically reduce the original size of audio file to 50–60%, and decompressed it to an identical copy of the original audio data, developed by Josh Coalson. • WMA Lossless (Windows Media Audio Lossless): developed by Microsoft
  • 14.  Moving Picture Experts Group  Aim to create standards relating to synchronized  audio and video compression  MPEG-1  MPEG-2
  • 15. Quite good high bit-rate is available Digital audio tap Complex Digital Audio Broadcasting most complex audio transmission over ISDN lines
  • 16. There are 3 categories in which certain Audio files belong to:- 1) Uncompressed: Ex) .Wav 2) Lossless: Ex) .WMA 3) Lossy: Ex) .Mp3
  • 24. ◦ Hearing threshold level – a function of frequency ◦ Any frequency components below the threshold will not be perceived by human ear
  • 25.  A frequency component can be partly or fully masked by another component that is close to it in frequency  A lower tone can effectively mask higher tone  This shifts the hearing threshold
  • 26. ◦ A quieter sound can be masked by a louder sound if they are temporally close Sounds that occur both (shortly) before and after volume increase can be masked
  • 28.  Power-law: larger values have less accuracy  quantization: uniform or non-uniform quantization.  coding: quantized spectral components are transmitted either directly, or as entropy coded words (Huffman coding)  For better data compression, variable- length Huffman codes are used to encode the quantized samples.  Resultant bitstream is now reduced, because of coarser quantisation, but can be further reduced by the use of Huffman coding.
  • 29.  formats encoded quantized samples into an encoded bit stream – final form in which the compressed signal is transmitted.  Header (First 4 bytes of a frame) ◦ Contains: Frame Sync, MPEG Layer, Sampling Frequency, Number of Channels, CRC, etc. ◦ Variable bit rate mp3’s switch bitrate between frames
  • 30.  Decoder side relatively easier. The gain, scale factor, quantization steps recovered and used for reconstruct the filter bank responses  Filter bank responses are combined to reconstruct the decoded audio signal
  • 31.  Advanced Audio Coding (AAC) – now part of MPEG- 4 Audio  Inclusion of 48 full-bandwidth audio channels  Default audio format for iPhone, iPad, Nintendo, PlayStation, Nokia, Android, BlackBerry  Introduced 1997 as MPEG-2 Part 7  In 1999 – updated and included in MPEG-4  standard for lossy digital audio compression. Designed to be the successor of the MP3 format, AAC generally achieves better sound quality than MP3 at similar bit rates.
  • 32.  Opus is a lossy audio coding format developed by the Internet Engineering Task Force (IETF) that is particularly suitable for interactive real-time applications over the Internet.  Opus incorporates technology from two other audio coding formats: the speech-oriented SILK and the low latency CELT.
  • 33.  The MDCT was proposed by Princen, Johnson, and Bradley[1] in 1987, following earlier (1986) work by Princen and Bradley[2] to develop the MDCT’s underlying principle of time-domain aliasing cancellation (TDAC), described below.  In MP3, the MDCT is not applied to the audio signal directly, but rather to the output of a 32-band polyphase quadrature filter (PQF) bank. The output of this MDCT is post processed by an alias reduction formula to reduce the typical aliasing of the PQF filter bank.
  • 34.  Such a combination of a filter bank with an MDCT is called a hybrid filter bank or a sub band MDCT. AAC, on the other hand, normally uses a pure MDCT; only the (rarely used) MPEG-4 AAC-SSR variant (by Sony) uses a four band PQF bank followed by an MDCT. Similar to MP3, ATRAC uses stacked quadrature mirror filters (QMF) followed by an MDCT.
  • 35.  ISO/IEC • MPEG-1 Layer III (MP3) • MPEG-1 Layer II • MPEG-1 Layer I • AAC • MPEG-4 ALS • MPEG-4 SLS • MPEG-D USAC
  • 36.  ITU-T • G.711 • G.718 • G.719 • G.722 • G.723 • G.726 • G.728 • G.729
  • 37. • audio compression is a key technology • many algorithms  many applications • Better algorithms  better quality, more compression