SlideShare a Scribd company logo
Guided By:-        Presented By:-
Ms. Savita Vijay   Kanika Rathore
                   B.tech (C.S.E) IIIyr
                   BTBTC10168
   Introduction
   What is the need?
   Aesthetic Aspects
   Video Aesthetic Features
   Audio Aesthetic Features
   Pivot Representation
   Advantages
   Applications
   Conclusion
   The PIVOT VECTOR SPACE APPROACH in audio mixing is
    a novel technique that automatically picks the best
    audio clip to mix with the given video shot.

   This technique uses pivot vector space mixing
    framework & High level perceptual descriptors of audio
    & video characteristics.

   It uses a Pivot Vector space mapping method that
    matches video shots with music segments based on
    aesthetic cinematographic heuristics .

   This automatic audio-video mixing technique is suited
    for Home videos.
   Most videos such as movies and sitcoms have several
    segments devoid of any speech. Adding carefully chosen
    music to such segments conveys emotions such as joy, tension
    ,or melancholy.

   In a typical professional video production, skilled audio-
    mixing artists aesthetically add appropriate audio to the given
    video shots. This process is tedious, time-consuming, and
    expensive.

   Many home video users would like to make their videos
    appear like professional productions before they share it with
    family and friends.
   Movies comprise :-
    Images
    Graphic traces
    Recorded speech , music and noises
    Sound effects

   Roles of music in movies :-
    Setting the scene
    Adding emotional meaning
    Serving as a background filler
    Creating continuity across shots or scenes
    Emphasizing climaxes
The table shows Aesthetic Features that correspond in video & music
Zettl based these proposed mixing rules on the following aspects :-

 Tonal matching

 Structural matching

 Thematic matching

 Historical-geographical matching
   A set of attributed features required to describe videos.

   This consists of features which required to describe videos.
     Light falloff :- refers to the brightness contrast between
    the light and shadow sides of an object

     Color features :- it consist of four features
        saturation
        hue
        brightness
        energy
     Motion vectors :- To measure the video segments’ motion
    intensity.
We obtained the mean and standard deviation for estimating the
confidence level of the Video & audio attributed features for any
test shot.
   Low level features :-
    Spectral centroid (brightness):- measure of a sound’s
    brightness.



    Zero crossing :- measure of the frequency content of the
    signal
Volume (loudness) :- represents the subjective measure ,
which depends on the human listener’s frequency response.
   Dynamics :- the volume of musical sound related to the
    music’s loudness or softness.

   Tempo features :- that makes the music flow unique and
    differentiates it from other types of audio signal is temporal
    organization . (beat rate)

   Perceptual pitch feature :- it has an important role in human
    hearing, and the auditory system apparently assigns a pitch to
    any thing that comes to its attention.
   A vector space P acts as a pivot between the audio and video
    representation.



   Independent of any media.



   This space is defined with some aesthetic features in which
    music M and videos V are mapped.
   We consider how to represent video and audio clips into
    their aesthetic spaces V or M

   In the two spaces, a dimension corresponds to an attributed
    feature,

   It includes brightness_high , brightness_low , and so on.

   One video shot is associated with one vector in the V space.

   Obtaining the values for each dimension resembles
    handling fuzzy variables
   The aesthetic feature playing the role of a fuzzy     variable
    and the attribute descriptor acting as a fuzzy value which is
    represented using diagram.
   The X-axis refers to the actual computed feature value and the
    Y-axis simultaneously indicates the aesthetic label and the
    confidence value.
In the below figure shows that
 a) Matching between the video L02_30 & the music T01_5
 b)Sample frame the video
pivot vector space approach in audio-video mixing
   Before the development of the PIVOT VECTOR SPACE
    APPROACH, audio-video mixing process can be carried out
    only by professional mixing artists.

   The Pivot vector space approach enables all the home video
    users and amateur video enthusiasts to give a professional look
    and feel to their videos.

   This technique also eliminates the need for professional
    mixing artists, thereby significantly reducing the cost, time
    and labour involved.
   A large amount of home video footage is being
    produced due to products such as Digital video
    camcorders , Handicams etc.

   Hence, this technique will be of great use to all the
    amateur video enthusiasts and home video users
   This is a technique that all amateur and home video
    artists can use in the creation of video footage that
    gives a professional look and feel.

   Since it is fully automatic, the user need not worry
    about his aesthetic capabilities.
   http://www-mrim.imag.fr/publications/2003/PM001/v_final.pdf
   http://ieeexplore.ieee.org
   www.edutalks.org
   www.scribd.com
pivot vector space approach in audio-video mixing

More Related Content

PDF
IT Introduction - 06. Graphic & Multimedia
PPT
Working of digital camera
PPTX
Introduction to camera & its major parts
PPTX
Film Grading - A Brief Introduction
PPTX
Image Filtering in the Frequency Domain
PPTX
Computer Vision harris
PPTX
Image Sensing and Acquisition.pptx
PPTX
Fundamental Steps Of Image Processing
IT Introduction - 06. Graphic & Multimedia
Working of digital camera
Introduction to camera & its major parts
Film Grading - A Brief Introduction
Image Filtering in the Frequency Domain
Computer Vision harris
Image Sensing and Acquisition.pptx
Fundamental Steps Of Image Processing

What's hot (20)

PDF
Digital Image Processing: An Introduction
PPTX
Lecture Summary : Camera Projection
PPTX
Comparison of image fusion methods
PPTX
HUMAN EMOTION RECOGNIITION SYSTEM
PPT
Digital Image Processing
PPT
Motion estimation overview
PPTX
Fundamental Steps of Digital Image Processing & Image Components
PPTX
Computer Vision - Stereo Vision
PDF
Digital Image Fundamentals
PPT
Image processing presentataion
PPT
PPT
Film Analysis Prod Tech
PPTX
Smart note taker
PPTX
Multimedia animation basic concepts
PDF
AS Level Media Studies - CREATIVE CRITICAL REFLECTION (Film Opening Task)
PPTX
Face Detection Attendance System By Arjun Sharma
PPTX
Cloud computing pricing models
PPTX
Camera model ‫‬
PPTX
Fundamentals and image compression models
PPTX
Video Surveillance
Digital Image Processing: An Introduction
Lecture Summary : Camera Projection
Comparison of image fusion methods
HUMAN EMOTION RECOGNIITION SYSTEM
Digital Image Processing
Motion estimation overview
Fundamental Steps of Digital Image Processing & Image Components
Computer Vision - Stereo Vision
Digital Image Fundamentals
Image processing presentataion
Film Analysis Prod Tech
Smart note taker
Multimedia animation basic concepts
AS Level Media Studies - CREATIVE CRITICAL REFLECTION (Film Opening Task)
Face Detection Attendance System By Arjun Sharma
Cloud computing pricing models
Camera model ‫‬
Fundamentals and image compression models
Video Surveillance
Ad

Viewers also liked (20)

PDF
Vector video standards
PPTX
Zig zag for website
PDF
Prezentace Inkscape a jeho použití v mojí praxi
PDF
Featured Pattern Run Length Coding for Test Data Compression
PDF
The ZIG ZAG
PPT
ZIG ZAG FEEDER
PDF
ประวัติแนะนำตนเอง
PDF
Canon Hd
PPS
Inivitation to an exotic party
PPT
Laboratorios California Janedy
PDF
PPTX
What is the definition of the 6 Sigma?
PPS
Photo Reflections
PPS
Water Themed Pictures
PPTX
Lenguaje Multimedia
PPT
Vasolix Nereyda
DOCX
Jumarni p
PPS
Living Quarters in China
PPTX
Rhinehart emerson power point presentation
PDF
Jumarni.p
Vector video standards
Zig zag for website
Prezentace Inkscape a jeho použití v mojí praxi
Featured Pattern Run Length Coding for Test Data Compression
The ZIG ZAG
ZIG ZAG FEEDER
ประวัติแนะนำตนเอง
Canon Hd
Inivitation to an exotic party
Laboratorios California Janedy
What is the definition of the 6 Sigma?
Photo Reflections
Water Themed Pictures
Lenguaje Multimedia
Vasolix Nereyda
Jumarni p
Living Quarters in China
Rhinehart emerson power point presentation
Jumarni.p
Ad

Similar to pivot vector space approach in audio-video mixing (20)

PPTX
C programming
PPT
Applying Media Content Analysis to the Production of Musical Videos as Summar...
PPTX
Identity fraud evaluation
PDF
Music video codes and conventions. .pdf
PDF
post-production-cmyk-2021-web.pdf
PDF
TVC production process
PDF
Performance Analysis of Audio and Video Synchronization using Spreaded Code D...
PPTX
COMP2 fundamental frequency of secondary succession season and unsure 🤔
PPTX
Identity fraud evaluation
PPTX
Master the Cut: Learn the Basics of Video Editing for Beginners
PPTX
Pod handler
PPTX
10 Ways to Enhance your Corporate Video on Edit Table
PPT
Video Data
PPTX
Evaluation question 6
PPTX
PDF
Example-Based Remixing of Multimedia Contents
PPTX
Forms and conventions of music videos
PDF
How to Edit a Music Video Like a || GXYZ Radio.pdf
DOCX
2nd editing sessionn
C programming
Applying Media Content Analysis to the Production of Musical Videos as Summar...
Identity fraud evaluation
Music video codes and conventions. .pdf
post-production-cmyk-2021-web.pdf
TVC production process
Performance Analysis of Audio and Video Synchronization using Spreaded Code D...
COMP2 fundamental frequency of secondary succession season and unsure 🤔
Identity fraud evaluation
Master the Cut: Learn the Basics of Video Editing for Beginners
Pod handler
10 Ways to Enhance your Corporate Video on Edit Table
Video Data
Evaluation question 6
Example-Based Remixing of Multimedia Contents
Forms and conventions of music videos
How to Edit a Music Video Like a || GXYZ Radio.pdf
2nd editing sessionn

Recently uploaded (20)

PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
Getting started with AI Agents and Multi-Agent Systems
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPT
Module 1.ppt Iot fundamentals and Architecture
PPTX
Modernising the Digital Integration Hub
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Hybrid model detection and classification of lung cancer
PPTX
observCloud-Native Containerability and monitoring.pptx
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
1. Introduction to Computer Programming.pptx
PPT
What is a Computer? Input Devices /output devices
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
OMC Textile Division Presentation 2021.pptx
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Getting started with AI Agents and Multi-Agent Systems
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Module 1.ppt Iot fundamentals and Architecture
Modernising the Digital Integration Hub
A novel scalable deep ensemble learning framework for big data classification...
Chapter 5: Probability Theory and Statistics
Hybrid model detection and classification of lung cancer
observCloud-Native Containerability and monitoring.pptx
Final SEM Unit 1 for mit wpu at pune .pptx
1. Introduction to Computer Programming.pptx
What is a Computer? Input Devices /output devices
Univ-Connecticut-ChatGPT-Presentaion.pdf
WOOl fibre morphology and structure.pdf for textiles
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
O2C Customer Invoices to Receipt V15A.pptx
DP Operators-handbook-extract for the Mautical Institute
Enhancing emotion recognition model for a student engagement use case through...
Hindi spoken digit analysis for native and non-native speakers
OMC Textile Division Presentation 2021.pptx

pivot vector space approach in audio-video mixing

  • 1. Guided By:- Presented By:- Ms. Savita Vijay Kanika Rathore B.tech (C.S.E) IIIyr BTBTC10168
  • 2. Introduction  What is the need?  Aesthetic Aspects  Video Aesthetic Features  Audio Aesthetic Features  Pivot Representation  Advantages  Applications  Conclusion
  • 3. The PIVOT VECTOR SPACE APPROACH in audio mixing is a novel technique that automatically picks the best audio clip to mix with the given video shot.  This technique uses pivot vector space mixing framework & High level perceptual descriptors of audio & video characteristics.  It uses a Pivot Vector space mapping method that matches video shots with music segments based on aesthetic cinematographic heuristics .  This automatic audio-video mixing technique is suited for Home videos.
  • 4. Most videos such as movies and sitcoms have several segments devoid of any speech. Adding carefully chosen music to such segments conveys emotions such as joy, tension ,or melancholy.  In a typical professional video production, skilled audio- mixing artists aesthetically add appropriate audio to the given video shots. This process is tedious, time-consuming, and expensive.  Many home video users would like to make their videos appear like professional productions before they share it with family and friends.
  • 5. Movies comprise :- Images Graphic traces Recorded speech , music and noises Sound effects  Roles of music in movies :- Setting the scene Adding emotional meaning Serving as a background filler Creating continuity across shots or scenes Emphasizing climaxes
  • 6. The table shows Aesthetic Features that correspond in video & music
  • 7. Zettl based these proposed mixing rules on the following aspects :-  Tonal matching  Structural matching  Thematic matching  Historical-geographical matching
  • 8. A set of attributed features required to describe videos.  This consists of features which required to describe videos. Light falloff :- refers to the brightness contrast between the light and shadow sides of an object Color features :- it consist of four features saturation hue brightness energy Motion vectors :- To measure the video segments’ motion intensity.
  • 9. We obtained the mean and standard deviation for estimating the confidence level of the Video & audio attributed features for any test shot.
  • 10. Low level features :- Spectral centroid (brightness):- measure of a sound’s brightness. Zero crossing :- measure of the frequency content of the signal
  • 11. Volume (loudness) :- represents the subjective measure , which depends on the human listener’s frequency response.
  • 12. Dynamics :- the volume of musical sound related to the music’s loudness or softness.  Tempo features :- that makes the music flow unique and differentiates it from other types of audio signal is temporal organization . (beat rate)  Perceptual pitch feature :- it has an important role in human hearing, and the auditory system apparently assigns a pitch to any thing that comes to its attention.
  • 13. A vector space P acts as a pivot between the audio and video representation.  Independent of any media.  This space is defined with some aesthetic features in which music M and videos V are mapped.
  • 14. We consider how to represent video and audio clips into their aesthetic spaces V or M  In the two spaces, a dimension corresponds to an attributed feature,  It includes brightness_high , brightness_low , and so on.  One video shot is associated with one vector in the V space.  Obtaining the values for each dimension resembles handling fuzzy variables
  • 15. The aesthetic feature playing the role of a fuzzy variable and the attribute descriptor acting as a fuzzy value which is represented using diagram.  The X-axis refers to the actual computed feature value and the Y-axis simultaneously indicates the aesthetic label and the confidence value.
  • 16. In the below figure shows that a) Matching between the video L02_30 & the music T01_5 b)Sample frame the video
  • 18. Before the development of the PIVOT VECTOR SPACE APPROACH, audio-video mixing process can be carried out only by professional mixing artists.  The Pivot vector space approach enables all the home video users and amateur video enthusiasts to give a professional look and feel to their videos.  This technique also eliminates the need for professional mixing artists, thereby significantly reducing the cost, time and labour involved.
  • 19. A large amount of home video footage is being produced due to products such as Digital video camcorders , Handicams etc.  Hence, this technique will be of great use to all the amateur video enthusiasts and home video users
  • 20. This is a technique that all amateur and home video artists can use in the creation of video footage that gives a professional look and feel.  Since it is fully automatic, the user need not worry about his aesthetic capabilities.
  • 21. http://www-mrim.imag.fr/publications/2003/PM001/v_final.pdf  http://ieeexplore.ieee.org  www.edutalks.org  www.scribd.com