SlideShare a Scribd company logo
2
Most read
3
Most read
5
Most read
07/05/15 1
11CP913 – DATA MINING
MULTIMEDIA DATA MINING
DATA MINING
07/05/15 2
MULTIMEDIA DATABASE
 Multimedia database system – stores & manages a large
collection of multimedia objects
 Audio data, image data, video data, sequence data,
hypertext data (contain text, text markups & linkages)
 Audio-video equipment, CD-ROM, internet
 Multimedia data mining focuses on image data mining
 Multimedia data mining methods
 Similarity search in multimedia data,
 Multidimensional analysis,
 Classification & prediction analysis and
 Mining associations in multimedia data
DATA MINING
07/05/15 3
SIMILARITY SEARCH IN MULTIMEDIA
DATA
 Two types of multimedia indexing and retrieval systems
 Description-based retrieval system
 Content-based retrieval system
 Description-based retrieval system
 Build indices and perform object retrieval based on image descriptions,
• keywords
• caption
• Size
• Time of creation
 Labor-intensive
 Poor quality
DATA MINING
07/05/15 4
CONTENT-BASED RETRIEVAL SYSTEM
 Object retrieval is based on the image content,
 color histogram
 texture
 pattern
 image topology
 shape of objects and their layouts and locations within the image
 Desirable in many applications
 Two kinds of queries
 Image sample-based queries
 Image feature specification queries
 Image sample-based queries
 Search compares the feature vector extracted from the sample with
images & indexed in image database
 Images closer to the sample images are returned
DATA MINING
07/05/15 5
CONTD…
 Image feature specification queries
 Sketch image features(color, texture or shape)
 Translated into feature vector to be matched with the image database
 Applications – medical diagnosis, weather prediction, web
search engines for images
 QBIC(Query By Image Content)
 Support both sample-based & image feature specification queries
 Approaches for similarity-based retrieval in image database
based on image signature
 Color histogram-based signature
 Multifeature composed signature
 Wavelet-based signature
 Wavelet-based signature with region-based granularity
DATA MINING
07/05/15 6
CONTD…
 Color histogram-based signature
 image signature includes color histogram based on the color
composition of an image
 No information about shape, location or texture
 Two images with similar color results in unrelated semantics
 Multifeature composed signature
 Image signature includes a composition of multiple features
• Color histogram, shape, location and texture
 Separate distance function for each feature
 Few features are used to search for images with similar features
DATA MINING
07/05/15 7
CONTD…
 Wavelet-based signature
 Image signature includes the wavelet coefficients of an image
 Wavelets capture shape, texture & location information in a single
unified framework
 Improves efficiency & reduces the need for multiple search primitives
 Computes a single signature for an entire image
 Wavelet-based signature with region-based granularity
 Computation & comparison of signatures are at the granularity of
regions, not the entire image
 Similar images may contain similar regions
 Region in one image – performs translation/scaling of a matching
region with other
 Similarity measure between query image & target image
DATA MINING
07/05/15 8
MULTIDIMENSIONAL ANALYSIS OF
MULTIMEDIA DATA
 Multimedia data cube
 contain additional dimensions
 Measures for multimedia information – color, texture & shape
 Multimedia miner
 Image contains 2 descriptors – feature descriptor & a layout descriptor
 Original image is not stored directly in database
 Description information
• Image file name
• Image URL
• Image type
• List of keywords
DATA MINING
07/05/15 9
CONTD…
 Feature descriptor
 set of vectors for visual characteristics
 Main vectors – color vector, MFC (Most Frequent Color), MFO (Most
Frequent Orientation) vector
 Layout descriptor
 Color layout vector - MFC
 Edge layout vector – number of edges for each orientation
 Dimensions of multimedia data cube
 Size of the image/video in bytes
 Width & height of the frames
 Date of creation (image/video)
 Format type
 Frame sequence duration in seconds
 Keywords, color & edge orientation
DATA MINING
07/05/15 10
CONTD…
 Construction of a multimedia data cube
 Facilitates multidimensional analysis of multimedia data
 Based on visual content
 Mining of multiple kinds of knowledge
• Summarization
• Comparison
• Classification
• Association
• clustering
DATA MINING
07/05/15 11DATA MINING
07/05/15 12
CONTD…
 Difficult to implement a data cube efficiently for large number
of dimensions
 Attributes are set-oriented instead of single-valued
 Eg : single image corresponds to set of keywords,
set of objects associated with set of colors
DATA MINING
07/05/15 13
CLASSIFICATION & PREDICTION
ANALYSIS OF MULTIMEDIA DATA
 Scientific research – astronomy, seismology & geoscientific
research
 Decision tree classification – essential data mining method
 Eg : sky images – classified by astronomers as the training set
constructing models for recognition of galaxies, stars
based on properties – magnitudes, areas, intensity, image
moments & orientation.sky images taken by telescope are
tested against the constructed models – to identify new
bodies
 Data preprocessing – mining image data
DATA MINING
07/05/15 14
MINING ASSOCIATIONS IN
MULTIMEDIA DATA
 Association between image content & non-image content
features: “if atleast 50% of the upper part of the picture is
blue, it is likely to represent sky”
 Association among image contents that are not related to
spatial relationships:”if a picture contains 2 blue squares, it
is likely to contain one red circle as well”
 Association among image contents related to spatial
relationships:”if a red triangle is in between 2 yellow
squares, it is likely there is a big oval-shaped object
underneath”
DATA MINING
07/05/15 15
CONTD…
 Multiple objects with multiple features – large number of
possible associations
 Essential to promote progressive resolution refinement
 Frequently occurring pattern – mine at rough level & focus on
finer resolution level
 Reduces the cost without loss of quality
 Picture containing multiple recurrent objects is an important
feature in image analysis
 Relative spatial relationships among multimedia objects –
above, beneath, between, nearby
DATA MINING
07/05/15 16
AUDIO & VIDEO DATA MINING
 Demand for effective content-based retrieval & data mining
methods for audio & video data
 Eg: editing video clips, detecting suspicious scenes in videos
 MPEG & JPEG – video compression schemes
 MPEG-7- formally named “Multimedia Content Description
Interface”
 Used in broad range of applications
 Audiovisual description – still pictures, video, graphics, audio,
speech
DATA MINING
07/05/15 17
CONTD…
 Elements in MPEG-7
 A set of descriptors defines the syntax & semantics of a feature
 Structure & semantics of the relationships between its components
 A set of coding schemes for the descriptors
 DDL(Description Definition Language)
 Facilitates content-based video retrieval & video data mining
 Video clip – collection of actions & events in time
 Shot – group of frames/pictures
 Key frame
 Most representative frame in a video
 Sequence of key frames defines the sequence of the events in the video
clip
DATA MINING

More Related Content

PPTX
Introduction to Web Mining and Spatial Data Mining
PPTX
PPTX
web mining
PPTX
Multimedia content based retrieval in digital libraries
PPTX
Data Mining
PPTX
UNIT - 5: Data Warehousing and Data Mining
PPTX
Attribute oriented analysis
PDF
Multimedia Information Retrieval
Introduction to Web Mining and Spatial Data Mining
web mining
Multimedia content based retrieval in digital libraries
Data Mining
UNIT - 5: Data Warehousing and Data Mining
Attribute oriented analysis
Multimedia Information Retrieval

What's hot (20)

PPT
Image segmentation ppt
PPTX
Fundamentals and image compression models
PPTX
Image compression .
PPTX
Bit plane coding
PPTX
Smoothing Filters in Spatial Domain
PDF
Digital image processing using matlab
PPTX
Data mining tasks
PPTX
Image compression standards
PPTX
Digital Image Processing
PPTX
Region based segmentation
PPT
Chapter10 image segmentation
PPT
Digital Image Processing
PPT
Multimedia Mining
PPTX
Module 31
PPTX
COM2304: Intensity Transformation and Spatial Filtering – I (Intensity Transf...
PPTX
digital image processing
ODP
image compression ppt
PPTX
Chapter 9 morphological image processing
PPTX
Introduction to Image Compression
PPT
4.2 spatial data mining
Image segmentation ppt
Fundamentals and image compression models
Image compression .
Bit plane coding
Smoothing Filters in Spatial Domain
Digital image processing using matlab
Data mining tasks
Image compression standards
Digital Image Processing
Region based segmentation
Chapter10 image segmentation
Digital Image Processing
Multimedia Mining
Module 31
COM2304: Intensity Transformation and Spatial Filtering – I (Intensity Transf...
digital image processing
image compression ppt
Chapter 9 morphological image processing
Introduction to Image Compression
4.2 spatial data mining
Ad

Similar to 4.3 multimedia datamining (20)

DOCX
Text Mining with Automatic Annotation from Unstructured Content
PDF
8017 25 image mining
POT
Multi media Data mining
PPTX
Data Mining: Applying data mining
PPTX
Data Mining: Applying data mining
PDF
Multimedia mining research – an overview
PPT
Multimedia Data Mining using Deep Learning
PPT
viretrieval2.ppt chain codes Multimedia Information Retrieval
PDF
Image Indexing and Retrieval
PDF
Searching in high dimensional spaces index structures for improving the perfo...
PDF
50320140502001 2
PDF
50320140502001
PPT
Data_Mining_Applications of various kinds .ppt
PDF
PDF
Data Management For Multimedia Retrieval K Seluk Candan Maria Luisa Sapino
PDF
Multimedia Big Data Management Processing And Analysis
PDF
IRJET- Retrieval of Images & Text using Data Mining Techniques
PPTX
Defense Powepoint
PDF
RECURRENT FEATURE GROUPING AND CLASSIFICATION MODEL FOR ACTION MODEL PREDICTI...
PDF
RECURRENT FEATURE GROUPING AND CLASSIFICATION MODEL FOR ACTION MODEL PREDICTI...
Text Mining with Automatic Annotation from Unstructured Content
8017 25 image mining
Multi media Data mining
Data Mining: Applying data mining
Data Mining: Applying data mining
Multimedia mining research – an overview
Multimedia Data Mining using Deep Learning
viretrieval2.ppt chain codes Multimedia Information Retrieval
Image Indexing and Retrieval
Searching in high dimensional spaces index structures for improving the perfo...
50320140502001 2
50320140502001
Data_Mining_Applications of various kinds .ppt
Data Management For Multimedia Retrieval K Seluk Candan Maria Luisa Sapino
Multimedia Big Data Management Processing And Analysis
IRJET- Retrieval of Images & Text using Data Mining Techniques
Defense Powepoint
RECURRENT FEATURE GROUPING AND CLASSIFICATION MODEL FOR ACTION MODEL PREDICTI...
RECURRENT FEATURE GROUPING AND CLASSIFICATION MODEL FOR ACTION MODEL PREDICTI...
Ad

More from Krish_ver2 (20)

PPT
5.5 back tracking
PPT
5.5 back track
PPT
5.5 back tracking 02
PPT
5.4 randomized datastructures
PPT
5.4 randomized datastructures
PPT
5.4 randamized algorithm
PPT
5.3 dynamic programming 03
PPT
5.3 dynamic programming
PPT
5.3 dyn algo-i
PPT
5.2 divede and conquer 03
PPT
5.2 divide and conquer
PPT
5.2 divede and conquer 03
PPT
5.1 greedyyy 02
PPT
5.1 greedy
PPT
5.1 greedy 03
PPT
4.4 hashing02
PPT
4.4 hashing
PPT
4.4 hashing ext
PPT
4.4 external hashing
PPT
4.2 bst
5.5 back tracking
5.5 back track
5.5 back tracking 02
5.4 randomized datastructures
5.4 randomized datastructures
5.4 randamized algorithm
5.3 dynamic programming 03
5.3 dynamic programming
5.3 dyn algo-i
5.2 divede and conquer 03
5.2 divide and conquer
5.2 divede and conquer 03
5.1 greedyyy 02
5.1 greedy
5.1 greedy 03
4.4 hashing02
4.4 hashing
4.4 hashing ext
4.4 external hashing
4.2 bst

Recently uploaded (20)

PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Pre independence Education in Inndia.pdf
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
Cell Types and Its function , kingdom of life
PDF
RMMM.pdf make it easy to upload and study
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
Business Ethics Teaching Materials for college
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
01-Introduction-to-Information-Management.pdf
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Insiders guide to clinical Medicine.pdf
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
102 student loan defaulters named and shamed – Is someone you know on the list?
Pre independence Education in Inndia.pdf
Anesthesia in Laparoscopic Surgery in India
Cell Types and Its function , kingdom of life
RMMM.pdf make it easy to upload and study
Week 4 Term 3 Study Techniques revisited.pptx
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Business Ethics Teaching Materials for college
STATICS OF THE RIGID BODIES Hibbelers.pdf
O5-L3 Freight Transport Ops (International) V1.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
01-Introduction-to-Information-Management.pdf
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Insiders guide to clinical Medicine.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx

4.3 multimedia datamining

  • 1. 07/05/15 1 11CP913 – DATA MINING MULTIMEDIA DATA MINING DATA MINING
  • 2. 07/05/15 2 MULTIMEDIA DATABASE  Multimedia database system – stores & manages a large collection of multimedia objects  Audio data, image data, video data, sequence data, hypertext data (contain text, text markups & linkages)  Audio-video equipment, CD-ROM, internet  Multimedia data mining focuses on image data mining  Multimedia data mining methods  Similarity search in multimedia data,  Multidimensional analysis,  Classification & prediction analysis and  Mining associations in multimedia data DATA MINING
  • 3. 07/05/15 3 SIMILARITY SEARCH IN MULTIMEDIA DATA  Two types of multimedia indexing and retrieval systems  Description-based retrieval system  Content-based retrieval system  Description-based retrieval system  Build indices and perform object retrieval based on image descriptions, • keywords • caption • Size • Time of creation  Labor-intensive  Poor quality DATA MINING
  • 4. 07/05/15 4 CONTENT-BASED RETRIEVAL SYSTEM  Object retrieval is based on the image content,  color histogram  texture  pattern  image topology  shape of objects and their layouts and locations within the image  Desirable in many applications  Two kinds of queries  Image sample-based queries  Image feature specification queries  Image sample-based queries  Search compares the feature vector extracted from the sample with images & indexed in image database  Images closer to the sample images are returned DATA MINING
  • 5. 07/05/15 5 CONTD…  Image feature specification queries  Sketch image features(color, texture or shape)  Translated into feature vector to be matched with the image database  Applications – medical diagnosis, weather prediction, web search engines for images  QBIC(Query By Image Content)  Support both sample-based & image feature specification queries  Approaches for similarity-based retrieval in image database based on image signature  Color histogram-based signature  Multifeature composed signature  Wavelet-based signature  Wavelet-based signature with region-based granularity DATA MINING
  • 6. 07/05/15 6 CONTD…  Color histogram-based signature  image signature includes color histogram based on the color composition of an image  No information about shape, location or texture  Two images with similar color results in unrelated semantics  Multifeature composed signature  Image signature includes a composition of multiple features • Color histogram, shape, location and texture  Separate distance function for each feature  Few features are used to search for images with similar features DATA MINING
  • 7. 07/05/15 7 CONTD…  Wavelet-based signature  Image signature includes the wavelet coefficients of an image  Wavelets capture shape, texture & location information in a single unified framework  Improves efficiency & reduces the need for multiple search primitives  Computes a single signature for an entire image  Wavelet-based signature with region-based granularity  Computation & comparison of signatures are at the granularity of regions, not the entire image  Similar images may contain similar regions  Region in one image – performs translation/scaling of a matching region with other  Similarity measure between query image & target image DATA MINING
  • 8. 07/05/15 8 MULTIDIMENSIONAL ANALYSIS OF MULTIMEDIA DATA  Multimedia data cube  contain additional dimensions  Measures for multimedia information – color, texture & shape  Multimedia miner  Image contains 2 descriptors – feature descriptor & a layout descriptor  Original image is not stored directly in database  Description information • Image file name • Image URL • Image type • List of keywords DATA MINING
  • 9. 07/05/15 9 CONTD…  Feature descriptor  set of vectors for visual characteristics  Main vectors – color vector, MFC (Most Frequent Color), MFO (Most Frequent Orientation) vector  Layout descriptor  Color layout vector - MFC  Edge layout vector – number of edges for each orientation  Dimensions of multimedia data cube  Size of the image/video in bytes  Width & height of the frames  Date of creation (image/video)  Format type  Frame sequence duration in seconds  Keywords, color & edge orientation DATA MINING
  • 10. 07/05/15 10 CONTD…  Construction of a multimedia data cube  Facilitates multidimensional analysis of multimedia data  Based on visual content  Mining of multiple kinds of knowledge • Summarization • Comparison • Classification • Association • clustering DATA MINING
  • 12. 07/05/15 12 CONTD…  Difficult to implement a data cube efficiently for large number of dimensions  Attributes are set-oriented instead of single-valued  Eg : single image corresponds to set of keywords, set of objects associated with set of colors DATA MINING
  • 13. 07/05/15 13 CLASSIFICATION & PREDICTION ANALYSIS OF MULTIMEDIA DATA  Scientific research – astronomy, seismology & geoscientific research  Decision tree classification – essential data mining method  Eg : sky images – classified by astronomers as the training set constructing models for recognition of galaxies, stars based on properties – magnitudes, areas, intensity, image moments & orientation.sky images taken by telescope are tested against the constructed models – to identify new bodies  Data preprocessing – mining image data DATA MINING
  • 14. 07/05/15 14 MINING ASSOCIATIONS IN MULTIMEDIA DATA  Association between image content & non-image content features: “if atleast 50% of the upper part of the picture is blue, it is likely to represent sky”  Association among image contents that are not related to spatial relationships:”if a picture contains 2 blue squares, it is likely to contain one red circle as well”  Association among image contents related to spatial relationships:”if a red triangle is in between 2 yellow squares, it is likely there is a big oval-shaped object underneath” DATA MINING
  • 15. 07/05/15 15 CONTD…  Multiple objects with multiple features – large number of possible associations  Essential to promote progressive resolution refinement  Frequently occurring pattern – mine at rough level & focus on finer resolution level  Reduces the cost without loss of quality  Picture containing multiple recurrent objects is an important feature in image analysis  Relative spatial relationships among multimedia objects – above, beneath, between, nearby DATA MINING
  • 16. 07/05/15 16 AUDIO & VIDEO DATA MINING  Demand for effective content-based retrieval & data mining methods for audio & video data  Eg: editing video clips, detecting suspicious scenes in videos  MPEG & JPEG – video compression schemes  MPEG-7- formally named “Multimedia Content Description Interface”  Used in broad range of applications  Audiovisual description – still pictures, video, graphics, audio, speech DATA MINING
  • 17. 07/05/15 17 CONTD…  Elements in MPEG-7  A set of descriptors defines the syntax & semantics of a feature  Structure & semantics of the relationships between its components  A set of coding schemes for the descriptors  DDL(Description Definition Language)  Facilitates content-based video retrieval & video data mining  Video clip – collection of actions & events in time  Shot – group of frames/pictures  Key frame  Most representative frame in a video  Sequence of key frames defines the sequence of the events in the video clip DATA MINING