Professional Documents
Culture Documents
Video Retrieval
Dr.S.Geetha
Professor
School of Computing Science and Engineering
VIT University, Chennai Campus, Chennai
AB1-205 – geetha.s@vit.ac.in
SYLLABUS
• Fundamentals of Content based image and video
retrieval
• Image Content descriptors –Key Frame features Color
• Image Content descriptors Key frame features– Texture,
Shape
• Motion features
• Similarity Measures and Indexing Schemes Feature
Extraction techniques
• Computer Vision System Toolbox
Recommended Books
• MOODLE
– Lecture Slides
– Reading materials
– Web links
– Tutorials
– Research works, Recent Trends etc.
• Digital Image Processing (3rd Edition)– Gonzalez
• Research and White Papers
Grading Policy
• CAT – 1 – 15%
• CAT – 2 – 15%
• FAT – 40%
• Digital Assignment, Quiz, Programming
Exercise – 30%
• Project
– Team of maximum 3 members
COURSE OBJECTIVES
• To understand the fundamentals for the content-based
image and video retrieval.
• To understand the role of key frame features for content
based image retrieval.
• To understand the motion features of videos.
• To give the exposure on importance of similarity
measure in content based image retrieval.
• To design the algorithm for content based image
retrieval using machine learning algorithm.
Lecture 1
Introduction to Image Retrieval
Applications
Lecture 1: Outline
Whatisisand
•• What andWhy
Why image
image retrieval?
retrieval?
• How to compare and retrieve images?
• How to compare and retrieve images?
− Digital image representation
– Digital image representation
− Common components of the CBIR systems
– Common components of the CBIR systems
− Main problems and research directions
– Main
• What areproblems and research directions
applications?
• What are applications?
What is image retrieval?
Description Based Image Retrieval (DBIR)
Content Based Image Retrieval (CBIR)
Query
Textual Text
query
Query Image
by
example Sketch
DBIR v. s. CBIR
DBIR CBIR
Full text search algorithms are Automatic index construction
applicable
+ Index is objective
Search results correspond to
image semantics
17
Query By Example (QBE)
• The image DB user should be able to:
– show the system a sample image, or
– Paint one interactively on the screen, or
– Just sketch the outline of an object.
• The system should then be able to return
similar images or images containing similar
objects.
18
IBM-QBIC
• The Hermitage Web site was voted the best in
Russia. It uses the QBIC engine for searching
archives of world-famous art.
– http://www.hermitagemuseum.org/fcgi-bin/db2w
ww/qbicSearch.mac/qbic?selLang=English
– Color percentage
– Color layout
19
A sample query
• SELECT CONTENTS(image),
QBScoreFROMStr(`averageColor=
<255,0,0>’, image) AS SCORE
FROM signs ORDER BY SCORE
20
Photobook System
Sketch entry
Results of search
ImageScape System
Collect user’s
feedback Initial Query Results
28/46
Content-based Image Retrieval (CBIR)
http://pro.corbis.com/
http://wwwqbic.almaden.ibm.com
UC Berkeley’s Blobworld
http://elib.cs.berkeley.edu/blobworld
User
Image Database
Distance Measure
36/46
Image retrieval by Google
37/46
Image retrieval by Yandex
38/46
Lecture 1: Outline
• What is and Why image retrieval?
• How to compare and retrieve images?
– Digital image representation
– Common components of the CBIR systems
– Main problems and research directions
• What are applications?
39/46
Digital image representation
Vector image
draw circle
center 0.5, 0.5
radius 0.4
fill-color yellow
stroke-color black
stroke-width 0.05
draw circle
center 0.35, 0.4
radius 0.05
fill-color black
draw circle
center 0.65, 0.4
radius 0.05
fill-color black
draw line
start 0.3, 0.6
end 0.7, 0.6
stroke-color black
stroke-width 0.1
40/46
Digital image representation
Bitmap (raster) image
43/46
Digital image representation
Bitmap (raster) image: color
• RGB – the most common color model (CRT monitors, LCD
screens/projectors)
• Each pixel represented by 3 values: red, green, blue
RGB bands:
color image built up of bands of red, green and blue color
44/46
Digital image representation
Bitmap (raster) image: color
• Pixel-interleaved format (chunky) – is a common one
45/46
Lecture 1: Outline
• What is and Why image retrieval?
• How to compare and retrieve images?
– Digital image representation
– Common components of the CBIR systems
– Main problems and research directions
• What are applications?
46/46
Common components of CBIR
system
indexation
image feature
extraction database
comparison
extraction
47/46
Lecture 1: Outline
• What is and Why image retrieval?
• How to compare and retrieve images?
– Digital image representation
– Common components of the CBIR systems
– Main problems and research directions
• What are applications?
48/46
Problems and directions
• Low-level feature extraction
– How to represent an image in a compact and descriptive
way?
– How to compare features, and, thus, images?
• High dimensional indexing
• How to index huge amounts of high dimensional data?
• Visual interface for image browsing
– How to visualize the results?
49/46
How to: Image features
Textual/metadata features
Semantics
Levels of image content
Shape
Texture
Low-level features / visual features
(signatures, descriptors)
Color, lightness
50/46
How to: Image features
Image features
Visual (low-
Textual
level)
Annotations and metadata: Features extracted from pixel
− tags/keywords; values:
− creation date; − color descriptors;
− geo tags; − texture descriptors;
− name of the file; − shape descriptors;
− photography conditions − spatial layout descriptors.
(exposition, aperture, flash…).
51/46
How to: Image features
Low-level features
Global Local
53/46
How to: Combine results
Image A Image B
Similarity measure
xA1 xA2 … x AN xB1 xB2 … xBN d1
Similarity measure
y A 1 yA 2 … yAM yB1 yB2 … yBM d2
Similarity measure
z A 1 zA 2 … zAK zB1 zB2 … zBK d3
…
…
D c d
i
i i
54/46
How to: Image segmentation
• Fixed regions
– The same region boundaries for all images.
• Segmentation
– Boundaries depends on image content.
• Key points (point of interest) detection
– Points of particular interest in the image, feature extraction for areas
around key points.
55/46
Problems: semantic gap
Objects semantics
(regions)
Levels of image content
Color, brightness
(one pixel) low-level features
57/46
Problems: what’s on the images?
59/46
Problems: high dimensional data
• More information in feature vectors – better search results.
60/46
How to: high dimensional
indexing
• Perform dimension reduction
– The dimension of the feature vectors is normally
very high, the embedded dimension is much lower.
61/46
Problems: visualization
• Image content is very rich and its interpretation is
very contextual and subjective.
62/46
1-D
2-D 3-D
63/46
Neighbour research areas
• Image processing
– Features extraction
– Pattern recognition and machine learning
• Faces, handwritings, thumbprints, …
• Classification tools
– Image enhancement
– Image classification
• The same features are used
• Classification helps to retrieve
• Information retrieval
– Scalability
– Performance measurement
– Fusion of multiple evidences
64/46
Lecture 1: Outline
• What is and Why image retrieval?
• How to compare and retrieve images?
– Digital image representation
– Common components of the CBIR systems
– Main problems and research directions
• What are applications?
65/46
What are applications? – Image Archives.
• Manage image archives
– Personal photo collections (many thousands of photos in mine)
– Professional photograph archives (millions of photos)
– Art collections (millions of photos)
• Browse images
• Organize image collection: delete duplicates, classify images, select “the best”
from the group of similar images
• Posters creation, auto cropping, album creation (www.snapfishlab.hpl.hp.com)
• Better organization of search-by-text results
66/46
What are applications? – Image Archives.
• Manage image archives
• …
• Search for particular image (by its smaller version, by its fragment)
• Search for similar images (landscape paintings, sea views, paintings by the
same author)
• Search for a painting with particular colors (“I want a sea view painting to my
bedroom with an orange carpet and yellow walls”)
• Search for group photos of my family
• Search for an image that will be a good illustration to my article/presentation
• … a lot of other use cases
67/46
What are applications? – Copyrights.
• Trademark and copyright application
– World Wide Web
– Enterprise network
68/46
What are applications? – Medical.
• Medical diagnosis
– Collection of X-ray images
69/46
What are applications? – Security.
• Security issues
– Video surveillance material
– Faces, fingerprints, retina images
70/46
What are applications? – In
industry.
• Quality assurance
(a) CD-ROM controller(b) Pack of pills (c) Level of liquid (d) Air-bladders
in plastic
• Control that all parts of the product are on place (a)
• Control if all places in pill pack are filled (b)
• Control the level of liquid in bottles (c)
• Control the quality of plastic details (d)
• And even control the corn flakes! (e)
(e) Corn flakes
71/46
What are applications? – Others.
• Military-related issues
– Auto aiming, tracking systems
72/46
Lecture 1: Summary
• CBIR is an actual problem and an active research area
73/46
Lecture 1: Bibliography
• Gonzalez R, Woods R. Digital Image Processing, published by
Pearson Education, Inc, 2002.
• Rui Y., Huang T.S., Chang S.-F. Image Retrieval: Past, Present
and Future. In Proc. of Int. Symposium on Multimedia
Information Processing, Dec. 1997.
• Links:
– http://videolectures.net/russir08_vassilieva_cbir/
– http://videolectures.net/kdd08_malik_fis/
– http://imagedatabase.cs.washington.edu/demo/cbir.html
– courses.cs.vt.edu/~cs4624/cache/qbic.htm
– http://homepages.inf.ed.ac.uk/cgi/rbf/CVONLINE/entries.pl?TAG363
http://www.ux.uis.no/~tranden/brodatz.html
http://www.cs.washington.edu/research/imagedatabase/groundtruth/
http://peipa.essex.ac.uk/benchmark/databases/
74/46
Thank You!