Professional Documents
Culture Documents
Multimedia Systems:
Research & Applications
Dr. Jian Zhang
NICTA & CSE UNSW
COMP9519 Multimedia Systems
S2 2007
jzhang@cse.unsw.edu.au
1.1 Introduction
z Our team’s profile – Multimedia and Visual
Communication Research Group (MVC); NICTA
z Jing Chen got his master degree in Telecommunication Engineering from Shanghai
Jiaotong University in 1993.
z Jing Chen received his Ph.D degree in Electrical Engineering from University of
Sydney in 2003.
z Jing Chen was a senior system engineer with China Telecomm during 1994 and 1998
z Jing Chen was a senior software engineer with Motorola Australia Software Sydney
Center during 1999 and 2003.
z Jing Chen's research interests include video process and video communication, video
QoS enhancement over wireless networks, and video representation and
understanding using machine learning technologies. Jing has had more than 10
papers published in the relevant areas.
M’Phone
3G
Network Embedded
Home entertainment unit MPEG-4
Live MPEG-4 Players
Home RF Error resilient decoder
Wireless Camera
Network
Error resilient encoder COMP9519 Multimedia Systems – Lecture 1 – Slide 13 – J Zhang
1.3 Multimedia Applications-- Content
Management Demonstration Platform
z Client / Server platform demonstrating content
based search using MPEG-7 visual descriptors
z Content can be searched using methods “query by
specification” or “query by example”.
z For “query by example”, the Analysis Engine at the server
extracts visual features from images
z the Search Engine searches for archived images that have
similar features to those of the example region.
Search Engine
Search results
locates other
Analysis Engine returned to client
images in the
User selects a extracts features for display.
User retrieves or database with
chooses a region of interest of the region: similar features. E.g. XML
sample image in the sample document with
E.g. Scalable E.g. Images with
image URL to matching
Colour Histogram Similar
images.
Scalable Colour
Histograms
Query by Example
Server
Video Storage
Metadata Generation
UDP/TCP/IP
(Video) Video Digital
Recorder
server
server
Tomcat
MPEG4 Encoding Servlets
UDP/TCP/IP
(Video) URL URL
HTTP Req HTTP Req
Metadata Generation Query Response
UDP/TCP/IP
UDP/TCP/IP (Metadata) Video Management Center
(Video) UDP/TCP/IP
(Metadata)
UDP/TCP/IP Metadata URL
server (Metadata) Database HTTP Req
Xindice
client
UDP/TCP/IP
(Metadata) Tomcat
Video communication server Servlets
Visualization
Query Response
Query
Feature Extracting
Search Engine
The Hierarchical Summary denotes the fidelity (i.e., f0, f1) of each key-frame
with respect to the video segment referred to by the key-frames at the next
lower level.
COMP9519 Multimedia Systems – Lecture 1 – Slide 24 – J Zhang
1.4 Introduction to Multimedia
Research Ref: J. Martinez
z Topics include
z Topics include
Key frame extraction and video shot segmentation
The basic element to build video scene (story) segmentation
Object classification
Classify human being, car, trucks, in indoor/outdoor area
Scanning pattern of TV
COMP9519 Multimedia Systems – Lecture 1 – Slide 33 – J Zhang
1.5.2. Video Image and Sequence
z Interlaced Video Format
z Standard frame rates (25 or 30 frames per second) are high
enough to provide smooth motion, but are not high enough to
prevent flickering.
z To prevent perceptible flicker in a bright image, the refresh rate
must be at least 50 frames per second. With 2-to-1 interlacing,
the odd numbered lines of a frame are displayed first (field 1),
followed by the even lines (field 2).
z A 25 frame per second sequence is displayed at 50 fields per
second.
z The eye does not readily perceive flickering objects that are
small, the 25 per second repetition rate of any one scan line is
not seen as flickering, but the entire picture appears to be
refreshed 50 times per second.
Transform
4:1:1 format
480lines
576 lines
625 lines
122
16
pixels
pixels
132
pixels 12
525/60: 60 fields/s
pixels
360 180
180
576 288 288
Rec. 601 144
U,V
Select Horizontal Vertical
Field Filter and Filter and
Subsampling Subsampling