Professional Documents
Culture Documents
ebffiledoc_490Download textbook Advances In Multimedia Information Processing Pcm 2016 17Th Pacific Rim Conference On Multimedia Xi An China September 15 16 2016 Proceedings Part Ii 1St Edition Enqing Chen ebook all chapter pdf
ebffiledoc_490Download textbook Advances In Multimedia Information Processing Pcm 2016 17Th Pacific Rim Conference On Multimedia Xi An China September 15 16 2016 Proceedings Part Ii 1St Edition Enqing Chen ebook all chapter pdf
https://textbookfull.com/product/neural-information-
processing-23rd-international-conference-iconip-2016-kyoto-japan-
october-16-21-2016-proceedings-part-iv-1st-edition-akira-hirose/
https://textbookfull.com/product/multimedia-modeling-22nd-
international-conference-mmm-2016-miami-fl-usa-
january-4-6-2016-proceedings-part-i-1st-edition-qi-tian/
https://textbookfull.com/product/web-age-information-
management-17th-international-conference-waim-2016-nanchang-
china-june-3-5-2016-proceedings-part-i-1st-edition-bin-cui/
Advances in Multimedia
LNCS 9917
Information Processing –
PCM 2016
17th Pacific-Rim Conference on Multimedia
Xi‘an, China, September 15–16, 2016
Proceedings, Part II
123
Lecture Notes in Computer Science 9917
Commenced Publication in 1973
Founding and Former Series Editors:
Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen
Editorial Board
David Hutchison
Lancaster University, Lancaster, UK
Takeo Kanade
Carnegie Mellon University, Pittsburgh, PA, USA
Josef Kittler
University of Surrey, Guildford, UK
Jon M. Kleinberg
Cornell University, Ithaca, NY, USA
Friedemann Mattern
ETH Zurich, Zurich, Switzerland
John C. Mitchell
Stanford University, Stanford, CA, USA
Moni Naor
Weizmann Institute of Science, Rehovot, Israel
C. Pandu Rangan
Indian Institute of Technology, Madras, India
Bernhard Steffen
TU Dortmund University, Dortmund, Germany
Demetri Terzopoulos
University of California, Los Angeles, CA, USA
Doug Tygar
University of California, Berkeley, CA, USA
Gerhard Weikum
Max Planck Institute for Informatics, Saarbrücken, Germany
More information about this series at http://www.springer.com/series/7409
Enqing Chen Yihong Gong
•
Advances in Multimedia
Information Processing –
PCM 2016
17th Pacific-Rim Conference on Multimedia
Xi’an, China, September 15–16, 2016
Proceedings, Part II
123
Editors
Enqing Chen Yun Tie
Zhengzhou University Zhengzhou University
Zhengzhou Zhengzhou
China China
Yihong Gong
Jiaotong University
Xi’an
China
LNCS Sublibrary: SL3 – Information Systems and Applications, incl. Internet/Web, and HCI
The 17th Pacific-Rim Conference on Multimedia (PCM 2016) was held in Xi’an,
China, during September 15–16, 2016, and hosted by the Xi’an Jiaotong University
(XJTU). PCM is a leading international conference for researchers and industry
practitioners to share their new ideas, original research results, and practical devel-
opment experiences from all multimedia-related areas.
It was a great honor for XJTU to host PCM 2016, one of the most longstanding
multimedia conferences, in Xi’an, China. Xi’an Jiaotong University, located in the
capital of Shaanxi province, is one of the key universities run by the Ministry of
Education, China. Recently its multimedia-related research has been attracting
increasing attention from the local and international multimedia community. For over
2000 years, Xi’an has been the center for political and economic developments and the
capital city of many Chinese dynasties, with the richest cultural and historical heritage,
including the world-famous Terracotta Warriors, Big Wild Goose Pagoda, etc. We
hope that our venue made PCM 2016 a memorable experience for all participants.
PCM 2016 featured a comprehensive program. The 202 submissions from authors
of more than ten countries included a large number of high-quality papers in multi-
media content analysis, multimedia signal processing and communications, and mul-
timedia applications and services. We thank our 28 Technical Program Committee
members who spent many hours reviewing papers and providing valuable feedback to
the authors. From the total of 202 submissions to the main conference and based on at
least three reviews per submission, the program chairs decided to accept 111 regular
papers (54 %) among which 67 were posters (33 %). This volume of the conference
proceedings contains the abstracts of two invited talks and all the regular, poster, and
special session papers.
The technical program is an important aspect but only achieves its full impact if
complemented by challenging keynotes. We are extremely pleased and grateful to have
had two exceptional keynote speakers, Wen Gao and Alex Hauptmann, accept our
invitation and present interesting ideas and insights at PCM 2016.
We are also heavily indebted to many individuals for their significant contributions.
We thank the PCM Steering Committee for their invaluable input and guidance on
crucial decisions. We wish to acknowledge and express our deepest appreciation to the
honorary chairs, Nanning Zheng, Shin’chi Satoh, general chairs, Yihong Gong, Tho-
mas Plagemann, Ke Lu, Jianping Fan, program chairs, Meng Wang, Qi Tian, Abdul-
motaleb EI Saddik, Yun Tie, organizing chairs, Jinye Peng, Xinbo Gao, Ziyu Guan,
Yizhou Wang, publicity chairs, Xueming Qian, Xiaojiang Chen, Cheng Jin, Xiangyang
Xue, publication chairs, Jun Wu, Enqing Chen, local Arrangements Chairs, Kuizi Mei,
Xuguang Lan, special session chairs, Jianbing Shen, Jialie Shen, Jianru Xue, demo
chairs, Yugang Jiang, Jitao Sang, finance and registration chair, Shuchan Gao. Without
their efforts and enthusiasm, PCM 2016 would not have become a reality. Moreover,
we want to thank our sponsors: Springer, Peking University, Zhengzhou University,
VI Preface
Honorary Chairs
Nanning Zheng Xi’an Jiaotong University, China
Shin’chi Satoh National Institute of Informatics, Japan
General Chairs
Yihong Gong Xi’an Jiaotong University, China
Thomas Plagemann University of Oslo, Norway
Ke Lu University of Chinese Academy of Sciences, China
Jianping Fan University of North Carolina at Charlotte, USA
Program Chairs
Meng Wang Hefei University of Technology, China
Qi Tian University of Texas at San Antonio, USA
Abdulmotaleb EI Saddik University of Ottawa, Canada
Yun Tie Zhengzhou University, China
Organizing Chairs
Jinye Peng Northwest University, China
Xinbo Gao Xidian University, China
Ziyu Guan Northwest University, China
Yizhou Wang Peking University, China
Publicity Chairs
Xueming Qian Xi’an Jiaotong University, China
Xiaojiang Chen Northwest University, China
Cheng Jin Fudan University, China
Xiangyang Xue Fudan University, China
Publication Chairs
Jun Wu Northwestern Polytechnical University, China
Enqing Chen Zhengzhou University, China
VIII Organization
Demo Chairs
Yugang Jiang Fudan University, China
Jitao Sang Institute of Automation, Chinese Academy of Sciences,
China
Dynamic Contour Matching for Lossy Screen Content Picture Intra Coding . . . 326
Hu Yuan, Tao Pin, and Yuanchun Shi
Product Image Search with Deep Attribute Mining and Re-ranking . . . . . . . . 561
Xin Zhou, Yuqi Zhang, Xiuxiu Bai, Jihua Zhu, Li Zhu, and Xueming Qian
A New Rate Control Algorithm Based on Region of Interest for HEVC . . . . 571
Liquan Shen, Qianqian Hu, Zhi Liu, and Ping An
GIP: Generic Image Prior for No Reference Image Quality Assessment . . . . . 600
Qingbo Wu, Hongliang Li, and King N. Ngan
Social Media Profiler: Inferring Your Social Media Personality from Visual
Attributes in Portrait . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 640
Jie Nie, Lei Huang, Peng Cui, Zhen Li, Yan Yan, Zhiqiang Wei,
and Wenwu Zhu
Scene Parsing with Deep Features and Spatial Structure Learning . . . . . . . . . 715
Hui Yu, Yuecheng Song, Wenyu Ju, and Zhenbao Liu
Multi-scale Point Set Saliency Detection Based on Site Entropy Rate . . . . . . 366
Yu Guo, Fei Wang, Pengyu Liu, Jingmin Xin, and Nanning Zheng
Deep Metric Learning with Improved Triplet Loss for Face Clustering
in Videos . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 497
Shun Zhang, Yihong Gong, and Jinjun Wang
Sparse Matrix Based Hashing for Approximate Nearest Neighbor Search . . . . 559
Min Wang, Wengang Zhou, Qi Tian, and Houqiang Li
With the recent proliferation of fashion web-stores, an important goal for online
advertising systems is to propose items that truly correspond to the expectations
of the users in terms of design, manufacturing and suitability. We put forward
here a method to extract, without user supervision, clothes and other fashion
items from web images. Indeed, localizing, extracting and tracking fashion items
during web browsing is an important step in addressing the needs of professionals
of online advertising and fashion media: present the users with relevant items
from a clothing database, based on the content of the web application they are
consulting and its context of use. Users usually look for characteristics expressed
by very subjective concepts, to describe a style, a brand or a specific design. For
this reason, recent research focused in the development of detection, recognition
and search of fashion items based on visual characteristics [11].
c Springer International Publishing AG 2016
E. Chen et al. (Eds.): PCM 2016, Part II, LNCS 9917, pp. 1–12, 2016.
DOI: 10.1007/978-3-319-48896-7 1
2 L. Yang et al.
Fig. 1. Our goal is to produce a precise segmentation (extraction) of the fashion items
as in (b).
2 Our Proposal
Detecting clothes in images is a difficult problem because the objects are
deformable, have large intra-class diversity and may appear against complex
backgrounds. To extract objects under these difficult conditions and without
user intervention, methods solely relying on optimizing a local criterion (or pixel
classification based on local features) are unlikely to perform well. Some knowl-
edge about the global shape of the class of objects to be extracted is necessary to
help a local analysis converge to a correct object boundary. In this paper we use
this intuition to develop a framework that takes into account the local/global
duality to select the most likely object segmentation.
We investigate here fashion items that are worn by a person. This covers
practically most of the situations encountered by users of fashion and/or news
web sites, while making possible the use of a person detector to restrict the
search regions in the image and to serve as reference for alignment operations.
First, we prepare a set of images containing the object of interest and we
manually segment them. These initial object masks (called templates in the fol-
lowing) provide the prior knowledge used by the algorithm. Of course, a given
manual segmentation will not match exactly the object in an unknown image.
We use each segmentation (after a suitable alignment) as a template to initiate
an active contour (AC) procedure that will converge closer to the true bound-
aries of the real object in the current image. We then extract the object with
a suitable GrabCut procedure to provide the final segmentation. Thus, at the
end we have as many candidate segmentations as hand-made templates. In the
final step we choose the best of them according to a criterion that optimizes
the coherence of the proposed segmentation with the edges extracted from the
image. In the following subsections we detail each of these stages (see also Fig. 2
for an illustration).
Fig. 2. Different stages of our approach: (a) original image, (b) a template segmenta-
tion, (c) output of the person detector, (d) result after the alignment step, (e) result
after the active contour step, (f) the GrabCut band, (g) result after the GrabCut step.
4 L. Yang et al.
Fig. 3. Medoids of the 8 clusters of template segmentations for three classes: jeans
(top), long dress (middle) and coat (bottom).
contour process. But they first need to be aligned into the unknown image at the
right site and with the correct angle and scale. We propose an SVM alignment
technique based on the observation that the person detector places the boxes
centered on the body joints. Thus, the line joining adjacent boxes represents the
body limbs. Since the clothing’s spatial distribution highly depends on the pose
of human body, and thus on limb placement, we use the vector of distances from
a pixel to the limbs as a feature vector to learn a pixel-level SVM classifier that
predicts if a pixel belongs to the object. Learning is performed on the template
image and prediction on the unknown image. Pixels predicted as positives form
the mask whose envelope serves as initialization for the active contour step.
The SVM uses a Gaussian kernel with a scale parameter σ = 1 found through
experiments.
where C is the current contour, c1 and c2 are the average pixel gray-level values
u(x, y) inside and respectively outside the contour C. The curvature term is
controlled by μ and the fitting terms by λ1 and λ2 . The averages c1 and c2
are usually computed on the entire image. Because of the large variability of the
background in real images, these values can be meaningless locally. Consequently,
in our case we replace them by averages computed in a local window of size 40×40
pixels around each contour pixel.
To reinforce the influence of the global shape of the template on the position
of the AC, we include a new term in the energy function (Eq. 1) that moderates
the tendency to converge too far away from the template:
Ft (C) = η Dm (x, y) (2)
on(C)
where Dm (x, y) is the distance between pixel (x, y) and the template. By includ-
ing this term, the contour will converge to those image regions that separate
best the inside from the outside and, at the same time, are not too far away
from the template contour.
A Global-Local Approach to Extracting Deformable Fashion Items 7
2.5 Segmentation
The contours obtained in the previous step suffer from two implicit problems:
(1) only the grey-level information is used by the AC process, and (2) possible
alignment errors may affect the result. To compensate for these problems, an
“exclusion band” of constant thickness is defined around the contour produced
by the previous step, then the inside region is labeled as “certain foreground”
and the outside area as “certain background”. A GrabCut algorithm [14] is then
initialized by these labels to obtain the final result. GrabCut takes into account
the global information of color in the image and will correct the alignment errors
within the limits of the defined band.
where De (x, y) is the distance from the current pixel to the closest edge detected
by [6] and C is the boundary of the segmentation proposal. This score measures
the average distance from the segmentation boundary to the closest edges in
the image. A small value indicates a good fit to the image. See Table 1 for an
illustration of this step.
Table 1. Segmentation selection from the results based on the 8 templates of the class,
using the corresponding fit values. The test image is given top left, with the extracted
edges shown bottom left. The best score is the smallest (outlined in boldface).
8 L. Yang et al.
Fig. 4. Qualitative evaluation: are original images and associated segmentation results.
3 Experimental Results
To assess the performance of the proposed method, we perform two sets of
experiments. In the first set, our method is compared to a recent improvement
of GrabCut [14] that is the standard approach in generic object extraction, on a
novel fashion item benchmark we built. The second set of experiments compares
our proposal to the recent PaperDoll [19] fashion item annotation method on
the Fashionista database [20].
Table 2. Comparison with the One Cut algorithm. The comparison measure is the
Jaccard index.
Class Boots Coat Mid dress Jeans Shirt T-shirt Short dress Long dress Vest Pull
Our method 0,54 0,74 0,84 0,78 0,77 0,67 0,80 0,74 0,65 0,74
One Cut 0,26 0,31 0,54 0,71 0,77 0,45 0,47 0,57 0,35 0,36
Language: English
ESCAPE,
——AND——
JOHN L. RANSOM,
LATE FIRST SERGEANT NINTH MICH. CAV.,
AUTHOR AND PUBLISHER.
AUBURN, N. Y.
1881.
“Entered according to act of Congress, in the year 1881, by
John L. Ransom, in the office of the Librarian of
Congress, at Washington.”
D E D I C AT I O N .
TO THE
—IN—
ANDERSONVILLE,
BY THE AUTHOR.
John L. Ransom.
(From a photograph taken two months
before capture.)
INTRODUCTION.
THE CAPTURE 9
NEW YEAR’S DAY 23
PEMERTON BUILDING 34
ANDERSONVILLE 41
FROM BAD TO WORSE 65
THE RAIDERS PUT DOWN 75
AN ACCOUNT OF THE
81
HANGING
MOVED JUST IN TIME 91
HOSPITAL LIFE 97
REMOVED TO MILLEN 109
ESCAPE BUT NOT ESCAPE 120
RE-CAPTURED 127
A SUCCESSFUL ESCAPE 136
SAFE AND SOUND 154
THE FINIS 160
MICHAEL HOARE’S ESCAPE 167
REBEL TESTIMONY 172
SUMMARY 187
THE WAR’S DEAD 188
EX-PRISONERS AND
189
PENSIONERS
LIST OF THE DEAD 193
A LIST OF OFFICERS
IMPRISONED AT CAMP 289
ASYLUM
THE CAPTURE.