You are on page 1of 19

MAULANA AZAD NATIONAL INSTITUTE

OF TECHNOLOGY, BHOPAL

Department of Computer Science and Engineering

END TERM SEMINAR

3D VIRTUAL TRIAL ROOM


DEBANGANA RAM GUIDED BY :
CSE – COMPUTER NETWORKING DR. BHOLANATH ROY
202112102 DR. VAIBHAV SONI
CONTENTS
INTRODUCTION

LITERATURE SURVEY

PROBLEM STATEMENT

PROPOSED SOLUTION

CONCLUSION

REFERENCES
INTRODUCTION
What is VIRTUAL TRIAL ROOM ?

 Artificial environment created with advanced software.


 Online equipment of an in-store changing room.
 Enables shoppers to try on clothes virtually.

 Develops advance 3D technology to make precise and photo realistic virtual dressing room
where the customer can try clothes before buying.
 Attracts more customers.
 Increase profitability.
OBJECTIVE
 Reality gap between the size on the label and the actual fit.

 Variation of sizes from one country to another.

 Online 2D shopping only provides general measurements.

 Excess returns which drops the profit if the shops.

 Missing or Unclear Product Information.


CONTINUE..
 Currently, Virtualization is used by the customers in over 100 countries.

 Comparison of garments : visual illustration of the two garments to show the


comparison of fit .

 Allows the customer to test different sizes.

 Leads to fewer online returns : up to 50 % .

 About 80% of customers prefer virtualise sizes over the standard size charts.
How It works ?
 Kinect sensor mainly provides three streams: image stream, depth stream
and audio stream, with detected range from 1.2 to 3.5 meters.
 Existing virtual fitting approaches can be divides into two main groups as :-
 2D image based
 3D model based
 Body Scanner
 Size recommendation services.
 Helps to satisfy customers needs and wants.
LITERATURE SURVEY
S.NO AUTHORS PAPER TOPIC METHODOLOGY ADVANTAGES LIMITATIONS

1. AUTHOR : 3D VIRTUAL TRIAL - Used Open CV and web - Using AR technology - The video files are very
Anagaha Ramesh , ROOM Camera to capture video. reduces the time of the large in side which may
Ankit Raj Kushal , customer. affect the size of the space
Brinda D ,   - OpenCV for identifying the - Can calculate specific of the computer.
Vaishnavi S , user and to change the areas to augment the
colour and logo according to - Difficult to analyse based
Purohit logos. on colour differentiation.
Srinivasacharya user's choice.
- After the colour boundary - Detecting wrong colour
boundaries results in
analysis, the image is unexpected outputs.
converted into grayscale
and thresholding is applied
on the image.
- The hierarchy is extracted
then the work of detecting
the contour.
- After this calculation of
specific areas is augment to
logos on the real time user
video feed or can change
colours of cloths according
to the user's desire.
SNO. AUTHORS PAPER TOPIC METHODOLOGY ADVANTAGES LIMITATIONS

2. AUTHOR : IMPLEMENTATION - Used MATLAB Software.   - Used only men’s wear.


Srinivasan, K., OF VIRTUAL - Works well in light - In background daylight
and S. Vivek FITTING ROOM - Extracted foreground from intensities. conditions it doesn’t give
USING IMAGE background by using k accurate results.
PROCESSING means clustering. - Pre-processing the image - It gives inaccurate results
- Extracted human silhouette removes the small in a noisier environment.
from the image by applying objects which may later
become errors. - Customers had to keep a
Laplacian filter.
certain distance from the
- The feature point like hips system for the accurate
and shoulder points had results with less noisier
extracted by finding the surroundings.
maximum single line
- If the customer doesn’t
silhouette.
stand on the correct spot,
- The face is detected by it produces an inaccurate
Viola -Jonas Algorithm. calculation of their height
- Taking the features points and width of the
as reference the virtual silhouette.
shirt is selected for the
customer.
- The last process is to
superimpose the shirt into
the customer’s body and
eliminate the errors during
the imposing process.
 
S.N AUTHORS PAPER TOPIC METHODOLOGY ADVANTAGES LIMITATIONS
O
3. AUTHOR : Minar, CLOTH- VTON:    
- Photo realistic
Matiur Rahman, CLOTHING 3D - Used Open Pose for results with great - The end product has
and Heejune Ahn. RECONSTRUCTION prediction of human details, high a white patch across
FOR HYBRID IMAGE posture. the collar and
BASED VIRTUAL TRY- resolution.
ON
  - Does not suffer borders of the
- Used 3D model based from blurring dresses.
and image-based effects or texture - If the colour of the
approaches. alterations in most dress is white or
  of the parts of the similar to the
dress. background it is
- SGN, MGN, PGN deep
- Works effectively in unable to detect the
learning classifiers are
the wild. dress.
clubbed.
 
- A fusion algorithm for
blending 3D warped
clothes to 2D human
images.
 
SNO. AUTHORS PAPER TOPIC METHODOLOGY ADVANTAGES LIMITATIONS

4. AUTHOR : MULTI- GARMENT   - The dataset contains - Lacking in reality and


Bhatnagar, NET: LEARNING - Used Multi-Garment control.
TO DRESS 3D actual clothing from
Bharat Lal, et al Network (MGN) for
PEOPLE FROM
predicting body shape and actual scans. - Posing is very
IMAGE
clothing. - Introduced and nonlinear.
  registered real
- Used CNN for connecting garments form the - Shape estimation is
the input to the body form, taking place by
posture and garments. real scans for better marginalizing over
results. pose fitting faults.
 
- Ignores the effect of
- Trained separate garment - Garment geometries
the camera intrinsic.
network for every class of change dramatically
the garment. between occurrences.
 
- Assumed that the
- The first predicts the human’s 3D shape
overall shape and second has a fixed height.
adds details.
- Also assumed that
  the distance from the
camera is fixed.
 
S.NO AUTHORS PAPER TOPIC METHODOLOGY ADVANTAGES LIMITATIONS

5. AUTHOR : IMAGE - BASED    


Yamada, VIRTUAL FITTING - Used OpenGL and wxWidgets in - Automatically - A 2D shape model is
Hiroki, et al. SYSTEM WITH C++ code. adjusts the used which limited
GARMENT IMAGE - The input is the whole-body brightness the input postures
RESHAPING image of the model image to those with
differences between extended arms and
wearing the garment which will the garment image feet to prevent
be used in the fitting. and the customer blocks of body
- Estimated the body silhouette image based on the outlines resulting in
models from the human bodies face colour. fewer variable
in the both model and human   results.
images. - Took less time to  
- The garment is reshaped from - The body contour
produce the results. points were fixed
the body silhouette of the model and there were
image to the human body. - The system let the
customer choose the fewer points,
- The customer adjusts the resulting in
position of the garment image appropriate face inaccurate and
which will be composited and colour from the face detailed outcomes.
virtual fitting image is the dataset with  
output. different colours. - The body lines and
- A virtual fitted result is obtained   clothing picture
have less fixed
by correcting the brightness of points.
the human image.  
   
PROBLEM STATEMENT

 User should be apart from the machine to maintain particular


distance.
 It may give incorrect measurement of height of the customer.
 Due to network issue softcopies of dresses will not going to
impose on target image correctly.
 Lighting conditions affected depth map.
 No provision for 3D viewing and sensitive to light conditions.
S.NO AUTHORS PAPER TOPIC METHODOLOGY ADVANTAGES LIMITATIONS

1. Zhongguo Li , Detailed 3D • 3D human body • Because the false • The dataset have 2
Magnus human body reconstruction from reconstruction on datasets o same colour
Oskarsson , reconstruction Multiview images through the coarse 3D and clothes.
Anders from multi-view learning pixel-aligned models can be • If new model with
Heyden images combining implicit function based on deleted and the colourful clothes the
voxel super- multi-scale features (MF- information on the method is not showing
resolution and PIFu). shape can be kept, good results.
learned implicit • Voxel super-resolution the coarse 3D • The texture of the
representation (VSR) from low-resolution models can be detailed model is not
voxel grids obtained by improved to have considered in their
MF-PIFu. greater accuracy and method.
• The structure of multi- better surface
stage networks is designed quality due to the
to provide multi-scale pixel super-
features for the MF-PIFu, resolution.
and a deep convolutional • To extract multi-
neural network predicts scale features from
the vacancy values of the low-resolution pixel
features to implicitly grids, multi-stage 3D
represent 3D models. convolutional layers
are performed.
S.NO AUTHORS PAPER TOPIC METHODOLOGY ADVANTAGES LIMITATIONS

1. Xiu Li ,Zhen 3D Pose • They use Open pose to • A fully automatic • The pose error is
Fan ,Yebin Detection of estimate each person's method is proposed detected due to the
Liu 2, Yipeng Closely 2D joints in each image to detect the 3D inaccuracy of 2D joint
Li , and Interactive and discover all valid pose of interactive detection.
Qionghai Dai Humans joints with a high humans with spare • The opacity between
Using Multi-View confidence score. multi-view images the closely interacted
Cameras • Triangulation of related as input. individuals, obtaining
2D key points using an • Reduces error the color information
epipolar geometry accumulation in a single image is
generates 3D joints. efficiency without difficult, even with the
• Based on the semantic tracing process. human figure
meaning of each joint, all • To decrease reprojected to original
feasible 3D pose seeds crossovers between images.
 have been constructed. human models, an
• At the end, a pre- unique
assembling process is interpenetration
proposed to decrees pose error function is
seeds number. And the proposed.
final result is generated • To boost the
with post assembling efficiency of their
process. process, a two-
stage assembly
design is
implemented.
PROPOSED SOLUTION
DATASET
 Two dataset is being used :
 The dataset as been split into 80% frames (1600) for training and 20% frames
(400) for testing.
 1. Articulated dataset
 Captured by 8 cameras and it contains 10 indoor scenarios.
 Two male subjects have four scenarios and one female subject performs two
scenarios.
 2. CAPE dataset
 Is 3D dynamic dataset of clothed humans generated by learning the clothing
deformation from the SMPL body model.
 There are 15 generative clothed SMPL models with various poses.
 There are 2000 frames with eight-view images and 3D meshes.
CONCLUSION

 Provides the photo-realistic results which come with great details, high
resolution, and quality.

 Overall the virtual trial room seems to be a good solution for a quick, easy
and accurate try on of garments.

 From this point of view it is an optimal addition for a cloth store.

 Cloth simulation can be viewed at different angles.

 Quality is highly realistic .


REFERENCES
 Botre, Saurabh, Sushant Chaudhari, and Shamla Mantri. "Virtual Trial Room." International Journal of Computer
Science Trends and Technology (IJCST)–Volume 2 (2014).
 Srinivasan, K., and S. Vivek. "Implementation of virtual fitting room using image processing." 2017 International
Conference on Computer, Communication and Signal Processing (ICCCSP). IEEE, 2017.
 Minar, Matiur Rahman, and Heejune Ahn. "CloTH-VTON: Clothing three-dimensional reconstruction for
hybrid image-based virtual try-on." Proceedings of the Asian Conference on Computer Vision. 2020.
 Bhatnagar, Bharat Lal, Garvita Tiwari, Christian Theobalt, and Gerard Pons-Moll. "Multi-garment net: Learning
to dress 3d people from images." In Proceedings of the IEEE/CVF International Conference on Computer Vision,
pp. 5420-5430. 2019.
 Yamada, Hiroki, Masaki Hirose, Yoshihiro Kanamori, Jun Mitani, and Yukio Fukui. "Image-based virtual fitting
system with garment image reshaping." In 2014 International Conference on Cyberworlds, pp. 47-54. IEEE, 2014.
 Li, Zhongguo, Magnus Oskarsson, and Anders Heyden. "Detailed 3D human body reconstruction from multi-
view images combining voxel super-resolution and learned implicit representation." Applied Intelligence (2021):
1-21.
 Li, Xiu, Zhen Fan, Yebin Liu, Yipeng Li, and Qionghai Dai. "3d pose detection of closely interactive humans using
multi-view cameras." Sensors 19, no. 12 (2019): 2831.
THANK YOU

You might also like