You are on page 1of 6

Appen - Person/Object Segmentation

Escalation Log

1. High-Level Project Objective


Objective: To draw pixel-level masks for person & object and mark classifications for each subject based on how they
are positioned in the image.  Each class should be instanced; that means, each instance of each of these classes should
have their own mask, even if boundaries of two instances are touching.

IMPORTANT NOTE: 

Subject(s) faces will be blurred out of the image, but please proceed with annotations if any other rejection
clauses are not shown. 

Consistency: 

Every person present in a frame must contain a mask and a separate mask if they are holding an object. If
visible, please annotate accordingly. If not visible, please leave the mask empty (add it in the UI matching the
instance of the person you're annotating but don't draw any mask).
To maintain consistency, especially with images with multiple people and objects, please classify the PERSON
annotations from LEFT to RIGHT. 
For the OBJECT class, where 1 PERSON could be associated with multiple OBJECT instances, please annotate
from LEFT to RIGHT (like instructed for the PERSON class), irrespective of who is holding the object.

Summary:

We segment 2 masking categories:

1. Person 
2. Objects being held by Person(s)

2. Annotation Instructions
RULES

Quality
Please make sure that masks are in the confinement of the individual 
Occlusion
In general, only annotate portions of classes that are visible; we are not to estimate boundaries when
portions of classes are occluded.
Hats can be inlcluded in the mask of the person as long as they are not oversized or significant in size. 
Glasses should be ignored (annotated over) entirely
Annotate over necklaces and other jewelry
Reflections
Please ignore reflections

CLASS DEFINITIONS
Person
Overall mask of a subject in a frame
For frames with multiple people, please classify as Person1, Person2, etc. so each person(s) has their own
unique mask. 
Object held by Person
Create a detailed mask over any object which is hold/used/played by the person. 
The object size should be relative small or medium respect to the person. 
Holdable: Camera, Cup, Bottle, Toy
Pets: Dog, Cat, Rabbit, Hamster
Instrument: Guitar, Violin. 
Fashion Items: Bag
Not to draw the mask. 
Relative Large: Desk. 
Examples:
Visual Examples
Description: Focus on the visible part. For the “person” segmentation, the segmentation only includes the visible
region.

Description: If the person holds more than an object, please label them as the different classes, such as object-00,
object-01. The ordering here does not matter. 

Rejections
WHEN TO REJECT IMAGES:

No people in an image
People are too small (e.g., area is less than 5% of the image)
People are extremely blurry (e.g., motion blur). And it’s too hard to draw the mask. 
Subject(s)' face isn't / aren't visible
Black-and-white or grayscale images
There are more than 6 in the image
Artificial overlays (e.g. stickers, face filters from FB Messenger or Snapchat)
IMPORTANT NOTE: SUBJECTS FACES WILL BE BLURRED FROM THE IMAGE AND THIS DOES NOT COUNT AS
A REJECTION FOR ARTIFICIAL OVERLAY
Animation images. 
Photo collages
Person is in the other object. (Painting, book cover)
Unnatural borders on the image
“Blurred area significantly prevents me from drawing high quality mask along important boundary”

REJECTED EXAMPLES:

“Blurred area significantly prevents


me from drawing high quality mask
along important boundary”
Extreme Blur (The person in the left
side)

Collages

Person in a photo

Artificial Overlay

Animated Object
5. Conflicted Examples
PEOPLE OCCLUDED BY COMPLEX OBJECTS

If excluding an object will make the mask very complex, you can include the object as part of the body (wire netting is
included in the example).

BLURRY PEOPLE

Please annotate the images where people are slightly blurry (the three examples below should be annotated). Only
reject the images if people are extremely blurry, so much that you cannot annotate them.

BOUNDARIES OF BLURRY PARTS

For the boundaries, include just the solid parts and exclude blurry parts (the parts that are slightly transparent where
the background can be seen). See the hands in the example.

You might also like