You are on page 1of 7

Ocean WORDLINE BOX Full-Page Correction Task

Overview
In this task, you will be checking the full-page documents with Wordline bounding boxes
completed by annotators. We’d like your help to identify wordline errors as well as missing
wordlines that were made during the annotation process. Once the errors are identified, we
need you to draw a bounding box correcting each error with the class that corresponds to
the error. Please see the step-by-step instructions and examples of common annotation
errors below.

Steps
• Step 1: Access the task and you will see a fully annotated document green
wordlines, like this:

1
• Step 2: Zoom in the document to identify incorrect or missing wordlines; make sure
EVERY PART of the document is zoomed in, checked, and corrected if applicable.

Example (Use class ‘wordline_error’ when fixing if you see this example):

• Step 3: Draw a bounding box around all missing or incorrect wordlines using the
following classes:
o wordline error: If a wordline box is incorrect, such as too tight (covering
texts) or too loose (extra space between the box and texts), draw the correct
wordline box with this class. THIS WILL REPLACE ALL WORDLINES
TOUCHING THE BOX!!
▪ Example of Corrected Wordline Box:

o wordline missing: If a wordline box is missing, draw the missing wordline


with this class.
o wordline delete: Draw a tight box around the wordline that you wish to
delete. ALL WORDLINES TOUCHING THE BOX TOUCHING THE BOX
WILL BE DELETED!!! Use this when there’s stray wordboxes that shouldn’t
be on the document.

Examples:

For wordline errors: *** NOTE: YOU DON’T NEED A VALID FOR WORDLINES. You
could put any texts such as ‘error”, “missing’, or “delete” in the transcription box for
wordlines. ***

Notice in the .gif below the transcription provided is "t". You must provide a transcription,
but it does not need to be correct.

2
Demo: https://nembar.s3.us-east-
2.amazonaws.com/OCR/Untitled_+Aug+3%2C+2020+4_06+PM.gif

• Step 4: Move to other parts of the documents and repeat Step 2 and Step 3 until all
the wordline box errors and missing wordline boxes are bounded

Common Error 1: Overlapping Boxes


The following example shows wordline boxes overlapping. For this case please re-box both
lines with the "wordline_error" class. Please try your best to fix the wordline boxes
overlapped with each other.

For example:
Incorrect
Annotation
Use 'wordline_error' class to rebox the incorrect wordline.
How to
Correct

Common Error 2: Hyphens As Bullet Points

If the hyphens are used as bullet points, please box them with the rest of
the sentence(s).

Correct
Annotation

3
Flags
Please choose any of the following flags if applicable

4
Examples for image has no errors

5
• Examples for dual pages

6
7

You might also like