Professional Documents
Culture Documents
RWS LATTE Image Dialogue Data Collection Guidelines CrowdFacing
RWS LATTE Image Dialogue Data Collection Guidelines CrowdFacing
Squirrel UI
Notes:
● You must use Google Chrome as a browser to open Squirrel. Also, please note
that iPhones are not compatible.
● You will be provided with an access link to the tool where you can start
working right away.
● No special computer requirements are needed, only a stable internet
connection.
● You may only open the production link on one device at a time.
● Below you can find instructions and screenshots on how to use the tool.
● The Average Handling time is 15 minutes per task.
● Important: Please be very mindful of the time, the task will be
automatically time out after 15 minutes.
Steps:
1. Determine if the Image is AI-generated or User-Generated; please provide
your own judgment and make your best assumption.
● If the Image is User-Generated, in the conversation you can pretend
that the Image was uploaded by you, in this kind of case you leave the
“AI Generated Image Caption (Long Description)” textbox empty.
Note: The caption you provide in the “AI Generated Image Caption (Long
Description)” textbox, must be a longer and more detailed description of the
AI-generated image, please focus on the Key elements of the image. You can check
Example 3 in the AI Generated Image Caption (Long Description) section.
Approach
Collect fictional conversations, each over a single image, which incorporates
elements of the tasks above. Here are three examples:
Example/Conversation 1
Image: AI Generated Image
User Request:
Draw me a picture of a cat riding a bicycle to work.
AI Initial Response:
Here is an image of a cat riding a bicycle to work:
<IMAGE>
User Question 1:
Nice! Why do you think he’s not wearing a helmet?
AI response 1:
The absence of a helmet in the image of the cat riding a bicycle could be due to a
few reasons. Firstly, it might be an oversight in the artistic interpretation, as the focus
was on creating a whimsical, fun image rather than emphasizing safety measures.
Secondly, depicting animals in human-like scenarios often prioritizes human
characteristics like clothing, while omitting elements that are not typically associated
with the animal's natural form, like a helmet on a cat. Lastly, it could be a deliberate
choice to keep the image simple and focused on the primary theme of a cat
engaging in a human activity. In real-life scenarios, of course, wearing a helmet while
cycling is important for safety.
Example/Conversation 2
AI Generated Image Caption (Long Description): (please keep this empty when you
assume the image is User Uploaded)
User Request:
I'm considering buying this car. What do you think?
<IMAGE>
AI Initial Response:
The car in the image is a sleek and contemporary SUV, which likely boasts the
reliability and efficiency that Toyota is known for. It would be a solid choice if you're
seeking a vehicle with a good balance of space, comfort, and possibly advanced
technology, especially if it's one of Toyota's hybrid or electric models. You'd benefit
from considering its safety features, fuel economy, and the driving experience it
offers. Before making your decision, however, it's advisable to compare it with other
vehicles in its class, check for consumer reviews, and take it for a test drive to ensure
it meets your specific needs and preferences.
User Question 1:
How can you tell it's an SUV?
AI Response 1:
The vehicle in the image has several distinctive features that suggest it's an SUV
(Sport Utility Vehicle):
* **Tall Stature:** It has a high roofline, providing ample headroom and often
indicating a more spacious interior.
* **Elevated Ride Height:** The ground clearance is higher than what you'd find in
typical sedans or hatchbacks, which is useful for rough terrain or poor road
conditions.
* **Robust Wheels and Tires:** The wheel design and size are more pronounced,
which can suggest an ability to handle a variety of road conditions.
These visual cues, when taken together, can help differentiate an SUV from other
vehicle types.
Example/Conversation 3
AI Initial Response:
Here’s an illustration of your funny dream with the sloth giving you nail and stock
market advice: <IMAGE>
User Question 1:
lol whats going on with that dollar polish?
AI Response 1:
In the illustration, the "dollar polish" adds a humorous touch to the scene. It's a nail
polish bottle, but instead of a regular label, it has a dollar sign on it. This whimsical
detail cleverly ties together the themes of nail care and financial advice, reflecting
the unique nature of your dream where a sloth not only does your nails but also talks
about the stock market. It's a playful element that highlights the dream's quirky
blend of a beauty salon and financial wisdom.
User Question 2: How many colors are present in the objects on the table in front of
the sloth?
AI Response 2: In the illustration, there are several objects on the table in front of the
sloth, and they display a variety of colors. I counted in total nine of them, including:
red, orange, yellow, green, teal, light blue, blue, magenta and purple.
Methodology
We collect images skewed towards aesthetics and manually create fictional
conversations about each image.
Conversations
We recommend the below methodology since we believe it can be beneficial to
follow when creating the conversations:
Format
● All “generated” images (i.e. images that appear on the assistant’s side of the
conversation) should contain the tag element: <IMAGE> at the end of the
assistant’s conversation. And provided an “AI Generated Image Caption
(Long Description)”; this Caption should be as informative and grounded as
possible. This teaches the model to “think” about what it wants to generate
before it does.
● For “uploaded” images (i.e. images that appear on the user’s side of the
conversation), There should not be any “AI Generated Image Caption (Long
Description)”, Please just add <IMAGE> tag at the end of the User´s
conversation.
● All conversations need at least 2 user interactions
(user-assistant-user-assistant). Some dialogues should have more than 2
interactions, without sacrificing the natural flow of conversation.
Payment
Your payment is done under your hourly rate on the real-time spent, which is
automatically captured by Squirrel, with a cap of 15 minutes per task. Going for a
break, etc is completely allowed, but please make sure to submit the current tasks
and close the Squirrel page before leaving so that the idle time is not being
captured. This will allow us to get a realistic idea of how much time is being spent
per unit and compared across participants.
You can always check your profile information on the Partners Portal website
https://locpartners.moravia.com.
You can keep an eye on your total job count via the personalized link on Squirrel.
Rule of thumb, as long as you’re completing 3-4 tasks per hour, you are on the
right track. We also highly suggest you track your work time as well.
You can also keep track of your performance by hovering over the Statistics Icon.
Payment Schedule:
● The production team will create POs at the beginning of the month for the
work completed in the previous month. Once POs are created, the Finance
team will process them and issue payments depending on your supplier
type, please carefully review the following documents to avoid any
payment delay.
● Depending on your supplier type, payments will be released based on the
payment terms.
Documents - Freelancers:
● EVERYONE MUST READ: RWS Moravia Invoicing and Payments Guide.pdf
● Freelancers payment terms: Payment Terms Info_Freelancer.pdf
Once you have received the notification about a payment having been created, or
any further invoicing questions, please contact our Accounts Payable team at
ap@rws.com.
Contact
If you have any questions regarding these instructions or the project in general,
please get in touch with the project team by sending an email to
latte_prod_humanevals@rws.com.