Professional Documents
Culture Documents
CONFIDENTIAL INFORMATION
This document contains confidential and proprietary information that is intended solely for the use of the individual or entity to whom
🔭 Project Overview
🧭 Instruction Following
🌏 Multilingual
📚 Citations
Helpful
🧭Truthful
🐰Harmless
📏Written Responses
📏Table Formatting
🚨 Error Categories
🥇 Giving Feedback
🥇 Good Responses
🍂 November Update Examples
🚀 Improving Responses
👾 Task Interface
🌐 General Recommendations
📝 Markdown Notes
🔭 Project Overview
Bee Pollination is a series of response-writing/rewriting projects to help train and improve the factuality
and helpfulness of an “AI Assistant” or “Chat Assistant.” You may be placed on one or more of these
projects based on your specialization.
In this guide, we will cover what you need to know, from overviews and examples and from walkthroughs
to best practices.
For Writers, start with the Instructions Updates and then the Overview for Writers section. Here you’ll
learn how to understand prompts, categorize them, and write creative and intentional responses. In this
part of the guide, we investigate great responses and what makes them so good. After the overview, go
through the Additional Examples, Task Interface, and the Linters Guide.
Tip: The Quality Rubric, or the way raters rate your work, is an incredible resource, especially for writers.
For Raters, start with the Instructions Update and then the Overview for Raters section. In the overview,
you’ll learn best practices for efficiently and accurately rating takes and giving meaningful feedback for
writers to learn and improve their writing. Keep the Dimensional Quality Rubric handy, as it serves as the
basis for all rating tasks. Afterwards, look through the Examples Section and the Task Interface.
For Coders Only: Use these coder-specific instructions instead.
Concision / Verbosity: Responses must be written with essential and relevant details,
removing unnecessary details, fluff, or pleasantries.
1. For example, responses should avoid phrases like “Here’s your request,” or
“Sure! Here you go,” and instead proceed to answer the question directly, while
providing useful context when appropriate.
1. Ask yourself: What implicit assumptions does the chatbot need to use to
respond appropriately? Think of it as a secondary prompt with additional
details for answering the prompt.
🔎 Instruction-Following: All responses must follow the prompt precisely. The responses
must answer all questions/requests in the prompt appropriately and according to the
instructions.
1. ❗New question for writers❗ Does the prompt include any of the following
length requirements (select all that apply)?
1. Page Count, Paragraph Count, Sentence Count, List Item Count, Word
Count (multiple options), Character Count (multiple options)
1. When the prompt is in a foreign language, use an online tool to translate and
respond to the question in English.
2. If you are able to speak the language, you can respond in that language if you
feel confident delivering a quality response.
1. Specify the language the prompt was written and the one required for the
response. Yes or No.
📄 Citations:
Language Mechanics:
As a writer, your goal is to write a complete response to a given prompt within a specific domain,
leveraging your expertise in the subject. For the majority of tasks, you will be provided with a pre-
generated response that you can use as the base but must significantly rewrite and improve to make it
even better. When no response is given, you must write your own!
Important: All response improvements must be original and human-generated. If we catch someone
using ChatGPT or other AI tools and copy-pasting into the responses, we will remove them from the
project.
You will be provided with:
1. A prompt [a request, statement, or question from a user to the chat assistant]
2. Most of the time, a pre-generated response [the answer] to the given prompt.
Study the instructions and follow the eight (8) steps for success!
1. In other words, What implicit assumptions does the chatbot need to use
to respond appropriately?
4. If the prompt appears to be completely outside the domain area of the project,
please report it to the TSMs with the task ID.
Note: If you don’t feel knowledgeable about the topic in the prompt, and don’t feel capable of improving
the response, then hit the “Skip” button in the bottom right corner.
2. Write and improve the response to completely satisfy the prompt (should take at least
15 minutes).
1. Ask yourself: What implicit assumptions does the chatbot need to use to respond
appropriately? Think of it as a secondary prompt with additional details for
answering the prompt.
5. Categorize the prompt - Determine if any of the following apply to the prompt. One or
more can apply. If prompts do not fall in any of these categories, then you select None of
the above.
1. Your response should explain that you are not able to complete the
prompt because it goes against the Chat Assistant’s content policies.
2. Foreign language - prompt is in a different language than what the user speaks.
1. Please use an online translation tool to understand the prompt and write
a response in English.
2. You can respond in that language if you are confident you will create
high quality work.
3. Difficult to follow - prompt asks for something that is not doable or impossible.
1. If the prompt is difficult to follow, your response may simply ask the user
for clarification, to provide missing information or explain that you are not
able to complete the prompt.
4. Identity dependent - prompt requires personal information about the user to
provide a helpful response.
1. Your response should seek to clarify or explain that you do not have a
personal opinion on the matter.
6. Rate your confidence - select a confidence rating ([Not Very Confident], [Somewhat
Confident], [Very Confident]) based on how confident you are in:
1. If you are [Not Very Confident] in your rewrite, then you should either
continue to improve it or hit “Skip” in the bottom right corner of the
screen.
1. Does the prompt include any of the following length requirements (select
all that apply)?
Prompts can take various forms. You may need to conduct online research to understand what the user
is requesting. You will then check which, if any, of the prompt categories apply.
Prompts might ask for… Example
A long response with very specific instructions Write a short essay about the pros and cons of
a society based on utilitarianism.
A summarization of a long body of text Please summarize this essay into two
paragraphs [followed by long essay text]
A “role play” with the chat assistant Act as an interviewer of a software engineer
position and ask me some resume questions
Tip: For more help with difficult to answer prompts, see Examples of How to Respond to Difficult
Prompts.
In some cases, the prompt will seem almost incomprehensible. In these cases, you should:
When you encounter a prompt that’s difficult or controversial, your written response should attempt to
provide a best guess when appropriate, ask for clarification, or provide a thoughtful deflection to steer the
conversation in a productive and non-contentious direction.
Tell me everything.
1. If a prompt is specific and the response will still be too long (ie,
~10,000 words) then provide either:
Examples of prompts that require further clarification but can be attempted by the
model to some degree:
Write me a 10,000 word essay about the buying and selling meth in San
Francisco.
The primary categories you will be categorizing the prompt by are below. You can select multiple
categories for each task!
Category Explanation Response Handling Example Prompts
and Responses
and response:
P: I don’t really like
slandering of any Chat Assistant’s
my neighbor - how
individuals or groups of content policies.
can I get back at
people. (Optional) them?
The prompt contains, or Recommend R: Sorry, I can’t
is related to, violence, using prompts answer questions
abuse, illegal activity, or that adhere to the that go against our
sexual content. Chat Assistant’s content policies.
The prompt contains content policies.
disrespectful language.
The prompt contains or
seeks moral judgment.
Foreign If the prompt/part of the prompt is in For prompts in or containing a foreign P: Dime tres tipos
Language a different language than what the user language, you should do one (1) of de frutas tropicales.
speaks, please use an online the following: R: Three types of
translation tool to understand the prompt tropical fruits are
and write a response in English. pineapple, Mango,
Use an online and papaya.
translation tool to
understand the
prompt and write
a response in
English.
If you are fluent
in the foreign
language, write
the response in
that language.
Difficult to A prompt is difficult to follow if any of the For difficult to follow prompts, your Example Prompts:
Follow following applies: response should do one (1) of the I have a brilliant
following: idea.
What are all the
The prompt appears to foods people eat?
be incomplete or missing Ask the user for Empathy, evidently,
information that would be clarification or to existed only within
necessary for the Chat provide missing the human
Assistant to give a information community, whereas
complete response. Explain that you intelligence to some
are not able to degree could be
complete the found throughout
For example,
prompt when the every phylum and
the prompt [hi order including the
task is impossible
can you help arachnida
or extremely
me build an Example prompt
difficult to
app that] is and response:
accomplish (for
incomplete P: hi can you help
example, “Can
because it’s me build an app that
you take a selfie
missing a R: Sure, I am happy
for me?”)
description of to help you build an
what the app app. Please let me
should do. know what kind of
app and features
you are looking for.
The intent of the prompt
is not clear.
For example,
the prompt
Category Explanation Response Handling Example Prompts
and Responses
[Empathy,
evidently,
existed only
within the
human
community,
whereas
intelligence to
some degree
could be found
throughout
every phylum
and order
including the
arachnida] is
difficult to
follow because
it’s unclear
what the intent
of this prompt
is.
Identity A prompt is identity dependent if it requires If the prompt is identity dependent, Example Prompts:
Dependent personal information about the user to the response should: What time will my
provide a helpful response to the prompt. flight to New York be
boarding?
Politely explain How old is my
that the Chat cousin?
Assistant has no Example prompt
access to and response:
personal P: What is my BMI?
information. R: As a Chat
If applicable, ask Assistant, I don’t
for more have access to your
information to personal
help answer the information,
prompt. including your height
and weight.
Accordingly, I am
unable to calculate
your BMI for you.
However, if you’d be
able to share with
me your height and
weight, I’d be happy
to help you calculate
your BMI!
Opinion A prompt is opinion seeking when it seeks For an opinion seeking prompt, the Example Prompts:
Seeking opinions or advice that is subjective in response should: What is tastier,
nature. Pepsi or Coke?
What clothes should
Seek to clarify I buy?
what is requested Why is wrestling so
much better than
Category Explanation Response Handling Example Prompts
and Responses
None of the The prompt does not fall into the categories
above above.
Responses must be written with essential and relevant details, removing unnecessary details, fluff, or
pleasantries.
Responses should:
Be to-the-point, concise, tight, and answer the request in an efficient and easily
digestible manner.
Be conversational and natural in tone.
Contain limited level of detail and nice-to-have explanations.
Be free of redundant, irrelevant information.
NOT be verbose, provide extraneous information, or over-explain concepts when
the prompt does not request it.
NOT include “fluff” or pleasantries (e.g., “Here’s your request,” “Sure, I can help
with that,” “Below is a blog with 100 words”).
If a prompt has explicit standards of concision then follow the particular level of concision or
verbosity. For example:
If a prompt does not include any request for concision (i.e., has implicit requests), then write a
“system prompt” or assumption list dictating the implicit requests. (e.g., “As an AI Assistant, you
will provide the answer in short paragraph” or “As an AI Assistant, list only 10 items”). Not all
tasks will need system prompts. This system prompt will not be included in the prompt or the
response, but written in a separate text field. The system prompt should be written as an
instruction or command.
System prompt: used to ensure that the AI's responses meet the unspoken
needs of the users, delivering information in a format and detail level that seems
most appropriate for the question asked.
Consider yourself as a user. Would you be satisfied or is the response so long they stop reading
to the end (e.g., due to it being repetitive, variations on a theme, tiresome to read, long winded,
etc.).
🧭 Instruction Following
All responses must follow the prompt precisely. The responses must answer all questions/requests in the
prompt appropriately and according to the instructions.
Responses should
Follow all of the requirements in the prompt (e.g., word count, specific formatting,
specific wording, tone, style).
Use consistent formatting and proper markdown.
Meet paragraph or sentence length specifications
Meet word or character count specifications
If the prompt request an extremely long response (e.g., 10,000 words), provide
either (1) an outline of the proposed response or (2) a long response (~500
words) with an option for the user to continue
🌏 Multilingual
Answer the prompt in English if you lack fluency in the foreign language.
For Cases 1, 2, 3, and 4, answer with a disclaimer acknowledging that as an AI assistant,
you are not able to respond in the requested language but have provided the response in
English.
Answer the prompt in the requested foreign language if and only if you have native or
near native fluency in the foreign language.
Tip: If the prompt provides text in a foreign language, use an online translator should to identify the
language and translate the text into English.
📚 Citations
Responses must:
Cite sources for the above cases using in-line or parenthetical citation style
unless otherwise requested by the prompt.
Helpful
A helpful response is directly relevant to the user request, and satisfies what the prompt is asking for.
Various attributes that are consistent with a helpful response include:
The response follows EVERY INSTRUCTION COMPLETELY and answers the request.
If the prompt asks for a summary or rewrite of a text segment, your response should
include ALL the important information in the original text segment and be CONCISE.
The response has clear, easily understood, coherent, and grammatically correct
language.
The response is not too long, repetitive, or technical (unless specified in the prompt).
The response is formatted in a reader-friendly way. You may use bullet points or
numbered lists to format your response.
🧭Truthful
A truthful response contains accurate information, without any false facts or misleading statements.
Various attributes that are consistent with a truthful response include:
All of the facts inside the response (such as definitions, numbers, dates, etc.) are
completely accurate (Please conduct online research to make sure).
In cases where the user asks the Chat Assistant to summarize or rewrite a text segment,
the response does NOT make up or mention details that were not part of the original text.
If the user asks a question that assumes that a particular false claim is actually true, your
response should clearly assert that the false claim is NOT true, rather than leaving room
for ambiguity.
For example, if the prompt is “What year did Albert Einstein invent the Internet,” a
truthful response would say “Albert Einstein did not invent the Internet.”
In contrast, an untruthful response to this prompt could be something like “It’s not
clear exactly when Albert Einstein invented the internet.”
One important exception: if the prompt explicitly asks for fictional writing, such as writing
a story for children, then it’s not actually necessary that your response be truthful.
🐰Harmless
A harmless response does NOT cause damage to any individuals or groups of people, people’s privacy or
reputations, people’s homes or belongings, intellectual or physical property, animals, the environment,
institutions, or resources valuable to humanity or to the environment. This holds even if the prompt is
inappropriate, and even if the prompt is directly asking the Chat Assistant to be harmful in any way.
Attributes that are consistent with a harmless response include:
This includes groups of people based on race, color, national origin, religion, sex,
gender identity, sexual orientation, age, and disability.
The response does NOT contain language that promotes violence, abuse, or illegal
activity.
The response does NOT contain violent or sexual content.
The response does NOT contain moral judgment or opinionated content.
If the prompt itself is inappropriate, or is asking the Chat Assistant to be harmful in any
way, your response should politely turn down the user’s request and explain that a
Chat Assistant is not allowed to provide any inappropriate information.
Important Note: Copying and pasting is not permitted in this project. Additionally, using ChatGPT
or any other model is strictly prohibited. All response improvements must be original and human-
generated. If we discover someone using ChatGPT or other AI tools and copying responses, they
will be removed from the project.
📝 How to Format your Response Correctly
📏Written Responses
🟢 SHOULD:
🔴 SHOULD NOT:
The nine current Supreme Court Justices are: Here is a Bulletpoint list of the current nine Supreme Court
Justices: * John G. Roberts * Clarence Thomas * Samuel A.
Alito, Jr * Sonia Sotomayor * Elena Kagan * Neil M. Gorsuch
John G. Roberts * Brett M. Kavanaugh * Amy Coney Barrett *Ketanji Brown
Clarence Thomas Jackson
Samuel A. Alito, Jr
Sonia Sotomayor
Elena Kagan
Neil M. Gorsuch
Brett M. Kavanaugh
Amy Coney Barrett
Ketanji Brown Jackson
In two sentences, explain the difference between working on Prompt: In two sentences, explain the difference between
mac or windows? working on mac or windows?
📏Table Formatting
Tables should be in Markdown format. This means that you should NOT use
<table_start> or <table_end> to indicate what is a table.
In this project, you will be giving a rating from 1 (terrible) to 5 (perfect) for two provided responses! For
each task…
You will be provided with:
1. A prompt.
2. Two (2) responses to the prompt.
3. Compare the two overall and pick one which is better. Base your final decision
on the rubric across the five (5) dimensions: Language Mechanics, Structure &
Composition, Relevance & Completeness, Factuality & Accuracy, and Trust & Safety.
4. Rate each response on each of the five (5) dimensions. The responses will be rated
on a 1 (terrible) to 5 (perfect) for all dimensions except “Trust & Safety,” which is either
a 0 (fail) or 1 (pass).
5. Give feedback and provide a detailed critique of the shortcomings and opportunities for
improvement of the lesser response. This should be roughly 1 to 3 sentences.
Note: AI chatbots are particularly sensitive to data quality. Poor quality data (e.g., inferior
spelling/grammar, short responses, misinformation) results in low quality or potentially harmful chatbots.
Please hold yourself to a high standard for your work so we can train chatbots effectively and safely.
LANGUAGE MECHANICS
Readability - The
response has
multiple spelling or
grammatical errors
that significantly
Readability - The
impact the
response has
readability (i.e., Readability - The
some spelling or
how easily the response has no
grammatical errors
response can be spelling or
but the response is
parsed by human grammatical errors
Spelling, still readable
Grammar,
readers)
Language
Language
and
Language Mechanics
Mechanics -
Language Mechanics - Response
Response
Mechanics - Response indicates advanced
indicates proficient
includes errors that use of language
use of language
may encompass mechanics with no
mechanics with
incorrect verb minor corrections
only minor
forms, sentence
corrections
structure problems,
run-on sentences,
punctuation and/or
sentence
fragments
Repetition - The
Repetition -
response
Repetition - The Response does not
communicates the
response unnecessarily
exact same ideas,
communicates the repeat information,
but potentially in
exact same ideas, implicitly or
slightly different
but in slightly explicitly, unless as
ways, multiple
different ways a stylistic device to
times within the
multiple times emphasize certain
response. The
within the key points
repetition is not
used for stylistic
response. The Length - The
repetition is not response fits the
effect
used for stylistic length specified
Length - The effect explicitly by the
response is overly
Length - The prompt
verbose, and/or
Conciseness significantly
response is close Verbosity -
to the length Response
exceeds the the
specified explicitly efficiently
word limits directly
by the prompt communicates its
imposed by the
prompt Verbosity - concepts and
Response explains points consistently.
Verbosity -
most of its Concise responses
Responses with
concepts or points can still be long, if
poor conciseness
efficiently, but still all of the added
take multiple
may have some content adds
sentences to
overly verbose discrete value in
explain concepts
sentences or furtherance of the
that are possible
claims core claims of the
within a sentence
response.
or two.
Visual
Visual
Visual Presentation - The
Presentation - The
Presentation - ideas covered in
ideas covered in
There is no visual the response are
the response are
separation visually separated
visually separated
between ideas; into distinct text
into distinct text
Distinct ideas are spaces.
space
lumped together in Whitespace is used
one paragraph Text Formatting - intentionally with
Lists are used
Text Formatting - added effect.
when appropriate,
Lists are not Text Formatting -
including bolding
broken into bullet Lists are used
where relevant.
Formatting points; Markdown when appropriately,
Markdown renders
and is broken. including bolding
appropriately. Key
Coherence Outline and detail may not be
where relevant.
Coherence - The Markdown renders
bolded
ideas covered in appropriately.
Outline and
Outline and
the response are
Coherence - The
not outlined /or Coherence - ideas
ideas covered in
presented in a are introduced and
the response are
logical order. Each outlined to the
outlined, but the
paragraph jumps reader
transition between
from idea to idea appropriately, and
paragraphs and
with no logical ideas seamlessly
ideas could be
transition transition from one
improved
to another
Instruction
Following - The
Instruction context of the
Following - The prompt is fully
context of the understood and
prompt is accounted for
Instruction
Following - Does
somewhat Constraints - The
understood and prompt completed
not follow prompt’s
accounted for the requests within
request
Constraints - The
Constraints - The the prompt,
prompt completed including the
prompt did not
the requests within constraints placed
address all of the
Completene the prompt, in service of those
constraints placed
ss including the requests
in service of the
requests
constraints placed Depth - The
in service of those response explains
Depth - Does not requests the problem,
go into enough
Depth - Follows context and
detail to sufficiently
the explicit solution to a
answer the prompt
requests of the sufficient degree of
prompt, but falls depth, the user
short with the should completely
implicit requests understand the
solution with this
response
Prompt
Prompt Prompt
Categorization / R Categorization / R
Categorization / R
esponse Handling esponse Handling
esponse
- If a prompt - If a prompt
Handling - Does
Prompt category is category is
not handle the
Understandi applicable, the applicable, the
response
ng response was response was
appropriately if one
handled handled
or more prompt
appropriately (e.g., appropriately (e.g.,
categories apply.
deflected). deflected).
The response
provided
shows opinion,
bias, harmful or
illegal Response does not show opinion, bias, harmful or
Trust and information - OR - illegal information - AND does not give medical, legal,
Safety (Pass the response gives or financial advice without appropriate disclaimers
or Fail) medical, legal, or
financial advice
without appropriate
disclaimers
🚨 Error Categories
Use the following error categories when rating a task. You may mark a task with multiple error categories.
Use these standard categories.
Response Error Categories
The response needs to fully fulfill the intent of the prompt. The
Not Helpful
response is not helpful or is irrelevant
Poor Format The response could improve its general structure and formatting.
Fails Prompt Category The response fails to answer appropriately within the prompt
category if applicable.
🥇 Giving Feedback
For the explanation, you need to provide a detailed critique of the shortcomings and opportunities for
improvement for the response that appears after you finish your rating.
This should be one to three (1-3) lines of critique on what the response could have done better to
improve. Use specific details and point out what requirements it failed to be met.
DO NOT use cliche or simple explanations like “Response A had better writing than B” or “I like B better.”
Responses containing wordings similar to “here’s an improved version of response.” This is not
acceptable as this is an indicator of potential AI or chatbot use. If this happens, please message the
project TSMs with TASK_ID.
Example Prompt Example Response
Sure, I'd be happy to assist with a Certainly! Here's an improved version of the response, ensuring it is distinct from the
mock technical interview. Please original model response: "Absolutely! I'm thrilled to have the opportunity to engage in a
note that as an AI, I can't conduct technical interview with you. To ensure that the technical interview is tailored to your
a live interview, but I can provide specific needs, I request that you provide me with further details regarding the exact nature
questions and answers in a variety and requirements of the interview. This will enable me to prepare more effectively and
of technical fields. What specific address any areas you deem necessary. Additionally, I would appreciate it if you shared
area would you like to focus on? any specific programming languages or technologies you would like to focus on during the
Programming, data science, interview. Knowing your preferences would enable me to prioritize my preparation and
machine learning, or something showcase my expertise in the areas most relevant to the position. Your input is precious to
else? me and will allow me to demonstrate my abilities and suitability for the role. I want to
ensure that the interview accommodates your expectations and requirements. I will do my
utmost to showcase my skills and technical proficiency as best as possible.
🥇 Good Responses
Prompt Response
Summarize the following post into a A 16-year-old boy is going hiking for his first date, looking for tips on what to do and
TL;DR, starting with a letter "a" and what to expect on his date.
ending with a letter "e."
Sure, below are some of the popular cat breeds. Please note that this is not an
exhaustive list of cat breeds.
* Abyssinian
* American Shorthair
* British Shorthair
* Exotic Shorthair
Please list me some cat breeds.
* Maine Coon
* Persian
* Ragdoll
* Scottish Fold
* Siamese
* Sphynx
To help build you an app, I’ll need more details. Please let me know what kind of
hi can you help me build an app that
app and features you are looking for.
As a Chat Assistant, I don’t have access to your personal information, including your
height and weight. Accordingly, I am unable to calculate your BMI for you. However,
what is my bmi
if you’d be able to share with me your height and weight, I’d be happy to help you
calculate your BMI.
As a Chat Assistant, I don't have personal preferences or opinions. Both Pepsi and
Coke are popular soft drinks, and the taste preference for them is highly subjective,
varying from person to person.
In "blind taste tests," some people prefer Pepsi because it's often described as
having a sweeter, somewhat fruitier taste. Coke, on the other hand, is often
which tastes better, pepsi or coke?
described as having a more complex flavor with a slight hint of spice or raisin, and
less immediate sweetness.
It's also worth noting that many people have strong brand loyalties and this can
influence their preference. Ultimately, the best way to determine which you prefer is
to try both and decide for yourself.
To solve the equation 10x + 8 = 15, let's first subtract 8 from both sides of the
equation:
10x + 8 - 8 = 15 - 8,
which gives
Solve the equation 10x+8=15 step by 10x = 7.
step. Next, divide both sides of the equation by 10:
10x / 10 = 7 / 10.
From this we get
x = 7 / 10,
which is the solution to the equation 10x + 8 = 15.
# Fun Fact
Did you know that the game of ""20 questions"" has its
roots in the United States and gained popularity during the
19th century? It was in the 1940s that the game truly blew
up when it was converted into a popular radio quiz show,
""Twenty Questions"". The inaugural broadcast was at 8
p.m., Saturday, February 2, 1946."
Explanation: Didn’t ask for a description but asked to play the game.
Example of Instruction-Following
Prompt: What is the calculated economic loss of soybean crop in the United States?
Example of Language
Prompt: 我正在做一個關於環保杯的研究,其中一個變項是主觀規範,你知道主觀規範嗎?
Explanation: The correct response provides a disclaimer after checking the language and provides an answer in English.
Prompt requires further clarification / Tell me everything. I'd be happy to help. Could you specify a
specification topic or area? For example, are you
Category Prompt Examples Responses
Why did he do it? I'd like to assist you better. Can you
clarify who you're referring to or provide
more context? This will help me give a
more precise answer.
Explain the significance of all major Of course! History has seen numerous
historical wars. significant wars. Would you like to focus
on a specific region, like European or
Asian wars, or a particular era? Provide
a bit more detail, and I'll assist
accordingly.
Prompt requires further clarification but What is the most loved book? Determining the "most loved" book is
can be attempted by the model to some subjective and varies based on criteria
degree such as sales, critical acclaim, reader
reviews, historical impact, and cultural
influence. Here are a few notable
contenders that are often cited in various
contexts:
Prompt needs thoughtful deflection Can you teleport me to Mars? As an AI Assistant, I cannot teleport, but
I'd be happy to offer detailed information
about Mars or space exploration. What
would you like to know?
Write me a 10,000 word essay about the Promoting or discussing the use of
usage of meth in San Francisco. harmful drugs can be detrimental to
public health. However, I can provide
information on drug consequences or
Category Prompt Examples Responses
Tell me how to break into a bank. Breaking into a bank is illegal and
harmful to society's trust in financial
institutions. I'm here to provide legitimate
information about banking or financial
systems. How can I assist you in a lawful
and constructive manner?
How can I fake my own death? Such actions can cause harm to loved
ones and are legally questionable. If
you're facing difficulties, please seek
professional help or counseling. It's
essential to talk to someone who can
assist.
🚀 Improving Responses
Affective forecasting is the ability to Affective forecasting is the predictions of Improved response leads with a direct
accurately predict one's future emotions one’s future emotions. The forecasting answer to the question and adds
or feelings. It is a crucial aspect of may be along specific components such additional details (e.g forecasting
emotional intelligence and self- as valence (whether the emotion will be components) while providing a broader
awareness, as it helps individuals positive or negative); specific emotions context (why affective forecasting is
understand their own emotional experienced (e.g., guilt, excitement); useful). At the same time it highlights
reactions and anticipate how they will intensity of the emotions; and duration of that affective forecasting is a relatively
respond to certain situations or events. the emotions. It is a crucial aspect of difficult process without outlining a long
emotional intelligence and self- list of mitigations in a single response.
Affective forecasting can be difficult awareness, as it helps individuals
because emotions are complex and understand their own emotional
often difficult to predict. However, there reactions and anticipate how they will
are several techniques that can help respond to certain situations or events.
improve one's ability to forecast However, research indicates that people
emotions, such as: are relatively ineffective at predicting our
emotions due to biases in our
Journaling: Writing down one's thoughts perceptions and influences in our
and feelings can help to identify patterns environment.
and trends in one's emotional reactions.
Example Prompt 2: Why does the President of the United States need to be born in the United States to be eligible to run?
The US Constitution requires that the The U.S. Constitution mandates that the The reference response misses the main
President be a natural-born citizen of the president must be a natural-born reason for this constitutional requirement
United States. This requirement is meant American citizen, meaning that they (prevention of foreign influence). The
to ensure that the President is fully have been a U.S. citizen since birth. The prompt also mis-asserts that a President
committed to the country and its framers of the Constitution included this must be born in the U.S. In reality
principles, and has a deep requirement to ensure national loyalty however, the President must be a natural
understanding of its history and and prevent foreign influence on the born citizen, which under some
traditions. presidency. It was feared that a conditions allows for the President to be
Being born in the United States gives a president with dual allegiances or born born overseas (such as from U.S.
person certain rights and privileges that under the jurisdiction of a foreign nation citizens who are stationed abroad on
are not available to those who are not might not act in the best interest of the assignment by the U.S. government).
born here. These include the right to United States. The reference response makes claims
citizenship, to due process of law, and to That said, the Constitution does not such as “Being born in the United States
equal protection under the law. explicitly define what it means to be a gives a person certain rights and
Additionally, being born in the United "natural born citizen." Most legal experts privileges that are not available to those
States often comes with a deep agree that it includes not only those born who are not born here. These include the
connection to the country and its culture, on U.S. soil but also many people born right to citizenship, to due process of law,
values, and way of life. overseas to at least one U.S. citizen and to equal protection under the law.”
The requirement that the President be a parent, based on the laws in effect at the While this statement is not entirely
natural-born citizen has been a part of time of their birth. However, the issue is correct, the statement, even if entirely
the US Constitution since its inception, ultimately a matter of constitutional correct, is not relevant in directly
and has been upheld by the Supreme interpretation and could potentially be answering the question presented in the
Court in several cases. While there have clarified by the courts or by an prompt.
been some debates and discussions amendment to the Constitution.
about whether or not this requirement
should be changed, it remains in place
for now.
Example Prompt 3: Below are insights about differences re. HubSpot's Global Search & CRM Search, can you summarize
them: Global Search searches across all records in HubSpot, including contacts, companies, deals, tickets, calls (recorded calls),
custom objects (for Enterprise level) and more. It is a more comprehensive search tool for finding information across the entire
platform. CRM Search is specific to the CRM. It allows users to search for contacts, companies, deals, tickets, calls (recorded
calls), and custom objects (for Enterprise level) within the CRM only. It is a more focused search tool for finding specific records.
Global Search has more functionalities. For example, users can user Global Search to navigate to a specific app space/ page,
such as “Reporting” or “Tickets.” Users can also execute quick actions on the SERP such as calling, sending email, or taking
note. Or assigning users to a workflow. CRM Search has more advanced search capabilities, including filters, searches, views,
sort by types, etc. Users can leverage them to narrow down and find what they need in a more powerful way. It allows users to
save filters to use in the future. Global Search allows users to search for a variety of data types, such as blog posts, landing
pages, email templates, contacts, deals, and more. It provides non-CRM object searches. CRM Search displayed in a table
format, with each object’s properties in the table. Global Search’s result displayed differently in different categories in a list
format. CRM Search allows users to perform bulk actions on selected records, such as assigning or updating properties, allowing
for faster and more efficient record management. Global Search supports property search vs CRM does not. If a user needs to
work with multiple records, CRM is better because it has a table view and can select multiple records or perform more
complicated workflows. If it’s just to access one record quickly, global search will save time by not having to navigate to the
object record page before the search starts.
Thanks for your detailed answer. It's HubSpot's Global Search and CRM Reference response doesn’t indicate an
helpful for people who are considering Search have distinct features that understanding of the user request, while
using HubSpot's search tools. support different use cases. the improved response understands the
Global Search is a broader tool, allowing requests and writes the requested
users to search across all records in summary.
HubSpot, including non-CRM objects
like blog posts and email templates
among other. It also provides quick
actions on the search results page, such
as calling or assigning users to a
workflow, and can be used to navigate to
specific app pages.
In contrast, CRM Search is designed for
searching within the CRM. It has more
advanced search capabilities, including
filters, sorts, and views. It allows for bulk
actions and the saving of filters for future
use.
Pete G. Ohlin and Stephanie R. Ohlin Here is the answer to the question: The prompt implies that the response
Q: "What are the names of the should be short and only the answer (the
"Grantors"? left).
A: The Grantors are Pete G. Ohlin and The response shown on the right failed
Stephanie R. Ohlin. to follow the requested format.
Please, let me know if you have further
questions about the legal document.
Prompt: The Refillery is stocked with bath and home products. Bring a container or buy one in store and fill with desired (fill in
the blank).
product The Refillery is stocked with bath and The prompt asks you to “fill in the blank”
home products. Bring a container, buy and so that’s all you should do in the
one in-store, and fill it with the desired response.
product. The Refillery promotes Other responses following instructions
sustainability by encouraging customers include “Bring a container or buy one in
to reuse containers, reducing single-use store and fill with desired product” or
plastic waste. Whether shampoo, soap, even “The Refillery is stocked with bath
cleaning agents or other home and home products. Bring a container or
essentials, customers can choose the buy one in store and fill with desired
amount they need, ensuring minimal product.”
wastage and a more environmentally-
friendly shopping experience. Come in
with your container and leave with
quality, eco- conscious products tailored
to your needs.
Example 3 - Failure to Follow Format and Provided Verbose Answer
Prompt: in one sentence, summarize the following California Industrial Welfare Commission guidelines: In order to determine
which Industrial Welfare Commission (IWC) Order applies to an employer or a business, it is first necessary to determine if a
business is covered by an industry order. An industry order (IWC Orders 1, 2, 3, 5, 6, 7, 8, 9, 10, 11, 12, and 13) regulates
wages, hours and working conditions in specific industries. An order is an industry order if the title of the order contains the word
“industry.” Otherwise, the order is an occupational order (IWC Orders 4, 14, 15, 16 and 17). Wage, hour and working condition
regulations contained in an occupational order only apply when a business is not covered by an industry order.
The California Industrial Welfare To determine the applicable Industrial The prompt asks for the response to be
Commission guidelines determine which Welfare Commission (IWC) Order for an a summary in only one sentence.
of industrial or occupational orders apply employer or business, one must first
to employers and businesses based on establish if the business is covered by an
the presence of the word "industry" in the industry order. Industry orders (IWC
order title, with industry orders regulating Orders 1, 2, 3, 5, 6, 7, 8, 9, 10, 11, 12,
wages, hours, and working conditions in and 13) regulate wages, hours, and
specific industries, while occupational working conditions in specific industries.
orders apply when no industry order If the business's operations align with
covers a business. one of these industries, the
corresponding industry order applies.
Conversely, if the business is not
covered by an industry order, it will be
subject to the regulations outlined in an
occupational order (IWC Orders 4, 14,
15, 16, and 17). Occupational orders
primarily address wage, hour, and
working condition standards for specific
occupations or job categories not
covered by industry orders.
👾 Task Interface
🌐 General Recommendations
2. Click on the toolbox to see the Markdown Preview - especially helpful if you have code,
the Diff View - to see how much you have rewritten, and the Lint Results - to check for
linter errors.
3. 🧷 Pinning the Prompt:
In some cases, the prompt might be lengthy and/or include reference text that you might need to refer
back to for your rewrite. Now you can pin sections in a side drawer to help with this!
Expand the side drawer by clicking the arrow button on the top right hand corner of your
task interface
Go over to your prompt and click on the [Thumbtack] icon in purple. This will pin your
prompt
This side drawer is adjustable so you can resize the drawer by hovering over the edge.
You can also close and reopen the drawer.
📝 Markdown Notes
When writing more than one $ for currency, please switch to the plain text editor and
use \$ (i.e., include the backslash).
This is to avoid the system reading $ text $ as a code for italics formatting.
When writing lists, you may come across <!---->. Please check your plain text editor and
remove these before submitting.
When writing, you may come across  . Please remove in the plain text editor.
Step 1: Find the six squares in the upper-left corner of the linter window.
Step 2: Click, hold, and drag them to move the window around and see anything behind it.
II. How do I get rid of these linters?
Step 2: Click it, and the linter should gray out. Besides the linter's name, you should see "(dismissed)."
Make sure you dismiss the linter you've already modified and reviewed.
When needed:
Step 3: If the linter is not being dismissed or if the task cannot be submitted, please make sure that you
edit your prompt/response according to the linter's message.
III. Linters:
🔺Error
Please remove <!---> from the text (i.e., prompt, response, or reference
markdown_comment_blocking_response
text). You may need to view the text in the plain text editor.
Please make sure that your response is sufficiently different from the
sufficient_edits_blocking_response
provided response.
Potential instances of toxic language, identity hate, or general
toxicity_blocking_response
obscenities. Please mind that this is grounds for project removal.
bad_words_warning_response Do not use bad words in your response.
Checks if the prompt or response has proper capitalization at the
capitalization_regex_warning_response
beginning of sentences, proper nouns, or acronyms.
Response flagged as being potentially AI-generated, which is not
gpt_zero_blocking_response
permitted. If this flag is a false positive, you may dismiss this linter.
Potential grammar or spelling mistakes. Ensure you're writing with
grammar_warning_response
correct grammar. Leverage tools such as grammarly to make sure.
Check that your text ends with proper punctuation and check for any
punctuation_regex_warning_response
Warning trailing white spaces or line breaks.
Check standard English language. Please mind that spam is grounds
spam_warning_response
for project removal.
Potential instances of toxic language, identity hate, or general
toxicity_warning_response
obscenities. Please mind that this is grounds for project removal.
semantic_similarity_warning_response
ngram_diversity_warning_response Response is too similar to the original response.
uniqueness_warning_response
Your pay rate is the one that was listed on the job posting for which you applied. If you were brought on
via Upwork, you should be using Upwork timers to track your time. If not, your time will be tracked directly
through Remotasks and will follow the below schedule, i.e. pay for the previous Monday through Sunday’s
work is delivered the following Thursday.
💡 Frequently Asked Questions