Professional Documents
Culture Documents
Guidelines Happyscribe Com en
Guidelines Happyscribe Com en
Scope
1. General Information
2. Flavour
4. Text Reduction
5. Interjections
6. Consistency
7. Special Content
8. Special Formatting
9. Language-specific Items
10. Terminology
15. Reminders
Each section is completed with a list of search keywords in your language to facilitate your navigation.
Scope
‣
Welcome to Happy Scribe Guidelines, a set of rules and hacks for granting consistency and delivering high-
quality transcription and subtitling projects.
✨Take the most out of them! ✨
How to read the Guidelines?
‣
1st time → Check all the sections thoroughly before starting working on your first assignment.
Following times → Use CTRL+F (or Command+F) to find the solution to your issue.
💡 You are not expected to read the Guidelines every single time you work on a file. However, you
are expected to be aware of the latest additions all the time.
Are you working on a file with no audio? Run a search for “no audio” and you will know what to
do with that.
The Guidelines have been designed to be a dynamic tool to fulfil your needs.
Feel free to share your feedback to expand, enrich and improve them.
Add a reliable source reference any time you make any suggestions.
WARNING: If customer-specific instructions are available, they prevail over these Guidelines.
‣
🇬🇧🇺🇸
The following Guidelines include specifications for the following flavours:
Flavour-specific rules will be noticed accordingly. If no flavour is specified, the rule is meant to be applied
across all variants.
1. General Information
‣
i. Service Level
‣
💯 Quality tip: Check the “Text Reduction” section for further details on how to edit the text
accordingly and “Punctuation” to know how to report long pauses.
💡 If the file you are transcribing or subtitling has some embedded subtitles in your language, please
DO transcribe the speaker’s speech. DON’T mirror the embedded subtitles even when they are
slightly different from the speech spoken by the speaker.
Has no audio.
2. Report the file as “unworkable” on the Editor and select either “No speech: there is nothing to
transcribe” or “Inaudible: it is impossible to transcribe because the audio cannot be heard”.
💡 If a file with no/little/very poor audio is submitted as “Done”, no payment will be processed.
💡 If there is very little audio in the file (less than 20% of the total length), flag it immediately by
sending an e-mail to human@happyscribe.com.
‣
You claimed a file but more than 80% of the audio is a song or lyrics only.
1. DON’T transcribe the lyrics of the song and DON’T submit the file.
3. Report the file as “unworkable” on the Editor and select “Song: the majority of the audio is a song
(80%+)”.
- If the song takes less than 10 seconds, transcribe the words to the song. Use italics.
- If the song takes more than 10, use the proper tag. Check the section “Notation Tag” below to
know how to use the right one!
Scenario DO
💯 Quality tip: If the speaker says “The children play with the… How do you say that? The qìqiú.”
when they mean “The children play with the ball”, DO type it as “The children play with the… How
do you say that? The [foreign language 00:00:00]”. DON’T type “The children play with the ball”.
In this example, the interviewee is using a word from his native language that is not known to the
interviewer: it is not a case of code-mixing or loan word, but a case of communication loss. Even
if you were to speak Chinese and know the word, please don’t add it as it is not relevant in this
specific context.
2. Flavour
‣
Depending on the geographical area in which they are spoken, some languages can have locales, also known
as “flavours”. This is the case of Dutch, for instance, there are two flavours: Dutch for Belgium and Dutch for
the Netherlands.
Always follow the flavour as selected by the customer and displayed on your job dashboard.
💡 DON’T confuse the flavour with the accent. The speaker can be speaking Dutch for Belgium with an
English accent.
💯 Quality tip: Before claiming your job, check the flavour requested. This will help you anticipate some
of the challenges of the file you will be working on and apply the relevant linguistic conventions for
that specific flavour, like the appropriate spelling and punctuation preferences, accent, and other
factors.
💯 Quality tip: In case of multiple English accents, unless otherwise specified, please use the British
spelling.
🔑 Search keywords: locale, flavour, localisation, adaptation, accent
‣
The style and tone of voice of speakers should be reported faithfully. This doesn’t refer to the intonation, but
rather to the usage of a formal or informal language (i.e., the register).
Stick to the following pillars when working on a file:
Avoid long sentences/paragraphs. Look for natural points in the speech, like shifts
Be concise
in topic, emphasis, or references, and start a new sentence.
Never use standard informal contractions for formal speeches and vice-versa.
Be proper e.g.: if the speaker says “I’m” don’t type “I am”. The same applies to other
grammatical contractions like they’re, who’s, etc.
💡 What should I do when speakers use non-standard informal contractions when speaking?
gonna → going to
wanna → want to
gotta → got to
💯 Quality tip: Before completing your job, run a quick search looking for unnecessary filler words using
CRTL+F (or Command+F).
🔑 Search keywords: contractions, contracted forms, informal tone, formal tone, formality
4. Text Reduction
‣
Unless otherwise specified, DON’T use unnecessary filler words, feedback words, interruptions, false starts,
and repetitions.
DO DON’T
Stutters & She said they should not be She s-s-said th-th-they should not
Stammers invited. So I didn’t. be in-in-invited. So I didn’t.
Self interruptions, She said they should not be They sa... She said they should
and false starts invited. So I didn’t. not be invited. So I didn’t.
In that case, the word cannot be considered unnecessary and it should be included in the text.
DO DON’T
Mm-hmm.
Speaker 2: Uh-huh.
Speaker 2: Uh-huh.
💡 If a short false start includes the crucial meaning, like the speakers correcting themselves, be sure to
include it.
We want to include clearer parts, I mean, linguistic conventions to these
DO
guidelines.
If conjunctions don’t work as such, but rather as fillers, please remove them. The same applies to
any other connectors (relatives pronouns, adverbs, etc.) when they don’t work for the purposes they
should.
DO DON’T
Truncate further content exclusively where reading speed and synchronicity to the audio are causing you
issues in subtitles. Never omit meaningful content.
💯 Quality tip: If you spot an unnecessary filler word, run a quick search throughout the file by using
CTRL+F (Command+F) functionality and remove them all.
Interjections should not be treated as filler words. Check out the relevant section below for further
details.
🔑 Search keywords: text reduction, filler words, stutters, stammers, repetitions, interruptions
5. Interjections
‣
As part of speech, interjections are exclamations that express speakers’ reactions or feelings. They are
grammatically independent of the rest of the sentence where they appear.
Only the interjections below are allowed in transcriptions and subtitles:
Oh my God!
Oh, dear!
Oh my!
Oh boy!
Wow!
DON’T use any interjections currently not included in the list or the reference.
DO reach out to add new interjections that you consider are missing.
In the case of subtitles, interjections can be omitted to meet character constraints, if their omission does not
imply any loss in meaning.
💡 Exclamation marks can be used only for interjections and shouts. Check the relevant section about
punctuation to discover when they are forbidden.
If the phrases work as a reaction from the speaker, they should rather be considered interjections.
Unless you have been instructed otherwise, ask yourself “Would omitting this item impact the core
message the speech is conveying?”. If the answer is “No”, it should be pretty safe to omit it.
6. Consistency
‣
Consistency means taking decisions on how to tackle specific items and applying the same approach
throughout a file.
What items should you apply consistency to?
Punctuation, capitalization, numbers, dates, spelling, symbols, terminology, interjection rendering... Be sure to
stick to these guidelines to ensure the highest level of consistency!
The more the consistency, the greater the user experience.
💡 By keeping the consistency, Scribes will be able to increase precision by reducing ambiguity and
preventing loss of meaning in case the transcriptions/subtitles are going to be translated into other
languages.
💯 Quality tip: You run a search for a proper name and you want to be sure the correct spelling is
applied also when files on the same topic arrive in bulk, do you? Or do you opt for one out of several
accepted spelling for a single word?
Then use the “Glossary” option and add some specific terms there to help grant consistency. Check
the section “Terminology” for more details.
To check internal consistency in the file you are working on, use the CTRL+F (Command+F) option to
navigate through the file and check all instances by typing the first three characters or potential
variants.
💡 Still in doubt with some specific items? Post your issue on the English Channel on Discord. Sharing is
caring!
7. Special Content
💯 Quality tip: The rule of thumb is always to check for spelling company/brand/product names by
doing a quick search. Always refer to the official websites to be sure to use the correct names. In
the case of book/movie titles, check them on the main online catalogues.
If several different spelling/formatting alternatives are available, choose one and be consistent
throughout the file. Add it to the Glossary!
🔑 Search keywords: proper nouns, spelling proper nouns, titles, product name, company name,
brand name
DON’T censor profanity or curse words. If they appear in the audio, transcribe them word for word.
DO DON’T
What the fuck is going on? What the f*** is going on?
💡 Exception: If the curse words are censored and beeped in the audio, DO use asterisks in the
transcription/subtitles.
DON’T use the [sic] tag and don’t edit the speech.
DO DON’T
DO DON’T
She told me to be here at 5 PM She told me to be here at 5 pm
Can you please switch off the TV? Can you please switch off the T.V?
The news they gave about UNESCO were quite The news they gave about the U-N-E-S-C-O was
inaccurate. quite inaccurate.
💡 Et cetera and its more common abbreviation, etc., are used to show that a list of at least two
items is incomplete. Either form can be used. However, choose an option and stick with the
same one throughout the file for keeping consistency.
If not otherwise specified, please choose either form and be consistent throughout the file.
Remember to always check the flavour and add the selected option to the Glossary!
DO DON’T
💡 What if the speaker spells out their e-mail address? Follow the rule for spelt-out items under the
section “Capitalization”.
vi. Onomatopoeias
‣
Since we are currently providing clean transcriptions and subtitles, onomatopoeias should not be
included in the text unless they convey a meaningful message, i.e. they are the only answer in a two-
person dialogue or they are said by the speakers instead of actual words, for instance.
Said DO
DO spell the onomatopoeias as per the list in the link below: https://www.vocabulary.com/lists/240643
💡 Check out the “Text Reduction” section for more details on how and when to reduce the text
being compliant with our service level.
Since we are currently providing clean transcriptions and subtitles, atmospherics and sounds should not
be included. If the speaker is interrupted by either of them, only the spoken speech should be typed.
Use numerals and follow conventionally written formatting for chapter/verse references of religious texts.
DO DON’T
Acts 27:1 (Said: Acts 27 verse 1) Acts 27 colon 1 (Said: Acts 27 verse 1)
‣
Italics, Bold, and Underline
Happy Scribe Editor supports Italics formatting only. Stick to the chart below for its correct usage.
◦ Narration
◦ The speaker is not in the scene(s), not merely off-screen, ◦ Foreign loan words which appear
behind a door, or out of shot in your language’s nominated
◦ Dialogue that is heard through electronic media, such as a dictionary, e.g. rendezvous, zeitgeist,
phone, television (especially if we see the television and hear etc. for English
the audio), computer, loudspeaker, non-sentient robots,
robotic voices or AI, etc
◦ Songs
🔑 Search keywords: italics, titles, book, movie, italics, when to use italics
9. Language-specific Items
‣
i. Punctuation
‣
marks
◦ After interjections
◦ Compound names
someone else
Speaker 2: Busy, can’t speak now.
someone else and then resumes Speaker 2: Busy, can’t speak now.
with unaltered grammar Speaker 1: -you would want to come and grab a
bite?
ii. Capitalization
‣
Ms, Mr
People titles
Chief Executive Officer John
Formal job titles:
Titles Smith/ I’m John Smith, the
◦ referring to specific people
Tables
Charts Appendix C
Names of
Figures Chapter 13
charts/sections
Bigger sections (NOT smaller BUT slide 3, page 100
divisions)
Variables Variables referred to in a X marks the spot.
sentence (NOT in an equation,
in that case, use lower case)
‣
Capitalized Item 💡 Further Explanation DOs & DONT’S
◦ Text messages
Books
Titles Use title case
Movies
Yahoo!
AirPods
Company names
Brand-relevant Items
Product names 💯 Check on official
websites/reference material the
proper capitalization
AfD
UNESCO
Saas
Institution acronyms
COVID-19
Acronyms Product types
Diseases
💯 Always check the proper
capitalization on reliable
resources
iii. Numbers
‣
1. General Rules
‣
Item Rule DO
Million
They processed three million minutes
Billion The above rules apply.
last month for a total of 59 billion over
Trillion Descriptors are kept.
the year.
and above
1.1 Exceptions
‣
Item Rule DO
Name of a
Always in figure The number 5 is my favourite one.
number
Repeated
-I read that you have 2,500 employees-
numbers in
-3,500.
consecutive Always in figures
-I beg you pardon?
sentences/cap
-3,500. We have 3,500 employees.
sules
Religious
Always in figures Acts 27:1 (Audio: Acts 27 verse 1).
references
Range of Always in figures unless at Kids aged 1-12 weren’t allowed to the
numbers the beginning of a sentence show.
2. Number Separators
‣
Phone numbers
- Hyphen 1-508-555-2232
units
💯 Quality tip: You found a number in the audio track of the file you are working on, but you don’t
remember how to handle that. Quickly surf the Guidelines by using CTRL+F (Command+F) and
search for the number type that is causing you troubles.
Warning: Use an apostrophe for truncated decades instead of a single quote mark.
Format Rule DO
Date ◦ Follow the format used by the Audio: The company was founded on January
speaker(s), also in the case of twenty seventeen, one one twenty seventeen to
centuries be exact.
Typed: The company was founded on January
2017, 1/1/2017 to be exact.
‣
Special rules apply to symbols and currencies as per the chart below:
Item Rule DO
dollars each.)
◦ Symbols should go before the
They should earn $3 million.
number and close up.
◦ Don’t convert.
Item Rule DO
Trigonometric
Spell them out. Let’s calculate the cosine of 1 radian.
functions
Use symbols and don’t spell Audio: 3 times x plus 4 times y equals 23
out the math operator. Typed: 3x plus 4y equals 23
Variables
Check capitalization rules X marks the spot. Add example and talk
here. with CS
vii. References
‣
Use the following references in case you have any doubts on items not included in these language
guidelines:
EF: https://www.ef.com/wwen/english-resources/
Merriam-Webster: https://www.merriam-webster.com/
Cambridge: https://dictionary.cambridge.org/dictionary/
‣
10. Terminology
i. Domain Relevance
‣
Always refer to the context to catch the appropriate words or phrases.
Lack of proper research might lead to accuracy issues and impact the final quality of your job.
💯 Quality tip: Some foreign words have become part of the common terminology for specific
verticals.
Depending on the industry, it is not uncommon to see how certain loan words have become part
of a specific industry domain.
DON’T use tags for loan words or any foreign word that is domain-specific.
Example:
The English term hashtag is part of the specific terminology of Social Media almost across all
languages. If the video that you are transcribing/subtitling is about Social Media, then you
needn’t use any italics or tags when typing the term hashtag. If the video you are
transcribing/subtitling is not about Social Media, and the term hashtag is used, then it should go
in italics.
ii. Glossary
‣
The Editor allows customers and Scribes to add a specific set of terms that should be used in one or
multiple files.
The on-screen text has typos. In this case flag this to the Admins.
The glossary provided by the customer includes options that differ from the on-screen text.
If the file you are working on has unclear, muffled portions of audio, you can use one of the notation tags listed
below, including a timestamp of the audio location.
Scenario Ta DO
Unintelligible or inaudible
[inaudible hh:mm:ss] [inaudible 00:01:32]
words are stated.
[singing 01:59:02]
The speaker starts singing
and it is more than 10 [singing hh:mm:ss] If it takes less than 10
seconds long. seconds, transcribe the
words to the song in italics.
General DON’Ts
DON’T use upper case/all-caps in any notation tags. [Foreign Language 02:03:55]
DON’T include the name of the foreign language even though you spot it. [Polish 02:03:55]
💯 Quality tip: Do you want to use a tag accurately? Type [ on the Editor at the exact point you want to
add a tag and choose from the drop-down menu.
When the tag is replacing a word, a phrase or a portion of a sentence, DO use standard punctuation
afterwards.
However, when a tag is used to replace a whole sentence or paragraph, DON’T include any
punctuation sign after the tag.
🔑 Search keywords: foreign, foreign language, inaudible, crosstalk, singing, notation tag, loan word
i. Multiple Speakers
‣
Refrain from creating your own descriptive labels and rather stick to the following rules based on different
scenarios:
Scenario Rule DO
💯 Quality tip: Remember you must assign a speaker to all paragraphs, even if there are multiple
consecutive paragraphs from the same speaker.
ii. Segmentation
‣
Avoid writing excessively long paragraphs, even if there is only one speaker in the file.
Look for natural points in the speech, such as a shift in topic, emphasis, or reference, to start a new
paragraph.
i. Multiple Speakers
‣
Use hyphens (not followed by any space) to separate speech uttered by two speakers speaking at the
same time, or speaking too closely to each other – putting them in one capsule preserves CPS limits
and avoids fast transition from one capsule to another.
Only one speaker per line is allowed.
DO DON’T
- Hi mum. - Hi John.
-Hi mum.
How was
-Hi John. How was your day?
your day?
ii. Segmentation
‣
1. Line Treatment
‣
Subtitles can have at most two lines, even in the case of multiple speakers.
Try to always stick to 1 line unless the character per line (CPL) limit is exceeded.
DO DON’T
- Hi mum. - Hi John.
-Hi mum.
How was
-Hi John. How was your day?
your day?
2. Line Breaks
‣
Ideally, try to seek subtitles balance by keeping the first line shorter than the second line.
Lines should be broken at logical points, matching a punctuation sign like a period, comma or colon
whenever possible.
If the break has to be elsewhere in the sentence, avoid splitting the parts of the speech in the chart
below.
DON’T sacrifice logical line breaks to keep a pyramidal text distribution in subtitles.
Parts of the
DO DON’T
speech
Preposition and
They arranged a conference They have arranged a conference in
the following
in Rome. Rome.
phrase
Conjunction and
They arranged a conference They have arranged a conference but
the following
but I didn’t go. I didn’t go.
item
💯 Quality tip: Are you still struggling to understand what a logical point is? Have a look at the
sentences below for further examples.
The orange slashes indicate where it would be OK to break if breaking at the green slashes
were not possible.
Mary wants/to go/to the store,/but as far as I know,/all the stores/are closed/on Translation Day.
3. CPL
‣
The subtitles CPL should not exceed 42 CPL. Whenever exceeded, it is marked in red in the Editor.
If the line is over 42 CPL, you can either:
Split into multiple subtitles by pressing Enter to stay within this limit;
Omit content without altering the core meaning of the speech. Check the section “Text Reduction”
for further details.
💯 Quality tip: Did you have to break many lines to meet the CPL and you are afraid of having
messed it up with line-break rules?
Be sure that no line ends with an article or a preposition and watch out for those ending with
an auxiliary verb – they might potentially contain issues.
💡 CPL for portrait mode is 32. For further details, check the section “Portrait Mode”.
1. CPS
‣
The subtitle CPS (Characters per second) is an indicator of the comfort of the viewer. A high CPS
indicates that the viewer will have to read faster, making the experience less comfortable.
Values higher than 20 CPS indicate a degraded viewer experience, so you should make an effort to
try to reduce it when possible. Whenever exceeded, it is marked in red in the Editor.
There is a slight tolerance in exceeding the 20 CPS soft limit if the speaker talks too fast and it is
not possible to reduce the subtitle further without losing meaning or worsening the final user’s
experience.
💡 If the content is for children, the CPS should not exceed 15.
2. Sync
‣
To be sure your text is synced with the audio and thus granting a great experience to the final user,
stick to the guidelines listed below (in order of priority):
Subtitle appearance should coincide with speech onset. Subtitle disappearance should coincide
roughly with the end of the corresponding speech segment.
Your subtitles should not anticipate speech by more than 1.5 seconds or hang up on the screen
for more than 1.5 seconds after speech has stopped.
Avoid creating subtitles that straddle a shot change (i.e. a subtitle that starts in the middle of
shot one and ends in the middle of shot two).
You might need to split a sentence at an appropriate point or delay the start of a new one to
coincide with the shot change.
To meet the 20 CPS limit, you can split or merge subtitles or extend their timespan.
💡 Use the timeline interface located at the bottom of the editor to synchronize the subtitles
with the audio and extend or shorten their timespan.
💯 Quality tip: While you should aim to meet the CPS, it is paramount that the text on the screen
coincides with the speech – the final user’s experience is worsened when they have to read
something different from what they are hearing.
Avoid pushing the text forward or backwards to meet the CPS if this leads to a mismatch
between speech and text.
In the example below, for instance, the third subtitle has been pushed back to meet the 20-
character limit of the second one (boxes in green show where the synced subtitles should be
appearing). It means that a new sentence in the speech would start while the previous subtitle is
still on screen, this worsening the final user’s experience, even though the CPS has been met.
Portrait Mode
In some cases, you might find the video has been recorded vertically.
This can be easily identified because the picture will not cover the entire screen.
Please follow the adjusted criteria (in order of priority):
Maximum 32 CPL
Keep 20 CPS
Ignore the number of lines appearing on the video, the font size will be adjusted later.
Square Videos
In some cases, you might find the video has a square format (1:1).
Please follow the adjusted criteria (in order of priority):
🔑 Search keywords: vertical video, portrait mode, video format, square video
14. Useful Tools
‣
Here is a list of life-saving tools for transcriptionists and subtitlers:
Grammarly
Language Tool
Sound booster
Chrome extension
15. Reminders
‣
Always check if there are specific instructions from the customer for a file. In that case, the customer’s
instructions prevail over these Guidelines.
Never transcribe in an external editor (Word for example) and then paste the text into Happy Scribe's
editor.
Be sure to run a spell-check by using Grammarly or any other available extension available for your
browser.
If you struggle with some specific words or a part of the speech, be sure to run some searches on the
Internet. Check pages related to the topic of the file you are working on for better results.
If you cannot find the answer, reach out to the Tribe on Discord and seek further support.
Please note, once you have submitted a file, you cannot alter and correct the text afterwards.
If you accidentally submit a file, please reach out to human@happyscribe.com