You are on page 1of 7

NOTE: All information provided in this document is confidential.

Any
publication, provision, or dissemination of this content is strictly
prohibited.​ Do not share or post the contents on the internet.

Flathead Tamil Guidelines


This task requires that you listen to short audio clips and select the 9 labels that
best describe each clip.

Please ​use headphones when working on this task. This will ensure you can hear
the audio clearly. Set your volume to a comfortable level (80%) so that it is not so
loud that your ears hurt, but also not so quiet that you might not hear important
sounds and speech.

Notes:
● If the file is corrupted, please select Corrupted for ​all categories​.
⚠​ If a file contains only silence, please do ​not​ choose corrupted
and answer all questions instead.

● If the file contains speech, listen to it again to check if it contains personal


information (see ​3. UII​)

Labels are grouped in the following 9 categories:

1
NOTE: All information provided in this document is confidential. Any
publication, provision, or dissemination of this content is strictly
prohibited.​ Do not share or post the contents on the internet.

1.​ ​Does the audio contain human speech?

The ​entire audio has human speech​ with normal


All Speech human speech pauses (less than one second between
words).

Only ​part of the audio contains human speech​.


Some Speech There is silence (for more than one second) or other
sounds that are not human speech.

No Speech There is ​no human speech​ in the audio.

The file is corrupted: There are static or machine


Corrupted augmented/contorted sounds (so it does not sound like
a human voice) in the audio.


Consider the level of confidence with which you answered the question. If the
audio is difficult to hear and you are not sure the answer you provided is
accurate, please select the label ​1. low confidence​.

2. Is the speech noisy?


The majority of the speech is not understandable
because there is loud noise drowning out the audio.
Noisy Speech Noise may include cross-talk of several speakers,
shouting, cheering, too much background noise, etc.
The speech cannot be understood​ because of the noise.

The above condition does not apply. The speech is


Intelligible Speech
clear.

No Speech There is ​no human speech​ in the audio.

The file is corrupted: There are static or machine


Corrupted
augmented/contorted sounds (so it does not sound like

2
NOTE: All information provided in this document is confidential. Any
publication, provision, or dissemination of this content is strictly
prohibited.​ Do not share or post the contents on the internet.

a human voice) in the audio.

3. Does the audio contain UII?

The audio contains user-identifiable information (UII)


UII includes full names, usernames, gamertags, street
addresses, telephone numbers, credit card numbers,
social security numbers, and email addresses.
UII
Self-promotional videos where full names are provided
are not considered UII. The names of public figures
should not be considered UII. News broadcasts are not
considered UII.

The audio ​does not​ contain user-identifiable


information (UII). UII includes full names, usernames,
gamertags, street addresses, telephone numbers, credit
card numbers, social security numbers, and email
No UII
addresses. Self-promotional videos where full names
are provided are not considered UII. The names of
public figures should not be considered UII. News
broadcasts are not considered UII.

The file is corrupted: There are static or machine


Corrupted augmented/contorted sounds (so it does not sound like
a human voice) in the audio.


Consider the level of confidence with which you answered the question. If the
audio is difficult to hear and you are not sure the answer you provided is
accurate, please select the label ​3. low confidence​.

3
NOTE: All information provided in this document is confidential. Any
publication, provision, or dissemination of this content is strictly
prohibited.​ Do not share or post the contents on the internet.

4. Does the audio contain foreground human singing?

Foreground Singing The audio contains dominant ​foreground singing​.

The audio ​does not​ contain dominant foreground


No Foreground
singing. (There may be background music or singing in
Singing
the background).

The file is corrupted: There are static or machine


Corrupted augmented/contorted sounds (so it does not sound like
a human voice) in the audio.


Consider the level of confidence with which you answered the question. If the
audio is difficult to hear and you are not sure the answer you provided is
accurate, please select the label ​4. low confidence​.

5. Does the audio contain foreground music (instrumental)?

The audio contains foreground ​instrumental music​,


Music
which may or may not be accompanied with singing.

The audio does not contain foreground instrumental


No Music music. (There may be background music or singing in
the background).

The file is corrupted: There are static or machine


Corrupted augmented/contorted sounds (so it does not sound like
a human voice) in the audio.

4
NOTE: All information provided in this document is confidential. Any
publication, provision, or dissemination of this content is strictly
prohibited.​ Do not share or post the contents on the internet.

6. Is the audio in Tamil?

All Tamil All human speech or singing in the audio file is in Tamil.

Some of the speech and singing in the audio is in Tamil,


Some Tamil
and some is in another language.

None of the human speech or singing in the audio file is


No Tamil
in Tamil.

No Speech And No There is ​no human speech​ and ​no singing​ in the
Singing audio.

The file is corrupted: There are static or machine


Corrupted augmented/contorted sounds (so it does not sound like
a human voice) in the audio.


Consider the level of confidence with which you answered the question. If the
audio is difficult to hear and you are not sure the answer you provided is
accurate, please select the label ​6. low confidence​.

Can you recognise another language in the audio (speech or singing)? Great!

If you hear another language than Tamil in the audio, make sure to choose the
appropriate label for question 6!
If you recognise the foreign language​, please ​use the text box provided to
write down the name of the language​ you hear.

By default, the text box will display a hyphen (​-​) as a placeholder. Please remove it
before typing the name of the language.
You do not need to type anything in the text box if you are not sure of the foreign
language you hear in the audio, or if there are no foreign languages in the audio.

5
NOTE: All information provided in this document is confidential. Any
publication, provision, or dissemination of this content is strictly
prohibited.​ Do not share or post the contents on the internet.

7. Does the audio contain any human sounds (other than


speech or singing)?

The audio has human sounds (other than singing or


Human Sounds speech). This includes laughing, crying, coughing,
grunting, screaming, humming.

The audio ​does not​ have human sounds (other than


No Human Sounds singing or speech) such as laughing, crying, coughing,
grunting, screaming, humming.

The file is corrupted: There are static or machine


Corrupted augmented/contorted sounds (so it does not sound like
a human voice) in the audio.

8. Does the audio contain other noises (non-human)?

The audio contains ​non-human sounds​ such as


Other Noises environmental noises, inanimate objects, animals or
any other sorts of noises.

No Other Noises The audio ​does not ​contain any non-human sounds.

The file is corrupted: There are static or machine


Corrupted augmented/contorted sounds (so it does not sound like
a human voice) in the audio.

6
NOTE: All information provided in this document is confidential. Any
publication, provision, or dissemination of this content is strictly
prohibited.​ Do not share or post the contents on the internet.

9. Is the audio explicit?

Select this label if the audio contains explicit/graphic


content such as pornography, extreme violence, or hate
speech.
Explicit
Note that a recording containing bad language alone
(i.e. without violence/harassment/hate speech) should
not​ be considered as containing explicit content.

Not Explicit The audio ​does not​ contain explicit/graphic content.

The file is corrupted: There are static or machine


Corrupted augmented/contorted sounds (so it does not sound like
a human voice) in the audio.

You might also like