Professional Documents
Culture Documents
1. Project Introduction
In this project, all audio clips come from conversation or narrative without a scriptby native speakers
in Nepal.
- The Annotator's work is to judge whether the audio is valid, adjust the timestamp, make the
annotation result exactly match the audio, and select the category corresponding to the audio.
2. Annotation work
If you encounter the following audio, please choose invalid and select the corresponding invalid
reason. There is no need to adjust the timestamp and transcription.
a. Non-target language. The whole audio is not Nepali.
b. Noise & non speech. The whole audio is full of noise or silence.
c. Meaningless sentence. The whole audio is only fill modal words like ah, um, haha... Or the
speaker realizes that he says it wrong and say
says the word again correctly.
d. Illegal content. The audio involve pornography, violence, racial discrimination, anti-government
anti
etc.
e. Non-native
native speaker/wrong accent
accent.The audio is not recorded by Nepali natives or does not meet
the requirements of Nepali standard accent.
f. Read off a script.It
It is obvious from the tone that the speaker is reading according to the text.
g. Interruption/Pause/Stutter/Overlap. The sentence is interrupted by sudden noise, or the
speaker is not fluent. The pause here refers to an abnormal pause.
h. Incomplete sentence. An incomplete sentence is a cut sentence or if that sentence does not
have enough meaning or information to determine the category.
i. Low volume. The audio volume is adjusted to 50% and the speaker's voice is still too light and
unclear.
j. Repetition sentence. The content of the current sentence is the same as the content of the
sentence you encountered before.
k. Children’s sound. The voice of the recording person is obviously the voice ofof a child.
A. Timestamp
a. Click on the spectrum and drag the mouse to determine the starting point and the end point.
b. Leave 1-1.5 seconds of silence at the beginning and end of a valid speech. Please note that it
should be more than 1 secondsand
and timestamp cannot cut any valid speech of the speaker.
B. Transcribe
C. Valid audio rule
1.1 Strictly follow the principle of RECORDING EXACTLY WHAT YOU HEAR. DO NOT ADD, OMIT
ANY CONTEXT.
Examples 1: repetition words
Transcription:where
where where are we going?
Examples 2: stutters
1.4 Numbers
Numbers should be completelyly translated into the Nepali
Nep words according to their
pronunciation. Arabic/Nepali number is NOT allowed.
Examples 1:
“156” - >
1.6 Category
The category has been pre-selected
selected when collecting audio. You only need to check whether the
category is consistent with the audio, and finally select the most obvious one for the audio.
Daily conversation: Any valid audio which you don‘t know its category, can be selected as this
category
Travel shopping
Number/ Time (Please include number or time in your sentence): Here must be a specific
word to identify the sentence. For example: 1. There are twostars in the sky. 2. It’s seven
o’clock now
Social/ Economy
Education
Medical/COVID
Political/ Diplomacy:
There are some classifications about politicalsensitive sentence, and should beinvalid (illegal
content) :
Make native speaker feel uncomfortable, like inappropriate remarks about their country,
political situation, political party, etc.
Sports/ Entertainment
Technology/ Digital Products/ Games: Here must be a specific word to identify the sentence.
For example: 1. My telephoneneed be repaired.
Name/ Location/ Address:Here must be a specific word to identify the sentence. For example:
1. Starbucks sells coffee. 2.Hynix school is the place for education 3.Jake is my
friend.