You are on page 1of 1

‘Guess who Am I?

’ Application
Digital Siganls Processing

Name: Rami Musleh


Name Amjad Shomali Name: Nadine Eideh
ID: 1141304
ID: 1161957 ID: 1161961
Faculty of Engineering and Technology
Faculty of Engineering and Technology Faculty of Engineering and Technology
Birzeit University
Birzeit University Birzeit University

I. INTRODUCTION Hence comparing the user’s input voice with the calculated
averages for example if the zero crossing for the user is high
A simple python voice recognition program. The project it may highly mean that it’s a YES and vice versa.
consists of 3 phases, first being collecting voice data of the Note that splitting the signal and calculating the zero
words “YES” and “NO” by different people in order to crossing for it gives more accurate results.
analyze this data and compare it with data that the computer Finally comparing the cosine spatial distance of the bulk
has trained for. Second being a basic programming guess calculated data and the input data in order to determine the
“Who am I?” game through the terminal. The program has similarity.
12 pictures in which the user picks one, the program
afterwards starts asking yes or no questions in order to guess Making a guess who am I game was fairly easy, it was just
the picture. Third and final phase is connecting both phase made up of reducing the characteristics array after each
one and two to complete a whole Guess who am I program. input until we finally have one entry left.
Simply said the program asks the user through computer
generated speech (TTS) and records the answer for analysis, And so was connecting both phases, it was just a matter of
it being a YES or a NO. code organization.

II. PROBLEM SPECIFICATION VI. RESULT AND ANALYSIS

Methods for analyzing the user’s voice pattern in order to Asserting a simple collection of test data after training,
determine if it’s a yes or a no. optimizing the recognition to which had a 100% accuracy rate. But it’s hard to confirm
be fairly accurate for each case by adding more test data the accuracy rate with such data
with a variety of spectrum cases
After merging into phase 3 we encountered a shortfall where
the test data was fed US voice training data, hence, running
III. DATA
the program on our own voices almost never gave a right
We’ve used two parts in phase one’s data. Part one: answer.
Training voice samples where the program feeds the training
So we went off our shoes and made our own data with a
data to itself and compares it to Part two: which are the test
simple program that records 10 samples of yes/no and uses
voice samples.
it as training data. We’ve added subtitles in the console for
Phase II we pulled a few easily characterized cartoonish better observation, we can see that... IT WORKS!
pictures straight out of google for the user to use.
VII. DEVELOPMENT
Phase III we’ve added a simple program that asks the user to
say yes/no to the mic 5 times each and adds it to the training For better recognition there has to be a furthermore accurate
data for better recognition next time! way, for example calculating a more precise ZCR by
chipping the signal to small pieces before calculation.

Most importantly, more data for comparison! As we can see


IV. EVALUATION CRITERIA
after we’ve added our own data, the program started to
Simply said, the more data we collect the better the program actually work, and with a rather high accuracy rate!
will perform! In phase one the training results are compared
with each test sample and assert if the analysis is right or
not! VIII. CONCLUSION

This was probably one of the most interesting projects


V. APPROACH we’ve run through, even though the method was
Bulk calculating the zero-crossing and energy of the training oversimplified compared to the more advanced methods like
data, knowing that “YES” would have a higher frequency pattern recognition and phonetic approach, this project was
(high zero crossing) compared to the “NO” which is a low definitely a good method to learn by doing as we’ve learned
dominant vowel. a lot about the course, moreover, as a team.

You might also like