Professional Documents
Culture Documents
Arshan Shivam PDF
Arshan Shivam PDF
Group Members:
Arshan Zaman (2016136)
Shivam Singh (2016196)
Proposal: We plan to do audio processing on a voice sample and perform voice morphing. The
main aim is to perform morphing. However, we will first try to work with a self-recorded sample
and try to remove noise if possible. We will then perform morphing on an audio sample to
change voice for the source speaker to a designated target speaker for eg. try to make it
metallic as in the music videos done by the electronic duo band Daft Punk (the motivation for
the project). We went through some papers and blogs on google (mentioned underneath) and
found that this can be done by employing a variety of techniques like pitch modification,
changing loudness, thresholding, etc.
There is no such strict work division presently. We plan to divide the paper readings among us
and go through more blogs/papers etc. to gain more ideas. After this, each of us will work on
some specific features to change in the audio sample.
References:
● https://ieeexplore.ieee.org/document/1325909
● http://www.123seminarsonly.com/CS/001/Voice-Morphing.html
● https://www.researchgate.net/publication/246021550_Relationship_between_changes_i
n_voice_pitch_and_loudness
● https://www.researchgate.net/publication/252824047_Voice_Conversion_using_Pitch_S
hifting_Algorithm_by_Time_Stretchingwith_PSOLA_and_Re-Sampling
● https://xamat.github.io/pubs/mosart2001-xamat.pdf
● https://pdfs.semanticscholar.org/9b8d/c317ee9428206c4f13ddba0dc215eb6f5d98.pdf