Professional Documents
Culture Documents
A TECHNICAL SEMINAR ON
“VOICE MORPHING”
CO-ORDINATOR: PRESENTED BY :
E.SRI LAXMI(M.Tech,Asst prof) P.SHIVA SHANKAR
(19631A0518)
CONTENTS
WHAT IS VOICE MORPHING ?
APPROACHS TO THE PROBLEM.
CONVERSION OF VOICE.
TYPES OF VOICE MORPHING.
REFRANCES OR METHODS.
APPLICATION OF VOICE MORPHING.
AVAILABLE SOFTWARE FOR VOICE
MORPHING.
CONCLUSION.
WHAT IS VOICE MORPHING ?
Voice Morphing which is also referred to as voice
transformation and voice conversion is a technique to
modify a source speaker's speech utterance to sound as
if it was spoken by a target speaker.
There are many applications which may benefit from
this sort of technology. For example, a TTS system with
voice morphing technology integrated can produce
many different voices. In cases where the speaker
identity plays a key role, such as dubbing movies and
TV-shows, the availability of high quality voice
morphing technology will be very valuable allowing the
appropriate voice to be generated (maybe in different
languages) without the original actors being present.
APPROACHES TO THE PROBLEM
Proposed model.
Wavelet Decomposition :-
Wavelets are a class of functions that possess compact
support and form a basis for all finite energy signals.
They are able to capture the non-stationary spectral
characteristics of a signal by decomposing it over a set of
atoms which are localized in both time and frequency. The
DWT uses the set of dyadic scales and translates of the
mother wavelet to form an orthonormal basis for signal
analysis.
EXAMPLE
The original signal S is
Click icon to add picture split into an approximation
cA1 and a detail cD1.
The approximation is then
itself split into an
approximation and a detail
and so on.
Decomposing a signal
into k levels of
decomposition therefore
results in k+1 sets of
coefficients at different
frequency resolutions, k
levels of detail and 1 level
of approximation
coefficients.
Proposed model :
Voice morphing is performed in two steps: training and
transformation. The training data consist of repetitions
of the same phonemes uttered by both source and
target speakers.
The source and target training data is divided into
frames of 128 samples and the data is randomly
divided into training and validation sets.
A 5-level wavelet decomposition is then performed to
the source and target training data.
TYPES OF VOICE MORPHING
IN THIS SECTION WE KNOW THAT IN WHICH
FORM WE CAN TRANFORM A NORMAL VOICE
OR SPEECH.
ENTERTAINMENT.
IN FILM INDUSTRY.
SECURITY.
IN COMPUTER GAMING
AVAILABLE SOFTWARE FOR VOICE
MORPHING
MORPH VOX PRO VOICE CHANGER 2.0.6.
MORPH VOX PRO VOICE CHANGER 4.2.2.
MORPH VOX PROVOICE CHANGER 4.3.8.
TERA VOICE SERVAER 2004.
FLASH VOICE BUTTONS 3.0.
VOICE TWISTER 1.0.4.
VOICE AGAIN 1.5.2.
QUICK VOICE FOR OSX 2.2.0.
QUICK VOICE FOR WINDOWS 2.2.0.
CONCLUSION