You are on page 1of 16

1

VIRTUAL VOICE
ASSISTANT IN
PYTHON
Submitted By:
OM BAJPAI
PALAK RANI
PUSHPENDRA
Contents 2

 Introduction
 How does an intelligent personal assistant work?
 Dataflow diagrams
 Automatic Speech Recognition System Model in Google AP
 Use case diagrams
 Modules
 Requirements
 Conclusion
Introduction 3

 A virtual assistant is a technology based on artificial intelligence. The software


uses a device’s microphone to receive voice requests while the voice output takes
place at the speaker. But the most exciting thing happens between these two
actions.
 It is a combination of several different technologies: voice recognition, voice
analysis and language processing.
 It is completely developed using one of the most powerful language python.
How does an intelligent personal assistant 4

work?
 User asks a personal assistant to perform a task.
 The natural language audio signal is converted into digital data that can be
analyzed by the software.
 Compared with a database of the software using an innovative algorithm to find a
suitable answer.
 This database is located on distributed servers in cloud networks. For this
reason ,it must have a reliable Internet connection.
Dataflow Diagrams 5

DFD - 0

Personal
Voice Speech to Action
User voice Computer
Text
assistant

Response

Voice output
Text to speech
6
DFD - 1

virtual voice assistant

Voice Command Perform action


User Voice to text Computer
Action perform
Database
7
DFD - 2

Microphone Google voice API

Flow sensor Voice audio


User This will convert
This will convert
value data voice data into text
voice into binary
form

Perform action

Computer
Automatic Speech Recognition System Model in 8
Google API

Acoustic Pronunciation Language


models Dictionary Models

Speech signal Recognized words


Feature Extraction Decoder
Feature Extraction 9

 Feature Extraction is a common to extract a set of features from speech signal.


 Classification of Feature extraction is carried out on the set of features instead of
the speech signals themselves.
 The feature extraction stage seeks to provide a compact representation of the
speech waveform. This form should minimize the loss of information that
discriminates between words, and provide a good match with the distributional
assumptions made by the acoustic models.
 ACOUSTIC MODELS 10
• Acoustic model is a relationship between audio signal and phoneme
• Phoneme means one of the smallest unit of speech that make one word different from another word
 PRONUNCIATION DICTIONARY
• The act or result of producing the sounds of speech, including articulation, stress, and intonation
• A phonetic transcription of a given word, sound, etc.
• An accepted standard of the sound and stress patterns of a word, phrase, etc.
 LANGUAGE MODELS
• The language model provides context to distinguish between words and phrases that sound similar.
for example, In American English the phrases “recognize speach” and “wreck a nice beach” sound
similar , but mean different things.
Use case diagrams arc
h 11
Se
en n ew
Op b

e
calculator

Open
chrom
whether ta

Yo pen
e
ub
uT
O
at eb er
ch W ws
o en
br Op il
a
em

Input Read
voice Wikipedia search

User

Ask
Music
Play

Time
User Microphone Computer Google API 12
Open Personal Assistant Accessing G-API

Voice response

Start Mic

Wait until user speak

Receive data
Convert audio to text

Match text with action

Perform action

Voice / Text
Response
 MODULES 13

 Speech recognition

 Process and system utilities ( psutil )

 PlaySound

 SMTP Protocol client ( smtplib )

 Google Text To Speech ( gtts )


Requirements 14

 Software requirements

 PyCharm IDE/visual studio code


 Python 3.8.2 and its Sub modules
Conclusion 15

 Virtual Voice Assistant System will use the Natural language processing
 and can be integrated with artificial intelligence techniques to achieve a smart
assistant that can control the computer
 and applications and even solve user queries using web searches..
 It can be designed to minimize the human efforts to interact with many other
subsystems, which would otherwise have to be performed manually.
 By achieving this, the system will make human life comfortable
16

THANK YOU

You might also like