Professional Documents
Culture Documents
Abstract:
Voice desktop architecture helps blind people to access desktop and other programs and
softwares functions of operating system.
The project gets user voice as input and performs the task according to their commend. If
user wants to open any software or application, then the user wants to tell the command. The
system will recognize the word and access the computer. This doesnt need mouse or keyboard.
A text-to-speech system (or "engine") is composed of two parts: a front-end and a backend. The front-end has two major tasks. First, it converts raw text containing symbols like
numbers and abbreviations into the equivalent of written-out words. This process is often called
text normalization, pre-processing, or tokenization. The front-end then assigns phonetic
transcriptions to each word, and divides and marks the text into prosodic units, like phrases,
clauses, and sentences. The process of assigning phonetic transcriptions to words is called textto-phoneme or grapheme-to-phoneme conversion. Phonetic transcriptions and prosody
information together make up the symbolic linguistic representation that is output by the frontend. The back-endoften referred to as the synthesizerthen converts the symbolic linguistic
representation into sound.
Speech recognition (in many contexts also known as 'automatic speech recognition',
computer speech recognition or erroneously as Voice Recognition) is the process of converting
a speech signal to a sequence of words, by means of an algorithm implemented as a computer
program. Speech recognition applications that have emerged over the last years include voice
dialing (e.g: open notepad), simple data entry (e.g., entering a credit card number), and
preparation of structured documents (e.g., a radiology report).Voice Verification or speaker
recognition is a related process that attempts to identify the person speaking, as opposed to what
is being said.
This project contains 5 modules.
1. Login Module
2. Dictation Module
3. Application Open
4. Text to Voice
5. Report Module
HARDWARE CONFIGURATION
RAM
128 MD SD RAM
MONITOR
15 COLOR
HARD DISK
20 GB
FLOPPY DRIVE
1.44 MB
CDDRIVE
LG 52X
KEYBOARD
MOUSE
3 BUTTONS
.2 SOFTWARE CONFIGURATION
Windows XP,7
FRONT END
Back end
SQL Server
OTHER