You are on page 1of 2

Voice desktop architecture

Abstract:
Voice desktop architecture helps blind people to access desktop and other programs and
softwares functions of operating system.
The project gets user voice as input and performs the task according to their commend. If
user wants to open any software or application, then the user wants to tell the command. The
system will recognize the word and access the computer. This doesnt need mouse or keyboard.
A text-to-speech system (or "engine") is composed of two parts: a front-end and a backend. The front-end has two major tasks. First, it converts raw text containing symbols like
numbers and abbreviations into the equivalent of written-out words. This process is often called
text normalization, pre-processing, or tokenization. The front-end then assigns phonetic
transcriptions to each word, and divides and marks the text into prosodic units, like phrases,
clauses, and sentences. The process of assigning phonetic transcriptions to words is called textto-phoneme or grapheme-to-phoneme conversion. Phonetic transcriptions and prosody
information together make up the symbolic linguistic representation that is output by the frontend. The back-endoften referred to as the synthesizerthen converts the symbolic linguistic
representation into sound.
Speech recognition (in many contexts also known as 'automatic speech recognition',
computer speech recognition or erroneously as Voice Recognition) is the process of converting
a speech signal to a sequence of words, by means of an algorithm implemented as a computer
program. Speech recognition applications that have emerged over the last years include voice
dialing (e.g: open notepad), simple data entry (e.g., entering a credit card number), and
preparation of structured documents (e.g., a radiology report).Voice Verification or speaker
recognition is a related process that attempts to identify the person speaking, as opposed to what
is being said.
This project contains 5 modules.
1. Login Module
2. Dictation Module

3. Application Open
4. Text to Voice
5. Report Module

HARDWARE CONFIGURATION

The hardware used for the development of the project is:


PROCESSOR

PENTIUM III 866 MHz

RAM

128 MD SD RAM

MONITOR

15 COLOR

HARD DISK

20 GB

FLOPPY DRIVE

1.44 MB

CDDRIVE

LG 52X

KEYBOARD

STANDARD 102 KEYS

MOUSE

3 BUTTONS

.2 SOFTWARE CONFIGURATION

The software used for the development of the project is:


OPERATING SYSTEM

Windows XP,7

FRONT END

Visual Basic.Net 2010

Back end

SQL Server

OTHER

Microsoft Speech SDK 5.0

You might also like