You are on page 1of 18

Topic:-

Artificial Intelligence for Speech Recognition

Artificial Intelligence (or AI ) : Definition:- The study and design of intelligent agents & also used to describe a property of machines or programs Among researchers hope machines will exhibit are reasoning, knowledge, planning, learning, communication, perception and the ability to move and manipulate.

Applications of AI Pattern Recognition Hand Recognition Speech Recognition

Natural Language Processing


Face Recognition Artificial Creativity Non linear controls and Robotics

Speech RecognitionSpeech recognition converts spoken words to machine-readable input. It is also called Voice Recognition.

Speech recognition includes voice dialing content-based spoken audio search

speech-to-text processing

SPEECH RECOGNITION PROCESS

Display Applications Voice User Sound Speech recognition device Dictating

Commands to computers

Input to other Robots, Expert systems

Dialogue with user

NLP

Understanding

Speech Recognition in CellphonesCallers words are captured and digitized by speech-recognition system. Digitized voice is split into individual frequency components, called spectral representations. The components are translated into phonemes. Complex models and algorithms determine a likely translation.

Utterances
When the user says something, this is known as an utterance. An utterance is any stream of speech between two periods of silence. Utterances are sent to the speech engine to be processed. Silence, in speech recognition, is almost as important as what is spoken, because silence delineates the start and end of an utterance.

Pronunciations
The speech recognition engine uses all sorts of data, statistical models, and algorithms to convert spoken input into text. One piece of information that the speech recognition engine uses to process a word is its pronunciation, which represents what the speech engine thinks a word should sound like.

Grammars
A grammar uses a particular syntax, or set of rules, to define the words and phrases that can be recognized by the engine. A grammar can be as simple as a list of words, or it can be flexible enough to allow such variability in what can be said that it approaches natural language capability.

Accuracy
The performance of a speech recognition system is measurable. Perhaps the most widely used measurement is accuracy. It is typically a quantitative measurement and can be calculated in several ways. Arguably the most important measurement of accuracy is whether the desired end result occurred.

Training
With speech recognition systems, both the software and the user require training. Patience and practice are required. The user needs to take things slowly, practising putting their thoughts into words before attempting to use the system.

SPEAKER- DEPENDENT WORD RECOGNIZER

SPEAKER INDEPENDENCY

Speaker-independent system can be used by anybody, and can recognize any voice, even though the characteristics vary widely from one speaker to another. Most of these systems are costly and complex. Also, these have very limited vocabularies.

It is important to consider the environment in which the speech recognition system has to work. The grammar used by the speaker and accepted by the system, noise level, noise type, position of the microphone, and speed and manner of the users speech are some factors that may affect the quality of the speech recognition.

Applications of Speech Recognition Health Care - In this even in the wake of Speech recognition technologies MT havent become obsolute.

Military -High-performance fighter aircraftHelicopters As in fighter applications overriding issue for voice in helicopters is the impact on pilot effectiveness. Battle Management Speech recognition equipment was tested in conjunction with an integrated information display for naval battle management applications.

Telephony and other domains ASR in the field of computer gaming and simulation is becoming more widespread. Disabled people These people are another part of population that benefit from speech recognition programs.

Advantages:
Speech is a very natural way to interact, and it is not necessary to sit at a keyboard or work with a remote control. No training required for users! Speech is prefered as an input because it does not require training and it is much faster than any other input.

Disadvantages:
Even the best speech recognition systems sometimes make errors. If there is noise or some other sound in the room (e.g. the television or a kettle boiling), the number of errors will increase. Speech Recognition works best if the microphone is close to the user (e.g. in a phone, or if the user is wearing a microphone). More distant microphones (e.g. on a table or wall) will tend to increase the number of errors. The computer has trouble with "sound-alike" errors. It's hard to get mad at the computer for not recognizing mumbling. But it can be frustrating when you think you are speaking clearly, and it just isn't good enough. For example, when I said: I sure look forward to seeing you The computer heard: Assure look forward to seen in you

Conclusion
This paper presents the Speech Recognition in Artificial intelligence systems and it is important to consider the environment in which the speech recognition system has to work.

The grammar used by the speaker and accepted by the system, noise level, noise type, position of the microphone, and speed and manner of the users speech are some factors that may affect the quality of speech recognition

Queries