You are on page 1of 18

Speech Recognition

Introduction

What is Speech Recognition? - Voice Recognition? Where can it be used? - Dictation - System control/navigation - Commercial/Industrial applications - Hand held digital recorders
2

Contents:
Continuous/Discrete How does it work? Recent improvements Current software options Future of SR

Continuous or Discrete?
Continuous speech - dictation Discrete speech - system controls

How does SR work?


Recognition Training Correction Command/Control

Recognition (1)
Voice Input Analog to Digital Acoustic Model

Language Model

Feedback

Display

Speech Engine

Recognition (2)
Acoustic Modeling Spoken words: I think there are.. Phonemes: ay th-in-nk-kd dh-eh-r aa-r H.M.M.s: 5 state representation Speech Engine

Recognition (3)
Language Modeling Word context Word frequency Transition possibilities

Voice Training (1)


Can be done by: Predetermined text segments Individual words Compare new acoustic with old and combines More training = better recognition

Voice Training (2)


User specific Voice file Voice qualities Pronunciation Patterns of word use Preferred vocabulary

10

Making Corrections
Move cursor by voice command Memorize edit commands List of possible alternatives Make correction manually

11

Command/Control
Desktop grid Program or Link name/number URL name Memorized commands

12

Recent Improvements in SR
Faster training ~10 min. Better recognition ~95% More compatible software Better system control/command

13

Current Software Options for PC


Dragon Systems Naturally Speaking Philips FreeSpeech IBM ViaVoice Lernout & Hauspie Voice Xpress

14

How well do the work?


Training
Dragon Philips IBM L&H

Dictation App. Correct. Integrat. Excellent Excellent Good


Fair Fair Good Good Good

Command - Control Good


Good Excellent Good
15

Excellent Good Good Good

Future of SR
SUI Speech-based User Interface Improvements needed: - Greater accuracy - Greater system control/command - More compatible software

16

Conclusion
SR Uses How does it work? Current Software Problems of SR More SR coming soon.

17

References
1. Alwang, Greg. Speech Recognition, PC Magazine, December 1 1999 2. Hauptmann, Alexander G. Jang, Photina Jaeyun. Carnegie Mellon University. Learning to Recognize Speech by Watching Television, IEEE Intelligent Systems, September/October 1999. 3. Miastkowski, Stan. Latest Speech Software Gets You Up and Running Faster, PC World, November 1999.

18

You might also like