Professional Documents
Culture Documents
RECOGNITION
Er Sarbjeet Singh, Er.Manjit Thapa , Er Gurpreet Singh , ErSukhvinder singh
Department of Computer Science, Sri Sai College of Engg. & Tech. Badhani
(Pathankot).
Abstract: In this paper presents an overview of speech recognition technology, software, development
and applications. It begins with a description of how such systems work, and the level of accuracy that can
be expected. Applications of speech recognition technology in education and beyond are then explored. A
brief comparison of the most common systems is presented, as well as notes on the main centers of speech
recognition research in the educational sector. The paper Concludes with potential uses of speech
recognition in education, probable main uses of the technology in the future, and a selection of key web-
based resources. We introduce original visual descriptors related to the dominant and residual image
motions. The different summary types are obtained by specifying adapted classification criteria which
involve audio features to select the relevant segments to be included in the video aids. Such systems are
now capable of understanding continuous speech input for vocabularies of several thousand words in
operational environments.
Keyword: Introduction, Conventional system, Audio and Video aids .uses of recognition & application
of system speech.
In this paper, it is very simple and Speech [5] L. Birgé, P. Massart, From model selection to
recognition will revolutionize the way people adaptive estimation, in D. Pollard (ed), Festchrift
conduct business over the Web and will, for L. Le Cam, Springer, vol. 7, No. 2, pp. 55-88,
Voice ties speech recognition and telephony [6] D. L. Donoho, De-noising by Soft-
together and provides the technology with which thresholding, IEEE Trans. Inform Theory, Vol.
businesses can develop and deploy voice-enabled 41, No. 3, pp. 613- 627, May 1995.
Web solutions TODAY! Speech recognition [7] D. L. Donoho, Nonlinear Wavelet Methods
refers to the ability to listen (input in audio forn Recovering Signals, Images, and Densities
and identify various sounds present in it, and Symposia in Applies Mathematics. Vol. 47, pp.
system domain may then be defined as the ability Gaussians Front End for Speech Recognition,
audio format - such as wav or raw - and then 2001, pp. 675-678, Scandinavia, 2001.
generate its content in text format. Visual speech [9] J. Potamifis, N. Fakotakis, G. Kokkinakis,
in itself does not contain sufficient information Improving the robustness of noisy MFCC