Professional Documents
Culture Documents
Chapter 1: Introduction
Human Robot
Animals Artificial animals toys Artificial intelligence:
Birds Drone or aero plane
Trees Artificial Trees ➢ Making artificial
Earth Earth Map objects work like
Mountains Artificial Mountains natural object
Sea Artificial Sea ➢ Artificial objects act
Sky Artificial Sky like natural creation
Cloud Cloud Computing
Wind Fan
✓ In the modern fast growing information and communication era, the artificial intelligence (AI)
concept has introduced tremendous advances to the researchers’ community.
✓ The phenomenon behind this remarkable attraction both in numerical representation and complexity
of various types of problems prevailing in academia and industries.
Machine Learning
✓ Machine learning is the scientific study of algorithms and statistical models that machines use to
perform a specific task
✓ The tasks are performed without taking explicit instructions from human
✓ It is considered as subset of AI
✓ The training data is used to make predictions or decisions without being explicitly programmed to
perform the task
✓ Machine learning algorithms are used in a wide variety of applications, such as email
filtering and computer vision
Deep Learning
✓ Deep learning is part of a broader family of machine learning methods
✓ Deep learning methods are based on artificial neural networks with complicated structure for solving
complex problems
✓ have been applied to fields including computer vision, speech recognition, natural language
processing, audio recognition, social network filtering, machine translation, bioinformatics, drug
design, medical image analysis, material inspection and game programs
Prepared By: Dr. Fazli Wahid;
Course: Speech Processing 8
Introduction
Artificial Intelligence
2. Algorithms,
Techniques,
1.
3. Act Models
Subset
Like
Human
5.Easy for
Human
Machine Learning
2. Algorithms,
1. Techniques,
Subset Models
3.Lear
n Like
Huma
n 5.Easiest for
Human
3Think 2. Algorithms,
Like Deep Learning Techniques,
Huma Models
n
Note: The Figure may have some minor Prepared By: Dr. Fazli Wahid;
inaccuracy Course: Speech Processing 10
Introduction
➢ The study of speech Speech acquisition ➢ while language is an
signals and the arbitrary (Random)
processing methods of ➢ Acquiring of speech by human association of symbols
signals. beings or computer used according to
➢ Aspects of speech prescribed rules to
processing includes ➢ Speech consists of an organized convey meaning
set of sounds or phonemes
➢ the acquisition, ➢ Our focus here is
➢ manipulation, ➢ Sounds or phonemes are used to speech acquisition by
storage, convey meaning computer system
➢ transfer and
➢ output of speech ➢ Phoneme: any of the distinct units ➢ When computer is used
signals. of sound in a specified language for acquisition of
that distinguish one word from sound, there are many
another, devices used for this
➢ for example p, b, d, and t in the purpose e.g.
English words pad, pat, bad, microphone
and bat.
➢ For example, there are ➢ For example, a given sound is ➢ For example, the
some noise in the sound music, normal sound, noise, speech recognition
systems store the
➢ That noise is actually ➢ For this process of distinguishing sounds of different
unwanted signal that many Artificial intelligence and individuals for future
create disturbance machine learning models are used reference
➢ Therefore some
mechanism is required to
Prepared By: Dr. Fazli Wahid;
remove this noise Course:: Speech Processing 12
Introduction
➢ It incorporates knowledge and
Speech recognition research in the computer
science, linguistics and computer
Speech Transfer ➢ Speech recognition is engineering fields.
an interdisciplinary sub
➢ The process of moving sound data field of computer Speech Synthesis
from one location to another science and computatio ➢ Speech synthesis is the artificial
location is called speech transfer nal linguistics production of human speech.
➢ Develops methodologie ➢ A computer system used for this
➢ For transferring sound, different s and technologies purpose is called a speech
devices are used such as radio or ➢ that enable the computer or speech synthesizer,
even TV recognition and
and translation of ➢ can be implemented
➢ Different communication media spoken language into in software or hardware products.
are used for transferring sound text by computers. ➢ A text-to-speech (TTS) system
➢ It is also known converts normal language text
➢ Both wired and wireless media as automatic speech into speech;
are used for this purpose recognition (ASR), co ➢ other systems render symbolic
mputer speech linguistic
Two most important components of recognition or speech representations like phonetic
speech processing to text (STT). transcriptions into speech
1. Speech recognition
Prepared By: Dr. Fazli Wahid;
2. Speech synthesis
Course:: Speech Processing 13
Introduction
1. Signal processing
Fields for Speech processing ➢ The process of extracting relevant information from the
speech signal in an efficient and robust manner.
These disciplines are involved in
speech processing system ➢ Using this process we can characterize the time-varying
properties of the speech signal
1. Signal processing
2. Physics ➢ as well as various types of signal preprocessing and post
3. Pattern recognition processing to make the speech signal robust
4. Linguistics
5. Physiology 2. Physics
6. Computer Science ➢ The science of understanding the relationship between the
7. Psychology physical speech signal and physiological mechanisms or
3. Pattern Recognition
Fields for Speech processing ➢ The set of algorithms used to cluster data to create patterns
and
These disciplines are involved in
speech processing system ➢ to compare a pair of patterns on the basis of feature
measurement.
1. Signal processing
2. Physics 4. Linguistics (Language related)
3. Pattern recognition
4. Linguistics ➢ The relationship between sounds (phonology), words in a
5. Physiology language (syntax), meaning of spoken words (semantics),
6. Computer Science and sense derived from the meaning (pragmatics).
7. Psychology
5. Physiology (Study of human body working)
Recognition