You are on page 1of 3

Digital Processing of Speech Signals, by L. R. Rabiner and R. W.

Schafer
Victor W. Zue

Citation: The Journal of the Acoustical Society of America 67, 1406 (1980); doi: 10.1121/1.384160
View online: https://doi.org/10.1121/1.384160
View Table of Contents: https://asa.scitation.org/toc/jas/67/4
Published by the Acoustical Society of America

ARTICLES YOU MAY BE INTERESTED IN

Speech and Audio Signal Processing: Processing and Perception of Speech and Music
The Journal of the Acoustical Society of America 132, 1861 (2012); https://doi.org/10.1121/1.4742973

The Evaluation and Calibration of Ultrasonic Transducers


The Journal of the Acoustical Society of America 67, 1407 (1980); https://doi.org/10.1121/1.384109

Digital Signal Processing and Theory and Application of Digital Signal Processing
Physics Today 29, 63 (1976); https://doi.org/10.1063/1.3023261

Cepstrum Pitch Determination


The Journal of the Acoustical Society of America 41, 293 (1967); https://doi.org/10.1121/1.1910339

Perceptual linear predictive (PLP) analysis of speech


The Journal of the Acoustical Society of America 87, 1738 (1990); https://doi.org/10.1121/1.399423

Audio Signal Processing and Coding


The Journal of the Acoustical Society of America 122, 15 (2007); https://doi.org/10.1121/1.2737746
BOOK REVIEWS
R. Bruce Lindsay
Brown University, Providence, Rhode Island 02912
The opinions expressed are those of the individual reviewers and are not
necessarily endorsed by the Editorial Board of this Journal.

Digital Processingof Speech Signals tion in concatenatedlossless tubesand relatingthe resultsto digital
filters, a discreterepresentationof the vocal tract transferfunction
L. R. Rabiner and R. W. Schafer is obtained. The resultingdigitalmodel for the speechsignalsforms
Prentice-Hall,EnglewoodCliffs, NJ, 1979. the basisfor subsequentdiscussions.
xvi + 512 pp. Price $23.95. The next five chaptersof the book are devotedto specificmeth-
odsfor speechprocessing.Chapter4 dealswith time-domainmeth-
During the past 15 years we have witnesseda flourish of research odsin which the processing involvesthe direct utilization of the
activitiesin the field of digital signalprocessing.Suchactivities, speechwaveform. The chapterbeginsby presentinga generalnotion
broughtabout at leastin part by the increasedavailabilityof high- of "short-time" processing.This is followed by a discussionof the
speed,low-costdigitalcomputersand specialpurposehardware,have definition and propertiesof severalshort-timemeasurements,suchas
contributed significantlyto the field in terms of theoretical as well as the short-timeenergy,short-timeaveragezero-crossing rate, and
algorithmicdevelopments.Sincemany of thesedevelopmentswere short-time autocorrelation, that are commonly used for time-domain
actually carriedout within the context of speechprocessing,our speechprocessing.Also includedin this chapteris a discussion
on the
understanding of the fundamentalsunderlyingthe processing of median smoothingtechnique, which hasbeen demonstratedto be
speechsignalsby digital meanshas advancedto the extent that a particularlyeffectivefor time-domainspeechmeasurements.Time-
textbook on the subjectis clearly waxranted. The authors of this domainpitch detectorsand algorithmsfor voiced/unvoiced
classifica-
book havehad, betweenthem, over30 yearsof exceptionallyactive tion arealsodescribed
asexamplesof time-domainspeechprocessing
researchexperiencein digital signalprocessing in generaland speech systems.Chapter5 dealswith the direct digital representationof the
processingin particular. The outcome is an extremely valuablebook speechwaveform. The chapterbeginswith a brief descriptionof the
devotedto a timely subject. statisticalpropertiesof the speechwaveform, followed by a discuss-
The book is organizedaroundseveralbasicapproachesto digital ion of a variety of digital representationsthat includeinstantaneous
representationsof speechsignals,suchas the time-domainanalysis, quantization,adaptivequantization, and predictivequantization.
short-timeFourier analysis,homomorphicanalysis,and linear predic- Variouswaveformcodersrangingfrom simplepulsecodemodulation
tion analysis. Estimation techniquesfor speechparameterssuchas (PCM) to adaptivedifferential coding(ADPCM) are alsodiscussed in
pitch and formant frequenciesas well asspecificspeechprocessing this chapter.
applicationsare left asexamplesof the utility of eachrepresentation. In contrastto the previoustwo chapters,Chapters6-8 deal with
Thus, for example,time-domainpitch detectionalgorithmsare dis- frequency-domainspeechprocessing methodsin that they are based
cussedwithin the context of the time-domainmodel of speechpro- on a spectralrepresentationof the speechsignal. Chapter6 is de-
cessingwhile cepstralpitch detectionis discussed in the chapter voted to the notion of short-time Fourier analysisand synthesisof
dealingwith homomorphicanalysisof speech.The authorshave speech.It is shownthat the time-dependentFourier representation
chosensuchan organizationsincethey feel that "the most funda- of a signalcan be interpretedin the context of linear filtering where-
mental notionswere thoserelatedto digitalspeechrepresentations by the signalsis passedthrougha bank of filters with the appropriate
and that a soundunderstandingof suchrepresentations would allow characteristics.This leadsto a discussion of the designtechniques
the reader both to understand and to advance the methods and tech- for digital filter banks, as well asa discussionon the theoretical and
niquesfor parameterestimationand for designing speechprocessing implementationalissuesregardingthe f'fiterbank summationmethod
systems."This reviewer,for the most part, agreeswith theirjudg- for speechanalysisand synthesis.From this generalperspective,
ment. While it is true that specificparameterestimationtechniques severalspeechprocessing systems,includingthe spectrographic dis-
and speechprocessing applicationsappearto be scatteredamongthe play, the channelvocoder,and the phasevocoder,are then dis-
variouschapters,the repetition of topicsoften servesto reinforcethe cussed.
issuesrelatedto a particulartechniqueor application. Chapter7 concerns
the techniqueof hornorphicspeechproces-
The first three chaptersof this book developthe necessary back- sing. From the generalnotion of the homomorphicsystemfor con-
groundfor digital speechprocessing.Chapter 1 is an introduction to volution, it is shownthat the complex cepstmm,defined asthe in-
the notion of speechprocessing, with an overviewof the application versetransformof the complex logarithm of the Fourier transform,
areasthat are directly relatedto the topicscoveredin subsequent of a convolvedsignalis the sumof the individualcomplexcepstra.
chapters. Chapter2 providesa brief review of the fundamentalsof For the speechsignal,the homomorphictechniquethusenablesthe
digital signalprocessing.The topics include the z-transform,the separation of the excitationfunctionandthe transferfunctionpara-
Fourier transform,the discreteFourier transform,digital filters, metersin the cepstraldomainby simplelinearfilteringtechniques.
samplingtheorem,and decimationand interpolationof sampled Thischapterpresents
an excellentsummaryof the importantprop-
waveform. Thesetopicsaxecovered,in 21 pages,in a clear and con- ertiesof this analysistechnique. Also includedis a discussionon the
cisefashion,providingthe readerwith quick accessto key equations computationalissues.The homomorphicanalysistechniquehasbeen
and conceptsof digital signalprocessingthat are relevantto speech appliedto speechanalysis/synthesis, pitch detection,and formant
processing.Chapter 3 dealswith digital modelsfor the speechsignal. estimation. Chapter7 endswith a discussion of the basicideasbe-
The chapterbeginswith a brief introduction to the mechanismof hind thesespeechprocessing systems.
speechproductionand the acousticpropertiesof speechsounds. Chapter8 dealswith the topic of linearprediction. From the
This is followedby a presentationof the acoustictheory of speech digitalmodeldevelopedin Chapter3, it is shownthat a speechsam-
production,resultingin a modelin whichthe speechsignalis the ple canbe represented
asa linearcombinationof the pastsamples
output of a time-varyinglinear system excited by either random andthat by imposingpropererrorcriteria,the parameters of the
noiseor a quasiperiodic impulsetrain. By considering wavepropaga- speechmodelcanbe estimatedfrom the speechsignal.The authors

1406 J. Acoust.Soc.Am. 67(4), April 1980 0001-4966/80/041406-02500.80 ¸ 1980 Acoustical


Societyof America 1406
firstpresentthe generalframeworkof linearpredictionanalysis. The Evaluation and Calibration of Ultrasonic
This is followed by a comparisonof the differencesamongthree
Transducers
different formulations, namely the autocorrelation,the covariance,
and the lattice formulations. These formulations are discussed both
[Conferenceorganizedby: Materials and Testing
in termsof their theoreticalimplicationsand computationalcom-
Groupof the Instituteof Physics(London)]
plexities.Theparameters derivedfrom linearpredictionanalysisare IPC Scienceand TechnologyPressLimited, Guildford,
then relatedto other speechparametersthat havebeenpresentedin
Surrey, England, 19 78.
earlierchapters. The chapterendswith a discussion of the applica-
viii + ! 40 pp: Price œ!6. 00.
tion of linearpredictionto pitch detection,formant extraction,and
speechanalysis/synthesis.
Chapter9, the final chapter,discusses severalspeechprocessing Recent years have seenmany developmentsin the application of
systemsin the area of man/machinecommunicationby voice. The ultrasonicmeasurementsto nondestructivetestingand biomedical
chapterprovidessomeconcreteexamplesof specificspeechproces- applications.The key element for quantitative measurementsin every
singsystemssuchasvoiceresponse, speakerrecognition,and speech applicationis the transducerby which the ultrasonicwavesare gen-
recognitionsystems,and demonstrates how the ideasdiscussed in erated and/or receivedin the material under test. Unfortunately, in
earlierchaptersare appliedto real world problems. practice,this element is leastunderstood.The literature is replete
In general,this reviewerfindsthe organizationof the materialto with transducer"horror stories"in which presumablyidentical trans-
be very cohesiveand effective. Each chapter of the book openswith ducerspossesswidely differing transruction and field characteristics.
an introductionthat motivatesthe topicsand endswith a summary The Materials and Testing Group of the Institute of Physics
of the key ideasthat havebeendiscussed.Extensivereferencesas (London) organizeda two-day conferencein May 1977 to addressvar-
well as a set of homework problemsare alsoincludedat the end of ious aspectsof the evaluation,calibration, and standardizationof ul-
mostchapters.This reviewerfindsthe referencematerialto be com- trasonictransducers,and this book contains 17 of the 18 paperspre-
pleteand the homeworkproblemsappropriate.Occasionally,the sentedat this conference.A detailed report of this conferencehas
problemsbecomesomewhatlengthyin their mathematicalderiva- beenpublished
• in whicheachof the paperspresented
hasbeenade-
tion, but suchexerciseis often rewardedwith addedinsights. quately summarized;therefore, there is no need to repeat the report
The coverageof individualtopicsis accomplishedwith varying in its entirety here.
degreesof success.This reviewerfindsthe materialin Chapters2, The book is divided into four sections,entitled (1) "Theoretical
6, 7, and 8 to be exceptionally well presentedin that the central Considerations,"(2) "Practical Evaluation," (3) "Development and
issuesare discussedsuccinctlywithout unwieldy detail while enough Design Features," and (4) "Absolute Calibration."
referencesare providedsuchthat studentswho are interestedin the Despiteits title, the principalthemesof the five paperscomprising
topic canlook up additionalmaterialwith little trouble. The review the first sectionare the approachesfor finding-experimentally and
of digitalsignalprocessingfundamentalsin Chapter2 is extensiveand analytically-the ultrasonic fields (transient or steady state) of various
to the point. The treatmentof short-timeFourieranalysisin Chap- contactand angle-beamtransducers.The paperby Wickhamand Cof-
ter 6 providesa strongtheoreticalframeworkfrom which valuable fey summarizesa ray analysisof a thickness-modevibrating piezoelec-
insightscan be obtained on someof the more familiar speechpro- tric transducerattachedto a steelspecimenvia a wedgeto generate
cessingsystemssuchasthe spectrographic displayand the channel shearwavesby mode conversionin the steel specimen.Also in this
vocoder. On the negativeside,this reviewerfinds parts of Chapters first sectionis a paper by Weight and Hayman, who report on their
3 and 5 to be too lengthy. The developmentof the acoustictheory measurementsand analysesof transducer sound fields in a fluid re-
of speechproduction,takinginto accountlossesin the vocaltract, sultingfrom singlecycle pulseexcitations.The transduceris modeled
seemsto be anunnecessarydiversionfrom the goal of developingthe as a piston radiator and the measurementsare made either with a
digitalmodelfor the speechsignal. Most of the material coveredin miniature probe transduceror a stroboscopicschlierenvisualization
Chapter5 dealswith the codingof waveformsin generaland thus,in technique.
the opinionof this reviewer,shouldnot be treatedwith suchpromin- The secondsectionof the book consistsof three papers.Two of
enceand length. In contrast,the coverageof speechprocessingsys- these,authoredby Wiistenberg
and Schulzandby Whittinghamand
temsfor man/machinecommunicationby voicein Chapter 9 is Farmery, give practical proceduresfor determining someof the pa-
somewhatsketchy. It is felt that someprinciplesof speechproces- rameterswhich can be usedto describethe performanceof a piezo-
electric transducer.
sing,suchasthe clusteringtechniquefor pattern recognitionand the
dynamicprogrammingtechniquefor time normalization,perhaps Among the five paperswhich comprisethe "Development and
shouldbe coveredwith greaterdetail. DesignFeatures" section,two are particularly noteworthy. The paper
This book was usedasthe principal textbook in a one-semester by Whittle and Smallmangivesa descriptionof the proceduresthey
graduatecoursein digital speechprocessingoffered in 1979 at the haveproposedas a standardfor assessing the performanceof piezo-
Massachusetts Institute of Technology. The coursewas organizedin- electrictransducers.The paper of Jehensonand Bredaelgivesan out-
to three hours of lecture and one hour of laboratory/demonstration line of the measurementsthey considerimportant for evaluating
per week. At the end of the term, eachstudent completed a term piezoelectrictransducersusedin medicalapplications.
project that is either a literature survey,a hardwaredesign,or a com- The last section of the book, entitled "Absolute Calibration" con-
puterproject.Theresponse
of th• students
to thetextbookhasbeen sistsof three papers.Speakein his paper showsthe utility of a laser
overwhelminglyfavorable,and severalof the 30-odd studentsactual- interferometer to make absolutedisplacementmeasurementsof a
ly adoptedresearchtopicsin speechprocessingasa direct conse- specimento which a transduceris attached.Such a noncontactmea-
quenceof their exposureto the materialscoveredin the book. suringsystemhasthe potential for allowingan absolutecalibrationof
In summary,the bookDigitalProcessing of SpeechSignalsby a piezoelectrictransducerto be achieved.Also in this sectionis a
L. R. Rabiner and R. W. Schafer is a very well written book. It f'ffis paperby Perdrix, Baboux, and Lakestani,who demonstratea novel
the gapbetweenbooksthat are more devotedto signalprocessing procedurefor determiningthe impulseresponseof varioustrans-
and those that are more related to speechsciencein general. It is a ducersin a liquid. They step-excitea thick piezoelectrictransduceras
book well suited to be a textbook for a courseon the subject, and it a known source,and with a reciprocity condition they can alsofind
is indispensableto researchersand seriousstudentsof speechpro- the transmitting sensitivity of the transducer.
cessing.- This book providesan indicatorof the Europeanactivity on the
ultrasonictransducercalibration problem, and for that purpose,the
VICTOR W. ZUE book may be worth its purchaseprice. The book appearsto be un-
ResearchLaboratory of Electronics editedandthereare severalmislabeled
figures(in the paperby Weight
Massachusetts
Institute of Technology and Hayman). Also, pages73 and 108 were missingin the copy pro-
Cambridge,Massachusetts 02139 videdfor review.Most of the papersappearto be reportsof work in

1407 J. Acoust.Soc.67(4), April 1980; 0001-4966/80/041407-02500.80; ¸ 1980 Acoust.Soc.Am.; Book Reviews 1407

You might also like