United States Patent (191: Antill Et A1

l l l l l l l l l l l l l l l l l l l l l l l l l l Il l l l l l l l l l l
USOO5297236A
United States Patent [191
[11] Patent Number:
Antill et a1.
[45]
Speech, IEEE Trans. Acoust, Speech, Signal Proa,

ASSP-27, Oct., 1979, pp. 512-530.
DECODER, AND ENCODER/DECODER
Jayant and N011, Digital Coding of Waveforms, Engle
[75] Inventors: Michael B. Antill, San Rafael; Grant

A. Davidson, Oakland, both of Calif.
[73] Assignee: Dolby Laboratories Licensing

Corporation, San Francisco, Calif.
[21] Appl. No.: 710,805

[22] Filed:
Jun. 5, 1991
Related US. Application Data

Continuation-impart of Ser. No. 508,809, Apr. 12,
1990, abandoned, and Ser. No. 458,894, Dec. 12, 1989,
Pat. No. 5,109,417, which is a continuation-in-part of
Ser. No. 303,714, Jan. 27, 1989, abandoned.
[51]
[52]
[58]
Mar. 22, 1994
DIGITAL FILTER BANK FOR ENCODER,
[54] LOW COMPUTATIONAL-COMPLEXITY
[63]
Date of Patent:
5,297,236
Int. Cl.5 .............................................. .. G10L 9/00

U5. C1. ................................................. .. 395/212
Field of Search .................................. .. 381/29-46;
395/212
[56]
References Cited
U.S. PATENT DOCUMENTS
5,109,417
4/1992
wood Cliffs, N.J.: Prentice-Hall, Inc., 1984, pp.

563-576.
Princen, Bradley, Analysis/Synthesis Filter Bank De

sign Based on Time Domain Aliasing Cancellation,
IEEE Trans, ASSP-34, 1986, pp. 1153-1161.
Princen, Johnson, Bradley, Subband/Iransform Cod

ing Using Filter Bank Designs Based on [TDAC],
ICASSP 1987 Conf Proc., May 1987, pp. 2161-2164.
Edler, Coding of Audio Signals with Overlapping
Block Transform and Adaptive Window Functions,
Frequenz, vol. 43, No. 9, 1989, pp. 252-256.
Malvar, Lapped Transforms for Efficient Transform
/Subband Coding, IEEE Trans. Acoust, Speech, Signal

Proc., ASSP-38, Jun., 1990, pp. 969-978.
Gluth, Regular FFT-Related Transform Kernels for
DCT/DST-Based Polyphase Filter Banks, ICASSP
1991 Conf Proc, vol. 3, May 1991, pp. 2205-2208.
Duhamel, Mahieux, and Petit, A Fast Algorithm for
the Implementation of Filter Banks Based on [TDAC],
ICASSP 1991 Conf Proc, May 1991, pp. 2209-2212.
Vetterli, Perfect Reconstruction FIR Filter Banks:
Some Properties and Factorizations, Jul. 1989, pp.
1057-1071, IEEE Transactions on Acoustics, Speech
and Signal Processing.
Fielder et al. ....................... .. 381/36
FOREIGN PATENT DOCUMENTS
Barnwell, Filter Banks for Analysis-Reconstruction

Systems: A Tutorial, May 1990, pp. 1999-2003, IEEE
Int. Symp. on CAS.
9116769 10/1991 World Int. Prop. 0. .
Primary ExaminerAllen R. MacDonald
OTHER PUBLICATIONS
Brigham, The Fast Fourier Transform, Englewood

Cliffs, N.J.: Prentice-Hall, Inc., 1974, pp. 166-167.
Oppenheim and Schafer, Digital Signal Processing, En

glewood Cliffs, N.J.: Prentice-Hall, Inc., 1975, pp.
307-314.
Harris, On the Use of Windows for Harmonic Analysis
Assistant Examiner-Michelle Doerrler

Attorney, Agent, or Firm-Thomas A. Gallagher; David
N. Lathrop
[57]
ABSTRACT
The invention relates in general to digital encoding and
decoding of information. More particularly, the inven
tion relates to efficient implementation of digital analy
with the Discrete Fourier Transform, Proc. IEEE, vol.
sis and synthesis ?lter banks used in encoding and de
66, Jan., 1978, pp. 51-83.

Narasimha and Peterson, On the Computation of the
Discrete Cosine Transform, IEEE Trans. Communica
tions, COM-26, Jun., 1978, pp. 934-936.
Tribolet and Crochiere, Frequency Domain Coding of
coding. The invention perrnits the length of a digital

transform used to implement critically-sampled analysis
and synthesis ?lter banks to be adaptively selected.
100
[102
K
BUFFER
/106
FoRwARD
PRETRANSFORM
34 Claims, 6 Drawing Sheets
V [105
-
FORWARD
TRANSFORM
f 110
FoRwARD
POSTTRANSFORM
[112
114
FDRMATTER 1L
US. Patent
Mar. 22, 1994
Sheet 2 of 6
5,297,236
01234567
01234567
US. Patent
XIn)
Mar. 22, 1994
TIME FIEVERSAL
Sheet 3 of 6
5,297,236
TIME REVERSAL
< REGION 1 >< REGION 2
I
I
TIME REVERSAL I TIME REVERSAL
REGION 2
REGION 1 >1<
| DST BLOCK ->I
FIG._5
US. Patent
Mar. 22, 1994
Sheet 4 of 6
5,297,236
~ DCT BLOCK M:
I
X(")A TIME REVERSAL 1 TIME REVERSAL I
- REGION 1 ->|~ REGlON2->i
l
a
TIME REVERSAL I TIME REVERSAL
IT'
'
REGICN 1
TIT
'
REGION 1
I< DST >!~ DST

BLOCK 1
FIG._6
BLOCK 2
US. Patent
X(")
Mar. 22, 1994
Sheet 5 of 6
5,297,236
REGION 2
REGION 1 ;
_\_
I
I
I
I
I
I
I
I
I
I
I
I
3
4
_ N
FIG... 7
> n
7
8
- N
A
Xm)
REGION 2
<--__-REGION 1
*4?
f 'l
I
I
I
!
!
l
I
I
l
!
I
I
I
I
l
l
l
I
b n
i -2- SUBBLOCK w SUBBLOCK

k BRIDGE TRANSFORM ->!
FIG._8
US. Patent
Mar. 22, 1994
Sheet 6 of 6
5,297,236
01234567
1234567890
FIG._10 i
1
k
15
44
01234567
5,297,236
bandwidths which are sums of individual transform
coef?cient bandwidths.
The mathematical basis for digital subband ?lter
LOW COMPUTATIONAL-COMPLEXITY DIGITAL

FILTER BANK FOR ENCODER, DECODER, AND
ENCODER/DECODER
banks and digital block transforms is essentially the

same. See Tribolet and Crochiere, Frequency Domain
Coding of Speech, IEEE Trans. Acoust, Speech, and _
CROSS-REFERENCE TO RELATED
APPLICATIONS
This application is a continuation-in-part of copend
Signal Proc, ASSP-27, October, 1979, pp. 512-30.

Therefore, throughout the following discussion, the
term subband coder shall refer to both a true subband
coder and a transform coder. The term subband shall
ing U.S. application Ser. No. 07/458,894, ?led Dec. 12,

1989, now U.S. Pat. No. 5,109,417, which was a con
tinuation-in-part
of U.S.
application
Ser.
refer to portions of the useful signal bandwidth whether
No.
implemented by a true subband coder or a transform
07/303,714, ?led Jan. 27, 1989, now abandoned a con
coder. The terms transform and transforming shall
tinuation-in-part of copending U.S. application Ser. No.
include digital ?lters and digital ?ltering, respectively.
07/508,809, ?led Apr. 12, 1990, now abandoned and a
In most digital coding applications, processing re

quirements can be reduced by increasing the ef?ciency
continuation-in-part of copending PCT/US9l/025l2,

?led Apr. 12, 1991.
of subband ?ltering. Improved processing ef?ciency

permits implementation of encoders and decoders
TECHNICAL FIELD
The invention relates in general to digital encoding

and decoding of information. More particularly, the
invention relates to ef?cient implementation of digital
analysis and synthesis ?lter banks used in digital encod
ing and decoding. In a preferred embodiment of the
invention, the length of the ?lter bank used to imple
ment critically-sampled analysis and synthesis ?lter
which are less expensive to build, or which impose

20
lower signal propagation delays through an en

coder/decoder system.
In many subband coder systems, the analysis and
synthesis ?lter banks are implemented by discrete time
25
Discrete Fourier Transform (DPT), the Discrete Co

sine Transform (DCT), and the Discrete Sine Trans
form (DST). The number of time-domain signal sam
ples, referred to herein as the time-domain signal sample
block length, processed by such transforms is some
times called the transform length, and the amount of
processing required to perform these transforms is gen
erally proportional to the square of the time-domain
domain to frequency-domain transforms such as the
banks may be adaptively selected.
Throughout the following discussion and especially

in the background discussion, more particular mention
will be made of audio applications, however, it should
be understood that the present invention is applicable to
a range of digital coding applications wider than just
that of audio encoding and decoding.
signal sample block length.

The number of frequency-domain transform coeffici
BACKGROUND ART
Introduction
There is considerable interest among those in the ?eld
of signal processing to develop ef?cient means to trans
mit or store information. Improving coding ef?ciency
35 ents generated by a transform is also sometimes called
includes (1) reducing informational requirements, that

is, reducing the amount of information required to ade
quately represent a signal during transmission or stor
age, and (2) reducing processing requirements, that is,
the transform length. It is common for the number of
frequency-domain transform coefficients generated by

the transform to be equal to the timedomain signal
sample block length, but this equality is not necessary.
For example, one transform referred to herein as the
E-TDAC transform is sometimes described in the art as
a transform of length 5N that transforms signal sample

blocks with a length of N samples. It is possible, how
ever, to also describe the transform as one of length N

reducing the amount of processing required to imple
45 which generates only 5N unique frequency-domain
ment the encoding and decoding processes.
transform coefficients. Thus, in this discussion the time
In high-quality audio coding applications, informa
domain signal sample block length and the discrete
tional requirements can sometimes be reduced without
loss of perceptible audio quality by exploiting various
psychoacoustic effects. Signal recording, transmitting,

or reproducing techniques which divide the useful sig
nal bandwidth into narrow bands with bandwidths ap
proximating the human ears critical bands can exploit
transform length are generally assumed to be synonyms.

Various techniques have been utilized to reduce the
amount of time required to perform a transform, or to
reduce the processing power required to perform a
transform in given amount of time, or both. One tech
psychoacoustic masking effects. Such techniques divide
nique is taught in Narashima and Peterson, On the

Computation of the Discrete Cosine Transform, IEEE
55 Trans. on Communications, COM-26, June, 1978, pp.
934-36. Brie?y, this technique evaluates an N-point
DCT by rearranging or shuffling the samples repre
senting the input signal, performing an N-point DFI' on
the shuffled samples, and multiplying the result with a
coders can reduce the informational requirements in 60 complex function. It is approximately twice as ef?cient
as other techniques using a 2N-point FFT, however,
particular frequency bands where the noise caused by
Narashima and Peterson only teach how to improve the
the resulting coding inaccuracy is psychoacoustically
ef?ciency of ?lter banks implemented by one particular
masked. Subband coders may be implemented by a bank
the signal bandwidth with an analysis ?lter bank, pro

cess the signal passed by each ?lter band, and recon
struct a replica of the original signal with a synthesis
?lter bank.
Two common coding techniques are subband coding
and transform coding. Subband coders and transform
of digital bandpass ?lters de?ning subbands of varying
DCT.
bandwidth. Transform coders may be implemented by 65
Another technique which yields approximately a

two-fold increase in processing ef?ciency concurrently
any of several time-domain to frequency-domain trans

forms. One or more adjacent transform coef?cients are
grouped together to de?ne subbands having effective
performs two real-valued discrete transforms of length

N with a single complex-valued FFT of length N. A
5,297,236
subband coder utilizing this technique to concurrently
DISCLOSURE OF INVENTION
It is an object of the present invention to provide a
subband encoder and a subband decoder of digital infor
perform a modi?ed DCT with a modi?ed DST is de
scribed in International Patent Application PCT/US

90/00501, Publication No. WO 90/09022 (published
Aug. 9, 1990). The signi?cance of these modi?ed DCT
mation by means of analysis ?ltering and synthesis ?l
and modi?ed DST is discussed in Princen and Bradley,

Analysis/Synthesis Filter Barik Design Based on Time
Domain Aliasing Cancellation, IEEE Trans. on
tering requiring lower processing requirements, or im-_
Acoust, Speech, Signal Proc., ASSP-34, 1986, pp.
vide a subband encoder and a subband decoder of digi
evenly-stacked critically-sampled single-sideband anal
means of analysis ?ltering and synthesis ?ltering permit
posing lower processing delays, or both.

It is another object of the present invention to pro
1153-1161. The authors describe a speci?c application 10 tal information requiring lower processing require
ments, or imposing lower processing delays, or both, by
of these transforms as the time-domain equivalent of an
ting adaptive selection of the ?lter-bank length.

ysis-synthesis system. They are referred to collectively
herein as the Evenly-stacked Time-Domain Aliasing 15 Furtherdetails of the above objects and still other
objects of the invention are set forth throughout this
Cancellation (E-TDAC) transform.
document, particularly in the Detailed Description of
Another technique to reduce processing require
the
Invention, below.
ments is taught by Malvar, Lapped Transforms for
In accordance with the teachings of the present in
Ef?cient Transform/subband Coding, IEEE Trans.
vention in one embodiment, an encoder provides for the
Acoust, Speech, Signal Proc., ASSP-38, June, 1980, pp. 20 encoding of input signal samples corresponding to an
969-78. This technique implements an N-point modi?ed
analog signal. The input samples are buffered into time
DCT by performing a iN-point DST after combining
domain signal sample blocks. Pairs of signal samples in
pairs of the samples representing the input signal, or
the time-domain signal sample blocks are combined to
folding the N input signal samples into a smaller set of
produced modi?ed samples. Frequency-domain trans
5N points. It is approximately twice as ef?cient as per 25 form coef?cients are generated by a discrete digital
forming the modi?ed DCT in a straight-forward man
transform in response to the modi?ed samples. Quan
ner, however, Malvar only teaches how to fold input

samples for a ?lter bank implemented by one speci?c
modi?ed DCT whose input samples have been
weighted by a speci?c sine-tapered analysis window.

The speci?c modi?ed DCT implemented by Malvar
tized information is generated in response to the fre
quency-domain transform coef?cients. Digital informa

30 tion including the quantized information is assembled
into a format suitable for transmission or storage.
Also in accordance with the teachings of the present
is discussed in greater detail by Princen, Johnson, and
invention in one embodiment, a decoder provides for
Bradley, subband/Transform Coding Using Filter
the decoding of digitally encoded information. Quan
Bank Designs Based on Time Domain Aliasing Cancel 35 tized information is extracted from the digitally en
coded signal. Frequency-domain transform coef?cients
lation, ICASSP 1987 Conf Proc., May 1987, pp.
2161-64. The authors describe this transform as the
are generated in response to the extracted quantized
time-domain equivalent of an oddly-stacked critically

sampled single-sideband analysis-synthesis system. It is
information. Time-domain transform coef?cients are
generated by an inverse discrete digital transform in

response to the frequency-domain transform coef?ci
ents. Signal samples are generated in response to the
referred to herein as the Oddly-stacked Time-Domain
Aliasing Cancellation (O-TDAC) transform.

It is desirable to implement encoders and decoders
with the ability to use different time-domain signal sam
time-domain transform coef?cients, and output samples

are generated which correspond to the input samples to
ple block lengths in order to optimize coder perfor
a companion encoder.
mance. It is well known in the art that longer time 45
Further in accordance with the teachings of the pres

ent invention in one embodiment, an encoder/decoder
domain signal sample block lengths improve the selec
system provides for the encoding of input signal sam

ples corresponding to an analog signal, and for the de
coding of a digitally encoded signal. For encoding, the
ity of a subband coder to exploit psychoacoustic mask
input
samples are buffered into signal sample blocks.
50
ing effects. See International Patent Application
Pairs of signal samples in the signal sample blocks are
PCT/U S 90/00507, Publication No. WO 90/09064
combined to produced modi?ed samples. Frequency
tivity or frequency-resolving power of subband coders,

and better ?lter selectivity generally improves the abil
(published Aug. 9, 1990) which discusses the impor

tance of time-domain signal sample block length and
subband ?lter selectivity.
But longer time-domain signal sample block lengths
domain transform coef?cients are generated by a dis

crete digital transform in response to the modi?ed sam
55
ples. Quantized information is generated in response to
the frequency-domain transform coef?cients. Digital
degrade the time-resolution of a subband ?lter bank.

Inadequate time-resolution can produce audible distor
information including the quantized information is as
tion artifacts when quantizing errors of signal events,

such as transients, producing pretransient and post-tran
sient ringing which exceed the ears temporal psycho
acoustic masking interval. See for example, Edler,
age. For decoding, quantized information is extracted

from a digitally encoded signal. Frequency-domain
Coding of Audio Signals with Overlapping Block

Transform and Adaptive Window Functions, Frequ
sembled into a format suitable for transmission or stor
transform coef?cients are generated in response to the

extracted quantized information. Time-domain trans
form coef?cients are generated by an inverse discrete
digital transform in response to the frequency-domain

enz, vol. 43, no. 9, 1989, pp. 252-56. Hence, it is desir 65 transform coef?cients. Signal samples are generated in
able that techniques which improve subband ?lter bank
response to the time-domain transform coef?cients, and
processing ef?ciency should also permit adaptive selec
output samples are generated which correspond to the
tion of the time-domain signal sample block length.
input samples to a companion encoder. I
5,297,236
forward post-transform 110 which generates quantized
The various features of the present invention and its

preferred embodiments are set forth in greater detail in
the following Detailed Description of the Invention and
information in response to the frequency-domain trans

form coef?cients and the information received from
in the accompanying drawings.
path 104, and formatter 112 which assembles digital
information including the quantized information into a

BRIEF DESCRIPTION OF DRAWINGS
form suitable for transmission or storage along path 114.
FIGS. 1 and 2 are functional block diagrams illustrat
The functions performed by buffer 102 and formatter 1
ing the basic functional structure of a preferred embodi
12 are not discussed in detail herein.
ment of the present invention.
A preferred embodiment of a decoder, as shown in
FIG. 3 is a ?owgraph illustrating the forward pre
FIG. 2, comprises deformatter 202 which extracts quan
transform function, applied to a 16-sample time-domain
tized information and information establishing the in
signal sample block to form an 8-sample modi?ed sam
verse transform length from the encoded digital signal
ple block, for a basic embodiment of the present inven
received from path 200, inverse pretransform 206 which
tion permitting implementation of an E-TDAC trans
generates frequency-domain transform coefficients in
form analysis ?lter bank by a DCT and DST.
response to the extracted quantized information and the
information establishing the inverse transform length
transform function, applied to a 16sample time-domain
received along path 204, inverse transform 208 which
signal sample block to form an S-sample modi?ed sam
transforms the frequency-domain transform coef?cients
ple block, for an alternative embodiment of the present
into time-domain transform coef?cients using a trans
invention permitting implementation of an E-TDAC 20 form whose length is adapted in response to information
transform analysis ?lter bank by a DFT.
received from path 204, inverse post-transform 210
FIG. 5 is a hypothetical raphical representation illus
which generates signal samples in response to the time
trating the time-reversal regions of the time-domain
domain transform coefficients and information received
aliasing component created by the E-TDAC transform

from path 204, and output processor 212 which gener
using the conventional phase term.
25 ates along path 214 output samples corresponding to the
FIG. 6 is a hypothetical graphical representation
input samples to a companion encoder in response to the
illustrating the time-reversal regions of the time-domain
signal
samples. The functions performed by deformatter
aliasing component created by the E-TDAC transform
202 and output processor 212 are not discussed in detail
using the phase term required to cancel time-domain
aliasing in an N-sample length block overlapped with a 30 herein.
A basic embodiment of the present invention is intro
subsequent iN-sample length block.
duced in some detail before alternative embodiments

are discussed. This basic embodiment uses fixed-length
FIG. 7 is a hypothetical graphical representation
illustrating the boundary between time-reversal regions
E-TDAC transforms to implement the analysis and

synthesis ?lter banks. Preferred embodiments of various
35 features are described throughout the discussion.
FIG. 8 is a hypothetical graphical representation of a
bridge transform, illustrating the time-reversal regions
II. Basic Embodiment of Invention
of the time~domain aliasing component.
A. Input Sample Buffering
transform function, applied to a l6-sample time-domain
A buffer, represented by box 102 in FIG. 1, receives
signal samples and groups them into a sequence of time
ple block, permitting implementation of an adaptive
domain signal sample blocks. Each block comprises N
length E-TDAC transform analysis ?lter bank by a
signal samples. The signal samples may be received
DCT and DST.
from the sampling of an analog signal, from the genera
FIG. 10 is a ?owgraph illustrating the forward pre 45 tion of samples representing or simulating an analog
transform function, applied to a l6-sample time-domain
signal, or from any other source of discrete-valued sam
ples which correspond to an analog signal.
ple block, permitting implementation of an adaptive
It is well known in the art that the frequency-resolv
length O-TDAC transform analysis ?lter bank by a
ing power or selectivity of a ?lter bank implemented by
DST.
a discrete transform improves as the transform length
increases. It is also well known that ?lter selectivity
DETAILED DESCRIPTION OF THE
may be affected signi?cantly by weighting the time
INVENTION
domain signal samples by a weighting function com
1. Overview of Functional Structure
monly called a window. See generally, Harris, On the
FIGS. 1 and 2 show the basic functional structure of 55 Use of Windows for Harmonic Analysis with the Dis
crete Fourier Transform, Proc. IEEE, vol. 66, January,
a preferred embodiment of the present invention. Ac
1978, pp. 51-83.
cording to this embodiment, as shown in FIG. 1, an
The E-TDAC transform used in the basic embodi
encoder comprises buffer 102 which buffers input sam
ment of the present invention requires window
ples received from input path 100 into time-domain
of the time-domain aliasing component in a iN-sample
length block.
signal sample blocks, forward pretransform 106 which
60
weighting, both weighting of the time-domain signal
samples in an encoder prior to forward transform ?lter

generates modi?ed samples in response to signal sam
ing, referred to as analysis windowing, and weighting of
ples received from buffer 102 and in response to infor
the recovered time-domain signal samples in a decoder
mation received from path 104 establishing the number
after inverse transform ?ltering, referred to as synthesis
of signal samples constituting a time-domain signal sam
ple block, forward transform 108 which transforms the 65 windowing. Analysis- and synthesis-window weighting
are discussed below only briefly. It is assumed herein
modi?ed samples into frequency-domain transform co
that the buffered signal samples are weighted by an
ef?cients using a transform whose length is adapted in
analysis window as may be required or desired. Signal
response to information received from input path 104,
5,297,236
samples may be weighted by an analysis window prior
celled by choosing the appropriate phase term m for

equations 1 and 2, applying the forward transform to
to or subsequent to their receipt by the buffer without

departing from the scope of the present invention.
overlapped analysis-window weighted time-domain

signal sample blocks, and by synthesis-window
weighting and adding adjacent overlapped time-domain
B. Analysis Filter Bank-Forward Transform

Although the forward pretransform function dis
cussed below is applied to time-domain signal sample
blocks prior to application of the forward transform, it
signal sample blocks recovered by the inverse trans

form. The phase term In in equations 1 and 2 controls
the phase shift of the time-domain aliasing distortion.
is necessary to introduce the forward transform before

the forward pretransform function can be fully de
To cancel this alias distortion and accurately recover
scribed. The forward transform is represented by box
aliasing to be as follows: for the MDCT, the time

domain alias component consists of the ?rst half of the
the original time-domain signal, E-TDAC requires the
108 in FIG. 1.
ment of the present invention is equivalent to the alter
nate application of a Modi?ed Discrete Cosine Trans
form (MDCT) with a Modi?ed Discrete Sine Trans
sampled and windowed signal reversed in time about

the one-quarter point of the sample block, and the sec
ond half of the sampled and windowed signal reversed
in time about the three-quarter point of the sample
form (MDST). The MDCT and the MDST, shown in
block; for the MDST, the alias component is similar to
equations 1 and 2 respectively, are
that for the MDCT except its amplitude is inverted in
sign. These relationships are illustrated in FIG. 5 in
(1) 20 which the time-domain aliasing component, shown by a
N-l
broken line, and the desired signal, shown by a solid
qk) = n:20 x(n)cos [21m ujpljrmo k < N
line, have been weighted by a synthesis window.
The phase term required to produce the appropriate
N_}
(2)
aliasing components for alias cancellation
S(k) .-. n::0 x(n)sin (2% %)rm 0 k < N
25
(6)
1
where
k=frequency~domain transform coefficient number,

n=time-domain signal sample number,
N=time-domain signal sample block length,
m=phase term required for TDAC (see equation 6),
x(n)=time-domain signal sample n,
C(k)=MDCT frequency-domain transform coeffici
ent k, and
S(k)=MDST frequency-domain transform coef?ci
+2.
30
The processing requirements of the technique used to

evaluate the MDCT and the MDST may be reduced by
applying a forward pretransform function to the time
35
domain signal sample blocks to produce blocks of modi

?ed samples, and applying a N-point DCT and a 5N
point DST to the modi?ed sample blocks for the N
ent k.
The E-TDAC transform produces one of two alter
point MDCT and MDST, respectively. The forward

pretransform function is represented by box 106 in FIG.
1. For E-TDAC, the pretransform function combines
pairs of signal samples in each time-domain signal sam
ple block of length N to produce a block of modi?ed
nating sets of frequency-domain transform coef?cients

in response to each time-domain signal sample block.
These sets of frequency-domain transform coefficients
are of the form
N
C. Forward Pre-Transform Function
samples of length 5N.
(3)
The mathematical basis for using a iN-point DCT
C(k) forO k <T
applied to modi?ed samples to perform the N-point

MDCT may be seen by ?rst substituting equation 6 for
the phase term In into equation 1. The MDCT in equa
tion 1 may be expressed as
50
(78)
C(k) =
where i=time-domain signal sample block number.

Each set of coefficients generated by the MDCT and
the MDST are referred to herein as MDCT coefficient 55
sets and MDST coefficient sets, respectively.

Princen and Bradley showed that with the proper
phase term In and a suitable pair of analysis-synthesis
windows, the E-TDAC technique can accurately re
cover an input signal from an alternating sequence of 60
overlapped ?xed-length MDCT coefficient sets and

MDST coefficient sets of the form
{C(k)}o, {30011, {C002, {500b,
(5)
65
Using only the alternate MDCT coefficient sets and

MDST coefficient sets produces a time-domain aliasing
component, but the aliasing component may be can
5,297,236
By setting d=n+N/4 and substituting it into expression

7a, by setting d=3N/4l-n and substituting it_into
expressions 7b and 7c, and by setting d=n-3N/4 and
10
embodiment of the present invention. The minus signs

shown within parenthesis denote terms which are sub
tractively combined with an associated sample for the

function shown above in equation 12. This subtractive
combination may be accomplished in circuitry, for ex
substituting it into expression 7d, it may be seen that

equation 1 can be rewritten as
ample, by negating the value of the signal sample repre

T __1
3N
sentations corresponding to the nodes in FIG. 3 with
(8a)
C(k) = dio {x(4 + d)+

10
minus signs in parenthesis, and additively combining the

resultant representations.
D. Forward Post-Transform Function
xtai-l-dllwtrtdelf
Frequency-domain transform coef?cients generated

by the forward transform are generally not suitable for
low bit-rate transmission or efficient storage. Various
quantization techniques may be used to reduce informa
(8b)
tional requirements by taking advantage of a signals

redundancy and irrelevancy.
In a basic embodiment of the present invention, the
forward post-transform function represented by box

110 in FIG. 1 may comprise quantizing the frequency
domain transform coefficients, however, quantizing is
not required to practice the present invention. Quantiz
ing is frequently used in audio coding applications be
Finally, by de?ning a new sequence
25 cause it may reduce coded signal informational require
ments without degrading subjective signal quality by
(9)
exploiting psychoacoustic masking effects. For ease in

discussing the basic embodiment of the present inven
tion, it is assumed hereafter that the forward post-trans
form function includes quantizing the frequency

domain transform functions into quantized information.
Any of various quantizing techniques may be utilized

without departing from the scope of the present inven
where [i] mod M represents the value of i modulo M,

the expressions 8a and 8b may be combined and written
tion. See generally, .layant and Noll, Digital Coding of

Waveforms, Englewood Cliffs, N.J.: Prentice-Hall, Inc.,
as
1984, pp. 563-76. An example of one quantizing tech

a. _,
(10)
nique employing adaptive bit allocation is discussed in

International Patent Application PCT/U S 90/00501,
Publication No. WO 90/09022 (published Aug. 9, 1990).
E. Output Formatting
which is a iN-point DCT for y(n).
Output formatting represented by box 112 in FIG. 1

assembles a coded digital signal, including quantized
From a similar derivation, it can be shown that the

MDST of length N can be implemented by a DST of
information, for transmission or storage. Any additional
45 side-information needed by a decoder is also assembled
into the formatted signal. Frame synchronization bits

and error detection/correction codes may be used as
needed for transmission. Database pointers or keys may

50
be added as needed for storage. The formatted data is
ready for transmission or for storage along path 114

shown in FIG. 1.
F. Input Deformatting
A deformatting process takes place in a decoder
55
when the encoded signal is received from path 200
either by receipt of a transmitted signal or retrieval
from storage. The deformatting process is represented

by box 202 in FIG. 2. Deformatting extracts quantized
It should be appreciated that the forward pretrans 60 information and any side information passed by the
encoder.
form function for this basic embodiment, as well as for

alternative embodiments discussed below, can be per
formed by any of several implementations including

software-controlled processors, and circuits capable of
combining pairs of time-domain signal samples to form
modified samples. One ?owgraph for a l6-sample block
is shown in FIG. 3 which illustrates the forward pre
transform functions of equations 9 and 12 for a basic
G. Inverse Pre-Transform Function
The inverse pretransform function, represented by

65
box 206 in FIG. 2, obtains frequency-domain transform

coefficients by performing a function essentially inverse
to that used by the forward post-transform function in
the encoder which encoded the signal.
5,297,236
11
In a basic embodiment of the present invention, the
inverse pretransform recovers spectral information by

dequantizing the encoded digital information into a
form suitable for input to the inverse transform ?lter
bank.
H. Synthesis Filter Bank-Inverse Transform

Box 208 in FIG. 2 represents a bank of synthesis
?lters which transforms each set of frequency-domain
transform coefficients into time-domain transform coef
?cients. A transform inverse to that used in analysis
?lter bank 108 in FIG. 1 implements synthesis ?lter
bank 208. The inverse discrete transforms for E-TDAC
used in the basic embodiment of the present invention is
an alternating application of an Inverse Modi?ed Dis
With an appropriate inverse post-transform function,

the IMDST of length N can be implemented by an
IDST of length 5N;
crete Cosine Transform (IMDCT) and an Inverse Mod
i?ed Discrete Sine Transform (IMDST) shown in equa
in) =
tions 13 and 14, respectively;
(13)
20
211k
N
M"
II
where i(n)=recovered time-domain transform coeffici

ent n.
Each recovered time-domain signal sample may be

obtained from the time-domain transform coef?cients
i(n) according to
C(k)=recovered MDCT frequency-domain trans

form coefficient k,
30
S(k)=recovered MDST frequency-domain trans

form coef?cient k, and
.i(n)=recovered time-domain signal sample n.
I. Inverse Post-Transform Function
35
The processing requirements of the technique used to

evaluate the IMDCT and the IMDST may be reduced
by instead evaluating an Inverse DCT (IDCT) and an
Inverse DST (IDST) and applying an inverse post
transform function after application of the inverse trans
forms. This inverse post-transform function is repre
sented by box 210 in FIG. 2.
J. Output Sample Processing

An overlap-add process is required to generate sam
ples corresponding to signal samples encoded by a com
panion encoder. This process, represented by box 212 in
For E-TDAC, the inverse post-transform function

splits time-domain transform coef?cients into signal
FIG. 2, overlaps adjacent time-domain sample blocks
samples. Using a derivation similar to that discussed 45 recovered from the inverse transform and adds the
above for the forward transform, it can be shown that,
samples in one block to samples in the adjacent over
with an appropriate inverse post-transform function,

discussed below, the IMDCT of length N can be imple
mented by an IDCT of length 5N;
in) =
O a(k)(k)oos
lapped block.
(l5)

ment of the present invention requires, in addition to
overlap-add, the application of a synthesis window to
the time-domain sample blocks. The constraints the
[n + -Dmo 5 n < ; N
E-TDAC transform places upon the design of the syn

thesis window, the analysis window, and the overlap
add process is_ discussed fully in the paper by Princen
55
where
y(n)=recovered time-domain transform coefficient
and Bradley referred to above.
III. Alternative Fixed-Length Embodiments

Alternative embodiments of the present invention
may achieve greater reductions in processing require
ments. The following description discusses the differ
n, and
ences between these alternative embodiments and the

basic embodiment described above.
2 otherwise.
65
Each recovered time-domain signal sample 2(n) may
A. E-TDAC Implemented by DFT
be obtained from the time-domain transform coeffici
In one alternative embodiment of the present inven

tion for an encoder, the forward E-TDAC transform is
ents y(n) according to
implemented by a Discrete Fourier Transform (DFT).
5,297,236
13
A forward pretransform function generates an alter

nating sequence of two types of blocks comprising mod
ified samples; one block type comprising modi?ed sam
ples p(n), and a second block type comprising modi?ed
14
(27)
M) cos
@[l _ k)
[1(k)=sin(i)-(k)-cos(l)-(l-k)
'
'
samples r(n). Each modi?ed sample is formed from the

combination of one pair of signal samples x(n) accord
ing to
6(k) + sin? "k

N
(23)
N
(21)
'
(19)
+ 2n ]modN)+
m) = cos
1v
(l
2 - k )1 sin
sun
(3O)
15
An inverse transform generates an alternating se

quence of two types of blocks comprising recovered
time-domain transform coefficients by applying an
IDFT to the alternating sequence of frequency-domain
transform coefficient blocks; one block type comprising
recovered time-domain transform coefficients 150:), and
a second block type comprising recovered time-domain
transform coefficients i'(k). The IDFI' used to recover
the time-domain transform coefficients is shown in
A flowgraph for a 16-sample block illustrating this for

ward pretransform function is shown in FIG. 4.
The forward E-TDAC transform is implemented by

a DFT which generates alternating sets of complex
equations 31 and 32;
valued frequency-domain transform coefficients, P(k)

of the form T(k)+j-U(k) and R(k) of the form
V(k) + j -W(l<), in response to the alternating sequence of 30
modi?ed sample blocks;
35
Time-domain signal samples i(n) are recovered by

applying an inverse post-transform function to each
recovered coefficient from the alternating sequence of
blocks comprising recovered time-domain transform

coefficients. Signal samples are recovered from blocks
Spectral information corresponding to E-TDAC

transform coefficients C(k) and S(k) is obtained by ap
plying a forward post-transform function according to;
comprising the fade) coefficients according to

(33)
45
(25)
C(k) = cos ( 1',
m) + sin
Um,
V(k) - cos
W(k).
(26)
S(k) = sin
(34)
In one alternative embodiment of the present inven

tion for a decoder, the inverse E-TDAC transform is 55
3N
implemented by an Inverse DPT (IDFT).
An inverse pretransform function recovers spectral

information C(k) and S(k), corresponding to E-TDAC
transform coefficients C(k) and S(k), respectively, from
the encoded signal, and generates in response to the
recovered spectral information an alternating sequence
of two types of blocks comprising recovered frequency
domain transform coefficients; one block type compris
Signal samples are recovered from blocks comprising

the i-(k) coefficients according to
ing regoveredcomplex-valued coefficients P(k) of the

form T(k) +j-U(k), and a second block type comprising
r_ecovered complex-valued coef?cients R(k) of the form

V(k) +j-W(k). Each frequency-domain transform coef
?cient is obtained according to
65
(35)
5,297,236
15
-continued
decoder, the IMDCT and the IMDST of one or more

inverse E-TDAC transforms are implemented concur
fa: 4 3N ]mod [%:Dfor0 n < N, n even,
rently by one or more IDFTS.
An inverse pretransform function recovers spectral

information C(k) and S(k), corresponding to E-TDAC
transform coefficients C(k) and S(k), respectively, from
(36)
3H- l n
i0!) = F
mod
16
In another embodiment of the present invention for a
the encoded signal, and generates in response to the

recovered spectral information a sequence of blocks
comprising recovered complex-"valued frequency

domain transform coefficients Q(k) of the form
-'r([ 3N_; 7' 4" ]mod [l%:l)for0 n < N. n odd.
G(k)+j-H(k) where G(k) and H(k) are obtained from

recovered spectral information according to
B. E-TDAC Implemented by Concurrent DFT

(40>
In another embodiment of the present invention for
60:) = cos
an encoder, the MDCT and the MDST of one or more

forward E-TDAC transforms are implemented concur
rently by one or more DFTS. In single channel encoder
applications, two adjacent frequency-domain coeffici
20
ent sets as illustrated in expression 5 above may be gen
[600 + Mm +
17k
sin[ N
erated concurrently by a single DFT. In two channel
(41)
applications, a MDCT coef?cient set for channel one
may be generated concurrently with a MDST coeffici

ent set for channel two, immediately followed by a 25
MDST coefficient set for channel one generated con
currently with a MDCT coefficient set for channel two.
Other combinations of coefficient sets for concurrent
processing are possible. For additional detail on concur
rent transforms, see generally, Brigham, The Fast Fou 30
rier Transform, Englewood Cliffs, N.J.: Prentice-Hall,

Inc., 1974, pp. 166-67.
The IMDCT and the IMDST constituting the in

verse E-TDAC transform are concurrently imple
mented by an IDPT which generates complex-valued
time-domain transform coefficients q(n) of the form
A forward pretransform function generates a se
quence of blocks comprising complex-valued modi?ed

samples q(n) of the form p(n)+j-r(n) where p(n) and
carefree-kitted]
p(n) +j-f(n) according to
35
r(n) are formed from the application of the forward

pretransform function described above and shown in
equations 21 and 22.
The MDCT and the MDST constituting the forward
E-TDAC transform are concurrently implemented by a
Time-domain signal samples i(n) are recovered from

the application of the inverse post-transform function
described above and shown in equations 33 through 36.
DFT which generates complex-valued frequency

domain transform coef?cients Q(k) of the form
G(k) +j-H(k) according to
vIV. Adaptive-Length Embodiments

45
A. Bridge Transform
As mentioned above, it is desirable that the technique
which improves transform processing efficiency should

also permit adaptive selection of the transform length.
Spectral information corresponding to E-TDAC

transform coefficients C(k) and S(k) is obtained by ap
Although the means and considerations required to

implement an adaptive-transform-length coder is not
included within the scope of the present invention, it is
plying the forward post-transform functions according
noted that a more detailed discussion of adaptive-length
transform coding is set forth in copending US patent

55
application Ser. No. 07/508,809, filed Apr. 12, 1990,

which is hereby incorporated by reference,
Changes in the length of either the E-TDAC trans
form or the O-TDAC transform may require changes in
the phase term In in order to realize time-domain alias
ing cancellation. FIG. 5 is a hypothetical graphical
representation of two adjacent overlapped N-sample

length time-domain signal sample blocks recovered by
an inverse E-TDAC transform, one block recovered
from the IMDCT and the second block recovered from

the IMDST after synthesis windowing but before over
lap-add of the adjacent blocks has cancelled time
domain aliasing. The representation in this and other
figures does not show individual signal samples, but
17
5,297,236
rather illustrates only the envelope of windowed signal

sample blocks.
Each recovered signal sample block comprises two
18
tion exceed the scope of this discussion, a bridge trans
form improves coder performance by instead trans
forming a single block of 2N samples.

The bridge transform required to process the 2N
components: one component represented by a solid line
substantially corresponds to the analysis- and synthesis

windowed input signal samples, and the second compo
sample block shown in FIG. 8 may be implemented by

an FFI to compute the transform for three iN blocks
nent represented by a broken line is the time-domain
followed by a recombination operation. This technique
aliasing distortion. As discussed above, the aliasing

component is a time-reversed replica of the windowed
input signal samples which occurs in two separate re
gions. The phase term In for the E-TDAC and the O
TDAC transforms controls the location of the bound
ary between these two regions. For ?xed-length E
TDAC and O-TDAC transforms, the boundary is lo
cated at the mid-point of the signal sample block. The 15
phase term required for time-domain aliasing cancella
is known in the art and is discussed in Oppenheim and
Schafer, Digital Signal Processing, Englewood Cliffs,

N.J.: Prentice-Hall, Inc., 1975, pp. 307-14. The FFT
with this recombination operation can also be used to
concurrently process two E-TDAC bridge transforms

in the same manner as that brie?y discussed above for
?xed-length transforms. It is important to note, how

ever, that concurrent processing in E-TDAC is possible
only for a MDCT and MDST which have the same

tion under this condition is shown in equation 6.
FIG. 6 is a hypothetical graphical representation of
length.
three time-domain signal sample blocks recovered from
For the decoder, the length of the inverse transform
an inverse E-TDAC transform prior to overlap-add. 20 may be established by side-information passed by the
The ?rst block is an N-sample length block which has
encoder in the encoded signal. The same considerations
been recovered from the IMDCT. The second and third
for adaptive-length transforms and bridge transforms
blocks are iN-sample length blocks which have been
that were discussed above for the forward transform,
recovered from the IMDST. The aliasing component in
including the phase term required for time-domain alias
the N-sample length MDCT block comprises a replica 25 ing cancellation, also apply to the inverse transform.
of the ?rst half of the signal sample block reversed in
time about the one-quarter point of the sample block,

and a replica of the second half of the sampled signal
reversed in time about the three-quarter point of the
sample block. If overlap-add of the MDCT block and
The following describes differences between adap

tive-length embodiments of the present invention and
the ?xed-length embodiments discussed above. The
structure of each adaptive-length embodiment is sub
cancel time-domain aliasing, the time-domain aliasing

component in the ?rst MDST iN-sample length block
length embodiment. The most signi?cant differences
time-domain aliasing component with these characteris
overlapping the immediately prior block by a samples,

and overlapping the immediately subsequent block by b
the ?rst MDST sample block shown in FIG. 6 is to 30 stantially the same as that for a corresponding ?xed
pertain to the pre- and post-transform functions, and to

the length and phase terms of the transform functions.
must be a replica of the entire block inverted in sign and
In the following discussions, each time-domain signal
time-reversed end-for-end. The phase term m required
by the MDST and IMDST transforms to produce a 35 sample block is de?ned to be a+b samples in length,
tics is m: .
It can be shown that the phase term may be written
generally as
samples. It is assumed that the number of samples in the

two overlap intervals may vary from block to block.
According to the conventions established in the previ

ous discussion, the bridge transform applied to each
time-domain signal sample block is an adaptive-length
(43)
(a+b)-point transform.
where d) is the location of the boundary between time

reversal regions, expressed as the number of time
domain signal samples from the right-hand or trailing
edge of the time-domain signal sample block.

For example, FIG. 7 illustrates two window
weighted time-domain signal sample blocks. The right

hand block is 5N samples in length. Within this block,
50
the boundary between time-reversal regions is at a point

N/ 8 samples from the right-hand or trailing edge of the
B. E-TDAC Implemented by DCT/DST

One adaptive-length embodiment corresponds to the
?xed-length basic embodiment discussed above. The
forward pretransform, corresponding to the functions
shown in equations 9 and 12 of the ?xed-length embodi
ment, generates an alternating sequence of modified
sample blocks according to
block. Thus, the phase term In required to cause time
reversal of the aliasing component within each region of

55
the iN-sample block is
With this background established, it is now possible

to introduce the bridge transform. A bridge trans
form is a transform which bridges the shift from one
transform length to another. For example, as shown in
FIG. 8, suppose the present invention is called upon to 65
process one block of {:N samples followed by another
block of {N samples. It is possible to perform a separate
transform for each block. For reasons whose explana
(45)
5,297,236
19
20
A flowgraph illustrating this forward pretransform

function, for a l6-sample block with a==4 and b=12, is
shown in FIG. 9.
The forward transform comprises a DCT and a DST
according to
The forward E-TDAC transform is implemented by

a DFI' which generates alternating sets of complex
The inverse transform comprises an IDCT and an
IDST according to
valued frequency-domain transform coefficients, P(k)

of the form T(k)+j-U(k) and R(k) of the form
V(k) +j-W(k), in response to the alternating sequence of
modi?ed sample blocks according to
2
0
a+b
2
25
aib
2
(49)
A
a+b kil a(k)S(k)sin (
M)
30
The inverse post-transform function, corresponding
The forward post-transform, corresponding to the
to the functions shown in equations 16, 17, 19, and 20

for the ?xed-length embodiment, recovers time-domain
signal samples from time-domain transform coefficients

according to
functions shown in equations 25 and 26 for the ?xed

3 III
length embodiment, generates alternating sets of spec

tral information according to
(53)
C(k) = 605K alkb
7(k) + sin( affb
U(k),
S(k) = sin ( afb
V(k) - cos( affb
W(k).
(59)
45
The inverse pretransform function, corresponding to

the functions shown in equations 27 through 30 for the
?xed-length embodiment, generates an alternating se
quence of blocks comprising recovered frequency
domain transform coefficients; one block type compris
ing recovered complex-valued coefficients P(k) of the

form T(k) +j-U(k), and a second block type comprising
recovered complex-valued coefficients ll(k) of the form

V(k) +j-W(k). Each frequency-domain transform coef
55
C. E-TDAC Implemented by DFT

Another adaptive-length embodiment corresponds to
the ?xed-length embodiment of the E-TDAC transform
implemented by a DFT, discussed above. The forward
pretransform, corresponding to the functions shown in 65
equations 21 and 22 for the ?xed-length embodiment,
generates an alternating sequence of modi?ed sample
blocks according to
ficient is obtained according to
5,297,236
22
D. O-TDAC Implemented by DST
The O-TDAC transform utilizes a MDCT of the

form
10
where E(k)=frequency-domain transform coefficient k.
The inverse transform generates an alternating se
The processing requirements needed to implement
quence of two types of blocks comprising recovered

time-domain transform coefficients by applying an
IDFT to the alternating sequence of frequency-domain
transform coefficient blocks; one block type comprising
recovered timesdomain transform coefficients {)(k), and
a second block type comprising recovered time-domain
transform coefficients f'(k). The IDFT used to recover
the time-domain transform coefficients is shown in
equations 64 and 65;

Z
ples to generate modified samples e(n), then applying a

DST to the modi?ed samples to generate frequency
domain transform coefficients X(k). The forward pre
transform function is
20
(71)
217i _]
I
p(n)= a+b
this transform can be reduced by applying a forward

pretransform function to the time-domain signal sam
2
.
121 m-
kio P(k)e
(64)
a
for0<n<-b-2 ,
25
22a -1
2 k
2,
<6
Rn)=aib k5 Rage "+5ror0n<-2Lb.
(72)
The inverse post-transform function, corresponding

to the functions shown in equations 33 through 36 for
the fixed~length embodiment, recovers time-domain
signal samples from time-domain transform coefficients

according to
35
A flowgraph illustrating this forward pretransform
(66)
function, for a l6-sample block with a=4 and b=12, is

shown in FIG. 10. The minus signs denote terms which
are subtractively combined with an associated sample
for the function shown above in equation 71.

The forward transform comprises a DST according
to
(67)
45
(73)
- l
O e(n)sin (
211
[Haw]
The inverse transform comprises an IDST according

0
(63)
a") =
55
(74)
[Haw-1)
(69)
where
(k)=recovered time-domain transform coefficient,

A and
65
E(k) = recovered frequency-domain transform coeffi

cient k.
The inverse post-transform function recovers time
domain signal samples x(n) from time-domain transform

coefficients according to
23
5,297,236
24
samples x(n) from another respective one of said
time-domain signal sample blocks according to
(76)
5 Matti 3:
(77)
said forward transform means generates spectral co
iii-M
ef?cients C(k) by applying a discrete transform

15
sample blocks and generates spectral coefficients
We claim:
1. In an encoder a combination for the ?ltering of one
or more signals comprising input samples, said combina
tion comprising
means for receiving said input samples representing
function substantially corresponding to a Discrete

Cosine Transform function to said ?rst modi?ed
20
stimuli intended for human perception,

input buffer means for grouping said input samples
into time-domain signal sample blocks of length N,
where N is a positive integer, wherein said input 25
samples are analysis-window weighted samples,
S(k) by applying a discrete transform function sub

stantially corresponding to a Discrete Sine Trans
form function to said second modi?ed-sample
blocks.
3. The combination according to claim 1 wherein
said forward pretransform means generates ?rst
modi?ed-sample blocks comprising modi?ed sam
ples p(n) formed from the additive combination of
a pair of analysis-window weighted samples x(n)
analysis means for generating spectral information in

response to said time-domain signal sample blocks,
from a respective one of said time-domain signal
said spectral information comprising spectral coef

?cients C(k) and S(k) substantially corresponding
to the frequency-domain transform coef?cients of
an Evenly-Stacked Time-Domain Aliasing Cancel
lation transform applied to said time-domain signal
sample blocks, wherein said spectral coef?cients

C(k) and S(k) substantially correspond to Modi?ed
Discrete Cosine Transform coefficients and Modi
?ed Discrete Sine Transform coef?cients, respec
35
tively, comprising
and said forward pretransform means generates
forward pre-transform means for generating modi

?ed-sample blocks comprising QN modi?ed sam
ples by combining one or more pairs of analysis
window weighted samples to form said modi?ed
second modi?ed-sample blocks comprising modi

?ed samples r(n) formed from the subtractive com
bination of a pair of analysis-window weighted

samples x(n) from another respective one of said
samples, and
forward transform means for generating frequency
domain transform coef?cients by applying one or
more discrete transform functions to said modi?ed
sample blocks, and

means for generating an encoded signal in response to
said spectral information.


ples y(n) formed from the additive combination of
said forward transform means generates a ?rst set of

55
complex-valued frequency-domain transform coef

?cients P(k) of the form T(k) +j'U(k) by applying
a discrete transform function substantially corre
sponding to a Discrete Fourier Transform to said
?rst modi?ed-sample blocks and generates a sec
ond set of complex-valued frequency-domain

transform coef?cients R(k) of the form
V(k) +j-W(k) by applying a discrete transform
Fourier Transform to said second modi?ed-sample
blocks, and
and said forward pretransform means generates 65 wherein said analysis means further comprises forward
second modi?ed-sample blocks comprising modi

?ed samples z(n) formed from the subtractive com
bination of a pair of analysis-window weighted
post-transformmeans for generating said spectral coef

?cients C(k) by applying a forward post-transform
function to said ?rst set of complex-valued frequency
5,297,236
25
domain
transform
coef?cients
according
to
26
-continued
C(k)=cos(1rk/N)-T(k)+sin('rrk/N)-U(k) and for gener
ating said spectral coefficients S(k) by applying a for

ward post-transform function to said second set of com
plex-valued frequency-domain transform coefficients 5

according to S(k)=sin(1rk/N)-V(k)-cos(1rk/N)-W(k).
5. In an encoder a combination for the ?ltering of one _
or more signals comprising input samples, said combina

tion comprising
for receiving said input samples representing
modi?ed-sample blocks comprising complex- 10 means
stimuli intended for human perception,
valued modi?ed samples q(n) of the form
input buffer means for grouping said input samples
p(n) +j-r(n) wherein each p(n) is formed from the
into time-domain signal sample blocks of length
additive combination of a pair of analysis-window
a+b, where a and b are non-negative integers and
weighted samples x(n) from a respective one of said
the sum a+b is positive, wherein said length varies
15
from block to block one or more times, and
wherein said input samples are analysis-window

said forward pretransform means generates said
weighted samples,
forward pre-transform means for generating modi
tied-sample blocks comprising (a+b) modi?ed

samples by combining pairs of analysis-window
weighted samples to form said modi?ed samples,
forward transform means for generating frequency
domain transform coefficients by applying one or
and each r(n) is formed from the subtractive combi- 25
nation of a pair of analysis-window weighted sam
ples x(n) from another respective one of said time
more discrete transform functions to said modi?ed
sample blocks, and

means for generating an encoded signal in response to
said frequency-domain transform coefficients.
domain signal sample blocks according

30


ples y(n) formed from the additive combination of

said forward transform means generates a set of com
plex-valued frequency-domain transform coeffici

ents Q(k) of the form 604) +j-H(k) by applying a 40
discrete transform substantially corresponding to a
Discrete Fourier Transform to said modi?ed-sam
ple blocks,
and
wherein said analysis means further comprises forward

post-transform means for generating said spectral coef
45
?cients C(k) by applying a forward post-transform

function to said set of complex-valued frequency
domain transform coefficients according to
50
and
said forward pretransform means generates second
modified-sample blocks comprising modified sam
ples 2(n) formed from the subtractive combination

of a pair of analysis-window weighted samples x(n)
from another respective one of said time-domain
signal sample blocks according to
and for generating said spectral coefficients 50:) by 60

applying a forward post-transform function to said set
of complex-valued frequency-domain transform coeffi
and
said forward transform means generates spectral co
cients according to
efficients C(k) by applying a discrete transform

65

Cosine Transform function to said ?rst modi?ed
sample blocks and generates spectral coefficients

S(k) by applying a discrete transform function sub

United States Patent (191: Antill Et A1

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

United States Patent (191: Antill Et A1

Uploaded by

Copyright:

Available Formats

l l l l l l l l l l l l l l l l l l l l l l l l l l Il l l l l l l l l l l

United States Patent [191

[11] Patent Number:

Speech, IEEE Trans. Acoust, Speech, Signal Proa,

DECODER, AND ENCODER/DECODER

Jayant and N011, Digital Coding of Waveforms, Engle

[75] Inventors: Michael B. Antill, San Rafael; Grant

[73] Assignee: Dolby Laboratories Licensing

[21] Appl. No.: 710,805

Related US. Application Data

Mar. 22, 1994

DIGITAL FILTER BANK FOR ENCODER,

[54] LOW COMPUTATIONAL-COMPLEXITY

Int. Cl.5 .............................................. .. G10L 9/00

wood Cliffs, N.J.: Prentice-Hall, Inc., 1984, pp.

Princen, Bradley, Analysis/Synthesis Filter Bank De

Princen, Johnson, Bradley, Subband/Iransform Cod

/Subband Coding, IEEE Trans. Acoust, Speech, Signal

and Signal Processing.

Fielder et al. ....................... .. 381/36

FOREIGN PATENT DOCUMENTS

Barnwell, Filter Banks for Analysis-Reconstruction

9116769 10/1991 World Int. Prop. 0. .

Primary ExaminerAllen R. MacDonald

Brigham, The Fast Fourier Transform, Englewood

Oppenheim and Schafer, Digital Signal Processing, En

Assistant Examiner-Michelle Doerrler

with the Discrete Fourier Transform, Proc. IEEE, vol.

sis and synthesis ?lter banks used in encoding and de

66, Jan., 1978, pp. 51-83.

coding. The invention perrnits the length of a digital

34 Claims, 6 Drawing Sheets

Mar. 22, 1994

Mar. 22, 1994

< REGION 1 >< REGION 2

Mar. 22, 1994

X(")A TIME REVERSAL 1 TIME REVERSAL I

- REGION 1 ->|~ REGlON2->i

TIME REVERSAL I TIME REVERSAL

I< DST >!~ DST

Mar. 22, 1994

i -2- SUBBLOCK w SUBBLOCK

Mar. 22, 1994

bandwidths which are sums of individual transform

LOW COMPUTATIONAL-COMPLEXITY DIGITAL

banks and digital block transforms is essentially the

Signal Proc, ASSP-27, October, 1979, pp. 512-30.

ing U.S. application Ser. No. 07/458,894, ?led Dec. 12,

refer to portions of the useful signal bandwidth whether

implemented by a true subband coder or a transform

07/303,714, ?led Jan. 27, 1989, now abandoned a con

coder. The terms transform and transforming shall

tinuation-in-part of copending U.S. application Ser. No.

include digital ?lters and digital ?ltering, respectively.

07/508,809, ?led Apr. 12, 1990, now abandoned and a

In most digital coding applications, processing re

continuation-in-part of copending PCT/US9l/025l2,

of subband ?ltering. Improved processing ef?ciency

The invention relates in general to digital encoding

which are less expensive to build, or which impose

lower signal propagation delays through an en

Discrete Fourier Transform (DPT), the Discrete Co