Professional Documents
Culture Documents
demands on a digital storage medium, system (CD-Audio) format was de- ity, and flexibility. Thus the CD is
A 60-min musical selection, recorded veloped to meet these demands. A highly suitable for many nonaudio
in stereo with pulse-code modulation CD-Audio disk holds a specification- applications. As a result, a number
· (PCM), with a sampling rate of 44.1 defined maximum of 74 min of high- of alternative CD formats have been
kHz and 16-bit quantization, gen- fidelity music on a highly robust and developed. For example, a CD-ROM
erates over 5 billion bits. Error cor- economically manufactured disk. A disk may store over 550 Mbytes of
rection, synchronization, and mod- Compact Disc (CD) player provides software or database information, and
ulation are needed for successful access to any part of the audio pro- a CD-I disk may interactively present
storage; total capacity required is gram within a second or less. with several hours of music, along with
over 15 billion bits. In addition, a many user-convenience features. The video and text information. The CD
commercially popular music storage format is well suited to audio's stor- is only the first of many optical disk
medium demands random access, age demands [1]. storage technologies for both the
small size, convenience of use, ro- The magnitude of the task of digital consumer and the audio professional.
bustness, low cost, and ease of rep- audio storage and the magnitude of This article informally reviews the '
lication. Clearly, digital audio's re- the achievement of the CD have re- technical specifications of the CD
quirements are considerable, sulted in an optical storage medium format, in terms of both physical and
250 ,.. J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April
data specifications. Also presented ameter 15 mm, and the thickness 1.2 surface is covered with a very thin
is the encoding method used to pre- mm. The innermost diameter does (50-i00-nm) metal layer, such as
pare audio data for disk storage on not hold data; it provides a clamping aluminum, silver, or gold, and an-
the CD-Audio format. In addition, area for the player to hold the disk other thin (10-30-1xm) plastic pro-
the developing family of CD appli- to the spindle motor shaft. Data are tection layer, with the identifying
cations, including CD-ROM, CD-I, recorded on a 35.5-mm-wide area. label (5 Ixm)printed on top, as shown
CD-V, and recordable CD formats, A lead-in area rings the innermost in Fig. 3. A laser beam is used to
are described [2]. data area, and a lead-out area rings read the data. It is applied from below
the outermost area. They contain and passes through the transparent
COMPACT DISC nonaudio data, which are used to substrate and back again. The beam
[ _ O) )) PHYSICAL control the player's optical pickup, is focused on the metalized pit sur-
_j SPECIFICATION polycarbonate)
A transparentsubstrate
plastic forms
(typically
most data
face, on
embeddedinsidethe
a d4sk are read disk.
by a Since
light
of a disk's i.2-mm thickness. Data beam, playing a CD causes no more
The physical characteristics of the are physically contained in pits that wear to the recording than reading
CD are shown in Fig. 1. The disk are impressed along the disk's top causes to the words printed on this
diameter is 120 mm, the hole di- surface, as shown in Fig. 2. The pit page.
between the disk and the pickup. The fact that the disk data surface
Compact Disc Pit Track To accomplish this, the rotation is physically separated from the
_._ formrelative
speed of a CDvelocityis
varies depending
maintained
on reading side of the substrate provides
the radial position of the pickup. Be- one of the most significant assets of
In the microscopic world on the cause each outer-track revolution the CD system. Dust and surface
data surface, the disk contains a contains more pits than each inner- damage do not lie in the focal plane
track of pits arranged in a contin- track revolution, the disk must be of the reading laser beam and hence
uous spiral running from the inner slowed down as it plays outward, to their effect is minimized. Specifi-
circumference to the outer. By maintain a constant data rate. The cally, any obstruction less than 0.5
starting from the inside, future disk rotates at a speed of 500 r/min mm becomes insignificant and causes
adoption of smaller and larger di- when the pickup is reading the inner no error in the readout.
ameter disks is facilitated, circumference, and as the pickup The polycarbonate substrate has a
The pits are amazingly small, moves outward, the rotatiorial speed refractive index of 1.55 (as opposed
considerably smaller than we bulky gradually decreases to 200 r/min, to 1.0 for air); the velocity of light
humans can appreciate. For example, Thus a constant linear velocity is slows from 3 x 105 to 1.9 x 105 km/
a pit is about 0.5 Ixm wide, or about maintained. In other words, all of s. When the velocity of light slows,
700 times smaller than a pin prick, the pits are read at the same speed, the beam is bent. Because of the
The track pitch, the distance between regardless of the circumference of bending from the refractive index,
successive tracks, is 1.6 ixm, as thatpart of the spiral. Thisis accom- the thickness of the substrate, and
shown in Fig. 4. There is a defined plished through a CLV servo system, the numerical aperture of the lens of
maximum of 20 188 revolutions The player constantly reads frame the laser pickup, the size of the laser
across the disk's signal surface of synchronization words from the data beam decreases from approximately
35.5 mm. Depending on the playing
time, a pit track might contain 3 bil-
lion pits. If unspiraled, this-track
would stretch about 3 miles (5 km).
The construction of the CD is dif-
fraction limited, that is, the dimen-
sions are as small as· permitted by \
252
tween pits, virtually all of its light
CD is reflected, and when it strikes a
120 mm pit, virtually all of its light is can-
flected.
i celed--hencevirtuallynoneis re-
In practice, the laser spot is larger
15 mm than required for complete cancel-
000x MAGNiFiCATiON lation between pit and land reflec-
tions, and pits are made slightly
shallower than the theoretical figure
5 _rn of one-quarter wavelength; this yields
a better tracking signal, among other
things. Typically the presence of a
bump reduces reflective power by
about 25%. In any case, the presence
of pits and land is thus read by the
laser beam; specifically, the disk
10-30 _m
surface modulates the intensity of the
light beam. Thus the data physically
encoded on the disk can be recovered
by the laser and later converted to an
METALIZEDPIT electric signal.
SURFACE
0.8 mm on the substrate surface to beam to interfere destructively with The music on a CD is the end result
approximately 1.7 _m at the pit sur- one another and to cancel each other, of an elaborate data transformation
face. The laser beam is thus focused Optically, if the CD pit surface is that takes place during master en-
to a point slightly larger than a pit considered a two-dimensional re- coding, and that undergoes decoding
width, flective grating, the focused laser each time the disk is played. The
The top of the pit surface is met- beam diffracts into higher orders, complete encoding process, from
alized. This reflective surface, known resulting in interference. In theory, sampling to data bits on the disk, is
as land, causes almost 90% of the when the beam strikes an area be- shown in Fig. 5. The premastered
laser light to be reflected back into
the pickup. When viewed from the 1.____,
1
perspective of the laser underside,
the pits appear as bumps. The height
of each bump is between 0.11 and -o.5tm_..,
0.13 I_m.than
smaller Thisa dimension is slightly
carefully chosen di- _'"' _ _) I
mension:the laserbeamwavelength | III DISK
' 1
I .J L.. I I I i SAMPL,NG
PE.,ODS
j
s
f '"..
110111001
I I
01110010 I 10001111 0101001011!32BITS PER SAMPLING PERIOD
I I_SYMBOLS OF 8
I I I ! BITS
DATABITS B2 . +8PARITY
33 DATASYMBOLS
·. _ .- 33 x 17 CHANNEL BITS
·. / . - + 27 SYNC BITS
. I -' ' 588 CHANNEL BITS
CHANNEL BITS B
_ lO00010010001000000_OOmO_OOmO00_ I I mOOOOOOOOO_OOOOOOOOOO_O×xx
I
SYNC
2xT_
Fig. 5. CD encoding process from input samples to channel bits. (Courtesy of Philips Technical Review.)
data are typically stored on a U-matic data. Six 32-bit PCM audio sampling about one error for every 0.1 to 1.0
video cassette via a digital audio periods (alternating from 16-bit left million bits. This is impressivestor-
processor with a 44.1-kHz sam- and right channels) are grouped in a age capability, but considering that
pling rate and 16-bit linear quanti- frame, left channel preceding right, a disk will output over 4 million bits
zation. Once the audio data adhere Each 32-bit sampling period is di- per second, the need for error cor-
to these basic specifications, they are vided to yield four 8-bit audio sym- rection is obvious. With error cor-
ready to be encoded to the CD for- bols. Considerable signal processing rection, over 200 errors per second
mat. follows topreparethe audiodata for can be corrected. To achieve such
CD encoding is the process of storage on the disk surface. In par- results, the CD employs the corner-
placing audio data in a format suitable ticular, error-correction encoding stones of error correction, interleav-
for storage on the disk. A frame must be accomplished, lng to distribute errors and parity to
structureprovidesa meanstodistin- correctthem[5].
guishthedatatypes.Theinformation The algorithm used for error
contained in a CD frame prior to Compact Disc Error- correction in the CD system is the
modulation includes a 27-chafinel- Correction Encoding cross-interleaveReed-Solomoncode
bit synchronization word, an 8-bit (CIRC). The CIRC algorithm uses
subcode(labeledC andD control twocorrectioncodesfor additional
and display in Fig. 5), 192 data The raw error rate from a CD. that correcting capability, and three in-
bits, and 64 parity bits [4]. is, the rate before error correction is terleaving stages to encode data be-
Encoding begins with the audio applied, is around 10-s to 10-6, or fore they are placed on a disk and to
256 J, Audio Eng. Soc., Vol. 36, No. 4, 1988 April
decode the data during playback, sures in one word. An erasure is a
Thanks to cross-interleaving, the symbol that has been negated by the
separation of two error-correction decoders because detection has as-
codes by an interleaving stage, one certained that its value is unreliable.
Reed-Solomon code can check the The parity symbols are placed in the
accuracy of the other code. The center of the block to permit inter- Following CIRC encoding, an 8-
Reed-Solomon codes used in CIRC polation in the event of large burst bit subcode symbol is added to each
are particularly well suited for the errors, frame. The eight subcode bits are
CD system because their decoding Cross-interleaving follows the C2 designated as P, Q, R, S, T, U, V,
requirements are relatively simple encoder. The 28 symbols are dela3)ed and W. Only the P or Q bits are re-
[6], [7]. by differing periods to store the data quired in the audio format. The CD
The complete CIRC encoding in 28 different words. The C1 error- player collects subcode symbdls from
scheme is shown in Fig. 6. With this correction encoder accepts 28 sym- 98 consecutive frames to form a sub-
encoding algorithm, data from the bols (from 28 different C2 words) code block, with eight 98-bit words.
audio signal are cross-interleaved, and produces four symbols of P par- Thus the eight subcode bits (P
and two encoding stages generate 8 ity. P parity is used to correct single through W) are used as eight different
bits of parity. Error-correction en- symbol errors and to detect and flag channels, with each frame containing
coding begins with the first stage of double and triple errors for Q cor- one P bit, one Q bit, one R bit, and
interleaving, designed to assist in- rection, so on. A subcode block is complete
terpolation. A word of 24 8-bit sym- A final interleave stage introduces with its own synchronization word,
bols is applied to the CIRC encoder, a fixed delay of one symbol to alter- instruction and data, commands, and
A delay of two symbols is placed be- nate symbols. This delay causes data parity. The start of each subcode
tween even and odd samples; this fa- in the output word to be spread over block is denoted by the presence of
cilities interpolation. In addition, the two words, it prevents small random So and S_ synchronization patterns
symbols are scrambled; this facili- errors from destroying more than one in the first symbol positions of two
tares concealment when errors are symbol, even if two adjacent symbols successive blocks. So and Si are two
beyond the capabilities of the system, in one word are destroyed. Finally patterns not used in the EFM code
The C2 error-correction encoder the P and Q parity symbols are in- (explained below), and thus can be
produces four symbols of Q parity, verted to provide nonzero P and Q uniquely identified. The subcode
Q parity is designed to correct one symbols with zero data. 32 8-bit block rate is 75 Hz.
erroneous symbol or up to four era- symbols leave the CIRC encoder. The P and Q subcode bits contain
C2
SCRAMBLER C2 INTERLEAVER C1SCRAMBLER
ENCODER C1ENCODER
W1 1
W2 2
Wa 3
W4 _ 4
W5 5
- 6
13 Ik
W21 25
! W22 _ 26
]_W23 27
· W24 _ __ _ 28
D= 4 28
29
30
31
32
information essential to accessing suppression method. Digital copy start of the lead-out area. Time and
audio data on the disk. The P channel prohibited (yes/no) is indicated, absolute time are expressed in mm-
contains a flag bit. It designates the The address information consists utes, seconds, and frames (75 frames
start of a track, as well as the lead- of 4 bits designating the three modes per second). Using indexing, up to
in and lead-out areas on a disk. Lead- for the Q data bits. Primarily mode 99 locations within tracks can be
in and lead-out signals tel1 the player I contains the number and start times marked.
where the music program begins and of tracks, mode 2 contains a catalog Mode 2 contains the catalog num-
ends. A lead-in signal consists of all number, and mode 3 contains other ber of the disk using the universal
Os appearing just prior to the begin- product codes Mode 1 stores infor- product code (UPC). Mode 2 also
ning of the music data. A start flag mation in the disk lead-inarea, pro- continues absolute time count from
appears just prior to the start of mu- gram area, and lead-out area; the data adjacent subcode blocks. Mode 3 is
sic. During the last music track, pre- format in the lead-in area differs from used to give the international standard
ceding the lead-out, a start flag ap- those in the other areas. Mode 1 in- recording code (ISRC) for identifying
pears again. That flag gives the start formation occupies at least nine out each music track. It is present only
of lead-out and the P channel remains of ten successive subcode blocks, in the program area. The ISRC num-
ar 0 for 2 to 3 seconds. Following Mode 1 lead-in information is con- bet gives the country code, the owner
that time, a P signal consisting of tained in the table of contents (TOC). code, the year of recording, and a
alternating ls and Os (at a 2-Hz rate) The TOC stores data indicating the serial number. Mode 3 also continues
appears. These various sets of iden- number of music selections (up to absolute time count. Modes 2 and 3
tity signals can be used by aba- 99) as a track number, and the starting occupy at least one out of 100 suc-
sic player to control the optical points of the tracks in disk running cessive subcode blocks and can be
pickup, time. The TOC is repeated continu- present only in the program area.
The Q channel contains four basic ously in the lead-in area, and the Modes 2 and 3 can be deleted from
kinds of information: control, ad- starting point data are repeated in the subcode if not needed.
dress. Q data, and cyclic redun- three successive subcodeblocks. The Every audio CD contains addi-
dancy check code (CRCC). Each TOCis read during disk initialization, tional data capacity in the guise of
subcode block contains 72 bits of before the disk begins playing, the subcode bits. The other six bits
Q data, as described below, and 16 In the program and lead-out areas, (R, S, T, U, V, and W), which ac-
bits for CRCC, used for error de- mode 1 contains track numbers, in- count for about 20 Mbytes of 8-bit
tection on the control, address, and dices (subdivision numbers) within storage, are available for video or
Q data information. The control in- a track, time within a track, and ab- other information.
formation flag bits handle several solute time. Time is set to zero at
player functions. The number of the beginning of each track and in- __
dicated. This distinguishes between the beginning of a pause, time de- Encoding
aaudio
two- channels
and a four-channel
(two or four)CDis re-
in- creases, ending
creases to withofzero
the end the at the end
track. At (_ Compact Disc EFM
cording. Preemphasis (on/off) is of the pause. The absolute time is
indicated. A CD track may be en- set to zero at the beginning of the After the audio, parity, and sub-
coded with preemphasis, a noise- program area and increases to the code data are assembled, the bit
16-BITSAMPLE
MERGING+cHANNELBITSBiTS
IsITs [ I elis i SITS [
/,
F'l. I.lotol 1Iolololol _Iolol _1ololololo Io
EFM
S,*NALJ F"-"I t' L_._J 1
EFM
Fig.7. Assignmentof EFMwordsand additionof mergingbits.
stream is modulated using eight-to- succeeding patterns) is added to each is thus a compromise between several
fourteen modulation (EFM), as code pattern to aid in clock synchro- conditions. Because only 256 pat-
shown in Fig. 7. Blocks of 8 data nization and suppress the signal's terns are needed, 11 of the 267 pat-
bits are translated into blocks of 14 low-frequency component. The latter terns are discarded. (Two of them
channel bits, assigning an arbitrary is accomplished by selecting merging are used for subcode synchronization
and unambiguous word of 14 bits to bits that maintain the signal's average words So and Sp)
each 8-bit word. By choosing select digital sum value at zero [9]. The The resultant NRZ channel stream
14-bit words with a large distance ratio of bits before and after modu- produces pits and lands that are at
between transitions, greater data lation is 8:17. During demodulation, least 3 channel bits (3T), and no more
density can be achieved. It would be only 14 bits will be processed; the than 11 channel bits (1 IT) long, as
inefficient to store the 8-bit symbols three merging bits are discarded, shown in Fig. 8. It is the combination
directly on the disk. In addition, 8- The 8 data bits require 2s, or 256, of these varying dimensions that
bit symbols have many similar pat- different code patterns. However the physically encodes the data. The se-
terns. For example, if I bit was 14-bit channelword can offer 16 384 lection of EFM bit patterns defines
wrong, 00001011 would be confused combinations. To achieve pits of con- the physical relationship of the pit
with 10001011. With 14-bit words, trolled length, only those combina- dimensions. The pits and the inter-
more unique patterns can be selected, tions are selected in which more than vening reflective land on the CD sur-
EFM thus provides a kind of error two but fewer than ten Os appear face do not directly designate ls and
protection [8]. continuously. In addition, unique Os. Rather, each pit edge, whether
Blocks of 14 bits are linked by patterns are sought. Only 267 corn- leading or trailing, is a 1 and all in-
three merging bits; at least two binations satisfy these criteria. With crements in between, whether inside
merging bits (always Os) are required these patterns, the minimum inver- or outside a pit, are Os, as shown in
to prevent the possibility of succes- sion width of the nonreturn to zero Fig. 9. As discussed, this is a much
sive Is between serial words (a vi- (NRZ) signal is limited by the re- more efficient storage technique than
olation of the coding scheme). The cording density, and the maximum coding the binary bits directly with
additional merging bit (either a 1 or inversion width is limited by the pits.
a 0, depending on the preceding and clock bit extraction. The conversion With EFM there are more bits to
asa
For the pas;tfew years, audio
* professionals
' capabilities,it'simprovedin sonicqualityas well.
have been pmisingYamah_ REV7 digital reverb to We added more DSP chips to boost the
the skies So there was incrediblepressure to make REV5§processing power,creating smoother reverb
its successor even better than expected, sounds and multi-effect combinations Fullband-
Introducingthe REVS.Representinga break- width extends reverb to 20 KHz.
through in the sound barrier forreverb.And a In additionto master analogEQ,the REV5
collectivesigh ofsatisfactionfrom the overachieving has three-band parametric, programmable digital
designengineersat Yamaha. EQ.Sowhen youmake individualEQsettings,
Becausenot onlyisthe REV5matured in they'rerecalledwith each program.
accommodate, but with modulation
the highest frequency in the output
3T '_'"x_':-_ signalis decreased;thereforea lower
track velocity can be utilized. Hence
longer playing time is achieved. This
4 T '_ .... ":' is a very efficient encoding method
because the number of bits trans-
ST tr'
'__::.:_
"'_.....'....
'_"'
_.
....
"' _'-' '" mitted divided by the minimum
W"'"'"
'":_' ';'_"'_:':':':":"-:'_::':_'_ length between transitions needed on
6T ...............
""_"
"'_'___'".:_'!
:_'"
--" "' the medium to convey them is high.
7X :.:'._3-::¥_:!.-"
-' *' ' '' '__:.
C-"______}
"___'"
........... "--'
''
'.
' '-.
".._..
' ' Compact Disc Frame
Assembly
';:':...?_..'3.
_';'
'...: '_::3_'.:
_'-'
"'_
' '_-
'""
' '..
';"'
:-· "'-.
':;-:_.:::::'_;_:'.
Fig.8. Pit(and land)length;createdby EFM rangefrom 3Tto 11I Minimum 24-bit synchronization word is
pit length_0.833-0.972 ,,m; maximumpit length_3.054-3.56 t_m(1.2-1.4 100000000001000000000010 plus 3
m/s). merging bits.
The total number of channel bits
sameCheck
asthe REV7yotfreused
with your friendlyYamaha
to using. Professional OYAMAHA ®
Audiodealerabout thenew REV5. Once you hear Engineering
Imagination TM
CD FORMATS
4T 10T 4T 4T 4T 7T 5T 4T
I
I I I I I I I I I
I I I_1_1 J 1 J_l
I I I I I I I I I
01 000 1 000000000 1 0001 000 1 000 1 0 0 0 00 0 1 0000 1 000 1 0
k,,_ COMPACT
PLAYER DISC
DESIGN [
/ / 1
cuits,ultimately
yieldinga stereo
analogsignal. Thedata pathtypically .
consists of elements such as data
separator, deinterleaving RAM, error
detection, correction,
digital-to-analog
and conceal-
ment circuits, oversampling filters,
converters,and
HFSIGNAL
INPUT
FROM DISK
4 SIT DETECTION
EFM
DEMODULATION
H BUFFER
analogoutputfilters.Theservo,the i I
control, and the display system must CLOCK I I SYNCHRONIZATION
I
the disk,
direct the including
mechanical spindle
operation drive,of REGE
NERATION
1'_ TDETECGTION
AND I
autotracking,andautofocus,and I I,I ]
handle user interface with the play-
er's controls and displays. An op- CRYSTAL
erational block diagram of the data Fig. 10. Architecture of CD player, with oversampling filters.
path is shown in Fig. 10.
J CONTROL DISPLAY
I FUNCTIONS FUNCTIONS
MEMORY ]
Ti t
ti o
CONCEALMENT CONVERTER FILTER OUTPUT
AND DIGITAL
T
CONVERTER FILTER
_o,i,,,tor theIndisk.
many three-beam designs the
,,mi.t_,_,n,_, property of astigmatism is used to
_ A beamsare alignedaboveandbelow
ersthepittrackwhilethetwotracking
TRACK
SHIFTS
LEFT B -]--_'_ _=_ t_ error and to either side of the center beam.
ONTRACK
'[-'0_ :_ C _ _ (A+ B) - (C + D) When the part
properly, beamof is
each
tracking
trackingthe beam
disk
LENS
SPOT
POLARIZATION OBJECTIVE LENS_'_ f tN
__ J PHOTODIODE
..u- LENS I
ci_ CYLINDRICAL I .__ FACE PLATE
E
photo,,od
array
an, s
cessing circuits produce a signal re-
0 0
o°"::_oO
o°%Oo°'_'?_?_o
I"i 1-+-I 1IoI
I...."01] IoI . 1 I
CIRCUIT
TRACKINGERROR
+ 0 - SIGNAL
3T 4T _ 6T 7T 8T 9T 10T 11T
tion in each frame. A synchronization rection decoder C2. Given this prior frame of 32 8-bit symbols; 24 are
pulse is generated to aid in locating knowledge, and help from deinter- audio symbols and eight are parity
the zero crossing of the EFM pattern, leaving, C2 is able to correct burst symbols. Odd-numbered symbols are
and to generate a transition at those errors as well as random errors that delayed and parity symbols are in-
points, in order to produce a binary C1 was unable to correct, vetted. The delay lines have a delay
signal. The use of two correction decoders equal to the duration of one symbol.
The EFM signal is demodulated and cross-interleaving helps tackle a This deinterleaving places the au-
so that every 17-bit EFM word again particularly difficult error scenario, dio symbols back into their original
becomes 8 bits. Depending on player Interleaving distributes burst errors, order by rearranging their order as
design, demodulation is accom- perhaps caused by surface contain- read from the disk. Any sequence of
plished by logic circuitry or by a ination, over different words for errors on the disk is thus distributed
lookup table. A buffer is used to re- easier correction. However, correc- among valid data. This aids in the
move the effect of disk rotational ir- tion is difficult when a burst error C1 decoder's correction efforts, es-
regularities. Data input to the buffer coincides with a random error, per- pecially for small errors in adjoining
may be irregular, but clocking en- haps introduced by a manufacturing symbols.
sures that the buffer output is precise, defect. The C 1 decoder corrects sin- In the C 1 decoder, errors are de-
To guarantee that the buffer neither gle random errors and flags burst er- tected and corrected by the four P
overflows nor underflows, a correc- rors, so that interleaving can accom- parity symbols. Short-duration ran-
tion signal is generated and is used plish its task. EFM coding guarantees dom errors are corrected, and longer
to control the disk rotating speed, that a random error will never corrupt burst errors are passed along. Spe-
By varying the rate of data from the more than two symbols, and the even/ cifically, the C 1 decoder can correct
disk, the buffer is maintained at 50% odd interleave guarantees that a two- a symbol error in every word of 32
capacity, symbol random error will always ap- symbols. If there is more than one
Following demodulation, data are pear as single errors in two different erroneous symbol, then all 28 data
sent to a CIRC decoder for deinter- C 1 words after deinterleaving. Ran- symbols are marked with an erasure
leaving, error detection, and error dom errors are thus always correct- flag and passed on. Valid symbols
correction. It utilizes parity from two able, and the C2 decoder retains its are passed along unprocessed.
Reed-Solomon decoders as well as burst error-correction capability. The deinterleaving delays between
deinterleaving. The first error-cor- The CIRC decoding process re- the decoders enable the C2 decoder
rection decoder C1 is designed to verses the processing steps performed to correct longer burst errors. Be-
correct random errors and to detect during encoding; the complete CIRC cause the word arriving at C2 contains
burst errors. It puts a flag on all burst decoding process is shown in Fig. symbols fromC1 decoded at different
errors to alert the second error-cor- 15. The CIRC decoder accepts one times, symbols marked with an era~
266 J.Audio
l=ng.
Soc.,
VoL36,No.4, 1988April
CD FORMATS
C1 c1 c2 c2
UNSCRAMBLER DECODER DEINTERLEAVER DECODER UNSCRAMBLER
J b
' We
m) __ _ cONVERTER
DECODER I_o If J
J'_ D
M --
m--
si ' C2 FLAG
)2_ C1 FLAG
sure flag are now distributed among converter could result in an audible is typically used to oversample (per-
valid symbols. This helps the C2 de- click. Muting is accomplished haps at a two-, four-, or _ight-time
coder correct burst errors. Symbols smoothly by beginning attenuation rate); image components appear at
without a flag are assumed error-free many samples before the invalid data, multiples of the new sampling rate
and are passed through unprocessed, muting the invalid data, then [13]. Because the distance between
In the C2 decoder errors are corrected smoothly restoring the signal level, the baseband and the sidebands is
by the four Q parity symbols. C2 can This method of muting is often larger, a gentle analog filter can be
detect and correct single-symbol er- largely inaudible, used to remove the images, without
rors and correct up to four symbols causing phase shift or other artifacts
if flagged. Burst errors and errors that _'-'_X attributed to analog brickwall filters.
process itself (rather than in the me- Reconstruction stage of every player is an audio
dium)are corrected,
might have occurred in the encoding _) Compact Disc Signal deemphasis
Also presentcircuit.
in theSomeCDs are
audio output
In addition,C2 cancorrect symbols encoded with an audio preemphasis
miscorrected by C1 decoding. When Finally the digital data must be characteristic with time constants of
C2 cannot accomplish correction converted to a stereo analog signal. 15 and 50 I_S. On playback, deem-
(mom than four symbols are flagged), This reconstruction process requires phasis is carried out, resulting in an
24 data symbols are flagged as un- one or two digital-to-analog con- improvement in the signal-to-noise
corrected and passed on for inter- verters and low-pass filters to sup- ratio. CD tracks that are encoded with
polation, press high-frequency image eom- preemphasis are so denoted by an
The final step in error protection ponents. If the data are first converted appropriate flag in the subcode. The
is error concealment. Thus interpo- to an analog signal (with a digital- player reads the preemphasis flag and
lation and muting circuits follow the to-analog converter), they must be automatically switches in the deem-
CIRC decoder. Uncorrected words filtered with brickwall analog filters, phasis circuit when required.
are detected through flags and dealt Rather than use an analog brickwall Increasingly, CD players offer a
with, while valid data pass through filter after the signal has been con- digital output connection, adhering
unprocessed. Using error flags from verted to analog form, the digitized to the AES/EBU serial transmission
the C2 decoder, the player's sig- signal may be processed before dig- format. To permit an orderly ex-
nal processing circuits determine ital-to-analog conversion using an change of data between digital de-
whether to output the data directly, oversampling digital filter [ 12]. vices, the Audio Engineering Society
to interpolate them, or to mute the An oversampling digital filter uses has established a standard transmis-
sound, samples from the disk as input, then sion format for linearly represented
For continuous errors, muting is computes interpolation samples, digital audio data [14].
employed as a last resort; bad data digitally implementing the response The digital audio format transmits
passed on to the digital-to-analog of an analog filter. A transversal filter both left- and right-channel data us-
ing one digital cable. A time-sharing tended mainly as a medium for becomes 2352 bytes (24 bytes × 98
multiplexing transmission transmits professional databases and mass frames), a workable area.
or receives frames, each containing storage for computer-related appli- This 98-frame area is called a sec-
left- and right-channel data alter- cations. CD-ROM is a cost-effective tor; it forms the basic CD-ROM data
nately. The transmission rate corre- way of distributing large amounts of unit. This area is sufficiently large
sponds exactly to the source sampling information [15]. to handle numerical data without al-
frequency.°When the sampling fre- To clarify, CD-Audio is a specific tering the audio CD format. Audio
quency is 44.1 kHz, that number of application of CD technology. It is data are replaced by user data and
frames is transmitted per second. The a standard for recording PCM audio, system data; this occupies 2048
length of one channel data period is The CD-ROM standard is derived bytes. The remaining 304 bytes are
thus 11.34 Ixs. One frame consists from the CD-Audio standard, but used for additional synchronization,
of two subframes, each with 32 bits. defines a format for general data headers, mode selection, and error
Each subframe carries data for one storage. UnlikeCD-Audio, CD-ROM detection/correction, as shown in
audio channel. The first 4 bits are is not tied to any specific application. Fig. 16. Each disk is divided into
used for synchronization and for Both media use a 120-mm-diameter 330 000 sectors, and each sector is
identifying preambles. The next 24 optical disk, but with different data divided into four fields. Data are re-
bits carry audio data, with the most formats, corded with EFM, as in audio CDs.
significant bit transmitted last. Since Apart from modulation and error The first 12 bytes from the 2352-
most CD players use 16-bit words, correction overhead, a CD-ROM byte block are used as a synchroni-
only the last 16 bits in the field are holds a theoretical maximum of 682 zation word. The next 4 bytes form
used. In addition, the first 4 bits in Mbytes (8-bit byte) of user infor- a header field used for time and ad-
the field are set aside for auxiliary mation. In practice, a storage capac- dress flags. Specifically, it contains
audio or other data. The last 4 bits ity of approximately 550 Mbytes 3 address bytes and 1 mode byte. The
form a control field that carries spe- permits a better yield for manufac- address bytes store location as time:
cial information, including an audio turers. This is a large storage area, one address byte holds minutes (74
sample validity bit, a user data bit, equivalent to 3600 still video pic- min maximum), the second byte
an audio channel status bit, and a tures, 1500 floppy disks, 18 000 holds seconds (0-59 seconds), and
subframe parity bit. pieces of computer graphics, or 275 the third holds the sector number
The digital audio signal may be 000 pages of alphanumerics, de- within the second (75 sectors per
transmitted through a coaxial cable, pending on resolution, number of second). For example, an address of
a glass fiber cable, or an ordinary colors, and so on [16]. 59-20-45 denotes the 45th sector
audio phono cable. Through this in the 20th second of the 59th minute
transmission standard, different __ I on the disk. Although this repeats
manufacturers are assured of com- Memory Data Format [ subcodechannel, it facilitates address
digital
patibleinterconnectionand
audio devices from different
transfer _ Compact Disc Read-Only ]I information already found in the Q
searching.
of digital audio information. The modebyte indicatesthe nature
The CD digital audio standard is Although a CD-ROM disk is vis- of the user data. There are two data
perhaps the most sophisticated audio ually identical to an audio CD, it modes, as shown in Fig. 17. The
playbacl_ medium ever devised. Its employs a modified data format. A mode 1 format specifies that 2048
small size, economy, robustness, and CD-ROM disk automatically iden- bytes of each block are devoted to
capacity make it an ideal music car- titles itself as differing from an audio user data. The mode 2 format allows
rier. However, the utility of the CD through the Q subcode channel, for 2336 bytes of user data. The dif-
plastic disk is not limited to music. Data in a music CD are derived from ference between modes 1 and 2 is
Other formats have been derived from the b44.1-kHz sampled signal, with that mode 1 uses 288 bytes to provide
the original audio standard. 16-_it quantization. The 16 bits are an additional layer of error detection
divlded into higher and lower 8-bit and correction (EDC/ECC). This is
bytes. These data bytes (before EFM in addition to the basic CIRC error
mo_lulation) are grouped into frames correction used on all CDs. It ensures
of 24 bytes, and parity and a syn- a level of data integrity essential for
chromzatlon word are added. Whale information that does not degrade
satisfactory for music applications, gracefully, such as text and database
CD read-only memory (CD-ROM) a frame is too short for numerical binary code. Mode 2 trades this ben-
is the logical extension of the CD applications, and there is no provi- efit of data security for maximum data
format toward the much broader ap- sion for addressing. A solution was rate by making the additional 288
plication of information storage in derived from the subcode field in bytes per sector available for user
general. Rather than store music, the which subcode from 98 frames is data. The basic error-correction en-
CD format can be treated as a general summed. Specifically, in CD-ROM, coding is suitable for gracefully de-
ROM system, used for any kind of 98 24-byte frames are summed, thus grading data such as video and audio
program material. CD-ROM is in- the effective size for the d_ita area [17].
operating
cessor in system andcomputer.
the host the micropro-
CD- DO009 130010 . I:X)011 IXX)12 J I
aROM
CD-ROM
can be stored con-
veniently on a few disks; for example,
is thus telephone
an open_ended
book storage/
for the
United States would be feasible. As-
suming 100 million listed telephones
and 40 bytes of information associ-
I
._[ 0
37
IY20lll
1)2045
D2049
l:kn053
[
D2038
D2042
!:)2046
1)2050
[.I I)2054
II t
D2039
D2043
D2047
1)2051
1_55
D2040
D2044
I)2048
111_52
13_056
USERDATA
(2048BYTES)
Header (4)
!--
object code as the coding method for (ADPCM) is used for three levels of is a l-3-s pause when switching from
application software. Multimedia varying fidelity. The A Hi-Fi audio the end of one channel to the begin-
applications also call for physical level uses 8-bit ADPCM, with a ning of the next channel. The spec-
interleaving of the three basic types sampling rate of 37.8 kHz. The ifications of the audio quality levels
of data (audio, video, and text/bi- quality is similar to that of vinyl LP are summarized in Table 1 [21].
nary) to ensure synchronized pre- records. The B Mid-Fi audio level ADPCM is one of several spe-
sentation of different data types. Fi- uses 4-bit ADPCM with a 37.8-kHz cialized methods used to encode
nally, the disk must provide efficient sampling rate. The quality is similar digitized audio. It is a variation of
storage for both data that degrade to that of an FM broadcast. The C linear PCM and extremely useful in
gracefully and data that do not. The Speech audio level uses 4-bit certain applications. Because of its
former require maximum bandwidth ADPCM with an 18.9-kHz sampling ability to store digital data with fewer
while the latter require extended error rate. The quality is similar to that of bits, it is thus more efficient. On the
detection and correction, an AM broadcast. The CD-I format other hand, ADPCM requires addi-
also supports an information audio tional processing beyond regular
n,er
The CD-I format permits a high
degree of real-time interactivity. The
digitally, a CD-I disk is compatible picture qualities, namely, studio data format allows for real-time
with both the NTSC (United States picture quality and two graphics switching between sectors. Thus it
and Japan) and the PAL/SECAM qualities. Natural (photographic) is possible to have sectors with dif-
(Europe) television standards. CD-I pictures use DYUV (delta YUV, ferent information contents and with
disks may be manufactured to provide where Y refers to luminance and U different physical formats. CD-I is
either 525 scanning lines (NTSC) or and V refer to chrominance) coding, not a peripheral, but a self-contained
625 scanninglines (PAL); either type One full-frame natural picture is system. To ensure universal disk/
of disk is playable on any standard transferred in just over 0.6 s. The drive compatibility, dedicated hard-
television. When a standard disk is first graphics mode is designed for ware and interfaces are specified. A
played on a corresponding tandard end-user manipulation applications; CD-I player contains a CD-ROM disk
television (525 through 525, or 625 it is based on absolute RGB (red- drive as the system's input, decoder
through 625), negligible visual dis- green-blue) 555 coding, and supports chips for text, graphics, video, audio,
tortion is produced. When a disk is either 8-bit/256 colors or 15-bit/ and microprocessor controllers. The
played through a television adhering 32 768 colors. The second graphics CD-I system uses the Motorola 68000
to another standard (525 through 625, mode is based on CLUT (color lookup microprocessor family. A CD-I
or 625 through 525), approximately table) graphics, which permit 4-bit/ player could be interfaced to a tele-
7% visual distortion results. Gen- 16-, 7-bit/128-, or 8-bit/256-color vision and stereo sound system.
erally, the eye is able to perceive 5% full-screen animation. With com- Because the CD-I format recog-
distortion in a side-by-side compar- pression, the interleaving of sound nizes 16-bit PCM data (the first audio
ison, or 10% distortion otherwisb, and CLUT picture results in the abil- quality level), a CD-I player can play
Alternatively, a CD-I disk may be ity to provide full-screen animation regular CD-Audio disks. An audio
produced witha525/625 compatible with a picture refresh rate of 17 CD player cannot play CD-I disks
video format, which is playable on frames per second. (unless connected to a CD-I con-
either a 525 or a 625 television with troller). As specified in the High
a 3.6% distortion. _x, x Sierra Group proposal, CD-ROM
quality levels for resolution and pixel Text Encoding systems. On the other hand, a mi-
coding.
The CD-IThree standards
format calls forofvarying
video _ Compact Disc Interactive crocomputer
mode 1 diskswith can aoperate
CD-ROM on drive
CD-I
resolution are supported: normal is not always capable of processing
resolution of 384 × 240 pixels Text encoding can be accomplished the information on a CD-I.
(NTSC) and 384 × 280 pixels (PAL), either as a bit-map process or by The CD-I format has inspired the
double resolution (twice the hori- character encoding. The bit-map development of alternative technol-
zontal resolution) with 768 × 240 process requires 5 bytes per char- ogles. Digitalvideo interactive (DVI)
pixels (NTSC) and 768 x 280 pixels acter, resulting in a maximum of 120 is an all-digital optical disk format
(PAL), and high resolution (twice the million characters per disk. Char- capable of reproducing full-motion,
horizontal-and the vertical resolution) acter-encoded text can use 1 byte per full-screen video, computer-gener-
J. AudioEng.Soc.,Vol.36,No.4,1988April 273
CD FORMATS
ated video graphics, and digital audio standards [24]. the outer diameter, encoded as an
via a CD-ROM drive. A DVI disk is However, because of incompatible analog signal according to the LV
effectively a CD-ROM disk contain- analog television standards, both format. In the NTSC standard the disk
ing DVI-specific data. Its data spec- NTSC and PAL players and disks speed ranges from 2700 to 1815 r/
ification permits motion video to be must be manufactured. When the rain, to yield a scanning velocity of
combined with foreground video ob- audio CD was standardized, design- 11-12 m/s. The track pitch is 1.7
jects, text, dynamic graphics, and ers were free to design universality I_m. In the PAL standard the disk
multitrack audio, all under the user's into the product. Thus any CD disk speed varies from 2250 to 1512 r/
control. Sophisticated data com- may be played on any CD player, min, with a scanning velocity of
pression techniques permit the system Not so with CD-V. TWO primary 9.2-10.2 m/s. The track pitch is also
to utilize disk storage space more ef- world standards exist, NTSC and 1.7 Ixm.
fectively and to output relatively PAL. Because video information is The player must sense the presence
larger amounts of data more quickly, stored and reproduced in analog of either digital audio-only or video
Although data on DVI disks are for- form, a CD-V player must be con- signal and adjust its operating pa-
matted to CD-ROM specifications figured for one standard or another, rameters accordingly. Typically, the
and can be played on a CD-ROM as are CD-V disks. In other words, player's pickup would read the audio
drive (other digital media may be the video portion of a PAL CD-V disk lead-in area, skip to the video por-
used as well), special DVI decoding will be unplayable on an NTSC player tion, then return to the audio-only
technology is also required. The DVI and vice versa. Also, the analog audio portion.
format is thus incompatible with portion of the video program is in-
other interactive CD formats, such compatible because it is integrated _'"_'X
However, in the PAL format the dig- digital filtering. Features encompass from a 3/4-in U-matic machine, syn-
ital audio signal replaces the analog those of both CD and LV players, chronized with time code. The high
FMcarriers. Thus a digital audio CD- such as programmability, picture bandwidth of 10 MHz (versus 1.5
V disk may nbt have analog audio as search, fast/slow motion, stills, and MHz for CD-Audio) necessitates
well. A PAL CD-V player is able to speed increase/decrease. Although it manufacturing procedures closer to
reproduce both digital audio LVs and uses primarily constant linear veloc- those of video disks, as opposed to
existing analog audio LVs. However, ity (CLV) scanning, CD-V also pro- audio CDs. However, CDV-Single
existing LV players will not be able vides for constant angular velocity disks will be manufactured at CD
to reproduce the audio portion of new (CAV) scanning in the 200- and 300- audio plants.
digital audio CD-V disks, mm formats. Other enhancements
In addition to the CD-V combi- call for high-definition (CDV-HD) _x,
DIGITAL AUDIO
SAMPLING An output Iow-pass filter is used to re- QUANTIZATION
I f move these image spectra. Aliasing, a [ 1
form of distortion, occurs when the input
° An analog waveform such as an frequency exceeds the half-sampling As we have seen, sampling is a
acoustic pressure function in air exists rate. When a digital audio system sam- method of taking a measurement pe-
continuouslyin time, over a continuously pies input frequencies greater than the riodically. Sampling represents the time
variableamplituderange.Suchan analog half-sampling frequency, spurious new of the measurement, quantization rep-
function may be discrete-time sampled, frequencies are "recorded" aswell. Spe- resents the value of the measurement,
Moreover,the sample points can be used cifically, if S is the sampling frequency such as the amplitude of an audio wave-
to reconstructthe original analog wave- and F is a frequency higher than half form at sample time. Whereas sampling
form. This digitization of audio forms the the sampling rate, then a new frequency is perfect within its prescribed context,
basis for the encoding and decoding of Ff is created as Ff = S - F. Furthermore, quantization is imperfect. In the same
the audio signals in any digital audio aliasing occurs at multiples of the sam- way that length, an analog quantity, can
format, including the CD. piing frequency. Forexample,all of these be measured (for example, in inches),
The Nyquist theorem states that given alias components would be produced: an audio waveform can be measured
correct bandlimitedconditions, sampling S _ F, 2S _+F, 3S -+ F.... (for example, in volts). But in the same
can be a Iossless process. Just as some Thus the input signal must be band- way that a measurement of length is
relationship is needed between a movie limited with an input Iow-pass filter. It always proneto error,quantizingis prone
camera's shutter rate and the speed of ensures that all frequencies above the to error.
movingobjects to produceusable motion half-samplingpoint are attenuatedbelow An analog waveform may be repre-
pictures, the relationship between the the amplitude resolution threshold of the sented as a series of samples; the am-
sampling frequency and audio frequen- system. To achieve a maximum audio plitude of each will yield a number that
cies must be defined. The Nyquist theo- bandwidth for a given sampling rate, fil- represents the analog value at that in-
rem defines this relationship: if samples ters with a very sharp characteristic, so- stant. By definition, an analog waveform
are taken twice as fast as the highest called brickwall filters, are employed, has an infinite number of amplitude val-
audiofrequency,the completewaveform Disallowed frequency components are ues. However, quantization selects from
can be reconstructed [1]. In other words, not allowed to enter the sampler; thus a finite number of digital values. The
S samples per second are needed to aliasing cannot occur. In other words, selected value will be only an approxi-
represent a waveform with a bandwidth when sampling, one must obey the mation to the actual. Specifically, after
of S/2 Hz. The choice of sampling fie- sampling theorem [2]. sampling, the analog staircase signal is
quency thus determines the frequency In accordance with the sampling rounded to the numerical value closest
response of the digitization system, theorem, the sampling rate must be set to the analog value. The difference be-
Sampling may be considered a mod- to a frequency at least twice that of the tween the actual values of the signal
ulation process. One result of that mod- highest input frequency. In the case of and values after quantization appears
ulation is the creation of a spectrum with the CD, the sampling frequency is 44.1 as error.
sidebands, or image frequencies, cen- kHz (a sample every 22.6757 millionths The number of quantization steps
tered on multiples of the sampling fie- of a second). This permits a flat fre- available is determined by the length of
quency. These sidebands are a con- quency response from 0 to 20 kHz, with the data word in bits. This determines
sequence of sampling and result even a filter guardband to permit complete the resolution of the digitization system.
when the Nyquist theorem is observed, attenuation. For example: 16 bits yield 216 = 65 536
276 J.Audio
Eng.
Soc.,
Vol.
36,No.
4,1988
April
professionalaudio applications. Con- ternatively, written data could be holes in a thin film. Other mecha-
siderable research effort has been destroyed, but not recorded over), nisms include lasers that cause huh-
applied to the development of re- While it may at first be uncomfortable bles or blisters to form in the medium.
cordable optical-disk formats. There to magnetically inclined minds, a Tellurium and tellurium alloys have
is little question that future genera- write-once system is very workable been selected by many manufacturers
tions of the CD standard will provide for many applications. For example, for the thin film because of their low
for both recordable and fully erasable in some applications a permanent data melting points and high sensitivity
CDs. A variety of technologies is trail is precisely what is called for. [29].
being explored. In short, a write-once optical disk In other systems an irreversible
does not necessarily present fatal phase change is used to provide an
important. Indeed, other nonerasable of the medium at the point where the
Write-OnceDisk formats have met with success before; laser strikes. In this way the reading
for example, Erasability
drawbacks. photographyis not
is aalways
write- laser may in
alteration differentiate
the index ofdata. Some
reflectivity
once technology, systems use a thin metallic recording
The simplest recordable optical A write-once optical disk may be layer that changes its physical prop-
systems are write-once. The user may implemented in a variety of ways. erty from crystalline to amorphous
record data only once, permanently, In some systems a mechanism called when it is thermally heated (to ap-
until the disk capacity is filled. (Al- ablation uses a laser writer to burn proximately 170°C) by the writing
-ch .....
PRIMER
steps (from 0000000000000000 to comparable to, or better than, the best ers, output sample-and-hold circuits,
1111111111111111 ). The more bits the analog systems, output (anti-imaging) Iow-pass filters, and
better, because the magnitude of the output amplifiers [6]. Alternatively, a
error is smaller. But there will always be digitalfilter may precededigital-to-analog
a marginof error, conversion. TheoutputsideofthePCM
Rarely the selectedstep will be exactly A PULSE-CODE MODULATION system forms the basis for the design.
at the analog value; usually it will not be SYSTEM of a CD player.
quite exact. At worst, the input analog [ I
level will be one half step away, that is,
there will be a maximum error of one Modulation is the process of encod-
half the least significant bit of the quan- lng source information prior to trans- REFERENCES
tization word. Interms of decibels, every mission or storage. In pulse-code mod- [ I
added bit yields about a 6-dB increase ulation (PCM) the amplitude of the digital
in the system's signal-to-error (S/E) ratio waveform encodes binary digits directly. [1] M. Mathews, The Technologyof
because it reduces the quantization error Because of its relatively efficient use of Computer Music (MIT Press, Cam-
by a factor of 2. The S/E relationship bandwidth, its compatibility with off- bridge, MA, 1969).
can be expressed in terms of word length the-shelf circuitry, and its ability to be [2] B. Blesser, "Elementary and Basic
as S/E (dB) = 6.02n + 1.76, where n easily multiplexed,PCM has proveditself Aspects of Digital Audio," DigitalAudio,
is the number of bits [3]. Using the for- to be an expedient means of represent- CollectedPapersfromtheAESPremiere
mula, a 16-bit system is seen to yield lng audio data for recording or repro- Conference(Rye, NY, 1982 June), pp.
an S/E ratio of about 98 dB. duction. When it comes to storage on 5-11.
Dither can help suppress the effect of magnetictape or optical disk,otherforms [3] B. Blesser, "Digitization of Audio:
quantization error. Dither is a Iow-am- of modulation may be more effective. A Comprehensive Examination of The-
plitude analog white noise added to the There is one consistent drawback to ory, Implementation, and Current Prac-
input analog signal. (Othertypes of dither PCM; like many other digital coding tice," J. Audio Eng. Soc., vol. 25, pp.
have been researched.) It is important schemes,it requiresa much wider band-
that this small amount of noise removes widththan the corresponding analogsig- 739-771 (1978 Oct.).
[4] J. Vanderkooy and S. P. Lipshitz,
the quantization artifacts from a sig- nal [5].
nal--it does not mask the artifacts, it The architecture of a fixed, linear PCM "Resolution Below the Least Significant
Bit in Digital Audio Systems with Dither,"
removesthem [4]. digitization system is comprised of a re-
Sampling and quantization are the two cording section of input amplifiers, a J. Audio Eng. Soc., vol. 32, pp. 106-
fundamental design problems for a dig- dither generator, input (anti-aliasing) Iow- 113 (1984 Mar.).
itization system. Sample rate determines pass filters, sample-and-hold circuits, [5] J. Strawn, Ed., DigitalAudio En-
bandlimiting and thus frequency re- analog-to-digital converters, a multi- gineering:An Anthology(William Kauf-
sponse, and word length determines plexer, digital processing circuits for error mann, Los Altos, CA, 1985).
signal-to-error ratio. Although bandlimited correction and modulation, and a storage [6] S. Yoshikawa, "An Introduction to
sampling is a Iossless process, quan- medium such as digital tape. On the Digital Audio," Proc. AES 3rd Interna-
tizing is one of approximation. In general, playback side are processing circuits for tional Conference, Present and Future
a sampling frequency of 44.1 or 48 kHz demodulation and error correction, a of Digital Audio (1985 June), pp. 14-
and a word length of 16 bits yield fidelity demultiplexer, digital-to-analog convert- 23.
Magnetooptical optics a magnetic field is used to re- ware address sectoring, and uses a
Recording cord data magnetically, but it is grooved disk in which address in-
weaker
area thanthan conventional
necessary. With recording
magneto- formation is physically
dressing. The method isformed
called in the
hard-
fields. By itself, it is too weak to groove and detected by light-beam
For some applications the advan- reorient the magnetic particles, reflection. Using this system, any
rages of write-once permanence are However, a unique property of mag- magnet()optical player will automat-
negated by the need to erase and re- netic materials is utilized: as they are ically track both address and data in-
record. For example, full erasability heated, their coercivity--the field formation contained on any disk. By
would be preferable for a multitrack strength required to bring a saturated superimposing the hardware ad-
278 J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April
dressing information with the re-
corded data signal, playing time is
not sacrificed.
A magnetooptical disk would re-
tain the protective properties of other
MAGNETIC optical media. The recording layer
PARTICLES/ would be sandwiched between a
, _ transparent substrate and a protective
_ _ _ __ _ { { t _fi _ _ _ _ _ _ _ _ _ _ _ _ t _ _ _/ layer. The laser light would shine
x, through the substrate, placing surface
I
MAGNETIC
_ FIELD
(REVERSED)
!
ERASE
LASER The CD standardincorporatesover
50 years of theoretical and practical
(c) work in the digital audio field.
Moreover, it is the first widely avail-
able consumer digital audio product.
Fig. 19.Magnetoopticaldisk.(a) Writing.(b) Reading.(e) Erasing. Its success has sparked numerous
evolutionary steps in optical storage
J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April 279
CD FORMATS
technology. First,therewasCDfor [ ]
digital audio (CD-Audio), a robust,
:i;
Diffraction Grating
............... Opticalcomponent usedin three-
beam pickups to split the laser beam
Byte Clock into secondary beams; to be used for
Group of bits operating together, Timing device that generates the tracking.
Usually 8 bits. basic periodic signal used as a source
of synchronizing signals in digital Digital
CAV (Constant Angular equipment. Crystal control ensures Use of numbers (typically binary)
Velocity) precision, to represent information.
Disk rotating at a consant number
of revolutions per second. The LP is CLV (Constant Linear Velocity) Digital Audio
a CAV system at 33V3 r/min. Disk rotating at a varying number Use of sampling and quantization
of revolutions per second to maintain techniques to store or transmit audio
CD-Audio (Compact Disc Digital a constant relative velocity between information.
Audio System) pickup and track across the disk ra-
System storing of digital audio and dius. The CD is a CLV system, ro- Digital Filter
subcode information on a 120-mm- tating at 200-500 r/min.
diameter optical disk. Any filter characteristic accom-
plished in the digital domain. Digital
Compact Disc filters may use oversampling to yield
CD-I (Compact Disc Interactive) 120-mm-diameter disk holding the characteristic needed for the anti-
System storing digital audio, audio program or other nonaudio aliasing and anti-imaging filters.
video, text, and graphics information data. The disk is made of plastic, However, digital filtering does not
interactively, with user control over with a top metalized layer, and read introduce phase distortion common
content and presentation, on a 120- by reflected laser light, to brickwall analog filters.
mm-diameter optical disk.
CD-ROM (Compact Disc Read- CRCC (Cyclic Redundancy Digital Sum Value
Only Memory) Check Code) Value representing the difference
Method of storing digitally coded Error-detection code that is re- between the lengths of the pits and
information, usually computer or corded with the audio information at fiats comprising the bit stream; min-
database related, on a 120-mm-di- a specified interval; code data derived imized to suppress low-frequency
ameter optical disk. from the audio data. content of signal.
Highest audio frequency that may as bumps by the reading laser, they Sampling Rate
be accurately sampled. The Nyquist diffract the light, decreasing the in- Frequency that an analog signal is
frequency is one half the sampling tensity of light returned to the pickup, sampled; expressed in hertz. The
frequency; for a CD system it is 22.05 sample rate used on CDs is 44 100
kHz. Player samples per second (44.1 kHz). Since
Mechanical and electrical device the CD reproduces stereo sound,
Objective Lens used to recover data from a disk. 88 200 samples are stored for each
Optical component used in laser second of sound on the disk.
pickups to focus laser light on the Polycarbonate
disk data surface. Transparent plastic material used Sampling Theorem
to form the substrate for CDs. Theorem stating that a bandlimited
One-Beam Tracking continuous waveform may be exactly
Optical pickup design in which a Premastering represented by a series of discrete
single laser spot is used for data re- Process of integration of digital samples. The sampling frequency
covery, tracking, and focusing, data code, error correction, and sub- must be at least twice the highest
codes, and of conversion to the CD frequency contained in the wave-
Optical Pickup data format prior to disk mastering, form.
Optical device using laser light to
convey data from a disk. Quantization SCSI (Small Computer Standard
Process of converting an infinitely Interface)
Oversampling variable amplitude of an analog Standard 8-bit parallel interface
Digital technique in which samples waveform to a finite series of discrete used to connect peripherals such as
are interpolated, typically by 2 or 4 levels expressed as binary words, a CD-ROM player to a microcom-
timesrate, to accomplishdigitalill- purer.
tering of the signal, prior to D/A performed by A/D converter.
conversion. A gentle analog anti- Serial
imaging filter is still required after Quantization Error Arrangement of data in linear se-
oversampling to remove extreme Error resulting from quantizing an quence; transmission of data through
supersonic frequencies, analog waveform to a discrete level; a communications line.
the longer the word length, the less
PAM (Pulse-Amplitude the error. Servo Mechanism
Modulation) Control device or system that reads
Method in which the amplitude of Random-Access Memory (RAM) its own output to determine the de-
pulses in a pulse train represents the Memory in which data may be read gree of further output as compared
analog information, and written regardless of address, to its input.
THE AUTHOR