You are on page 1of 30

Audio information places great The Compact Disc digital audio of unprecedented capacity, durabil-

demands on a digital storage medium, system (CD-Audio) format was de- ity, and flexibility. Thus the CD is
A 60-min musical selection, recorded veloped to meet these demands. A highly suitable for many nonaudio
in stereo with pulse-code modulation CD-Audio disk holds a specification- applications. As a result, a number
· (PCM), with a sampling rate of 44.1 defined maximum of 74 min of high- of alternative CD formats have been
kHz and 16-bit quantization, gen- fidelity music on a highly robust and developed. For example, a CD-ROM
erates over 5 billion bits. Error cor- economically manufactured disk. A disk may store over 550 Mbytes of
rection, synchronization, and mod- Compact Disc (CD) player provides software or database information, and
ulation are needed for successful access to any part of the audio pro- a CD-I disk may interactively present
storage; total capacity required is gram within a second or less. with several hours of music, along with
over 15 billion bits. In addition, a many user-convenience features. The video and text information. The CD
commercially popular music storage format is well suited to audio's stor- is only the first of many optical disk
medium demands random access, age demands [1]. storage technologies for both the
small size, convenience of use, ro- The magnitude of the task of digital consumer and the audio professional.
bustness, low cost, and ease of rep- audio storage and the magnitude of This article informally reviews the '
lication. Clearly, digital audio's re- the achievement of the CD have re- technical specifications of the CD
quirements are considerable, sulted in an optical storage medium format, in terms of both physical and

250 ,.. J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April
data specifications. Also presented ameter 15 mm, and the thickness 1.2 surface is covered with a very thin
is the encoding method used to pre- mm. The innermost diameter does (50-i00-nm) metal layer, such as
pare audio data for disk storage on not hold data; it provides a clamping aluminum, silver, or gold, and an-
the CD-Audio format. In addition, area for the player to hold the disk other thin (10-30-1xm) plastic pro-
the developing family of CD appli- to the spindle motor shaft. Data are tection layer, with the identifying
cations, including CD-ROM, CD-I, recorded on a 35.5-mm-wide area. label (5 Ixm)printed on top, as shown
CD-V, and recordable CD formats, A lead-in area rings the innermost in Fig. 3. A laser beam is used to
are described [2]. data area, and a lead-out area rings read the data. It is applied from below
the outermost area. They contain and passes through the transparent
COMPACT DISC nonaudio data, which are used to substrate and back again. The beam
[ _ O) )) PHYSICAL control the player's optical pickup, is focused on the metalized pit sur-
_j SPECIFICATION polycarbonate)
A transparentsubstrate
plastic forms
(typically
most data
face, on
embeddedinsidethe
a d4sk are read disk.
by a Since
light
of a disk's i.2-mm thickness. Data beam, playing a CD causes no more
The physical characteristics of the are physically contained in pits that wear to the recording than reading
CD are shown in Fig. 1. The disk are impressed along the disk's top causes to the words printed on this
diameter is 120 mm, the hole di- surface, as shown in Fig. 2. The pit page.

J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April 251


CD FORMATS

= lzumm I and adjusts the disk speed to keep


117mm the data rate constant.'
al j

116mm Although the CLV of any particular


' · CDis fixed,theCLVsusedondif-
= 50mm = ferent disks can range from 1.2 to
46mm LEAD
_n 1.4 m/s. In general, disks with play-
1.2mm PROTECTIVE 11 5mm I LEAD
oux ing times of less than 60 min are re-
_
LAYER \ corded at 1.4 m/s, while disks with
I-T............ I I ....... longer,playingtimesuseCLVsas low

T / Z_ CLAMPINGARE4]= 26mm _ as 1.2 m/s. The CD ptayer regulates


TRANSPAREI_iTLASERBEAM ic 33mm INFORMATION
(PIT) the disk
cally rotational
to maintain speed automati-
a constant bit rate
SUBSTRATE
DIS_(N= 1.55) of 4.32180 MHz.
Fig. 1. Physical specifications of the CD. (Courtesy of Sony Corporation of
America.)

between the disk and the pickup. The fact that the disk data surface
Compact Disc Pit Track To accomplish this, the rotation is physically separated from the
_._ formrelative
speed of a CDvelocityis
varies depending
maintained
on reading side of the substrate provides
the radial position of the pickup. Be- one of the most significant assets of
In the microscopic world on the cause each outer-track revolution the CD system. Dust and surface
data surface, the disk contains a contains more pits than each inner- damage do not lie in the focal plane
track of pits arranged in a contin- track revolution, the disk must be of the reading laser beam and hence
uous spiral running from the inner slowed down as it plays outward, to their effect is minimized. Specifi-
circumference to the outer. By maintain a constant data rate. The cally, any obstruction less than 0.5
starting from the inside, future disk rotates at a speed of 500 r/min mm becomes insignificant and causes
adoption of smaller and larger di- when the pickup is reading the inner no error in the readout.
ameter disks is facilitated, circumference, and as the pickup The polycarbonate substrate has a
The pits are amazingly small, moves outward, the rotatiorial speed refractive index of 1.55 (as opposed
considerably smaller than we bulky gradually decreases to 200 r/min, to 1.0 for air); the velocity of light
humans can appreciate. For example, Thus a constant linear velocity is slows from 3 x 105 to 1.9 x 105 km/
a pit is about 0.5 Ixm wide, or about maintained. In other words, all of s. When the velocity of light slows,
700 times smaller than a pin prick, the pits are read at the same speed, the beam is bent. Because of the
The track pitch, the distance between regardless of the circumference of bending from the refractive index,
successive tracks, is 1.6 ixm, as thatpart of the spiral. Thisis accom- the thickness of the substrate, and
shown in Fig. 4. There is a defined plished through a CLV servo system, the numerical aperture of the lens of
maximum of 20 188 revolutions The player constantly reads frame the laser pickup, the size of the laser
across the disk's signal surface of synchronization words from the data beam decreases from approximately
35.5 mm. Depending on the playing
time, a pit track might contain 3 bil-
lion pits. If unspiraled, this-track
would stretch about 3 miles (5 km).
The construction of the CD is dif-
fraction limited, that is, the dimen-
sions are as small as· permitted by \

the wave nature of hght. The track


pitch acts as a diffraction grating,
producing a rainbow of colors. In
fact, CD pits are among the smallest.,
manufactured formations [3].
Examination of a pit track would
reveal that the linear dimensions of
a given track, are the same at the be-
ginning of it_ spiral as at the end.
This means that an audio CD must
rotate with constant linear velocity
(CLV), a condition in which a uni- Fig. 2. CD pit surface. (Courtesy of University of Miami.)

252
tween pits, virtually all of its light
CD is reflected, and when it strikes a
120 mm pit, virtually all of its light is can-

flected.
i celed--hencevirtuallynoneis re-
In practice, the laser spot is larger
15 mm than required for complete cancel-
000x MAGNiFiCATiON lation between pit and land reflec-
tions, and pits are made slightly
shallower than the theoretical figure
5 _rn of one-quarter wavelength; this yields
a better tracking signal, among other
things. Typically the presence of a
bump reduces reflective power by
about 25%. In any case, the presence
of pits and land is thus read by the
laser beam; specifically, the disk
10-30 _m
surface modulates the intensity of the
light beam. Thus the data physically
encoded on the disk can be recovered
by the laser and later converted to an
METALIZEDPIT electric signal.
SURFACE

0.05-0.1 _m _ COMPACT DISC DATA

Fig. 3. Scaledrawingof CD datasurface. _ ENCODING

0.8 mm on the substrate surface to beam to interfere destructively with The music on a CD is the end result
approximately 1.7 _m at the pit sur- one another and to cancel each other, of an elaborate data transformation
face. The laser beam is thus focused Optically, if the CD pit surface is that takes place during master en-
to a point slightly larger than a pit considered a two-dimensional re- coding, and that undergoes decoding
width, flective grating, the focused laser each time the disk is played. The
The top of the pit surface is met- beam diffracts into higher orders, complete encoding process, from
alized. This reflective surface, known resulting in interference. In theory, sampling to data bits on the disk, is
as land, causes almost 90% of the when the beam strikes an area be- shown in Fig. 5. The premastered
laser light to be reflected back into
the pickup. When viewed from the 1.____,
1
perspective of the laser underside,
the pits appear as bumps. The height
of each bump is between 0.11 and -o.5tm_..,

0.13 I_m.than
smaller Thisa dimension is slightly
carefully chosen di- _'"' _ _) I
mension:the laserbeamwavelength | III DISK

790 nm). Inside the polycarbonate


in air is 780 nm (some players use .._ _ .J,. _1 ROTATION

about 500 nm. The height of the (0.833-3.054_rn)


bumps
substrate,is thus approximately
the laser wavelength oneis 3 T-11 T T-- __..___ PIT
quarter of the laser wavelength in the
/ \ II I
III /
/ -"1'_
/+
FOCUSED
LASERBEAM

Thus light striking land travels


substrate. -2p.m
a distanceof one-halfwavelength ! _
(1/4 + l/4) farther than light striking
a bump. This creates a phase differ-
ence of one-half wavelength between
the part of the beam reflected from
the bump and the part reflected from Fig.4. Dataare readthroughan intensity-modulated
laserbeam.(Courtesyof
the surrounding land. The phase dif- Sony Corporationof America.)
ference causes the two parts of the

J. Audio Eng. Soc., VoL 36, No. 4, 1988 April 253


--.--,1_. t
1 FRAME
4

' 1
I .J L.. I I I i SAMPL,NG
PE.,ODS
j
s

f '"..

110111001
I I
01110010 I 10001111 0101001011!32BITS PER SAMPLING PERIOD
I I_SYMBOLS OF 8
I I I ! BITS

C&D' , ', '1 C&D

DATABITS B2 . +8PARITY
33 DATASYMBOLS

",, t ,' , ,_ 3 CHANNEL


17 (MERGING)BITS

Ba '[ 10000100100010 xx:( 00100100100100 xxx J PER SYMBOL

·. _ .- 33 x 17 CHANNEL BITS
·. / . - + 27 SYNC BITS
. I -' ' 588 CHANNEL BITS

CHANNEL BITS B

_ lO00010010001000000_OOmO_OOmO00_ I I mOOOOOOOOO_OOOOOOOOOO_O×xx
I
SYNC

2xT_

Dd I.=..=1 ! r='"'""t.-J"_l I'=' J i

Fig. 5. CD encoding process from input samples to channel bits. (Courtesy of Philips Technical Review.)

data are typically stored on a U-matic data. Six 32-bit PCM audio sampling about one error for every 0.1 to 1.0
video cassette via a digital audio periods (alternating from 16-bit left million bits. This is impressivestor-
processor with a 44.1-kHz sam- and right channels) are grouped in a age capability, but considering that
pling rate and 16-bit linear quanti- frame, left channel preceding right, a disk will output over 4 million bits
zation. Once the audio data adhere Each 32-bit sampling period is di- per second, the need for error cor-
to these basic specifications, they are vided to yield four 8-bit audio sym- rection is obvious. With error cor-
ready to be encoded to the CD for- bols. Considerable signal processing rection, over 200 errors per second
mat. follows topreparethe audiodata for can be corrected. To achieve such
CD encoding is the process of storage on the disk surface. In par- results, the CD employs the corner-
placing audio data in a format suitable ticular, error-correction encoding stones of error correction, interleav-
for storage on the disk. A frame must be accomplished, lng to distribute errors and parity to
structureprovidesa meanstodistin- correctthem[5].
guishthedatatypes.Theinformation The algorithm used for error
contained in a CD frame prior to Compact Disc Error- correction in the CD system is the
modulation includes a 27-chafinel- Correction Encoding cross-interleaveReed-Solomoncode
bit synchronization word, an 8-bit (CIRC). The CIRC algorithm uses
subcode(labeledC andD control twocorrectioncodesfor additional
and display in Fig. 5), 192 data The raw error rate from a CD. that correcting capability, and three in-
bits, and 64 parity bits [4]. is, the rate before error correction is terleaving stages to encode data be-
Encoding begins with the audio applied, is around 10-s to 10-6, or fore they are placed on a disk and to
256 J, Audio Eng. Soc., Vol. 36, No. 4, 1988 April
decode the data during playback, sures in one word. An erasure is a
Thanks to cross-interleaving, the symbol that has been negated by the
separation of two error-correction decoders because detection has as-
codes by an interleaving stage, one certained that its value is unreliable.
Reed-Solomon code can check the The parity symbols are placed in the
accuracy of the other code. The center of the block to permit inter- Following CIRC encoding, an 8-
Reed-Solomon codes used in CIRC polation in the event of large burst bit subcode symbol is added to each
are particularly well suited for the errors, frame. The eight subcode bits are
CD system because their decoding Cross-interleaving follows the C2 designated as P, Q, R, S, T, U, V,
requirements are relatively simple encoder. The 28 symbols are dela3)ed and W. Only the P or Q bits are re-
[6], [7]. by differing periods to store the data quired in the audio format. The CD
The complete CIRC encoding in 28 different words. The C1 error- player collects subcode symbdls from
scheme is shown in Fig. 6. With this correction encoder accepts 28 sym- 98 consecutive frames to form a sub-
encoding algorithm, data from the bols (from 28 different C2 words) code block, with eight 98-bit words.
audio signal are cross-interleaved, and produces four symbols of P par- Thus the eight subcode bits (P
and two encoding stages generate 8 ity. P parity is used to correct single through W) are used as eight different
bits of parity. Error-correction en- symbol errors and to detect and flag channels, with each frame containing
coding begins with the first stage of double and triple errors for Q cor- one P bit, one Q bit, one R bit, and
interleaving, designed to assist in- rection, so on. A subcode block is complete
terpolation. A word of 24 8-bit sym- A final interleave stage introduces with its own synchronization word,
bols is applied to the CIRC encoder, a fixed delay of one symbol to alter- instruction and data, commands, and
A delay of two symbols is placed be- nate symbols. This delay causes data parity. The start of each subcode
tween even and odd samples; this fa- in the output word to be spread over block is denoted by the presence of
cilities interpolation. In addition, the two words, it prevents small random So and S_ synchronization patterns
symbols are scrambled; this facili- errors from destroying more than one in the first symbol positions of two
tares concealment when errors are symbol, even if two adjacent symbols successive blocks. So and Si are two
beyond the capabilities of the system, in one word are destroyed. Finally patterns not used in the EFM code
The C2 error-correction encoder the P and Q parity symbols are in- (explained below), and thus can be
produces four symbols of Q parity, verted to provide nonzero P and Q uniquely identified. The subcode
Q parity is designed to correct one symbols with zero data. 32 8-bit block rate is 75 Hz.
erroneous symbol or up to four era- symbols leave the CIRC encoder. The P and Q subcode bits contain

C2
SCRAMBLER C2 INTERLEAVER C1SCRAMBLER
ENCODER C1ENCODER
W1 1
W2 2
Wa 3
W4 _ 4
W5 5
- 6

13 Ik

1 framo ,_O.....-.._. 14 r-'--_ To EFM


. 15 I !_ encoder
16

W21 25
! W22 _ 26
]_W23 27
· W24 _ __ _ 28
D= 4 28

29
30
31
32

Fig. 6. CIRCencodingemploysthreeinterleavingstagesand two correctionencodingalgorithms.


CD FORMATS

information essential to accessing suppression method. Digital copy start of the lead-out area. Time and
audio data on the disk. The P channel prohibited (yes/no) is indicated, absolute time are expressed in mm-
contains a flag bit. It designates the The address information consists utes, seconds, and frames (75 frames
start of a track, as well as the lead- of 4 bits designating the three modes per second). Using indexing, up to
in and lead-out areas on a disk. Lead- for the Q data bits. Primarily mode 99 locations within tracks can be
in and lead-out signals tel1 the player I contains the number and start times marked.
where the music program begins and of tracks, mode 2 contains a catalog Mode 2 contains the catalog num-
ends. A lead-in signal consists of all number, and mode 3 contains other ber of the disk using the universal
Os appearing just prior to the begin- product codes Mode 1 stores infor- product code (UPC). Mode 2 also
ning of the music data. A start flag mation in the disk lead-inarea, pro- continues absolute time count from
appears just prior to the start of mu- gram area, and lead-out area; the data adjacent subcode blocks. Mode 3 is
sic. During the last music track, pre- format in the lead-in area differs from used to give the international standard
ceding the lead-out, a start flag ap- those in the other areas. Mode 1 in- recording code (ISRC) for identifying
pears again. That flag gives the start formation occupies at least nine out each music track. It is present only
of lead-out and the P channel remains of ten successive subcode blocks, in the program area. The ISRC num-
ar 0 for 2 to 3 seconds. Following Mode 1 lead-in information is con- bet gives the country code, the owner
that time, a P signal consisting of tained in the table of contents (TOC). code, the year of recording, and a
alternating ls and Os (at a 2-Hz rate) The TOC stores data indicating the serial number. Mode 3 also continues
appears. These various sets of iden- number of music selections (up to absolute time count. Modes 2 and 3
tity signals can be used by aba- 99) as a track number, and the starting occupy at least one out of 100 suc-
sic player to control the optical points of the tracks in disk running cessive subcode blocks and can be
pickup, time. The TOC is repeated continu- present only in the program area.
The Q channel contains four basic ously in the lead-in area, and the Modes 2 and 3 can be deleted from
kinds of information: control, ad- starting point data are repeated in the subcode if not needed.
dress. Q data, and cyclic redun- three successive subcodeblocks. The Every audio CD contains addi-
dancy check code (CRCC). Each TOCis read during disk initialization, tional data capacity in the guise of
subcode block contains 72 bits of before the disk begins playing, the subcode bits. The other six bits
Q data, as described below, and 16 In the program and lead-out areas, (R, S, T, U, V, and W), which ac-
bits for CRCC, used for error de- mode 1 contains track numbers, in- count for about 20 Mbytes of 8-bit
tection on the control, address, and dices (subdivision numbers) within storage, are available for video or
Q data information. The control in- a track, time within a track, and ab- other information.
formation flag bits handle several solute time. Time is set to zero at
player functions. The number of the beginning of each track and in- __

dicated. This distinguishes between the beginning of a pause, time de- Encoding
aaudio
two- channels
and a four-channel
(two or four)CDis re-
in- creases, ending
creases to withofzero
the end the at the end
track. At (_ Compact Disc EFM
cording. Preemphasis (on/off) is of the pause. The absolute time is
indicated. A CD track may be en- set to zero at the beginning of the After the audio, parity, and sub-
coded with preemphasis, a noise- program area and increases to the code data are assembled, the bit

16-BITSAMPLE

SAMPLE i SYMBOL 1 -- SYMBOL 2 t


2SYMBOLS s 1 ,_
oi ......... dll el ......... 4

EXAMPLE 1olol,Iot_l _l,Iq olololol, Ioi_'1,!


8-TO-14
<2>
CHANNEL
BITS 10101_10101010I
10l i_10101_ ' 01otol_loI0J_[O10[0101olbl
i

MERGING+cHANNELBITSBiTS
IsITs [ I elis i SITS [

/,
F'l. I.lotol 1Iolololol _Iolol _1ololololo Io
EFM
S,*NALJ F"-"I t' L_._J 1
EFM
Fig.7. Assignmentof EFMwordsand additionof mergingbits.

258 J.AudioEng.Soc.,Vol.36,No.4, 1988April


CD FORMATS

stream is modulated using eight-to- succeeding patterns) is added to each is thus a compromise between several
fourteen modulation (EFM), as code pattern to aid in clock synchro- conditions. Because only 256 pat-
shown in Fig. 7. Blocks of 8 data nization and suppress the signal's terns are needed, 11 of the 267 pat-
bits are translated into blocks of 14 low-frequency component. The latter terns are discarded. (Two of them
channel bits, assigning an arbitrary is accomplished by selecting merging are used for subcode synchronization
and unambiguous word of 14 bits to bits that maintain the signal's average words So and Sp)
each 8-bit word. By choosing select digital sum value at zero [9]. The The resultant NRZ channel stream
14-bit words with a large distance ratio of bits before and after modu- produces pits and lands that are at
between transitions, greater data lation is 8:17. During demodulation, least 3 channel bits (3T), and no more
density can be achieved. It would be only 14 bits will be processed; the than 11 channel bits (1 IT) long, as
inefficient to store the 8-bit symbols three merging bits are discarded, shown in Fig. 8. It is the combination
directly on the disk. In addition, 8- The 8 data bits require 2s, or 256, of these varying dimensions that
bit symbols have many similar pat- different code patterns. However the physically encodes the data. The se-
terns. For example, if I bit was 14-bit channelword can offer 16 384 lection of EFM bit patterns defines
wrong, 00001011 would be confused combinations. To achieve pits of con- the physical relationship of the pit
with 10001011. With 14-bit words, trolled length, only those combina- dimensions. The pits and the inter-
more unique patterns can be selected, tions are selected in which more than vening reflective land on the CD sur-
EFM thus provides a kind of error two but fewer than ten Os appear face do not directly designate ls and
protection [8]. continuously. In addition, unique Os. Rather, each pit edge, whether
Blocks of 14 bits are linked by patterns are sought. Only 267 corn- leading or trailing, is a 1 and all in-
three merging bits; at least two binations satisfy these criteria. With crements in between, whether inside
merging bits (always Os) are required these patterns, the minimum inver- or outside a pit, are Os, as shown in
to prevent the possibility of succes- sion width of the nonreturn to zero Fig. 9. As discussed, this is a much
sive Is between serial words (a vi- (NRZ) signal is limited by the re- more efficient storage technique than
olation of the coding scheme). The cording density, and the maximum coding the binary bits directly with
additional merging bit (either a 1 or inversion width is limited by the pits.
a 0, depending on the preceding and clock bit extraction. The conversion With EFM there are more bits to

asa
For the pas;tfew years, audio
* professionals
' capabilities,it'simprovedin sonicqualityas well.
have been pmisingYamah_ REV7 digital reverb to We added more DSP chips to boost the
the skies So there was incrediblepressure to make REV5§processing power,creating smoother reverb
its successor even better than expected, sounds and multi-effect combinations Fullband-
Introducingthe REVS.Representinga break- width extends reverb to 20 KHz.
through in the sound barrier forreverb.And a In additionto master analogEQ,the REV5
collectivesigh ofsatisfactionfrom the overachieving has three-band parametric, programmable digital
designengineersat Yamaha. EQ.Sowhen youmake individualEQsettings,
Becausenot onlyisthe REV5matured in they'rerecalledwith each program.
accommodate, but with modulation
the highest frequency in the output
3T '_'"x_':-_ signalis decreased;thereforea lower
track velocity can be utilized. Hence
longer playing time is achieved. This
4 T '_ .... ":' is a very efficient encoding method
because the number of bits trans-
ST tr'
'__::.:_
"'_.....'....
'_"'
_.
....
"' _'-' '" mitted divided by the minimum
W"'"'"
'":_' ';'_"'_:':':':":"-:'_::':_'_ length between transitions needed on
6T ...............
""_"
"'_'___'".:_'!
:_'"
--" "' the medium to convey them is high.

7X :.:'._3-::¥_:!.-"
-' *' ' '' '__:.
C-"______}
"___'"
........... "--'
''
'.
' '-.
".._..
' ' Compact Disc Frame
Assembly
';:':...?_..'3.
_';'
'...: '_::3_'.:
_'-'
"'_
' '_-
'""
' '..
';"'
:-· "'-.
':;-:_.:::::'_;_:'.

The individual frames in the re-


9T C"'";__'
'___:_i_'4-3_.8.:.:;3_::::_:-_'._}':.
""'"'
....
' · :) suiting EFM bit stream must be de-
lineated. Thus a synchronization
eT ''"_-............
:_:.
'___..
"'_' "__--''' · -'..-.-
'_''"_" '".'.......-
·' ' .....
' · ' ............
· :.,......
' .._.:.'.{!:'.3 pattern is added prior to each frame.
The synchronization word is uniquely
-,.-_-__....-..'
-_._-
'___..:'
''_.''"' · .._'' ' -'..... ·· ' · ' ·· , - . '.:'_:.:.. identifiable from any other possible
ll T ('___'
"'_____:;.:::':_"""'
"'"'
'''""
"-"'"'
'"':''"'' ' ' ' ''"' ' ' '_-;'.'f0
'_'
:_":' data configuration. Specifically the

Fig.8. Pit(and land)length;createdby EFM rangefrom 3Tto 11I Minimum 24-bit synchronization word is
pit length_0.833-0.972 ,,m; maximumpit length_3.054-3.56 t_m(1.2-1.4 100000000001000000000010 plus 3
m/s). merging bits.
The total number of channel bits

with a sonic boom.


Andwhen youdon'thave the time to make it, youmay find it _cult to come back down to
many decisions,there are 30 preset programs,plus earth. YamahaMusicCorporation,Professional
nine unique preset combinationprograms.Sixty AudioDivision,P.O.Box6600, Buena Park,CA
user-memoryslotslet yousave yourcustom effects 90622.In Canada,YamahaCanada MusicLtd., 135
And even though a lot of the features are new, Milner Avenue,Scarborough,Ontario MIS 3R1.
usingthe REV5won'tbe. Because the format isthe

sameCheck
asthe REV7yotfreused
with your friendlyYamaha
to using. Professional OYAMAHA ®
Audiodealerabout thenew REV5. Once you hear Engineering
Imagination TM
CD FORMATS

4T 10T 4T 4T 4T 7T 5T 4T

I
I I I I I I I I I
I I I_1_1 J 1 J_l
I I I I I I I I I
01 000 1 000000000 1 0001 000 1 000 1 0 0 0 00 0 1 0000 1 000 1 0

Fig. 9. Data are represented by pit/land transitions.

per frame after encoding is 588, response to trackinginformationand


comprising 24 synchronization bits, user random-access programming.
336 (12 x 2 x 14)data bits, 112 Furthermore,the pickupmustmain-
(4 x 2 x 14) error-correction bits, tain focusing and tracking even under
14 subcode bits, and 102 (34 x 3) adverse playing conditions, such as a
merging bits, When the data manip- The optical pickup must focus on, dirty disk or impact and vibration [10].
ulation is completed, the original track, and read the data spiral. The To achieve sharp focus on the data
audio bit rate of l .41 million bits per entire lens assembly, a combination surface, and to achieve intensity
second has been augmented to 4.32180 of the laser source and the reader, modulation from the pit depth prop-
million bits per second, must be small enough to glide lat- erly, it is necessary to use a laser as
erally beneath the disk, moving in the light source. CD pickups use an

k,,_ COMPACT
PLAYER DISC
DESIGN [
/ / 1

A CD player's architecture may be D,SKP,CKUP-2 SERVO _ J §.O_O_BNC%0E


considered as five functional circuit AND TRANSPORT'_"-_
I CONTROL I DECODING
elements, each working in close co- I / /
operation with the other: optical
readout, servo system, spindle motor,
control and: display, and decoding
system. The data path directs the
modulated light from the pickup
through a series of processing cir-

cuits,ultimately
yieldinga stereo
analogsignal. Thedata pathtypically .
consists of elements such as data
separator, deinterleaving RAM, error
detection, correction,

digital-to-analog
and conceal-
ment circuits, oversampling filters,
converters,and
HFSIGNAL
INPUT
FROM DISK

4 SIT DETECTION
EFM
DEMODULATION

H BUFFER

analogoutputfilters.Theservo,the i I
control, and the display system must CLOCK I I SYNCHRONIZATION
I

the disk,
direct the including
mechanical spindle
operation drive,of REGE
NERATION
1'_ TDETECGTION
AND I
autotracking,andautofocus,and I I,I ]
handle user interface with the play-
er's controls and displays. An op- CRYSTAL
erational block diagram of the data Fig. 10. Architecture of CD player, with oversampling filters.
path is shown in Fig. 10.

262 J. Audio Eng. Soc., VoL 36, No. 4, 1988 April


A1GaAs semiconductor laser irra- returns through the objective lens and from the pit track, interference cre-
diating a coherent-phase laser beam the collimator lens to pass through ates asymmetry in the beam, which
with a 780-nm wavelength (some the semitransparent mirror, and results in an intensity difference be-
manufacturers use 790 nm). strikes a wedge lens. It is then split tween the two split beams. Either
Two types of pickups commonly into two beams, adjusted to strike an photodiode pair A/B or C/D generates
employed are one-beam and three- array of four horizontally arranged a greater signal from the increased
beam designs. The optical compo- photodiodes. The outputs of all the laser light. The difference between
nents of a one-beam design are shown photodiodes are summed to provide the pairs (A + B) - (C + D) produces
in Fig. 11, along with the photodiode the data signal (A + B + C + D), an error signal that is used to maintain
array used to generate tracking and which is demodulated to yield audio the pickup on track.
focusing signals, and the data signal, data and servo control signals. In a one-beam design, the pickup
The laser light is generated from a Autofocusing is accomplished by may be mounted on a pivoting arm
semiconductor device, strikes a semi- the four photodiodes. When correct or a linear sled. The pivoting arm
transparent mirror, and is reflected focus is achieved, two sharp images describes a radial arc across the data
90 °. A collimator follows the semi- are located between photodiode pairs, surface. A coil and permanent mag-
transparent mirror. Its purpose is to When the focus varies, the focal point net are mounted around the pivotal
make the divergent light rays parallel, of the system is shifted and the split point of the arm. When the coil is
The objective lens is used to focus beams draw closer together (when energized, the pickup may be placed
the beam on the disk data surface, the disk is too far) or farther apart anywhere on the pit track and posi-
taking into account the refractive in- (when the disk is too near). The dif- tioned as corrected by the autotrack-
dex of the polycarbonate substrate ference between photodiode pairs ing system. The objective lens is
[11]. A/D and B/C forms a focus error sig- provided with a coil and a permanent
As discussed, when the laser spot hal (A + D) - (B + C), which is magnet for displacement in the di-
strikes land, the smooth interval be- used to maintain the focus of the rection of the optical axis, that is,
tween two pits, the light is almost servo-driven objective lens. autofocusing.
totally reflected. When it strikes a The photodiodes also generate er- A three-beam pickup uses a center
pit (seen as a bump by the laser), ror signals for autotracking. When beam for reading data and focusing,
destructive interference and diffrac- the laser spot strikes the center of and two secondary beams for tracking
tion cause less light to be reflected, the pit track, a symmetrical beam is only. To generate additional beams,
The intensity-modulated light beam reflected. As the laser beam wanders the light from the laser passes through

J CONTROL DISPLAY

I FUNCTIONS FUNCTIONS

MEMORY ]

Ti t

ti o
CONCEALMENT CONVERTER FILTER OUTPUT
AND DIGITAL

T
CONVERTER FILTER

J. Audio Eng. Soc., VoL 36, NO. 4, 1988 April 263


CD FORMATS

a diffraction grating, a screen with


slits spaced only a few laser wave-
lengths apart. As the beam passes
through the grating, the light dif-
fracts. When the resulting collection
DISK is againfocused,it will appearas a
) ' /_ I single, bright, centered beam with a
series of successively less intense
I o_i,_t_,,, beams on either side. It is this dif-
fraction pattern that actually strikes

_o,i,,,tor theIndisk.
many three-beam designs the
,,mi.t_,_,n,_, property of astigmatism is used to

m_,or achieve autofocusing. A cylindrical


:_,.a_.'-a_,w.
r--' lens is used to detect an out-of-focus
ONE-BEAM condition. As the distance between

P,CKUP _!::J_:. the objective lens and the reflective


_ surfaceof the diskvaries,the focal

iL! point of the optical system also


= = photodiodes changes, and the image projected by
AB e D thecylindricallenschangesitsshape,
as shown in Fig. 12. That change in
the image on a four-quadrant pho-
todiode generates the focus correction
signal.
TRACKING In three-beam pickups the two
secondary beams are used for auto-
A B e D basic
,_g,_lp_,,_,_ tracking. The central beam spot cov-

_ A beamsare alignedaboveandbelow
ersthepittrackwhilethetwotracking
TRACK
SHIFTS
LEFT B -]--_'_ _=_ t_ error and to either side of the center beam.

ONTRACK
'[-'0_ :_ C _ _ (A+ B) - (C + D) When the part
properly, beamof is
each
tracking
trackingthe beam
disk

_ O [_'_ I _ RF signal part covers the the


is alignedon mirrored land between
pit edge;the other
TRACK
SHIFTS
RIGHT _ (A+B +C + D) pit tracks. The main beam strikes a
r'""l light present four-quadrant photodiode, and the
I1 light absent tWO tracking beams strike two sep-
arate photodiodes mounted to either
side of the main photodiode.
As the three spots drift to either
side of the pit track, the amount of
light reflected from the tracking
beams varies. There is less average
FOCUSING light intensity reflected by the beam
A EC O that encountersmorepit area and
_ greaterreflected
lightintensitybythe
DISK
TOOCLOSE A _ beam that encounters less pit area.

focusing error two tracking photodiodes thus form


:_ (A+ D)- (B+ C)
DISK
IN
FOCUS a a trackingcorrectionsignal,as shown
c
D _ inFig.13.
Therelativeoutput voltagesfromthe
FI[Y]
_ In manythree-beam designsboth
DISK
TOOFAR
AWAY the trackingand the focus difference
signals are applied to a two-axis ac-
tuator, with focus/tracking coil and
permanent magnet. A circular yoke
Fig. 11.One-beampickup,with photDdiodearrayfor datarecovery,tracking, supports a bobbin containing both the
andfocusing.(Courtesy
of PhilipsInternational
N.V.) focus and the tracking coils. A con-
trol voltage from the focus circuit is

264 J. Audio Eng. Soc., VoL 36, No. 4, 1988 April


the data themselves is used to deter-
mine the correct rotating speed and
ruSK CYLINDRICAL
maintaintheproperdataoutputrate.
BEAM SPLITTER LENS __

LENS
SPOT
POLARIZATION OBJECTIVE LENS_'_ f tN
__ J PHOTODIODE
..u- LENS I
ci_ CYLINDRICAL I .__ FACE PLATE

PHOTODIODE IN FOCUS _f_N_

E
photo,,od
array
an, s
cessing circuits produce a signal re-

sine waves,calledthe EFMsignal.


This is sometimes referred to as a
radio frequency (RF) signal. A col-
lection of EFM waveforms, some-
_Dt_ A_.,
v_ semblinga
times referredseries
to asf high-frequency
an eye pattern,
..m_ DISKFAR is shown in Fig. 14. The digital data
Fig. 12.Astigmatismis usedto tracefocus-errorpattern.(Courtesyof Howard can always be recovered from the
W. Sams, Inc.) EFM signal if it can be determined
when the signal crosses the zero axis,
applied to the bobbin focus coil to relative to the pit track. The three- relative to the timing constraints
maintain proper depth of focus. The beam pickup is thus mounted in a created by the EFM encoding rules.
other axis of movement is used to sled which moves radially across the Data decoding follows a procedure
achieve tracking accuracy. When the disk. The sled is often driven by a that essentially duplicates the en-
tracking difference signal is applied linear motor, coding process in reverse order. The
to the coil, the bobbin swings around A spindle motor is used to rotate EFM signal is converted to an NRZI
a shaft to move the objective lens the disk with constant linear velocity, signal; the first data to be extracted
laterally. Thus the disk must vary its speed from the NRZI signal are synchro-
A three-beam pickup requires a depending on where the pickup is lo- nization words. This synchronization
linear movement because the three cated underneath the surface. In an- information is used to synchronize
beams must stay in a fixed position other servo loop, information from the 33 symbols of channel informa-

0 0
o°"::_oO
o°%Oo°'_'?_?_o
I"i 1-+-I 1IoI
I...."01] IoI . 1 I
CIRCUIT

TRACKINGERROR
+ 0 - SIGNAL

Fig. 13.Generationof trackingerror signalfrom two side beams.(Courtesyof HowardW. Sams,Inc.)

j. Audio Eng. Soc., Vol. 36, No. 4, 1988 April 265


CD FORMATS

3T 4T _ 6T 7T 8T 9T 10T 11T

Fig. 14.Familyof EFM waveforms,the "eyepattern,"

tion in each frame. A synchronization rection decoder C2. Given this prior frame of 32 8-bit symbols; 24 are
pulse is generated to aid in locating knowledge, and help from deinter- audio symbols and eight are parity
the zero crossing of the EFM pattern, leaving, C2 is able to correct burst symbols. Odd-numbered symbols are
and to generate a transition at those errors as well as random errors that delayed and parity symbols are in-
points, in order to produce a binary C1 was unable to correct, vetted. The delay lines have a delay
signal. The use of two correction decoders equal to the duration of one symbol.
The EFM signal is demodulated and cross-interleaving helps tackle a This deinterleaving places the au-
so that every 17-bit EFM word again particularly difficult error scenario, dio symbols back into their original
becomes 8 bits. Depending on player Interleaving distributes burst errors, order by rearranging their order as
design, demodulation is accom- perhaps caused by surface contain- read from the disk. Any sequence of
plished by logic circuitry or by a ination, over different words for errors on the disk is thus distributed
lookup table. A buffer is used to re- easier correction. However, correc- among valid data. This aids in the
move the effect of disk rotational ir- tion is difficult when a burst error C1 decoder's correction efforts, es-
regularities. Data input to the buffer coincides with a random error, per- pecially for small errors in adjoining
may be irregular, but clocking en- haps introduced by a manufacturing symbols.
sures that the buffer output is precise, defect. The C 1 decoder corrects sin- In the C 1 decoder, errors are de-
To guarantee that the buffer neither gle random errors and flags burst er- tected and corrected by the four P
overflows nor underflows, a correc- rors, so that interleaving can accom- parity symbols. Short-duration ran-
tion signal is generated and is used plish its task. EFM coding guarantees dom errors are corrected, and longer
to control the disk rotating speed, that a random error will never corrupt burst errors are passed along. Spe-
By varying the rate of data from the more than two symbols, and the even/ cifically, the C 1 decoder can correct
disk, the buffer is maintained at 50% odd interleave guarantees that a two- a symbol error in every word of 32
capacity, symbol random error will always ap- symbols. If there is more than one
Following demodulation, data are pear as single errors in two different erroneous symbol, then all 28 data
sent to a CIRC decoder for deinter- C 1 words after deinterleaving. Ran- symbols are marked with an erasure
leaving, error detection, and error dom errors are thus always correct- flag and passed on. Valid symbols
correction. It utilizes parity from two able, and the C2 decoder retains its are passed along unprocessed.
Reed-Solomon decoders as well as burst error-correction capability. The deinterleaving delays between
deinterleaving. The first error-cor- The CIRC decoding process re- the decoders enable the C2 decoder
rection decoder C1 is designed to verses the processing steps performed to correct longer burst errors. Be-
correct random errors and to detect during encoding; the complete CIRC cause the word arriving at C2 contains
burst errors. It puts a flag on all burst decoding process is shown in Fig. symbols fromC1 decoded at different
errors to alert the second error-cor- 15. The CIRC decoder accepts one times, symbols marked with an era~

266 J.Audio
l=ng.
Soc.,
VoL36,No.4, 1988April
CD FORMATS

C1 c1 c2 c2
UNSCRAMBLER DECODER DEINTERLEAVER DECODER UNSCRAMBLER

J b
' We

m) __ _ cONVERTER

FROM _:FM '_; I J TO D/A

DECODER I_o If J

J'_ D
M --

m--

si ' C2 FLAG
)2_ C1 FLAG

Fig. t5. CIRCdecodingutilizesthreedeinterleavingand two correctiondecodingalgorithms.

sure flag are now distributed among converter could result in an audible is typically used to oversample (per-
valid symbols. This helps the C2 de- click. Muting is accomplished haps at a two-, four-, or _ight-time
coder correct burst errors. Symbols smoothly by beginning attenuation rate); image components appear at
without a flag are assumed error-free many samples before the invalid data, multiples of the new sampling rate
and are passed through unprocessed, muting the invalid data, then [13]. Because the distance between
In the C2 decoder errors are corrected smoothly restoring the signal level, the baseband and the sidebands is
by the four Q parity symbols. C2 can This method of muting is often larger, a gentle analog filter can be
detect and correct single-symbol er- largely inaudible, used to remove the images, without
rors and correct up to four symbols causing phase shift or other artifacts
if flagged. Burst errors and errors that _'-'_X attributed to analog brickwall filters.

process itself (rather than in the me- Reconstruction stage of every player is an audio
dium)are corrected,
might have occurred in the encoding _) Compact Disc Signal deemphasis
Also presentcircuit.
in theSomeCDs are
audio output
In addition,C2 cancorrect symbols encoded with an audio preemphasis
miscorrected by C1 decoding. When Finally the digital data must be characteristic with time constants of
C2 cannot accomplish correction converted to a stereo analog signal. 15 and 50 I_S. On playback, deem-
(mom than four symbols are flagged), This reconstruction process requires phasis is carried out, resulting in an
24 data symbols are flagged as un- one or two digital-to-analog con- improvement in the signal-to-noise
corrected and passed on for inter- verters and low-pass filters to sup- ratio. CD tracks that are encoded with
polation, press high-frequency image eom- preemphasis are so denoted by an
The final step in error protection ponents. If the data are first converted appropriate flag in the subcode. The
is error concealment. Thus interpo- to an analog signal (with a digital- player reads the preemphasis flag and
lation and muting circuits follow the to-analog converter), they must be automatically switches in the deem-
CIRC decoder. Uncorrected words filtered with brickwall analog filters, phasis circuit when required.
are detected through flags and dealt Rather than use an analog brickwall Increasingly, CD players offer a
with, while valid data pass through filter after the signal has been con- digital output connection, adhering
unprocessed. Using error flags from verted to analog form, the digitized to the AES/EBU serial transmission
the C2 decoder, the player's sig- signal may be processed before dig- format. To permit an orderly ex-
nal processing circuits determine ital-to-analog conversion using an change of data between digital de-
whether to output the data directly, oversampling digital filter [ 12]. vices, the Audio Engineering Society
to interpolate them, or to mute the An oversampling digital filter uses has established a standard transmis-
sound, samples from the disk as input, then sion format for linearly represented
For continuous errors, muting is computes interpolation samples, digital audio data [14].
employed as a last resort; bad data digitally implementing the response The digital audio format transmits
passed on to the digital-to-analog of an analog filter. A transversal filter both left- and right-channel data us-

268 J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April


CD FORMATS

ing one digital cable. A time-sharing tended mainly as a medium for becomes 2352 bytes (24 bytes × 98
multiplexing transmission transmits professional databases and mass frames), a workable area.
or receives frames, each containing storage for computer-related appli- This 98-frame area is called a sec-
left- and right-channel data alter- cations. CD-ROM is a cost-effective tor; it forms the basic CD-ROM data
nately. The transmission rate corre- way of distributing large amounts of unit. This area is sufficiently large
sponds exactly to the source sampling information [15]. to handle numerical data without al-
frequency.°When the sampling fre- To clarify, CD-Audio is a specific tering the audio CD format. Audio
quency is 44.1 kHz, that number of application of CD technology. It is data are replaced by user data and
frames is transmitted per second. The a standard for recording PCM audio, system data; this occupies 2048
length of one channel data period is The CD-ROM standard is derived bytes. The remaining 304 bytes are
thus 11.34 Ixs. One frame consists from the CD-Audio standard, but used for additional synchronization,
of two subframes, each with 32 bits. defines a format for general data headers, mode selection, and error
Each subframe carries data for one storage. UnlikeCD-Audio, CD-ROM detection/correction, as shown in
audio channel. The first 4 bits are is not tied to any specific application. Fig. 16. Each disk is divided into
used for synchronization and for Both media use a 120-mm-diameter 330 000 sectors, and each sector is
identifying preambles. The next 24 optical disk, but with different data divided into four fields. Data are re-
bits carry audio data, with the most formats, corded with EFM, as in audio CDs.
significant bit transmitted last. Since Apart from modulation and error The first 12 bytes from the 2352-
most CD players use 16-bit words, correction overhead, a CD-ROM byte block are used as a synchroni-
only the last 16 bits in the field are holds a theoretical maximum of 682 zation word. The next 4 bytes form
used. In addition, the first 4 bits in Mbytes (8-bit byte) of user infor- a header field used for time and ad-
the field are set aside for auxiliary mation. In practice, a storage capac- dress flags. Specifically, it contains
audio or other data. The last 4 bits ity of approximately 550 Mbytes 3 address bytes and 1 mode byte. The
form a control field that carries spe- permits a better yield for manufac- address bytes store location as time:
cial information, including an audio turers. This is a large storage area, one address byte holds minutes (74
sample validity bit, a user data bit, equivalent to 3600 still video pic- min maximum), the second byte
an audio channel status bit, and a tures, 1500 floppy disks, 18 000 holds seconds (0-59 seconds), and
subframe parity bit. pieces of computer graphics, or 275 the third holds the sector number
The digital audio signal may be 000 pages of alphanumerics, de- within the second (75 sectors per
transmitted through a coaxial cable, pending on resolution, number of second). For example, an address of
a glass fiber cable, or an ordinary colors, and so on [16]. 59-20-45 denotes the 45th sector
audio phono cable. Through this in the 20th second of the 59th minute
transmission standard, different __ I on the disk. Although this repeats

manufacturers are assured of com- Memory Data Format [ subcodechannel, it facilitates address
digital
patibleinterconnectionand
audio devices from different
transfer _ Compact Disc Read-Only ]I information already found in the Q
searching.
of digital audio information. The modebyte indicatesthe nature
The CD digital audio standard is Although a CD-ROM disk is vis- of the user data. There are two data
perhaps the most sophisticated audio ually identical to an audio CD, it modes, as shown in Fig. 17. The
playbacl_ medium ever devised. Its employs a modified data format. A mode 1 format specifies that 2048
small size, economy, robustness, and CD-ROM disk automatically iden- bytes of each block are devoted to
capacity make it an ideal music car- titles itself as differing from an audio user data. The mode 2 format allows
rier. However, the utility of the CD through the Q subcode channel, for 2336 bytes of user data. The dif-
plastic disk is not limited to music. Data in a music CD are derived from ference between modes 1 and 2 is
Other formats have been derived from the b44.1-kHz sampled signal, with that mode 1 uses 288 bytes to provide
the original audio standard. 16-_it quantization. The 16 bits are an additional layer of error detection
divlded into higher and lower 8-bit and correction (EDC/ECC). This is
bytes. These data bytes (before EFM in addition to the basic CIRC error
mo_lulation) are grouped into frames correction used on all CDs. It ensures
of 24 bytes, and parity and a syn- a level of data integrity essential for
chromzatlon word are added. Whale information that does not degrade
satisfactory for music applications, gracefully, such as text and database
CD read-only memory (CD-ROM) a frame is too short for numerical binary code. Mode 2 trades this ben-
is the logical extension of the CD applications, and there is no provi- efit of data security for maximum data
format toward the much broader ap- sion for addressing. A solution was rate by making the additional 288
plication of information storage in derived from the subcode field in bytes per sector available for user
general. Rather than store music, the which subcode from 98 frames is data. The basic error-correction en-
CD format can be treated as a general summed. Specifically, in CD-ROM, coding is suitable for gracefully de-
ROM system, used for any kind of 98 24-byte frames are summed, thus grading data such as video and audio
program material. CD-ROM is in- the effective size for the d_ita area [17].

270 J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April


Compact Disc Read-Only dias, reference and technical man- is considerable and CD-I applications
Memory Architecture uals, business catalogs, computer promise to be diverse. For example,
applications dictionaries,
rectories, software, encyclope-
operating a CD-I dictionary
Thescopeof mightspecification
theCD-I contain a
systems, on-line databases, back word and its definition, as well as
Because the CD-ROM standard issues of journals, libraries of still spoken pronunciation, pictures, and
does not provide a logical file struc- pictures, parts catalogs, and so on. additional cataloging to synonyms,
ture for CD-ROM media, software It represents an entirely new tech- antonyms, word relationships,
developers have implemented their nology of information dissemina- origins, or translations into foreign
own file formats as well as extensions tion. languages. A hobbyist's book on or-
to common operating system envi- nithology could reproducebird calls.
ronments, such as MS-DOS and Music lovers could define the pa-
VMS. One group of vendors, the rameters of reproduced music using
HighSierra Group, developeda pro- "elastic music" programs. Fiction
posal for a standardCD-ROMlogical works could be provided with a lab-
file structure. When a file structure yrinth of plot deviations, differing
is defined, all information on CD- CD interactive (CD-I) is a product- each time the book is read, or steered
ROMs that follow this standard can specific application of the CD-ROM at the discretion of the reader.
be made readable as standard files in format. Rather than storing specific CD-I specifications require flexi-
all operating system environments data as on a CD-ROM, CD-I permits bility in audio and video quality lev-
[18]. storage of a simultaneous combina- els to best suit the application at hand.
While CD-ROM uses a data format tion of audio, video, graphics, text, Thus audio fidelity can be selected,
similar to that of music CDs, disks and data, all functioning in a real- as can video resolution and color
are not compatible. A CD-ROM time, interactive manner. CD-Iisthus coding. Longer playing times are
player contains laser optics, modu- a defined multimedia extension of achieved by dividing a disk into
lation, and error correction, but dig- CD-ROM. The CD-I format defines channels, the number depending on
ital-to-analog conversion and audio both hardware and software stun- the quality of the audio and video
output sections are replaced with a dards, as well as integration of di- programs. In addition, real-time re-
computer interface to output the verse, selectable functions, depend- quirements for interactivity neces-
ROM data to a host computer. Data ing on the application [19]. sitated the adoption of an executable
are transferred to the host computer
in sectors. Using the header field,
the sector address in minutes, sec-
onds, and blocks is verified by the LEFT-CHANNEL DATA RIGHT-CHANNEL DATA
host computer. The disk table of OFAUDIO
CD OFAUDIO
CD
contents can be transferred to the CD- 8BITS 8BITS 8BITS 8BITS
ROM player
quested track.and used to locate a re- 00000000 1 111111 1111111111
111111 11 --]/
Because CD-ROM is not tied to 11111111 11111111 11111111 111111111__ SYNC PATTERN
one specific operating system or data 11111111 II1111111 11111111 00000000 U (12BYTES)
processor, CD-ROM devices can be ,MINUTES SECONDS SECTORS HEADER(4BYTES)
interfaced with all existing and future 1301301 1313002 130003
computer systems. CD-ROM is lim-
ited only by the capabilities of the 130005 130006 130130?

operating
cessor in system andcomputer.
the host the micropro-
CD- DO009 130010 . I:X)011 IXX)12 J I

retrieval system. Extensive amounts


of information

aROM
CD-ROM
can be stored con-
veniently on a few disks; for example,
is thus telephone
an open_ended
book storage/
for the
United States would be feasible. As-
suming 100 million listed telephones
and 40 bytes of information associ-
I
._[ 0
37
IY20lll
1)2045

D2049
l:kn053
[
D2038
D2042
!:)2046

1)2050
[.I I)2054
II t
D2039
D2043
D2047

1)2051
1_55
D2040
D2044
I)2048

111_52
13_056
USERDATA
(2048BYTES)

atedwitheachone, the totalof 4 _ _--


EOC AND ECC
Gbytes could be stored on about 1)2329 D2330 1)2331 1:11_.332 SYSTEM DATA
seven disks. (288BYTES)
The CD-ROM forms the basis for , 1_333 132334 1)2335 D_36
a new electronic publishing medium Fig. 16. CD-ROMdata formatfor 1 sector(98 frames).(Courtesyof Denon,
applicable to book publishing, di- NipponColumbia.)
J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April 271
Mode 1 (153.6kbytes/s)
q

Header (4) Auxiliarydata (288)

Sync Block address (3) Mode UserData ECC (276)


EDC Space
Min Sec Block 1 P parity Q parity
(12) (1) (1) (1) (1) (2048) (4) (8) (172) (104)
i

EDCmerror detectioncode; ECC_error correctionCode

Mode 2 (175.2 kbytes/s)

Header (4)

Sync BloCkaddress (3) Mode All user data

Min Sec Block 2


(12) (1) (1) (1) (1) (2 3 3 6)

!--

!< -- 2.352 bytes/CD block


Scrambledarea !
This area is equivalentto 1/75s in CD audiO,i.e., 16 bits x 2 channel x 44.1 kHz x 1/75 - 18 816 bits · 2352 bytes

1 mode per track


1-99 tracks per disk
Fig. 17. CD-ROM standard provides for two data modes. One mode per track, 1 to 99 tracks per disk. (Courtesy of
Denon, Nippon Columbia.)

object code as the coding method for (ADPCM) is used for three levels of is a l-3-s pause when switching from
application software. Multimedia varying fidelity. The A Hi-Fi audio the end of one channel to the begin-
applications also call for physical level uses 8-bit ADPCM, with a ning of the next channel. The spec-
interleaving of the three basic types sampling rate of 37.8 kHz. The ifications of the audio quality levels
of data (audio, video, and text/bi- quality is similar to that of vinyl LP are summarized in Table 1 [21].
nary) to ensure synchronized pre- records. The B Mid-Fi audio level ADPCM is one of several spe-
sentation of different data types. Fi- uses 4-bit ADPCM with a 37.8-kHz cialized methods used to encode
nally, the disk must provide efficient sampling rate. The quality is similar digitized audio. It is a variation of
storage for both data that degrade to that of an FM broadcast. The C linear PCM and extremely useful in
gracefully and data that do not. The Speech audio level uses 4-bit certain applications. Because of its
former require maximum bandwidth ADPCM with an 18.9-kHz sampling ability to store digital data with fewer
while the latter require extended error rate. The quality is similar to that of bits, it is thus more efficient. On the
detection and correction, an AM broadcast. The CD-I format other hand, ADPCM requires addi-
also supports an information audio tional processing beyond regular

Compact Disc Interactive tion 4s encoded. With this level, ing.


Audio Format over 10 000 hours of phonetic audio With a proper sampling rate
materialin may
level whichbe phonetic
stored oninforma-
a disk ADPCM
PCM for can
bothperform
encoding competitively
and decod-
[20]. with linear PCM. If the signal were
The CD-I format offers five levels An audio program may be divided always near its maximum frequency,
of audio quality, to be selected ac- into multiple channels (each with ADPCM's performance would be
cording to the need for fidelity. To about 74 min of playing time), stored similar to PCM's. However, with
allow for extended video informa- in parallel across the disk playing audio signals this is almost never the
tion, data compression techniques surface. The 16-bit PCM channel is case. Since the instantaneous audio
may be used to reduce the storage stereo; the three ADPCM modes can frequency is relatively low, the signal
space required to encode the audio be recorded in either stereo or mon- is changing more slowly, amplitude
program. A CD-Audio mode uses aural; the phonetic level is monaural, changes are therefore smaller, so
linear 16-bit PCM with 44.1-kHz The number of channels increases as ADPCM's quantization error is less
sampling rate. The quality duplicates the fidelity level decreases; there are than that for PCM. Thus in theory
that of stereo audio CDs. Alterna- o always twice as many monaural ADPCM can achieve better signal
tively, 8-or4-bit adaptive delta PCM channels as stereo channels. There resolution for the same number of

272 O.AudioEng.Soc.,Vol.36, No.4, 1988April


quantization bits. ADPCM was se- with 768 x 480 pixels (NTSC) and character, giving a total of 600 mil-
lected for the CD-I format because 768 x 560 pixels (PAL). Normal lion characters on a disk, or 2 bytes
relatively fewer bits are needed to resolution corresponds to the best per character, resulting in a total of
achieve good performance. As ira- achievable resolution with present 300 million characters per disk. In
plemented, its fidelity is lower than television receivers, while double and both cases it is possible to manipulate
that of the PCM signal on an audio high resolution represent the quality text via software, for example, copy-
CD, but it is suitable for many CD- of future enhanced or digital receiv- lng text from disk to external storage.
1applications, ers.

Compact Disc Interactive


Video Format

Because video material is stored


Compact Disc Interactive
Image Encoding

Image coding provides for three


Architecture

n,er
The CD-I format permits a high
degree of real-time interactivity. The
digitally, a CD-I disk is compatible picture qualities, namely, studio data format allows for real-time
with both the NTSC (United States picture quality and two graphics switching between sectors. Thus it
and Japan) and the PAL/SECAM qualities. Natural (photographic) is possible to have sectors with dif-
(Europe) television standards. CD-I pictures use DYUV (delta YUV, ferent information contents and with
disks may be manufactured to provide where Y refers to luminance and U different physical formats. CD-I is
either 525 scanning lines (NTSC) or and V refer to chrominance) coding, not a peripheral, but a self-contained
625 scanninglines (PAL); either type One full-frame natural picture is system. To ensure universal disk/
of disk is playable on any standard transferred in just over 0.6 s. The drive compatibility, dedicated hard-
television. When a standard disk is first graphics mode is designed for ware and interfaces are specified. A
played on a corresponding tandard end-user manipulation applications; CD-I player contains a CD-ROM disk
television (525 through 525, or 625 it is based on absolute RGB (red- drive as the system's input, decoder
through 625), negligible visual dis- green-blue) 555 coding, and supports chips for text, graphics, video, audio,
tortion is produced. When a disk is either 8-bit/256 colors or 15-bit/ and microprocessor controllers. The
played through a television adhering 32 768 colors. The second graphics CD-I system uses the Motorola 68000
to another standard (525 through 625, mode is based on CLUT (color lookup microprocessor family. A CD-I
or 625 through 525), approximately table) graphics, which permit 4-bit/ player could be interfaced to a tele-
7% visual distortion results. Gen- 16-, 7-bit/128-, or 8-bit/256-color vision and stereo sound system.
erally, the eye is able to perceive 5% full-screen animation. With com- Because the CD-I format recog-
distortion in a side-by-side compar- pression, the interleaving of sound nizes 16-bit PCM data (the first audio
ison, or 10% distortion otherwisb, and CLUT picture results in the abil- quality level), a CD-I player can play
Alternatively, a CD-I disk may be ity to provide full-screen animation regular CD-Audio disks. An audio
produced witha525/625 compatible with a picture refresh rate of 17 CD player cannot play CD-I disks
video format, which is playable on frames per second. (unless connected to a CD-I con-
either a 525 or a 625 television with troller). As specified in the High
a 3.6% distortion. _x, x Sierra Group proposal, CD-ROM

quality levels for resolution and pixel Text Encoding systems. On the other hand, a mi-
coding.
The CD-IThree standards
format calls forofvarying
video _ Compact Disc Interactive crocomputer
mode 1 diskswith can aoperate
CD-ROM on drive
CD-I
resolution are supported: normal is not always capable of processing
resolution of 384 × 240 pixels Text encoding can be accomplished the information on a CD-I.
(NTSC) and 384 × 280 pixels (PAL), either as a bit-map process or by The CD-I format has inspired the
double resolution (twice the hori- character encoding. The bit-map development of alternative technol-
zontal resolution) with 768 × 240 process requires 5 bytes per char- ogles. Digitalvideo interactive (DVI)
pixels (NTSC) and 768 x 280 pixels acter, resulting in a maximum of 120 is an all-digital optical disk format
(PAL), and high resolution (twice the million characters per disk. Char- capable of reproducing full-motion,
horizontal-and the vertical resolution) acter-encoded text can use 1 byte per full-screen video, computer-gener-

Table I, CD~I sound quality levels and specifications.

Level .Encoding Samplingrate(kHz) Bits Bandwidth(kHz) Numberof channels Maximumplayingtime


CD-Audio PCM 44.1 t6 20 1 stereo 74 rain
A ADPCM 37.8 8 17 2 stereo, 4 mono 4.5 h
B ADPCM 37,8 4 17 4mono, 8stereo 9h
C ADPCM 18.9 4 8.5 8stereo, 16mono 19h
InformationPhonetic .... 10000 h

J. AudioEng.Soc.,Vol.36,No.4,1988April 273
CD FORMATS

ated video graphics, and digital audio standards [24]. the outer diameter, encoded as an
via a CD-ROM drive. A DVI disk is However, because of incompatible analog signal according to the LV
effectively a CD-ROM disk contain- analog television standards, both format. In the NTSC standard the disk
ing DVI-specific data. Its data spec- NTSC and PAL players and disks speed ranges from 2700 to 1815 r/
ification permits motion video to be must be manufactured. When the rain, to yield a scanning velocity of
combined with foreground video ob- audio CD was standardized, design- 11-12 m/s. The track pitch is 1.7
jects, text, dynamic graphics, and ers were free to design universality I_m. In the PAL standard the disk
multitrack audio, all under the user's into the product. Thus any CD disk speed varies from 2250 to 1512 r/
control. Sophisticated data com- may be played on any CD player, min, with a scanning velocity of
pression techniques permit the system Not so with CD-V. TWO primary 9.2-10.2 m/s. The track pitch is also
to utilize disk storage space more ef- world standards exist, NTSC and 1.7 Ixm.
fectively and to output relatively PAL. Because video information is The player must sense the presence
larger amounts of data more quickly, stored and reproduced in analog of either digital audio-only or video
Although data on DVI disks are for- form, a CD-V player must be con- signal and adjust its operating pa-
matted to CD-ROM specifications figured for one standard or another, rameters accordingly. Typically, the
and can be played on a CD-ROM as are CD-V disks. In other words, player's pickup would read the audio
drive (other digital media may be the video portion of a PAL CD-V disk lead-in area, skip to the video por-
used as well), special DVI decoding will be unplayable on an NTSC player tion, then return to the audio-only
technology is also required. The DVI and vice versa. Also, the analog audio portion.
format is thus incompatible with portion of the video program is in-
other interactive CD formats, such compatible because it is integrated _'"_'X

of CD-ROM [22]. CD tracks will be playable on any Video/Laservision


as The
CD-ICD-I
and format presents consid-
other implementations PAL or NTSC
into the or CD-Audio
video signal. player.
The audio-only _ Compact Disc
erable opportunities for hardware,
software, and publishing industries _x, x The 200- and 300-mm-diameter

of interactive entertainment and ed- Single CDV-LP, respectively. They contain


ucation.
to provideHowever,there
consumers with arenew
notforms
many _ Compact Disc Video audio
disks conforming
are denoted tobytheCDV-EP
CD standard
and
existing, readily transferable pro- and video conforming to the LV
grams that take advantage of the CD-V introduces an entirely new standard. Analog audio may be used
video, graphics, audio, and text ca- software format, the CDV-Single. in addition to digital audio with the
pabilities supported by this new The CDV-Single appears similar to NTSC standard. The CDV-EP plays
standard. There is thus a considerable an audio CD. However, the substrate two 20-min sides, the CDV-LP plays
amount of creative work ahead [23]. and top lacquer are gold-tinted to two 60-min sides [25], [26]. Ob-
distinguish it visually. The disk is viously these disks cannot be played

approximately 5 min of full-motion scanning speed in the NTSC format


COMPACT DISC VIDEO NTSC video (encoded as an analog is 10.1-11.4 m/s. In the PAL format
signal)
the size with
of anCD digital
audio CD, audio sound
but contains it is a8.4-9.5
on m/s [27], [28].
CD-Audio-only Because
player. The
track, as well as an extra 20 min of of their long playing times, these
CD video (CD-V)is a combination digital audio only. A PAL standard disks will contain material such as
of audio and video technology, spe- disk encodes approximately 6 min of concert videos, classical music vi-
cifically the CD and Laservision (LV) video material. CD-Audio standards deos, and feature films.
systems. From that merger comes a are used for the digital audio data, In the NTSC format, the audio
new medium, the CDV-Single, and including a scanning velocity of portions of LV disks were formerly
a new hardware product, the CD-V 1.2-1.4 m/s and a track pitch of 1.6 encoded with two FM carriers placed
combiplayer. Ixm. Because the audio portion of the at 2.3 and 2.8 MHz, below the FM
A CD-V player is a combination CDV-Single disk is placed on the in- picture signal. In the PAL format,
CD and LV player. It plays 120-mm nermost diameter (from 50 to 74 the FM carriers were placed at 648
video CDV-Singles, along with con- mm), the audio portion may be played and 1066 kHz. In either case the dig-
ventionalfull-lengthaudio-onlyCDs, on a regular CD player. Likewise, ital audio signal in CD-V disks is
and all 200- and 300-mm LV video the CD-V player may play any audio placed in the 0-1.75-MHz band be-
disks as well. Hence the name corn- CD. The video portion, on the outer low the FM picture signal carriers.
biplayer. The audio portion of a diameter (from 78 to 116 mm), is Thus NTSC disks may be encoded
combiplayer is identical to that of a read only by the CD-V player. The with both analog and digital sound.
CD player; audio quality of the CD- combined digital audio and video Digital audio NTSC CD-V disks are
V will duplicate that of an audio CD (NTSC and PAL) formats are shown playable in existing LV players, with
player. CD-V thus retains full com- in Fig. 18. analog audio circuitry. An NTSC CD-
patibility with existing CD-Audio Video information is contained on V player plays existin, g LV disks.

274 J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April


CDFORMATS

However, in the PAL format the dig- digital filtering. Features encompass from a 3/4-in U-matic machine, syn-
ital audio signal replaces the analog those of both CD and LV players, chronized with time code. The high
FMcarriers. Thus a digital audio CD- such as programmability, picture bandwidth of 10 MHz (versus 1.5
V disk may nbt have analog audio as search, fast/slow motion, stills, and MHz for CD-Audio) necessitates
well. A PAL CD-V player is able to speed increase/decrease. Although it manufacturing procedures closer to
reproduce both digital audio LVs and uses primarily constant linear veloc- those of video disks, as opposed to
existing analog audio LVs. However, ity (CLV) scanning, CD-V also pro- audio CDs. However, CDV-Single
existing LV players will not be able vides for constant angular velocity disks will be manufactured at CD
to reproduce the audio portion of new (CAV) scanning in the 200- and 300- audio plants.
digital audio CD-V disks, mm formats. Other enhancements
In addition to the CD-V combi- call for high-definition (CDV-HD) _x,

formats, a CDV-Single dedicated guage software, Dolby surround COMPACT


player is able to play only CD-Audio sound encoding, as well as use of DiSC FORMATS
player playing all four optical disk picture, CD-I controllers, dual-lan- _ RECORDABLE
and CDV-Single disks. CD-V players copy-guard encoding.
use audio circuitry identical to ex- From a manufacturing standpoint, While important to the consumer,
isting CD players, including, for ex- CDV-Single disks must be mastered playback-only CD formats are of
ample, 16-bit digital-to-analog con- on a dual-speed lathe, using video limited utility in the recording studio.
verters and four times oversampling from a 1-in C-type machine and audio Recordability is essential for most

DIGITAL AUDIO
SAMPLING An output Iow-pass filter is used to re- QUANTIZATION
I f move these image spectra. Aliasing, a [ 1
form of distortion, occurs when the input
° An analog waveform such as an frequency exceeds the half-sampling As we have seen, sampling is a
acoustic pressure function in air exists rate. When a digital audio system sam- method of taking a measurement pe-
continuouslyin time, over a continuously pies input frequencies greater than the riodically. Sampling represents the time
variableamplituderange.Suchan analog half-sampling frequency, spurious new of the measurement, quantization rep-
function may be discrete-time sampled, frequencies are "recorded" aswell. Spe- resents the value of the measurement,
Moreover,the sample points can be used cifically, if S is the sampling frequency such as the amplitude of an audio wave-
to reconstructthe original analog wave- and F is a frequency higher than half form at sample time. Whereas sampling
form. This digitization of audio forms the the sampling rate, then a new frequency is perfect within its prescribed context,
basis for the encoding and decoding of Ff is created as Ff = S - F. Furthermore, quantization is imperfect. In the same
the audio signals in any digital audio aliasing occurs at multiples of the sam- way that length, an analog quantity, can
format, including the CD. piing frequency. Forexample,all of these be measured (for example, in inches),
The Nyquist theorem states that given alias components would be produced: an audio waveform can be measured
correct bandlimitedconditions, sampling S _ F, 2S _+F, 3S -+ F.... (for example, in volts). But in the same
can be a Iossless process. Just as some Thus the input signal must be band- way that a measurement of length is
relationship is needed between a movie limited with an input Iow-pass filter. It always proneto error,quantizingis prone
camera's shutter rate and the speed of ensures that all frequencies above the to error.
movingobjects to produceusable motion half-samplingpoint are attenuatedbelow An analog waveform may be repre-
pictures, the relationship between the the amplitude resolution threshold of the sented as a series of samples; the am-
sampling frequency and audio frequen- system. To achieve a maximum audio plitude of each will yield a number that
cies must be defined. The Nyquist theo- bandwidth for a given sampling rate, fil- represents the analog value at that in-
rem defines this relationship: if samples ters with a very sharp characteristic, so- stant. By definition, an analog waveform
are taken twice as fast as the highest called brickwall filters, are employed, has an infinite number of amplitude val-
audiofrequency,the completewaveform Disallowed frequency components are ues. However, quantization selects from
can be reconstructed [1]. In other words, not allowed to enter the sampler; thus a finite number of digital values. The
S samples per second are needed to aliasing cannot occur. In other words, selected value will be only an approxi-
represent a waveform with a bandwidth when sampling, one must obey the mation to the actual. Specifically, after
of S/2 Hz. The choice of sampling fie- sampling theorem [2]. sampling, the analog staircase signal is
quency thus determines the frequency In accordance with the sampling rounded to the numerical value closest
response of the digitization system, theorem, the sampling rate must be set to the analog value. The difference be-
Sampling may be considered a mod- to a frequency at least twice that of the tween the actual values of the signal
ulation process. One result of that mod- highest input frequency. In the case of and values after quantization appears
ulation is the creation of a spectrum with the CD, the sampling frequency is 44.1 as error.
sidebands, or image frequencies, cen- kHz (a sample every 22.6757 millionths The number of quantization steps
tered on multiples of the sampling fie- of a second). This permits a flat fre- available is determined by the length of
quency. These sidebands are a con- quency response from 0 to 20 kHz, with the data word in bits. This determines
sequence of sampling and result even a filter guardband to permit complete the resolution of the digitization system.
when the Nyquist theorem is observed, attenuation. For example: 16 bits yield 216 = 65 536

276 J.Audio
Eng.
Soc.,
Vol.
36,No.
4,1988
April
professionalaudio applications. Con- ternatively, written data could be holes in a thin film. Other mecha-
siderable research effort has been destroyed, but not recorded over), nisms include lasers that cause huh-
applied to the development of re- While it may at first be uncomfortable bles or blisters to form in the medium.
cordable optical-disk formats. There to magnetically inclined minds, a Tellurium and tellurium alloys have
is little question that future genera- write-once system is very workable been selected by many manufacturers
tions of the CD standard will provide for many applications. For example, for the thin film because of their low
for both recordable and fully erasable in some applications a permanent data melting points and high sensitivity
CDs. A variety of technologies is trail is precisely what is called for. [29].
being explored. In short, a write-once optical disk In other systems an irreversible
does not necessarily present fatal phase change is used to provide an

important. Indeed, other nonerasable of the medium at the point where the
Write-OnceDisk formats have met with success before; laser strikes. In this way the reading
for example, Erasability
drawbacks. photographyis not
is aalways
write- laser may in
alteration differentiate
the index ofdata. Some
reflectivity
once technology, systems use a thin metallic recording
The simplest recordable optical A write-once optical disk may be layer that changes its physical prop-
systems are write-once. The user may implemented in a variety of ways. erty from crystalline to amorphous
record data only once, permanently, In some systems a mechanism called when it is thermally heated (to ap-
until the disk capacity is filled. (Al- ablation uses a laser writer to burn proximately 170°C) by the writing

-ch .....

PRIMER
steps (from 0000000000000000 to comparable to, or better than, the best ers, output sample-and-hold circuits,
1111111111111111 ). The more bits the analog systems, output (anti-imaging) Iow-pass filters, and
better, because the magnitude of the output amplifiers [6]. Alternatively, a
error is smaller. But there will always be digitalfilter may precededigital-to-analog
a marginof error, conversion. TheoutputsideofthePCM
Rarely the selectedstep will be exactly A PULSE-CODE MODULATION system forms the basis for the design.
at the analog value; usually it will not be SYSTEM of a CD player.
quite exact. At worst, the input analog [ I
level will be one half step away, that is,
there will be a maximum error of one Modulation is the process of encod-
half the least significant bit of the quan- lng source information prior to trans- REFERENCES
tization word. Interms of decibels, every mission or storage. In pulse-code mod- [ I
added bit yields about a 6-dB increase ulation (PCM) the amplitude of the digital
in the system's signal-to-error (S/E) ratio waveform encodes binary digits directly. [1] M. Mathews, The Technologyof
because it reduces the quantization error Because of its relatively efficient use of Computer Music (MIT Press, Cam-
by a factor of 2. The S/E relationship bandwidth, its compatibility with off- bridge, MA, 1969).
can be expressed in terms of word length the-shelf circuitry, and its ability to be [2] B. Blesser, "Elementary and Basic
as S/E (dB) = 6.02n + 1.76, where n easily multiplexed,PCM has proveditself Aspects of Digital Audio," DigitalAudio,
is the number of bits [3]. Using the for- to be an expedient means of represent- CollectedPapersfromtheAESPremiere
mula, a 16-bit system is seen to yield lng audio data for recording or repro- Conference(Rye, NY, 1982 June), pp.
an S/E ratio of about 98 dB. duction. When it comes to storage on 5-11.
Dither can help suppress the effect of magnetictape or optical disk,otherforms [3] B. Blesser, "Digitization of Audio:
quantization error. Dither is a Iow-am- of modulation may be more effective. A Comprehensive Examination of The-
plitude analog white noise added to the There is one consistent drawback to ory, Implementation, and Current Prac-
input analog signal. (Othertypes of dither PCM; like many other digital coding tice," J. Audio Eng. Soc., vol. 25, pp.
have been researched.) It is important schemes,it requiresa much wider band-
that this small amount of noise removes widththan the corresponding analogsig- 739-771 (1978 Oct.).
[4] J. Vanderkooy and S. P. Lipshitz,
the quantization artifacts from a sig- nal [5].
nal--it does not mask the artifacts, it The architecture of a fixed, linear PCM "Resolution Below the Least Significant
Bit in Digital Audio Systems with Dither,"
removesthem [4]. digitization system is comprised of a re-
Sampling and quantization are the two cording section of input amplifiers, a J. Audio Eng. Soc., vol. 32, pp. 106-
fundamental design problems for a dig- dither generator, input (anti-aliasing) Iow- 113 (1984 Mar.).
itization system. Sample rate determines pass filters, sample-and-hold circuits, [5] J. Strawn, Ed., DigitalAudio En-
bandlimiting and thus frequency re- analog-to-digital converters, a multi- gineering:An Anthology(William Kauf-
sponse, and word length determines plexer, digital processing circuits for error mann, Los Altos, CA, 1985).
signal-to-error ratio. Although bandlimited correction and modulation, and a storage [6] S. Yoshikawa, "An Introduction to
sampling is a Iossless process, quan- medium such as digital tape. On the Digital Audio," Proc. AES 3rd Interna-
tizing is one of approximation. In general, playback side are processing circuits for tional Conference, Present and Future
a sampling frequency of 44.1 or 48 kHz demodulation and error correction, a of Digital Audio (1985 June), pp. 14-
and a word length of 16 bits yield fidelity demultiplexer, digital-to-analog convert- 23.

J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April 277


CD FORMATS

tape to erasure--suddenly decreases.


AnalOg Specifically, at their Curie temper- o
0 NTSC format video
sync ,Jo ature (about 200°C), magnetic par-
Digital J1 ['"' "' ticles are easily oriented by a weak
-20 audio field.

--40 nutespotof magnetic materialto its


Alaserbeamis usedto heata mi-
heated by the finely focused laser
-6o beamare affectedby the magnetic
-al fieldfromtherecording coil,anda
very high density recording results,
dB ..... . , . I[ , Curie point.
as shown Only19(a).
in Fig. those After
few particles
the laser
1 2 3 4 5 8 7 8 9 10 pulse, the temperature decreases, and
f(MHz) the data are stored in the magnetic
(a) layer.
Analog Data readout utilizes the Kerr ef-
0 PALformat video sync Jo fect, which describes the rotation of
Digital · the plane of polarized light as it
--20 audio ' passes through a magnetized mate-
rial. Specifically, the reverse-ori-
ented particles will reflect laser light
dB differently thanthe unreversed re-
-s0 gions.To readthe disk,a laseris
focused on the disk, and the angle
-80 of rotationof itsreflection is moni-
tored, as shown in Fig. 19(b). An
, - analyzer distinguishes between ro-
i 2 a 4 5 6 7 8 9 lO tated and unrotated light, and data
f(MHz)
are recovered from a modulated sig-
(b) nal. The intensity of the reading laser
Fig. 18.CD-Vsignalformats.(a) NTSCstandard.(b) PALstandard.(Courtesy
ofPhilipsInternational
N,V.) is much lower than that of the re-
cording laser, so the recorded mag-
netic information is not affected.
laser. The phase transition alters the optical disk recorder. Again, several To erase data and rerecord, a mag-
reflectivity of the recording layer at technologies present alternatives, netic field is applied to the disk along
written spots, thus allowing laser Perhaps the most intriguing is mag- with the laser heating spot, as shown
reading of the data. In some proto- netooptical recording (sometimes in Fig. 19(c), and new data are writ-
types a layer of antimony-selenium called optically assisted magnetic ten. Both operations can be accom-
has been employed, evaporated onto recording). Magnetooptical record- plished in one pass, as they can with
a hard plastic disk substrate, ing combines the erasability of a conventionalmagneticmedia. Amag-
An alternative to metal film is the magnetic medium with the non- netooptical recorder can be made to
use of polymer dye binder bilayer wearing high-density advantages of play back conventional CDs as well.
media. Colored dyes in a plastic me- optical recording [30]. An important aspect of any re-
dium over a reflective material are The recording density of magnetic cording medium is its compatibility
written with infrared light and read vertical media is underutilized by with media from other recorders. To
with red light. Unlike metal film conventional magnetic heads. Their achieve this within the high toler-
disks, polymer dye technology can recording flux fields cannot be fo- ances of a magnetooptical disk, blank
use a flexible substrate, cused sufficiently. In other words, disks may be manufactured with
the recorded area uses a far greater prerecorded and nonerasable ad-

Magnetooptical optics a magnetic field is used to re- ware address sectoring, and uses a
Recording cord data magnetically, but it is grooved disk in which address in-
weaker
area thanthan conventional
necessary. With recording
magneto- formation is physically
dressing. The method isformed
called in the
hard-
fields. By itself, it is too weak to groove and detected by light-beam
For some applications the advan- reorient the magnetic particles, reflection. Using this system, any
rages of write-once permanence are However, a unique property of mag- magnet()optical player will automat-
negated by the need to erase and re- netic materials is utilized: as they are ically track both address and data in-
record. For example, full erasability heated, their coercivity--the field formation contained on any disk. By
would be preferable for a multitrack strength required to bring a saturated superimposing the hardware ad-
278 J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April
dressing information with the re-
corded data signal, playing time is
not sacrificed.
A magnetooptical disk would re-
tain the protective properties of other
MAGNETIC optical media. The recording layer
PARTICLES/ would be sandwiched between a
, _ transparent substrate and a protective
_ _ _ __ _ { { t _fi _ _ _ _ _ _ _ _ _ _ _ _ t _ _ _/ layer. The laser light would shine
x, through the substrate, placing surface

_ REVERSE-ORiENTEDI respect to the interioroutof


dustandscratches data. focuswith
Although
PARTICLES _ MAGNETIC several magnetic materials could be

' 'i _'X_ & ' FIELD used,


based the ultimate selection
on orientation properties,will sig-
be
WRITING
LASER nal-to-noise ratio, and long-term
stability. System designers are cur-
(a) rentlyresearchingamorphousthin-
film magnetic materials with a coer-
civity of about 2500 Oe. Tests in-
dicate that a magnetooptical disk
could be erased/recorded over 10
million times, and would retain its
data for 10 years or more.
At present a magnetooptical CD is
still an experimental system. How-
i.{._ _ _ _ _ _ __ _ __ _ _ _ _ _ _ _ _ _ _ _ {_ _ _ _ _ _ ever, the introduction of an erasable

REFLECTED READ close at hand. However,magneto-


i BEAM (REVERSED) optics is not the only system being
researched. Some systems use phase-
change technology similar to that
__ [ _ usedin
opticalm write-oncesystems;
ediumseems theyuse
inevitableand
READING
LASER materialsthat exhibit a reversible
crystalline/amorphous phase change
when recorded at one temperature and
(b) erasedat another.Typically,a high-
reflectivity (crystalline) to low-re-
flectivity (amorphous) phase change
is used to record data, and the reverse
to erase. A tellurium suboxide re-
cording layer alloyed with germa-
nium, indium, and lead has been de-
veloped. By simply increasing the
laser power to burn holes rather than
change phase, permanent recording
{ _ * { { { {{ { { { { { {* { { { { _ { _ { { * { { { * * { { may be accomplished.

I
MAGNETIC
_ FIELD
(REVERSED)

!
ERASE
LASER The CD standardincorporatesover
50 years of theoretical and practical
(c) work in the digital audio field.
Moreover, it is the first widely avail-
able consumer digital audio product.
Fig. 19.Magnetoopticaldisk.(a) Writing.(b) Reading.(e) Erasing. Its success has sparked numerous
evolutionary steps in optical storage
J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April 279
CD FORMATS

technology. First,therewasCDfor [ ]
digital audio (CD-Audio), a robust,

dium. Next, CD-ROM utilizes the


large-capacity music ofstorage
great storage capacity the CD me-REFERENCES
for
primarilynonaudioapplications, [ ]
such as software and database. Then
CD-V adds video playback to CD- The material used in this article is "An Integrated Approach to CD
Audio and merges the CD with the excerpted from Ken Pohlmann's Players, Part 1: The Optical Pick-
LV format. CD-I offers backward forthcoming book, The Compact Disc up," Electron. Compon. Appl., vol.
compatibility with CD-Audio, adding Handbook, edited by John Strawn, 6, no. 4 (1984).
audio-visual interactivity with even and published by A-R Editions Inc., [12] J. Matull, "ICs for Compact
greater capacity, due to data com- 315 W. Gorham St., Madison, WI Disc Decoders," Elec. Compon.
pression methods. The CD-Single, 53703. Appl. (1982 May).
an 80-mm disk holding 20 rain of [1] M. G. Carasso, J. B. H. Peek, [13] M. G. Goedhart, J. B. H.
digital audio, has been introduced, and J. P. Sinjou, "The Compact Disc Peek, and J. P. Sinjou, "Digital-to-
A standard for write-once disks has Digital Audio System," Philips Tech. Analog Conversion in Playing a
been announced. Finally, record- Rev., vol. 40, no. 6 (1982). Compact Disc," Philips Tech. Rev.,
able-erasable CDs, using magne- [2] K. C. Pohlmann, The Compact vol. 40, no. 6 (1982).
tooptical and other technology, have Disc Handbook (A-R Editions, [14] Audio Engineering Society,
beenprototyped. 1988). "AES RecommendedPractice for
Whatever the format, optical disk [3] K. C. Pohlmann, Principles of Digital Audio Engineering--Serial
storage as exemplified by the CD Digital Audio (Howard W. Sams, Transmission Format for Linearly
standard promises to surpass the Indianapolis, IN, 1985). Represented Digital Audio Data,"
performance levels offered by other [4] J. P. J. Heemskerk and K.A. J. Audio Eng. Soc., vol. 33, pp.
audio storage mediums. Optical disk Schouhamer Immink, "Compact Disc 975- 984 (1985 Dec.).
technology provides data reliability System Aspects and Modulation," [15] S. Lambert and S. Ropiequet,
andlongevity, large storage capacity, Philips Tech. Rev., vol. 40, no. 6 Eds., CD-ROM, The New Papyrus
random access, low cost per bit, re- (1982). (Microsoft Press, 1986).
movability, recordability, no backup [5] R. W. Hamming, Coding and [16] J. P. Roth, Essential Guide
requirements, no medium wear, and Information Theory (Prentice-Hall, to CD-ROM (Meckler Publishing,
the ability to mix digital audio data Englewood Cliffs, NJ, 1980). 1986).
with video, control, time code, and [6] H. Hoeve, J. Timmermans, [17] Denon, Nippon Cdl_mbia
other data. Certainly optical storage and L. B. Vries, "Error Correction Co., Ltd., "CD-ROM Disc Technical
is an excellent complement to other and Concealment in the Compact Information" (1986 Mar.).
contemporary audio technologies, Disc System," Philips Tech. Rev., [18] J. B. Schwerin, CD-ROM
offering all the capacity and speed vol. 40, no. 6 (1982). Standards: The Book (Learned In-
currently required. In comparison, [7] L. B. Vries, and K. Odaka, formation Ltd., 1986).
other audio storage media are show- "CIRC--The Error-Correcting Code [19] K. C. Pohlmann, "Technical
lng their age. Over the coming dec- for the Compact Disc Digital Audio Overview of the CD-I Format" Mu-
ade, any and all users of audio media System," Digital Audio, Collected sic and Digital Technology, Proc.
will have to evaluate their recording Papers from the AES Premiere Con- AES 5th Internat. Conf. (1987 May),
technology in light of the CD and ference (Rye, NY, 1982 June), pp. pp. 223-232.
opticalstorage. 178-186. [20] OpticalMediainPerspective
At the same time it is certain that [8] H. Ogawa and K. A. Schou- (Philips International B.V., 1987
other technology-driven industries hamer Immink, "EFM--The Mod- Mar.).
will endorse the CD and optical stor- ulation Method for the Compact Disc [21] A General Introduction to
age. In very direct terms, the CD thus Digital Audio System," Digital Au- CD-Interactive (Philips International
further integrates the audio industry dio, Collected Papers from the AES B.V., 1986 Oct.).
with the video and computer indus- Premiere Conference (Rye, NY, 1982 [22] D. F. Dixon, S. J. Golin, and
tries. Soon, with technology such as June), pp. 117-124. I.H. Has hfield, "DVI Video/Graph-
the CD, the distinctions between [9] K.A. Schouhamerlmminkand ics," Computer Graph. World (1987
these fields, and the expertise of the U. Gross, "Optimization of Low- July).
practitioners, will become blurred Frequency Properties of Eight-to- [23] S. Lambert and J. Sallis,
indeed. Certainly, in terms of pro- Fourteen Modulation," Radio Elec- Eds., CD-I and Interactive Videodisc
fessional challenge and new market tron. Eng., vol. 53 (1983 Feb.). Technology (Howard W. Sams, In-
potential, the audio industry wel- [10] J. R. Watkinson, "Principles dianapolis, IN, 1987).
comes the CD and anticipates the fu- of Optical Storage," Electron. Wire- [24] K. A. Schouhamer Immink,
ture evolutions of technology and less World (1985 Mar./Apr.). A.H. Hoogendijk, and J. A. Kahl-
applications it will engender. [11] Philips International B.V., man, "Digital Audio Modulation in

280 J. Audio Eng. Soc., Vol. 36, No, 4, 1988 April


the PAL and NTC Optical Video Disk ips Tech. Rev., vol. 33, no. 7 (1973).
Coding Formats," J. Audio Eng. [28] W. van den Bussche, A. H.

Soc., vol. 32, pp. 883--888 (1984


Nov.).
Hoogendijk, and J. H. Wessels,
"Signal Processing in the Philips
BIBLIOGRAPHY
[25] G. Bouwhuis and P. Burg- 'VLP' System," Philips Tech. Rev.,
stede, "The Optical Scanning System vol. 33, no. 7 (1973).
of the Philips 'VLP' Record Player," [29] E. S. Rothchild, "Optical [31] G. Bouwhuis, J. Braat, A.
Philips Tech. Rev., vol. 33, no. 7 Memory: Data Storage by 'Laser," Huijser, J. Pasman, G. van Rosma-
(1973). Byte (1984 Oct.). len, and K. Schouhamer Immink,
[26] K. Compaan and P. Kramer, [30] K. A. Schouhamer Immink Principles of Optical Disc Systems
"The Philips 'VLP' System," Philips and J. J. M. Braat, "Experiments (Adam Hilger Ltd., Bristol and Bos-
Tech. Rev., vol. 33, no. 7 (1973). Toward an Erasable Compact Disc ton, 1985).
[27] P. J. M. Janssen and P.E. Digital Audio System," J. Audio [32] J. Isailovic, Videodisc and
Day, "Control Mechanisms in the Eng. Soc., vol. 32, pp. 531-538 Optical Memory Systems (Prentice-
Philips 'VLP' Record Player," Phil- (1984 July/Aug.). Hall, Englewood Cliffs, NJ, 1985).

:i;

Access Time Anti-Aliasing Filter Bandwidth


Time interval for the laser pickup Low-pass filter used at the input Range between lower and upper
to locate and begin reading data from of digital audio recorders to attenuate limiting frequencies; width of a band
any point on the CD. frequencies above the half-sampling of frequencies; also, maximum amount
frequency, to prevent aliasing, of information (for example, bits)
A/D (Analog-to-Digital) capable of transmission or storage.
Converter Anti-Imaging Filter
Electric circuit that converts an Low-pass filter used at the output Beam
instantaneous point of an analog sig- of digital audio reproducers, such as Laser light used in the optical
nal to a digital word, represented as CD players, to attenuate frequencies pickup to recover data from the disk,
a binary number, above the half-sampling frequency as well as to provide for focusing
and to eliminate supersonic image and tracking.
AES/EBU Serial Transmission spectra present at multiples of the
Format ,: sampling frequency. Binary
Signaltransmissionformat to con- Condition in which there are two
vey digital audio data. Increasingly, Aperture Circuit possible states; for example, the bi-
CD players have this digital output Sample-and-hold circuit used to nary number system (base 2) using
the digits 0 and 1.
to transfer data to digital-input am- buffer instability in the analog signal
plifiers, andto correctfor high-frequencyroll- Bit
off following the D/A converter. Smallest amount of digital infor-
Aliasing mation; abbreviation for Binary
Unwanted frequencies created Autofocus digiT. A bit can store only two states,
when sampling a signal of a fre- Method used in CD player pickups a 0 and a 1.
quency higher than half the sampling to maintain focus of the objective lens
rate (Nyquist frequency). Anti-al- on the pit surface. Bit Rate
iasing filters in digital recordersen- Rate or frequency at which bits
sure that aliasing will not occur. Autotracking appear in a bit stream. The bit rate
Method used in CD player pickups of raw data from a CD is 4.3218
Analog to maintain tracking of the laser spot MHz.
Physical quantity or data charac- on the pit track.
terized by being continuously vari- Brickwall
able rather than discrete. Acoustical Bandlimited Refers to slope of a filter with steep
waveforms are analog in nature. Defined and finite frequency band. attenuation. Analog anti-aliasing and

282 J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April


anti-imaging filters have brickwall CD-V (Compact Disc Video) Cylindrical Lens
characteristics. System storing 5 min of analog Optical component used in some
Buffer video and digital audio plus 20 min optical pickups to create an elliptical
Memory used to provide consistent of digital audio only on a 120-mm- pattern, through astigmatism, to
rate of data flow from an inconsistent diameter disk, and longer times on generate a focusing signal.
source. A crystal-clockedbuffervir- 200- or 300-mm-diameter optical
tually eliminates wow and flutter in disks. D/A (Digital-to-Analog)
Converter
digital audio systems. CIRC (Cross-Interleave Reed- Electric circuit that converts a bi-
Burst Error Solomon Code) nary-coded word to an equivalent
Large number of data bits lost on Method of error detection and analog voltage. A CD player may
the medium due to excessive damage correction using data interleaving and have one D/A converter shared be-
or obstruction on the medium. Error the Reed-Solomon error-correction tween audio channels, or one D/A
correction can compensate for burst code algorithm; used in all CD for- per channel. The latter is preferred.
errors, to a limit, mats.
Data, Analog
Any physical representations of
information processed by an analog
circuit.

DICTIONARY Data, Digital


Anynumerical information
cessed by a digital circuit.
pro-

Diffraction Grating
............... Opticalcomponent usedin three-
beam pickups to split the laser beam
Byte Clock into secondary beams; to be used for
Group of bits operating together, Timing device that generates the tracking.
Usually 8 bits. basic periodic signal used as a source
of synchronizing signals in digital Digital
CAV (Constant Angular equipment. Crystal control ensures Use of numbers (typically binary)
Velocity) precision, to represent information.
Disk rotating at a consant number
of revolutions per second. The LP is CLV (Constant Linear Velocity) Digital Audio
a CAV system at 33V3 r/min. Disk rotating at a varying number Use of sampling and quantization
of revolutions per second to maintain techniques to store or transmit audio
CD-Audio (Compact Disc Digital a constant relative velocity between information.
Audio System) pickup and track across the disk ra-
System storing of digital audio and dius. The CD is a CLV system, ro- Digital Filter
subcode information on a 120-mm- tating at 200-500 r/min.
diameter optical disk. Any filter characteristic accom-
plished in the digital domain. Digital
Compact Disc filters may use oversampling to yield
CD-I (Compact Disc Interactive) 120-mm-diameter disk holding the characteristic needed for the anti-
System storing digital audio, audio program or other nonaudio aliasing and anti-imaging filters.
video, text, and graphics information data. The disk is made of plastic, However, digital filtering does not
interactively, with user control over with a top metalized layer, and read introduce phase distortion common
content and presentation, on a 120- by reflected laser light, to brickwall analog filters.
mm-diameter optical disk.

CD-ROM (Compact Disc Read- CRCC (Cyclic Redundancy Digital Sum Value
Only Memory) Check Code) Value representing the difference
Method of storing digitally coded Error-detection code that is re- between the lengths of the pits and
information, usually computer or corded with the audio information at fiats comprising the bit stream; min-
database related, on a 120-mm-di- a specified interval; code data derived imized to suppress low-frequency
ameter optical disk. from the audio data. content of signal.

CD-Single Cutoff Frequency Digitization


System storing 20 min of digital Frequency above or below which Any conversion of analog infor-
audio on an 80-mm-diameter disk. a filter effectively attenuates, mation into digital form.

J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April 283


couPA
cT ,SCD,C ONA.¥-;
Discrete-Time Sampling Error Concealment lng, errors in consecutive bits or
Technique of representing a Method used to repair an error in words are distributed to a wider area
waveform at discrete instants of time the audio signal. Through interpo- to guard against consecutive errors
with a set of periodic pulses; the basis lation new data are derived from ad- in the storage medium.
of digital audio, jacent error-free data blocks. An ap-
proximation to the original data is Interpolation
Disk Drive generated. Method used to conceal errors by
Mechanismfor supporting and rD- using adjacent data to determine the
taring the disk. Often magnetic Error Correction approximate value of missing data;
clamping is used to ensure a tight, Method using a coding system to error concealment technique.
nonvibrating grip on the disk. correct data errors by use of inter-
Dither leaved data or redundant data within Land
a data block. Corrected data are Reflective substrate between pits
Analog noise added to the input identical tol the original, on the CD data surface. The land
of a digital signal processing or re- provides almost complete reflection
cording chain to reduce the effect of Error Detection of the laser light.
quantizing error at low-level signals. Coding system which provides a
Dither must be added at the input method of determining errors that Laser (Light Amplification by
when the digital recording is made. occur in a digital data stream. Stimulated Emission of
Dropout Radiation)
Error condition in which bits are Eye Pattern Device that generates coherent,
incorrect or lost from a medium. A High-frequency or radio-frequency monochromatic light waves in the
disk defect or obstruction could cause data signal output from the optical light spectrum. All CD players con-
a dropout, pickup, tain a semiconductor
laser in their
optical pickup.
DVI (Digital Video Interactive) Fiberoptics
Technology using data compres- Technology of using glass fibers Low-Pass Filter
sion techniques to store 1 hour of to convey light and modulated in- Filter with a characteristic that al-
digital video and audio information formation. Some CD players have lows all frequencies below a specified
on a 120-mm-diameter optical disk. fiberoptic outputs for noise-free roll-off to pass and that attenuates
transfer of data to the amplifier, all frequencies above. Anti-aliasing
EFM (Eight-to-Fourteen and anti-imaging filters are low-pass
Modulation) Frame filters.
Encoding technique used in CD Basic unit of data representation
mastering to convert 8 bits of digital on a CD disk, containing synchro- Mastering
data to 14 bits. This defines pit nization, subcode, audio data, and Process of encoding audio and
structure, increases data density, and parity, control data on a glass master disk,
enhances tracking. The player per- prior to CD replication.
forms EFM demodulation. Index Numbers

Emphasis Delineations found in the subcode Matricing


of some CDs, marking points of ref- Production of metal impressions
Boosting of high frequencies in the
audio signal during recording, then erence in the audio program, from master or submaster CD disks,
deemphasizing or correspondingly to be used for disk molding.
attenuating them during playback. Initialization
This increases the signal-to-noise Process of reading the table of MCR (Magnetooptical
ratio. A CD with emphasis is iden- contents when a CD is first loaded Recording)
tiffed by a set emphasis bit, and the in a player. Initialization is accom- Erasable optical disk system using
player automatically performs plished automatically; most players magnetic media and laser reading/
display disk track numbers and time writing.
deemphasis, after initialization.
Erasable Optical Storage Modulation
Optical medium upon which data Interleaving Process of varying a carrier signal
can be written, read, then erased, and Process of rearranging the time according to the instantaneous value
written again; reversible medium, sequence of data. Upon deinterleav- of an input signal.

284 d. Audio Eng. Soc., Vol. 36, No. 4, 1988 April


DISCDICTIONARY
Multiplexing Parallel RandomBit Error
Process of combining independent Simultaneous presence of bits in Error in a digital data stream where
signals. Demultiplexing reverses the a data word on multiple conductors, only a few bits are lost; corrected by
process, errorcorrection.
PCM (Pulse-Code Modulation)
NRZ (Nonreturn to Zero) Method in which digital words in Read
Digital recording code in which the a bit stream represent samples of an- Nondestructive reading of data
state of the medium changes posi- alog information; basis of most dig- from a medium.
tively with a 0/1, and negatively with ital audio recordings.
a 1/0 signal change. Read-Only Memory (ROM)
Pitch Memoryfromwhichdatamayonly
NRZI (Nonreturn to Zero Distance between adjacent tracks be read, after initial storage. The CD
Inverted) of a data track. The pitch of the pit is a read-only memory.
Digital recording code in which the spiral on CDs is 1.6 lam.
state of the medium is reversed every Sampling
time a 1 is recorded and remains Process of storing the amplitude
constant when a 0 is recorded. Pits of a signal at a particular point in
Physical impressions on a disk
time, represented by a number.
Nyquist Frequency substrate storing the data. Viewed

Highest audio frequency that may as bumps by the reading laser, they Sampling Rate
be accurately sampled. The Nyquist diffract the light, decreasing the in- Frequency that an analog signal is
frequency is one half the sampling tensity of light returned to the pickup, sampled; expressed in hertz. The
frequency; for a CD system it is 22.05 sample rate used on CDs is 44 100
kHz. Player samples per second (44.1 kHz). Since
Mechanical and electrical device the CD reproduces stereo sound,
Objective Lens used to recover data from a disk. 88 200 samples are stored for each
Optical component used in laser second of sound on the disk.
pickups to focus laser light on the Polycarbonate
disk data surface. Transparent plastic material used Sampling Theorem
to form the substrate for CDs. Theorem stating that a bandlimited
One-Beam Tracking continuous waveform may be exactly
Optical pickup design in which a Premastering represented by a series of discrete
single laser spot is used for data re- Process of integration of digital samples. The sampling frequency
covery, tracking, and focusing, data code, error correction, and sub- must be at least twice the highest
codes, and of conversion to the CD frequency contained in the wave-
Optical Pickup data format prior to disk mastering, form.
Optical device using laser light to
convey data from a disk. Quantization SCSI (Small Computer Standard
Process of converting an infinitely Interface)
Oversampling variable amplitude of an analog Standard 8-bit parallel interface
Digital technique in which samples waveform to a finite series of discrete used to connect peripherals such as
are interpolated, typically by 2 or 4 levels expressed as binary words, a CD-ROM player to a microcom-
timesrate, to accomplishdigitalill- purer.
tering of the signal, prior to D/A performed by A/D converter.
conversion. A gentle analog anti- Serial
imaging filter is still required after Quantization Error Arrangement of data in linear se-
oversampling to remove extreme Error resulting from quantizing an quence; transmission of data through
supersonic frequencies, analog waveform to a discrete level; a communications line.
the longer the word length, the less
PAM (Pulse-Amplitude the error. Servo Mechanism
Modulation) Control device or system that reads
Method in which the amplitude of Random-Access Memory (RAM) its own output to determine the de-
pulses in a pulse train represents the Memory in which data may be read gree of further output as compared
analog information, and written regardless of address, to its input.

286 J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April


S/H (Sample-and-Hold Circuit) Substrate Tracking Error
Circuit that captures and retains Material upon which a device is Error condition in which the optical
an analog signal for a finite period constructed and supported. A CD is pickup deviates from the pit track,
of time. The input S/H precedes the made of a polycarbonate substrate, perhaps caused by an obstruction or
A/D converter, allowing time for disk defect.
conversion. An output S/H (aperture Table of Contents
circuit) follows the D/A converter, Nonaudio data contained in the Transversal Filter
smoothing glitches and compensating lead-in area of a CD, storing initial- Numerical filter using a shift reg-
for aperture error, ization data, such as track and time ister, multipliers, and adder to per-
information; read during disk ini- form digital filtering; circuit used in
Spiral Track tialization, many CD players.
Pit track spiral physically encoding
data on a CD, running from the in- Three-Beam Tracking Wow and Flutter
nermost diameter to the outermost. Optical pickup design in which a Variation in transport speed from
mechanical error; negligible in the
The spiral track on a disk may be 3 laser beam is split into three spots,
miles long, containing over 3 billion the center spot being used for data CD system, since it is electronically
pits. recoveryand focusing, and the sec- compensatedby buffering.
ondary spots for tracking. Write
Subcode To record data on a medium.
Data encoded on a CD that contain Track Numbers
definable information, such as track Numbering of songs or movements Write-Once Media
number, timings, copy inhibit, and on a disk. This information is con- System in which data may be written
copyright, tainedin the subcode, once, but not erased and rewritten.

THE AUTHOR

Ken C. Pohlmann is an associate professor of


music at the University of Miami in Coral Gables,
Florida, and a director of the undergraduate and
graduate Music Engineering Technology programs
there. He received B.S. and M.S degrees in
electrical engineering from the University of Illinois
in Urbana-Champaign in 1974 and 1976,
respectively. His teachers have included James
Beauchamp, Sal Martirano, and Herbert Br0n.
Professor Pohlmann is the author of Principles
of DigitalAudio (1985) and The Compact Disc
Handbook (1988). In addition, he has published
over 300 articles in popular audio magazines,
including Audio, Car Stereo Review,db, Digital
Audio and Compact Disc Review, Electronics
Australia,Mix, Sound and Vision,and Stereo
Review. He is editor-at-large of DigitalAudio and
Compact Disc Review, technical editor of Car
Stereo Review and Mix magazine, and
contributing editor of Audio and Sound and Vision.
He is chief recording engineer for the Greater
Miami Opera. He cofounded International
Business Information Systems, Inc., Miami, a
business computer export company, and
Microcomputer Arts, Inc., also in Miami, a music
synthesis design company. He also cofounded
U.S. Digital Disc Corporation, a New-York-based
company for consulting, and for manufacturing of
Compact Discs. Professor Pohlmann also works
independently as a consultant to the audio
industry.

J. Audio Eng. Soc., Vol. 36, No. 4, 1988 April 287

You might also like