Professional Documents
Culture Documents
Sound
Processes
Chapter 4 Music Distribution page 3
This publication forms part of an Open University course, TA225 The Technology of
Music. Details of this and other Open University courses can be obtained from the
Course Information and Advice Centre, PO Box 724, The Open University, Milton Keynes
MK7 6ZS, United Kingdom: tel. +44 (0)1908 653231, email general-enquiries@open.ac.uk
Alternatively, you may visit the Open University website at http://www.open.ac.uk
where you can learn more about the wide range of courses and packs offered at all
levels by The Open University.
To purchase a selection of Open University course materials visit the webshop at
www.ouw.co.uk, or contact Open University Worldwide, Michael Young Building,
Walton Hall, Milton Keynes MK7 6AA, United Kingdom for a brochure. tel. +44 (0)1908
858785; fax +44 (0)1908 858787; email ouwenq@open.ac.uk
MK7 6AA
All rights reserved. No part of this publication may be reproduced, stored in a retrieval system,
recording or otherwise, without written permission from the publisher or a licence from the
Copyright Licensing Agency Ltd. Details of such licences (for reprographic reproduction) may be
obtained from the Copyright Licensing Agency Ltd of 90 Tottenham Court Road, London W1T 4LP.
Open University course materials may also be made available in electronic formats for use by
students of the University. All rights, including copyright and related rights and database
rights, in electronic course materials and their contents are owned by or licensed to The Open
In using electronic course materials and their contents you agree that your use will be solely
for the purposes of following an Open University course of study or otherwise as licensed by
Except as permitted above you undertake not to copy, store in any medium (including
or show in public such electronic materials in whole or in part without the prior written consent
of The Open University or in accordance with the Copyright, Designs and Patents Act 1988.
Printed in the United Kingdom by The Burlington Press, Foxton, Cambridge CB2 6SW.
1.1
3 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 3
Chapter 4
Music Distribution
CONTENTS
Aims of Chapter 4 4
1 Introduction 5
2.1 Introduction 5
2.2.1 Planning 7
2.2.3 Mix-down 9
2.2.5 Manufacture 11
2.2.6 Distribution 11
4 Digital audio 18
4.1 Introduction 18
5.1 Introduction 26
6.1 Introduction 32
6.3 MiniDisc 38
7.1 Oversampling 42
8.1 Introduction 51
8.3.1 Introduction 53
8.5.1 Introduction 62
Summary of Chapter 4 67
Learning outcomes 74
Acknowledgements 74
AIMS OF CHAPTER 4
1 INTRODUCTION
The first three chapters of this block discussed ways in which musical
performances may be recorded and stored. These include a written
score, a set of coded instructions, or an audio recording of the actual
sounds. This chapter takes the last of these methods and describes
both the events leading up to the recording and the subsequent
manufacturing and distribution processes.
At the time of writing (2004) the recorded medium is most likely to be
the digital audio compact disc (correctly abbreviated to CD-DA but
generally referred to as the CD, as I will here). Available since October
1982, the CD revolutionised the music industry and so part of this
chapter is devoted to CD technology. Other technologies for recording
and distributing music in the digital domain, including digital tape,
alternative disc systems, the Internet and wireless broadcasting, will
also be discussed.
2.1 Introduction
Music companies such as Universal Music Group, Sony Music, EMI,
Warner Music and BMG are responsible for all aspects of record
production from deciding which works merit recording to the
appearance and sound of the final product.
A commercial recording is the outcome of hours of work involving
many people. Look closely at the notes that accompany any
commercial recording and you should find somewhere a list of people
involved with the production. The booklet that came with my copy of
the Virgin Classic’s recording A Venetian Coronation 1595, illustrated
in Figure 1, lists 63 names.
Figure 1 The booklet from the Virgin Classic’s recording of A Venetian Coronation 1595
(Virgin Veritas 59006)
6 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 6
Aside from the performers and the conductor, Paul McCreesh, the
names listed include the executive producer, the music editors, the
balance engineer, the cover designer, the photographer and several
friends and advisers to the conductor. Even the organ tuner gets a
credit! Not included in the list, but equally important, are all those
responsible for organising the venue (it was recorded at Brinkburn
Priory, Northumberland, UK), providing the hospitality during the
recording, undertaking post production after the recording,
manufacturing the CD, printing the booklet, etc., etc. In reality
hundreds of people will have been involved in making this record.
Take a few moments to look at some of the booklets from your own
recordings and see how many people are involved in producing them.
Do you find that each record has a similar list? You may find that
some record labels make more information available than others.
Typically CDs tend to show more information than either audio
cassettes or vinyl LPs.
Note down any job titles that recur. Usually the job title will be
accompanied by the name of a person. Are you able to classify any of
these jobs into either artistic or engineering rôles?
Comment
Some of the jobs that I discovered by looking at labels from a random
selection of my own CDs include: producer; remix engineer;
production assistant; A&R co-ordinator (A&R is explained later in this
section); director; balance engineer; tonmeister (literally sound-
master); designer; recording engineer; digital remix engineer; executive
producer.
The job functions within the record side of the music industry may be
classified into two rôles, the artistic and the engineering, as listed in
Table 1.
the USA.
Figure 4 A BBC recording session for the Open University course A207
Not all artists expect to spend a lot of time in the recording studio.
In 1942 the American singer Bing Crosby made a recording of Irving
Berlin’s song White Christmas in eighteen minutes with only two
takes. This recording has now sold over 31 million copies.
9 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 9
2.2.3 Mix-down
Once the recording session is over the producer and engineers work on
the mix-down of the session recording to make the master recording.
The mix-down will usually be to two channel stereo although now
surround sound formats may also be made, depending on the content
of the recording and how it is to be distributed. Judgements as to the
ultimate sound will be made by the producer and engineers on an
artistic basis. In well-designed digital audio systems there should be
no loss in quality or added noise on any copies made from the session.
Digital editing allows very small blemishes in the sound to be corrected to
achieve a near-perfect recording from the original performance. (You may
recall from Chapter 3 in this block that piano rolls could be similarly
corrected.) Figure 5 shows a page from the editing notes of a recording of
the cadenza from the Sibelius Violin Concerto in D minor. The numbers
at bars, e.g. 205 and 118, represent recording takes. You can see the
reference to noise from a passing motor car (‘filter car’) during take 120
which needs attention. Note also the mixing of takes from 118 to 120, back
to 118, etc. gradually building a new performance from a collection of
takes. The really
important point
about this example is
that the editing is not
being used to piece
together an accept-
able performance
from an incompetent
artist (although that
possibility could
exist). Instead edit-
ing is being used to
create a performance
that is better than
even this highly
competent player
would be likely to
achieve in a single
take or at a live
performance.
Once the mix-down
is complete the
master recording will
be stored on a digital
recording medium
such as an 8 mm data
cartridge, digital
audio tape (DAT) or
CD. (These systems
will be introduced
later in this chapter.)
Figure 5 Part of the editing notes for editing a musical performance
10 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 10
Data specific to the medium is also added at the post production stage.
For a CD this would include information about track separation, track
numbers and the length of each track. Once all the digital audio and
associated data is finalised the master recording is stored onto digital
media such as the Exabyte 8 mm tape cartridge illustrated in Figure 6
using a special disc description protocol (DDP) file format. Exabyte
tape is a specially formulated, substantially bit-error free, tape
manufactured primarily for archiving computer data.This is preferred
by engineers as the audio data will be substantially free from errors.
DAT and CD-R
may also be used
but are not
considered ideal
as they have
relatively high
bit-error rates.
The post
production
master tape may
be referred to as
the EQ’d-master
to identify the
fact that has been
made for a
specific medium.
Figure 6 An Exabyte tape cartridge
11 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 11
2.2.5 Manufacture
Today’s music industry mass-produces recordings in enormous
quantities. For example nearly ten billion CDs were manufactured
world-wide between 1999 and 2002. And yet the duplication process,
which produces records that are identical to the master, still uses a
process similar to that invented by Emile Berliner in 1894. (You will
read more about Berliner in the final chapter of this block.) His process
used ‘stampers’, which were reverse or negative representations of a
master disc, to press a groove into a hard rubber compound. The fully
automated manufacturing process for CDs using a similar principle
but with different compounds is described in Box 1 ‘The compact disc
manufacturing process’.
2.2.6 Distribution
Record distributors are owned either by music companies or are
independent. The independent distributors handle the smaller labels
which offer the more specialist genres. Distributors look after
shipping, warehousing, inventory control and have a sales force who
get the records into the marketplace by selling to the record stores.
Take a look again at some of the inserts from recordings that you own but
now look for the record company and label. Make a list of the company
and label. Do you see that a record company can own several labels?
Comment
Table 2 lists the major music companies with examples of their record
labels. Unfortunately, the distributor’s name does not feature on the
notes for the record. To find a distributor you would need to search
the Web or refer to one of the many record magazines.
Publicity, both for specialist magazines and the record retailers will be
made available prior to the release date and advanced copies of the
recording will be sent to reviewers and broadcasting organisations.
Finally, enough records should be manufactured to ensure those of us
who want a copy are not kept waiting!
Describe in your own words the six stages necessary to get a CD into
the shops.
12 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 12
father
electroplating
6 7
mother
disk substrate
Disc duplication
reflective Figure 7(b) shows the duplication process. The disc is
aluminium made from clear polycarbonate plastic in an injection
moulding machine. The stamper presses the data into
10 metalisation the plastic material and forms the disc (9). (It takes
about 5 seconds to press each one.) Then a reflective
protective layer
of lacquer layer of aluminium is deposited over the data surface
11 lacquering (10) using an evaporation or ‘sputtering’ technique.
(This takes about 3 seconds.) Finally a layer of lacquer
is sprayed over the aluminium (11) to protect it from
abrasion and oxidation and a label is printed onto the
Figure 7 The manufacturing steps to make a lacquered surface. Following quality control checks the
CD based on Berliner’s original process discs are packaged and boxed for distribution.
13 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 13
120
sound pressure level (dB) (reference: 20 µPa)
100
80
60
40
20
by the noise in the system, i.e. when the lowest signal level becomes
14
(a)
0
� noise
time
amplitude
(b)
0
CHAPTER 4 MUSIC DISTRIBUTION
amplitude
maximum
signal level
Figure 9 (a) an audio signal enveloped in noise, (b) a clipped audio signal
Comment
The ideal dynamic range would be one that matched the human ear,
i.e. 130 dB. However, given that studio microphones limit dynamic
range to around 90 dB this would appear to be a suitable value for
consumer audio systems. Of course in an average domestic
environment it would be impossible to achieve a dynamic range
approaching this value – just think what the neighbours might say!
90 dB 90
80 dB 80
70 dB 70
gain
60 dB 60
50 dB 50
40 dB 40
30 dB 30
10 Hz 100 Hz 1000 Hz 10 000 Hz 100 000 Hz
frequency
Figure 10 The frequency response of a typical audio amplifier (a repeat of Figure 26 in
Block 1 Chapter 6)
frequency between the two cut-off frequencies must be the same for a
constant input amplitude to ensure the relationship between the
fundamental frequencies and the harmonics that comprise the sound
are preserved. Any variation will corrupt this relationship and cause
audible distortion which can change the timbre of the sound.
Run the course’s sound editing software and open the computer sound
file associated with this activity. Use the program’s graphic equaliser
filter to create, in turn, each of the three bandwidths given in Table 3.
What effect on the sound does each setting of the filter have compared
to the unfiltered sound?
Comment
I hope you noticed how the timbre of the musical instrument changed
especially when you used the very narrow bandwidth setting. You
probably noticed a lack of high frequency with the AM setting and
really very little difference at all with the FM setting when compared
to the unfiltered sound.
Recent research has shown that increasing the bandwidth beyond the
audible range may contribute to a more realistic reproduction
especially of hall acoustics. Some domestic audio systems now offer
bandwidths up to 80 kHz to support SACDs and DVD-As.
Audio systems may be described by a transfer function, introduced in
Chapter 6 of Block 1, which relates the characteristics of the output
signal to those of the input. The transfer function may also be represented
on a spectral graph where the frequency content of the input and the
output can be compared. Unless the transfer function is linear as shown
in Figure 11(a), the output signal will be distorted due to the creation
of additional harmonics as shown in Figure 11(b). The additional
harmonics, including sub-harmonics (harmonics below the lowest
frequency in the input signal), cause distortion in the output signal.
voltage
voltage
linear transfer
function
frequency frequency
(a)
unwanted additional
frequencies causing
distortion
voltage
voltage
non-linear
transfer function
frequency frequency
(b)
+ =
You may recall from Block 1, Chapter 6 that the signal-to-noise ratio is
the average audio signal power divided by the average noise power and
is usually expressed in decibels. Whilst it is usual to specify the
largest possible value, in reality the signal-to-noise ratio is very
dependant on the sound levels and the environment in which they are
being heard. For example a noisy amplifier may be adequate to feed a
loudspeaker, but totally unacceptable for use with headphones. The
former would need a much higher sound level thus masking the noise
from the amplifier.
4 DIGITAL AUDIO
4.1 Introduction
You may recall from Chapter 6 of Block 1 that a technique known as
pulse code modulation (PCM) can be used to code analogue audio in
the form of a stream of electrical pulses. The principle used is to
sample the analogue level at regular intervals, assign a binary code to
represent each level and convert the resulting codes to a serial stream
of binary bits. The advantages of using a digital code to convey and
store the audio signals will be examined in this section.
Although invented in 1937, by an Englishman, Alec Reeves, it took 30
years for technology to develop the first digital audio recording using
PCM to be demonstrated by the Japanese Broadcasting Corporation.
The general impression gained at that time was that the sound
reproduced by the digital audio tape recorder could not be matched by
any analogue machine of the day. The bit pattern recovered by playing
back the tape corresponded exactly to the bit pattern originally coded
from the analogue signal and recorded onto tape. The record and
playback systems apparently caused no audible artefacts to be added to
the digital signal – it was described at the time as an ‘ideal’ recording
system.*
Over the next few years evolving electronic and computer technologies
allowed the development of the circuits and storage methods needed to
cope with the complexities of digital audio. In 1982, fifteen years after
the first demonstration of PCM recording, Philips and Sony jointly
introduced the compact disc to the world – consumer digital audio
systems had become possible.
*Baert, L., Theunissen, L., Vergult, G., Maes, J. and Arts, J. (1995) Digital
Audio and Compact Disc Technology, Oxford: Focal Press, Butterworth-
Heinemann, p. 7.
19 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 19
How does the performance of a digital audio system compare with that
of an analogue system in terms of bandwidth, signal-to-noise ratio and
dynamic range? Well, to put it simply, as long as the digital audio system
offers a ‘window’ that is wider than necessary for the analogue signal
in all three of these aspects then no loss of quality will occur when the
digital system conveys the analogue signal. For this to happen the
digital clipping level must be above the largest analogue signal and the
digital noise level has to be below any noise in the analogue signal.
Also the high and low frequency response of the digital channel must
exceed the range of frequencies in the analogue signal. An example of
this is a recording on a CD which was mastered on an analogue tape
recorder. All the artefacts of the master tape are reliably reproduced by
the CD and nothing is added by the digital system.
20 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 20
Recording
The block diagram in Figure 13 shows (a) the coder and (b) the decoder used for recording and
playing back audio signals. The analogue signal input to the coder is amplified and bandwidth limited
by an anti-aliasing filter. This is a low-pass filter designed to stop high frequencies, i.e. those
above half the sampling rate, being passed into the quantiser (the analogue-to-digital converter).
For a typical high-quality analogue audio signal the minimum sampling rate needs to be at least 40
kHz (twice the audio bandwidth of 20 kHz). In practice rates slightly higher than twice the audio
bandwidth are chosen to ensure the filter is able stop unwanted frequencies to prevent aliasing.
(This was introduced in Section 5.3 in Chapter 6 of Block 1.)
second channel
(if stereo)
analogue
anti-aliasing A/D + error parallel
audio amplifier correction
filter converter to serial
input
storage
medium
(a) coder
error analogue
serial to D/A reconstruction
– audio
parallel correction converter filter
output
storage
medium second channel
(if stereo)
(b) decoder
Figure 13 (a) A digital recording system and (b) a digital playback system
The quantiser generates a parallel binary number, often called a code word, which represents the
amplitude value for each sample. You read in Block 1 that the greater the number of quantisation bits
used by the quantiser the better the quality of the reconstructed sound. This is because the
binary code may not be exactly the value of the sample, but the closest approximation to it.
(The greatest difference is half the interval between two quantisation levels.) The difference
between the actual and the quantised value is termed the quantisation error which leads to
quantisation noise, a form of distortion in the reconstructed analogue signal, shown in Figure 14. The
parallel code word output from
the quantiser is combined with original signal quantisation error quantised signal
the code words from the second
channel, if two channels are
being used, and error correction
data and other information to
= +
suit the digital storage medium
is added. Finally the data is
converted into a serial bit
stream using a parallel-to-serial
converter so that it can be Figure 14 Quantisation noise is the sum of the audio signal
stored or transmitted. and the quantisation error
Playback
The digital audio data is recovered as a serial stream of bits from the digital storage medium shown
in Figure 13(b). It is converted back to parallel form and then processing, including error correction,
is carried out before the two channels are separated. The binary code words are then input to a
digital-to-analogue converter. The reconstructed samples are converted at the original sampling
rate. The converter’s output is passed into a low-pass reconstruction filter which forms the final
analogue audio signal. The cut-off frequency of this filter will be slightly less than half the sampling
rate of the D/A converter. You should note here that this is a very generalised description and in
reality some of the operations may be carried out in a different order – in particular, serial-to-parallel
conversion may be performed at an earlier stage. A similar situation would also exist in the coder.
21 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 21
Listen to the audio track associated with this activity. You will hear
two versions of the same recording. I hope you can hear a difference
between the two. Can you suggest what is wrong with the second
version? (Hint: Think about sampling rates.)
Comment
The first version is the recording of a piano using the standard CD
sampling rate of 44.1 kHz. The second version is the same piece but
recorded using a sampling rate of 11.025 kHz but without lowering the
frequency of the anti-aliasing filter. (Note, a quarter of the necessary
sampling rate was chosen to emphasise the problem of aliasing).
The effect of using a reduced sampling rate is to introduce unwanted
‘alias’ frequencies into the audio signal. These arise because there are
too few samples to be able later to reconstruct the original signal from
the digital data. Instead of the high frequencies being reproduced a
false low frequency tone is reconstructed from the samples, which
distorts the sound.
Comment
Using the expression derived in Box 3, for the domestic CD player with
16 quantisation bits the signal-to-noise ratio equals 6 × 16 = 96 dB.
1000 1000
0111 0111
amplitude
amplitude
0110 0110
0101 0101
0100 0100
time time
Figure 15 Quantising a low-level signal that causes only a single bit change
Comment
Start from the expression for the theoretical signal-to-noise ratio as
derived in Box 3:
signal-to-noise ratio = 6n dB
where n is the number of quantisation bits
So, for a given signal-to-noise ratio, n = signal-to-noise ratio/6. In this
case the required signal-to-noise ratio is 144 dB, therefore the number
of quantisation bits required is 144/6 = 24.
As a domestic CD player has 16 quantisation bits 8 additional
quantisation bits are therefore necessary to increase the signal-to-noise
ratio from 96 to 144 dB.
As you can see adding extra bits is necessary but costly in terms of
increased storage capacity. The technology to provide this increase in
storage capacity did not exist at the time when the CD was developed.
The next activity considers storage capacity of a CD when additional
quantisation bits are used.
24 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 24
ACTIVITY 15
(EXPLORATORY) ................................................................
Comment
For each sample the number bits to be stored is increased from 16 to
24. If overheads are ignored then the playing time will be reduced by
the ratio of 16:24 which gives a new playing time of:
70 × (16/24) = 46.7 minutes
Again you can see that using more quantisation levels would not be a
very good idea for CDs due to the reduction in playing time. However,
adding more quantisation bits is exactly what has happened with the
higher capacity storage technologies used for SACD and DVD-A systems.
Another way to ensure that the distortion resulting from quantising a
low level signal is not generated is to add a low level ‘spoiling’ signal.
This is an idea taken from video recording technology where low level
random noise is used to improve picture quality. If a very small
amount of random noise, known as dither noise, is added before the
analogue-to-digital conversion process then the sound is improved.
This may seem quite strange, but although the overall signal-to-noise
ratio is decreased slightly, the effect of the additional noise is to
decorrelate it from the signal with the result that the audible distortion
is effectively reduced. This then allows the digital system to code
signal amplitudes of less than the amplitude of one quantisation
interval and still keep the noise levels low, as shown in Figure 16.
1000 1000
0111 0111
amplitude
amplitude
0110 0110
0101 0101
0100 0100
time time
ACTIVITY 16
(LISTENING) .....................................................................
The audio track for this activity contains two examples of the same
recording. The first has been made without the use of dither noise.
You will be able to hear audible distortion particularly as the notes
fade away. In the second example dither noise has been added
effectively reducing the distortion. In both cases the effects have been
deliberately emphasised to demonstrate how the addition of dither
noise can improve the sound of a digital audio recording. In normal
cases the dither noise is so low that it is inaudible as you will see in
the following activity.
25 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 25
In this activity you will use the course’s sound editing software to
both see and hear the amount of dither noise that could be added to a
digital audio signal on a CD. As the activity uses features of the sound
editor you have not previously used, you will find detailed steps on
how to carry out this activity in the Block 3 Companion.
Comment
In this activity you were able to see that a very low level of dither noise
is added to the signal to avoid any audible distortion when converting
from a higher to a lower number of quantisation levels. This level of
noise is inaudible, but as Activity 16 demonstrated, the audible
distortion is removed, although it is at a very low level.
Comment
Increasing the number of quantisation bits improves the signal-to-
noise ratio so reducing the lower operating level of the audio signal.
As a result of increasing the number of quantisation levels there are
more code-word values available before the maximum code-word value
is reached and clipping occurs. Thus the dynamic range is extended.
5.1 Introduction
From Activity 10 you found that a typical stereo digital audio signal
generates nearly 1.5 megabits of data every second. To store data at this
rate, recording equipment with a bandwidth approaching 2 MHz is
required which is a hundred times greater than the bandwidth of an
analogue audio recorder. Fortunately, in the early days of digital audio,
the newly developed video tape recorder (VTR), which had been
designed for recording the high bandwidth signals that television
pictures require, was able to be adapted to recording digital audio data.
In this section then we will look at the basic technology behind digital
audio recorders that use magnetic tape as the recording medium. This
will highlight some of the disadvantages of using tape, particularly in
the consumer area. Section 6 then will look at disc-based systems such
as the CD and MiniDisc, that overcome some of these disadvantages.
Can you suggest a reason why the low sample rate of 32 kHz is
included in the specification of the DAT recorder?
Comment
Broadcasters use a 15 kHz bandwidth for analogue radio
transmissions. The lower DAT recorder sampling rate exceeds this
bandwidth on replay making it suitable for broadcast use without the
need for re-sampling. Being portable the DAT recorder is ideal for
location recordings, especially interviews.
The use of high quality S-VHS video tape and a faster than normal tape
speed means that 8 channels of digital sound sampling at either 44.1 or
48 kHz using 16, 20, or 24 quantisation bits can be recorded. In
addition the system uses a special encoding system that includes
synchronisation/identification signals that not only aid editing, but
also allow recorders to be cascaded to allow the possibility of 16 or
more tracks.
The ADAT system remains popular and there are a wide range of
recorders, editors and other devices available that are compatible with
the ADAT digital input and output format and synchronisation
signals. Indeed there are now ADAT hard disk recorders that use a
computer hard disk instead of magnetic tape as the storage medium.
30 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 30
Why do you think the multitrack tape recorder was so popular with
recording companies?
Comment
The multitrack facility allows many microphones to be used to record
individual artists and instruments. The final mix-down can then be
created from the best possible tracks in the quiet of the studio and
away from all the tensions of the recording session. Different mixes
may be experimented with which would not be possible with only a
two track recording.
Why do you think tape is not an ideal medium for domestic audio use
compared with disc?
6.1 Introduction
A number of different disc storage media may be used for digital audio
data, some of which were originally developed for use in the computer
industry. Table 6 gives examples of the more popular disc media, not
all of which currently offer recording facilities.
* The playing time is similar to that of CD but offers higher quality sound. Additional
features such as multi-channel surround sound and limited video are also possible
** Whilst not originally designed for audio use, hard disks are becoming very popular
as domestic audio storage devices now that very high capacity drives are available at
relatively low cost. The storage capacity shown is for uncompressed WAV files.
Book Specification
colour
Red CD (Digital Audio compact disc)
Yellow CD ROM (Computer data compact disc)
Orange CD-R and CD-RW (Recordable compact disc)
Green CD-i (Interactive compact disc)
Blue E-CD ( Enhanced compact disc)
White VCD (Video compact disc)
Scarlet SACD (Super Audio Compact Disc)
*This is the Red book specification. CDs now play for over 80 minutes.
34 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 34
The light spot reflected from the land will have a particular intensity
as measured by an optical pickup. When a data pit is encountered the
reflected light will be scattered, so that the intensity measured by the
optical pickup will be less. These two levels of intensity are used to
retrieve the data, with low intensity being logic zero. As you can
probably imagine the optics and electronics are extremely complex
with a servo-mechanism ensuring the light spot is kept focused all the
time the disc is playing.
The data on the disc is not stored in a straight PCM format because the
bit patterns created by this coding method are not the most efficient
way to store the data. Efficient serial data storage systems, such as
those found on CDs, require the data to:
compact disc
pits
optical head tracking servo
motor
focus servo
6.3 MiniDisc
Once CD technology had become accepted by consumers the Sony
Corporation decided to attack the compact cassette market. The development
of recordable CD technologies allowed them to design a portable disc-based
system called the MiniDisc (MD). Sony released the MD system at around
the same time that Philips brought out their abortive digital compact cassette
(DCC), which attempted to provide consumers with the digital equivalent of
a standard analogue compact cassette. The two systems were in direct
competition; however, Sony’s aggressive marketing of their product together
with the advantages of a disc system for instant access and the inclusion of
additional text information caused the MD to win the day. MD uses a 6.4 cm
diameter disc (CDs are 12 cm in diameter) with pre-recorded discs using the
same recording technology as CDs. The recordable disc uses a magneto-
optical method as outlined in Box 13.
A controversial feature of the MD is that in order to use a small diameter
disc whilst still giving the same record and playback times as a CD, audio
data compression had to be used. Called ATRAC (Adaptive TRansform
Acoustic Coding), this coding system allows about five times more audio
data to be stored on the disc than would be possible without its use by
only coding sounds the ear can actually hear. The ideas behind audio data
compression will be discussed in a later section. Although the first version
of ATRAC was perceived as not being up to high fidelity standards,
subsequent versions have showed noticeable improvements, and the
current version (in 2004), is of such a quality that it needs highly trained
ears or special test sounds to distinguish any quality difference between
MD and CD recordings. In 1999 Sony produced a long playing version
(termed MDLP) which uses an ATRAC variant called ATRAC3 which
for a reduced quality allows double the recording time. A monophonic
single channel mode is also available in both standard and LP variants.
The MD system also caters for textual information about the sound data
to be stored. As well as a table of contents, and associated directory
structure, details of the music tracks can also be stored. In addition, MD
players usually contain some simple editing facilities such as moving
and deleting sections of sound, dividing up a track into two or more
separate tracks and combining tracks. A typical specification for a
MiniDisc recorder is given in Box 14 for reference only.
1 1 0 0 0 1 0 1 0 1
disc recordable minidisc
rotation cross section
objective lens analyser optical pickup 1
data decode
Figure 27 Playback from a recordable MD and audio out
showing the polarisation rotation of the conversion
laser beam caused by the magnetism in
the magnetic layer laser optical pickup 2
40 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 40
CD layer
0.6 mm
(entirely reflective)
(1 nm = 1 nanometre = 10 –9 m or
one thousand millionth of a metre)
SACD pick up CD pickup
wavelength 650 nm wavelength 780 nm
focused only on the focused only on the Figure 28 The hybrid SACD with
SACD layer CD layer two data layers
Table 10 Comparison of the SACD, DVD-A and original CD systems (for reference only)
CD SACD DVD-A
Capacity (Gbytes) 0.65 4.7 (single layer), 4.7
9.4 (dual layer)
Disc size (diameter) (cm) 12 12 12
Audio channels 2 up to 6 up to 6
Frequency response (kHz) 0.020 – 20 0 – >100 0 – 96
Sampling frequency (kHz) 44.1 2,833.4* 44.1, 88.2, 48, 96 or 192
(192 not in multi-channel mode)
Theoretical dynamic range (dB) 96 120 144
Recording time (minutes) 74 110 (2 channels, 74 or more
single layer)
Additional features text text, graphics, video text, still images
7.1 Oversampling
Referring throughout this section to Figure 29, constructing an analogue
signal from a digital signal that has been sampled at 44.1 kHz, shown as
fs1, can cause technical difficulties. To ensure none of the unwanted
frequencies which are produced during the reconstruction process reach
the analogue output, complex anti-aliasing or reconstruction low pass
filters are necessary. These unwanted frequencies are generated by
the digital-to-analogue conversion process and occur in the range
(fs1 – fm) to (fs1 + fm) and if not removed can cause harmonic distortion.
To do this a reconstruction filter with an attenuation in the order of
–80 dB at 22.05 kHz (i.e. half the sampling frequency) but still flat
at fm (i.e. 20 kHz) is needed.
Filters with this sharp response are capable of being manufactured but
tend to be very complicated and can themselves actually introduce
43 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 43
wanted audio
frequency band sharp filter
response
0 fm fs – fm fs fs + fm fs – fm fs fs + f m
1 1 1 2 2 2
Comment
1 Dropouts are random faults and of variable size and so also cause
burst errors.
2 Jitter causes random single bit errors rather than burst errors.
3 Interference between adjacent bits causes random errors rather
than burst errors
4 Electronic noise affects groups of bits causing burst errors.
Parity
The simplest and therefore the most common method of error
detection is parity checking. By adding an additional bit called a
parity bit to a code word it is possible to determine if a single bit
random error has occurred. The value of the parity bit (i.e. a zero or a
one) is determined by a rule (known as a protocol) that is applied to
the binary code. Two types of parity exist; even parity in which the
total number of one-bits in the code word (including the parity bit) is
even; odd parity which has an odd total number of one-bits.
Even parity examples: Odd parity examples:
Data = 1001001 Parity bit = 1 Data = 1001001 Parity bit = 0
Data = 1100110 Parity bit = 0 Data = 1100110 Parity bit = 1
Unfortunately with parity checking there is no way of finding which
bit in the code word is in error. Also, if an even number of bits in the
same code word is affected (happily a surprisingly rare occurrence)
then the error will go undetected. Nevertheless parity checking is still
widely used in the computer industry where its simplicity allied to
the knowledge that an error has occurred is sufficient to employ a
correction strategy.
(a) Which of the following data words have errors? You may assume
that even parity has been used and that only single bit errors have
occurred.
1 1000 0111
2 1100 0010
3 1010 1010
4 1111 0001
5 0000 0000
6 1100 1101
7 1101 1111
8 0001 0000
9 1111 1111
Describe what you think would happen if the backward error method
encounters a permanent error in the data layer of a CD.
If you don’t think media errors exist try holding a CD up to a light
source and looking through it. (You’ll have to choose one with a see-
through label.) You will need to use a bright light but DO NOT, under
any circumstances, look directly at the light source. You may be able to
see a number of tiny pin holes in the reflective surface any of which
might cause a permanent error.
47 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 47
Comment
When the backward error correction method encounters an error a
request is made to send the same data again. If the error is transitory
then it will not be there the next time it is sent and so all will be well.
However, sending the same data which includes a permanent error
simply results in the error occurring again. The error correction
mechanism will be in a continuous loop. This makes this method
unsuitable for correcting errors in CDs.
Comment
(a) The data rate will be increased as more data bits are effectively
used to store the digital audio signal.
(b) The data rate prior to adding correction data is 44 100 × 16
= 1 411 200 bit/s per channel or 2.8224 Mbit/s in total.
Assuming the additional data adds an extra third then the new
data rate will be 2.8224 + (2.8224/3) = 3.7632 Mbit/s
error
(b)
Figure 31 Example of data interleaving: (a) the interleaved block of sound data is
subjected to a burst error that affects four consecutive code words; (b) these four
words become random errors when the code words are placed in their correct order
8.1 Introduction
Digital audio transmission involves sending a signal carrying digital
audio data from one location to another. The distance between the two
locations may vary from a few centimetres to many thousands of kilo-
metres and the various ways the data can flow are described in Box 19.
Comment
Analogue transmissions include:
• Fixed telephone line (although it is converted to a digital signal at
the telephone exchange) which uses copper wires.
• AM and FM radio broadcasts using radio waves.
• Analogue TV broadcasts also using radio waves.
Digital transmissions include:
• Mobile telephone using radio waves.
• Computer networks using copper wire.
• Digital audio broadcasting (DAB) using radio waves.
• Terrestrial digital television (DVB) again using radio waves.
• Satellite television also using radio waves.
• Cable digital television using copper coaxial cable.
I suppose I know which is analogue and which is digital either by how
long I have been using the product or because I was sold the product
because it was digital or because I work in the field.
Calculate the bit rate necessary for a stereo radio broadcast using PCM
with 16 bits for each sample per channel. Use a sampling rate of 32 kHz.
53 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 53
Table 14 shows that the digital bit rates necessary to transmit high quality
audio and video signals are very much higher than the equivalent
analogue bandwidths. To take advantage of the benefits offered by digital
transmission it is necessary to reduce the data rates whilst maintaining
the sound quality.
transmission
digital audio in channel digital audio out
coder decoder
n bits/s n
/10 bits/s n bits/s
*Do not confuse these devices with digital recording coders and decoders
mentioned in Section 4.2.1. Indeed compression codecs could well be contained
within digital audio codecs.
54 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 54
Note that because of the compression factor of (in this case) 10, the
channel bandwidth can also be reduced by a factor of 10 without
affecting the digital data. Alternatively, ten times the amount of digital
data can be sent through the channel if its bandwidth is not reduced.
Can you think of one example where it is essential to keep all the
information and an example where some data may be lost without
losing the integrity of the information?
Comment
An example of lossless compression would be a computer program.
Every single bit in a computer program is essential for its operation.
The loss of a single bit of data from the program could cause the
program to fail.
An example of lossy compression could be the storage of the photographs.
Much of the information may be removed from a picture and yet the
subject remains recognisable.
In general lossless systems are not very efficient with digital audio as
the data does not lend itself well to the deterministic characteristics
these systems require. However lossless coding is used for high
quality digital audio transmission and storage in order to provide a bit-
for-bit reconstruction of the original data. For example coding in the
CD digital audio format achieves a lossless compression ratio of
around two and a half times, a much lower value than that achieved by
the lossy systems discussed in the next section.
Without altering the anti-aliasing filter or sound level, what effect does
a reduction in a) the sampling rate, and b) the number of quantisation
levels have on sound quality? (Hint: You may like to listen to the
audio track associated with Activity 12 again.)
56 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 56
psychoacoustic
(a) model
(b)
this effect by 60
masker
removing the masking
parts of a sound threshold
signal that are 40
sound is masked
more than a because it is within
threshold the masking
certain level 20 threshold
in quiet
below a loud
sound, and thus
would not be 0 inaudible signal
heard, so redu-
cing the amount 0.02 0.05 0.1 0.2 0.5 1 2 5 10 20
of information frequency (kHz)
that the sound Figure 35 The masking effect of a loud tone on
signal contains. a near frequency softer tone
You will notice from Figure 34(a) that the coder appears more
complicated than the decoder in Figure 34(b). This is an important
feature introduced by the MPEG group. Can you suggest why this
should be the case?
Comment
Every listener to compressed audio data will require a decoder whilst
coders will only be needed by those involved in the production of the
compressed audio material, which will be fewer in number. Thus by
making the decoder comparatively simple the consumer saves money.
Many software decoders are either free or merely carry a license fee.
psychoacoustic
algorithm
layer 3 which has become the most used and is colloquially known as
MP3, explained in Box 23. Table 16 summarises the three MPEG-2
audio layers. Layer 2 sets the standard for audio compression and is based
on the work of two independent groups who developed audio
compression codes for digital audio broadcasting (MUSICAM*) and
telecommunications transmission (ASPEC**).
Each layer currently allows input sampling rates of 32, 44.1 and 48 kHz
and can support output bit rates of 32, 48, 56, 64, 96, 112, 128, 192, 256
and 384 kbit/s. The audio can be mono, dual channel (e.g. two different
languages) or stereo.
*MUSICAM, Masking pattern adapted Universal Sub-band Integrated Coding And Multiplexing.
**ASPEC, Adaptive Spectral Perceptual Entropy Coding.
59 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 59
Ogg Vorbis
Ogg Vorbis is an open source non-proprietary compression format that
offers similar functionality to MP3. Developed by the Open Source
Foundation it is becoming widely supported by common media
players. If aggressive payment methods are ever sought by the MP3
patent holders current users may well turn to alternative, royalty-free
formats such as this.
local radio and television broadcasters try to cater for the needs of the
many so mostly play material that attracts the highest possible
audience. Commercial broadcasters must do this to ensure sufficient
advertising revenue. Minority audiences are often poorly catered for
even from the public broadcasters. The Internet has none of the
constraints of the broadcasting companies and for very little cost any
form of music may be made available and we can use powerful search
engines to seek out and listen to the music we want.
However there is a difficulty with providing music over the Internet.
For many people accessing the Internet from home gives an incoming
bandwidth of no better than 45 kbit/s or about 2.5 Mbits per minute.
As you have already found out, PCM digital audio needs about 80
Mbits per minute which is over 30 times faster than the home data rate
so real-time listening is impossible. Even downloading music can be
time consuming as demonstrated in the next activity.
(a) Calculate the time required to copy a 3 minute song from the
Internet onto a computer if the download speed is 45 kbit/s.
Assume the song is stored in the standard CD format (2 channels,
44.1 kHz sampling rate, 16-bits per sample).
(b) Can you suggest ways to reduce the download time?
Comment
(a) One second of standard CD-format sound will generate 2 × 44 100 ×
16 = 1 411 200 bits. The song is 3 minutes long so the total size of
the data to download is 3 × 60 × 1 411 200 = 254 016 000 bits.
The download speed is 45 kbit/s or 45 × 1024 bit/s = 46 080 bit/s.
So the time to download the 3 minute song will be 254 016 000/
46 080 = 5512.5 seconds or 92 minutes which is over 1.5 hours –
just for a 3 minute song!
(b) The download time can be reduced in two ways. Firstly the data
rate could be increased by using a broadband connection.
Secondly the amount of data could be reduced, using data
compression, before downloading.
Why do you think the music industry is concerned about Internet radio?
Comment
They are concerned that record sales will be effected. Thus they not
only prosecute illegal sites under the Act but also have an input into
what music can be played and the frequency of playing.
streaming
streaming
audio server
audio
player
software
internet buffer
audio file
When an Internet broadcast is selected the whole audio data file is not
downloaded and stored on the computer before the sound is output.
Instead a measurement of the line speed is taken and a temporary local
store, called a buffer, is opened. Audio data is streamed into the buffer
and when sufficiently full the audio output is started. As long as the
download data rate is maintained the buffer will not empty and the
audio is heard as a continuous stream. If the data stream is interrupted
for too long the buffer will empty and the audio will be muted until
the buffer is filled to a suitable level. Sound quality is variable and is
often likened to that of AM radio as you will see in Section 8.5.
Comment
A reliable and preferably fast link to the Internet is the most important
requirement for successful reception. The original specification for
Internet radio needed an ISDN 128 kbit/s link. Improvements in
compression codecs have meant that lower bit-rates can provide
satisfactory reception but popular sites get very busy slowing
download rates.
You may like to use your Internet connection to try Internet radio.
Details on how to do this are given in the Block 3 Companion.
Web page
World Service 16 kbs Radio 1
64 kbs 128 kbs
Asian Network
64 kbs
Radio 2
BBC 7
128 kbs
80 kbs
BBC 6 Music
128 kbs
Radio 3
192 kbs
1 Xtra - BBC
128 kbs
Radio 4
Radio 5 SportX 80 kbs
64 kbs Radio 5 Live (128 kbs when
Figure 41 The BBC’s 80 kbs Radio 5 SportX is
multiplex allocation in January 2004 not being broadcast)
65 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 65
Comment
As with any new service which has needed an investment from both
the suppliers and the users, digital audio broadcasting must appeal to
as wide a range of the population as possible in order for it to be a
success. The necessity for the public to purchase new equipment
means they must get value for their investment. Additional services,
rather than absolute audio quality, appears to be the route taken by the
broadcasters to attract users. It should be borne in mind that one of the
goals of the Eureka 147 consortium was to improve mobile and
portable reception. In both these cases absolute audio quality is not
paramount as the listening conditions will be far from ideal.
List the main reasons for the Internet becoming a major source of
music.
SUMMARY OF CHAPTER 4
Stationary head digital tape recorders levels. They also offer multi-channel
eventually became available when sound formats as well as stereo and, in the
technology allowed suitable heads and case of the DVD, video sequences are
tapes to be made. These machines provide possible. Copy protection technology
multitrack recording and tape editing inhibits illegal copying. Hybrid SACDs can
facilities similar to earlier analogue be played on standard CD players but
machines. (Section 5.3) without the advantages of the SACD
technology. Universal players have been
Disc storage is preferred by consumers as developed to play all digital audio disc
the random access nature of discs more standards in use. (Section 6.4)
closely matches the way the they listen to
music. (Section 5.4) Oversampling increases the sampling
frequency by two or more times in order to
allow the use of simple anti-aliasing filters
Digital audio compact discs replaced
when converting the digital audio data
analogue discs because they could support
back to its analogue form. This is usually
an improved frequency response, dynamic
achieved by interpolating sample level
range and signal-to-noise ratio. They also
points between the original 44.1 kHz
offered an increased playing time. CDs use
sample points. (Section 7.1)
a spiral track of lands and pits on a plastic
disc coated with a reflective layer to store
Single bit conversion samples the data 64
the digital data. The data is read using a
or 128 times the normal sampling
laser beam and an optical pickup that
frequency. At these very high rates the
measures the reflected laser light from the
difference between samples will only be
pits and lands on the CD. This means that
one quantisation level at most. This allows
playback involves no physical contact
the recovery of the audio signal just by
with the medium with no resultant wear
simple filtering. (Section 7.2)
through repeated playing. The discs are
not prone to sound degradation due to
physical faults, dust and mishandling. CD The ability to detect and correct or conceal
audio data uses 44.1 kHz sampling media faults caused by drop-outs, jitter,
frequency with 16 quantisation levels. At interference and noise ensures the sound
least 74 minutes of stereo sound can be is not disturbed when an error occurs. If
stored. A CD can store 650 Mbytes of data. the error cannot be corrected then it can be
(Section 6.2) concealed. If errors are not corrected or
concealed audio disturbance occurs, which
may or may not be audible. (Section 7.3)
The MiniDisc was designed to replace the
analogue compact cassette system. It was Parity is the simplest way of detecting a
the first disc-based digital audio system bit in error. It cannot detect an even number
use magneto-optical recording technology. of bits in error. Cyclic redundancy check
The recordable disc has optical and code will detect multiple bit errors.
magnetic layers. A laser beam heats a spot (Section 7.3.1)
on the magnetic layer which can be
magnetically polarised to store the value Forward error correction, which adds
of the bit as a tiny magnet. Old data is additional bits to the digital audio data, is
simply overwritten. On playback the used to correct the bit errors on digital
magnetic polarisation affects the reflection audio discs. Additional backward error
of a (less intense) laser beam allowing the correction can be used to protect against
value of the bit to be read. To give MDs shock errors in jog-proof systems.
the same playing time as CDs (74 minutes) Combinational parity is an example of
data compression (called ATRAC) is used. forward error correction. (Section 7.3.2)
The sampling frequency and quantisation
le v e l s a r e t h e s a m e a s f o r t h e C D . To ensure that consecutive blocks of data
(Section 6.3) are not destroyed by a media fault which
might not be capable of being corrected, the
Super Audio Compact Disc (SACD) and data blocks are interleaved in such a way
Audio DVD (DVD-A) systems use 12 cm that, whilst several blocks may be affected,
high density discs (4.7 Gbytes) to offer the errors are able to be corrected because
improved audio quality by increasing the they become random errors rather than
sampling frequency and quantisation burst errors. (Section 7.3.3)
69 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 69
Errors that are too gross to be corrected may different to the original. Lossy methods are
be concealed. This can be achieved by much more efficient at compressing audio
muting, using the previous data word or data. Examples of these include MP3 and
most commomnly by interpolating the data Ogg Vorbis. (Section 8.3.4 )
word value from the values of the
surrounding data words. (Section 7.3.4) Systems employing lossy data compression
should not be used for mastering recordings
Copies of audio stored as digital data are as audible degradation may occur when
perfect. The industry has developed ways mixed and re-recorded with other sources.
of protecting itself from the manufacture (Section 8.3.5)
of multiple copies. Serial copy
management system (SCMS) allows users The Internet is an ever expanding source
to make a single digital copy whilst of music. Streaming audio, whilst offering
preventing multiple copies of copyright lower quality audio due to the need for very
material being made. (Section 7.4) high data compression, allows a wide range
of musical genres to be heard by consumers
The path over which a signal is transmitted at very low production costs. (Section 8.4)
is usually referred to as a channel. The
channel bandwidth puts a limit on the bit
Broadcasting digital audio has the
rate of the digital signal and therefore the
potential for offering CD quality sound from
quality of the analogue signal. (Section 8.2)
radio receivers. In reality Digital Audio
Broadcasting in the UK currently offers
Large volumes of digital audio require high more stations with less bandwidth than are
data rates for transmission and large available in the existing AM or FM bands
storage capacities. Both of these problems but this places a restriction on the bit-
are alleviated by using data compression rates available and therefore limits the
techniques. (Section 8.3.1) audio quality of the stations. Programme
information and text services are
Lossless methods allow bit-for-bit retrieval available. DAB technology overcomes
of the digital data but do not have the high many problems associated with analogue
compression ratios of lossy systems. AM and FM systems but is incompatible
Entropy codes and run length coding are needing the listener to purchase new
examples of lossless compression. Lossless receivers. (Sections 8.5.1 and 8.5.2)
systems are not very efficient at
compressing audio data due to the random Digital Radio Mondile exploits the wide
nature of sound. (Sections 8.3.2 and 8.3.3) coverage of the medium- and short-
wavebands to broadcast highly compressed
Lossy methods rely on psychoacoustic low bit-rate digital audio signals without
algorithms to identify and remove sounds the problems of interference associated with
that should not be heard by the listener. analogue AM radio broadcasts. Again new
This means the decompressed version is receivers will be required. (Section 8.5.3)
70 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 70
Activity 2
Equalisation enables the engineer to control the relative levels of various
frequencies in the audio bandwidth. It can vary the harmonic content or
timbre of the final recorded sound.
Normalisation ensures that the sound makes best use of the available
dynamic range.
In the case of the additional track the nature of the sound of the other
tracks would be analysed before a decision was made as to whether
changes were necessary. Particularly the track preceding the new track
must be checked for level and sound as it would be unfortunate if the
listener had to make adjustments to the replay equipment just for this
additional track.
Activity 4
Music companies control both the artistic and the technical elements of
the recording. In the planning stage (1) a production team led by a
producer will make proposals on the music, artists, engineers, venue,
recording dates, etc. based around agreed budgets. The performance
generates a session recording (2) which is subsequently edited in the
mix-down stage (3) to make the recording master. This is used in the
post production stage (4) to generate an appropriately EQ’d master for
the chosen delivery medium. This master version is used in the
manufacturing process (5) to make the final product which is then
marketed and sold through distributors (6) to the record stores.
Activity 6
The effect of using a microphone with insufficient bandwidth to record
the organ would be to alter the timbre of the sound. The relationship
between the fundamental frequency and harmonics would be changed
giving a different sound to the original. It is vital to ensure any equipment
used for recording and playback has sufficient bandwidth to capture the
full range of frequencies at their correct levels.
Activity 8
The ratio of the average output signal power to noise power is:
–6 7
100/10 × 10 = 100/0.000 001 = 10 000 000 = 10 = 70 dB
Since, in terms of power, a ratio of 10 = 10 dB, 100 = 20 dB, 1000 = 30 dB etc.
Activity 9
In order to avoid the generation of any additional harmonics which would
create distortion of the output signal, the transfer function should be linear
as illustrated in Figure 11(a). A non-linear transfer function, illustrated in
Figure 11(b), adds unwanted frequencies to the original signal.
Activity 10
With a sampling frequency of 44.1 kHz, 16 bits of digital audio data will
be stored 44 100 times a second for each channel.
71 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 71
Thus for 1 second of audio, 16 × 44 100 bit/s = 705 600 bits per
channel are needed or 2 × 705 600 = 1 411 200 bits for the pair of
stereo channels.
Activity 11
The analogue signal is passed through an anti-aliasing filter to limit
the bandwidth to half the sample frequency. An analogue-to-digital
converter converts each sampled level of the analogue signal to a
binary code word using the number of binary bits determined by the
required number of quantisation levels. The second channel is added
and then each code word has the error correcting code added and is
converted to a serial form before being sent to the storage medium.
Activity 19
The conversion process adds a quantisation error to the analogue
signal which is termed quantisation noise. This error is the difference
between the actual analogue level at the time the signal is sampled and
the voltage level the nearest code word represents. This difference can
be up to half the minimum quantisation interval.
Quantisation noise can be reduced by increasing the number of bits in
the code word used to represent the level, thus reducing the minimum
quantisation interval.
Unlike the random nature of the noise experienced in analogue
systems quantisation noise is correlated to the signal, in other words it
forms a distinct pattern that is related to the signal. The ear is
sensitive to correlated noise and so another way of reducing the effect
of quantisation error is to add a small amount of noise, called dither
noise, to the original signal. This decorrelates the quantisation noise
from the digital signal making it less audible.
Activity 22
The random selection of tracks involves searching the tape by spooling
from one end to the other and then rewinding once the track is played.
This is time consuming, wears the tape, and is difficult to automate.
Discs enable fast access to tracks involving little wear on the medium
and the system is easier to automate.
Activity 23
MiniDiscs use a data compression method called ATRAC to compress
the audio data by about five times. This allows the physically smaller
MiniDisc to playback for the same time as a CD.
Activity 25
(a) Parity errors occur in 2, 4, 6, 7 and 8. There are an odd number of
1s in each.
(b) An even number of bits in error will disguise the fault as the total
number of 1s will still be even. For example changing any two bits
in the code word 1110 1000 will still keep the total number of 1s
even, e.g. 1101 1000. The two bits in error are highlighted.
72 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 72
Activity 28
Step 1 is to show the data and parity bits (in blue) of the (20,12) code
format:
1010 0
1001 1
0010 1
0011 0
Totalling the ones in the bottom parity check row (row 4) shows the
parity to be correct, (as there are an even number of ones). Performing
the same action on the other three rows shows that row 2 has a error in
it as there are an odd number of ones (there should be an even
number). Similarly totalling the ones in each column shows that the
parity check column is correct, but it also shows that column 3 has an
error (an odd number of ones again), so the bit in error is at the
intersection of row 2 and column 3, highlighted below:
1010 0
1001 1
0010 1
0011 0
Once this bit is changed all is well and the corrected code is:
10100 10111 00101 00110
Activity 29
Error concealment is used when the a data error is detected but cannot
be corrected or because it is too big. Rather than ignore the error which
would cause audio distortion the error is hidden by using
interpolation where the value of the data error is replaced by using the
data either side of the error, as shown in Figure 32(c).
Activity 30
A random error would be a single bit error which would use the
forward error correction data to correct the bit in error. There would be
no disturbance to the sound which would appear as a perfect
recording.
Activity 32
With a sampling rate of 32 kHz and a 2 channel stereo signal where
each channel is quantised using 16 bits, the bit rate will be
32 000 × 2 × 16 = 1 024 000 bits/s ≈ 1 Mbit/s
Activity 33
For a bit rate of 56 kbit/s and using 16 bit quantisation the maximum
available bandwidth would be 56 000/16 = 3.5 kHz for mono sound (or
1.75 kHz for stereo sound). Hardly suitable for high quality reproduction
but then neither is the existing analogue telephone network!
73 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 4 MUSIC DISTRIBUTION 73
Activity 35
(a) The necessary sampling rate is determined by the bandwidth of the
audio signal, so if the sampling rate is lowered the high frequency
components of the signal will not be reproduced and aliasing will
occur giving a very unpleasant and distorted sound.
(b) Reducing the number of quantisation levels reduces the dynamic
range (by 6 dB for every bit reduction in the numbers of bits used to
code the samples) and increases distortion by adding quantisation
noise.
Activity 37
The main difference between lossless and lossy compression is that in
the former no data is lost in the compression process. Lossy
compression always loses data, but the idea is that the brain has no
perception that any of the original information is missing when the
data is reconstructed.
Activity 44
The reasons for the rise in use of the Internet for music distribution
are as follows:
LEARNING OUTCOMES
After studying this chapter you should be able to:
Acknowledgements
Grateful acknowledgement is made to the following sources for
Figures 18 & 21: Sony Service Centre (Europe) NV, 1988, 1992, 1995.
Thanks also to Bill Strang for playing the piano on the recording for
Activities 7, 12 and 16 (and not minding what the Course Team did
with it).
Every effort has been made to trace all the copyright owners, but if any
Chapter 5
CONTENTS
Aims of Chapter 5 76
1 Introduction 77
2.1 Introduction 78
AIMS OF CHAPTER 5
1 INTRODUCTION
LP
45 compact cassette
CD
magnetic tape
78
This chapter opens with a brief history of the recording industry from
its beginnings at the end of the nineteenth century. Step changes in
technology will be highlighted in a story which often is as much about the
people who built the industry and the recordings they made as to the
technologies that were developed and used. During this discussion you
will be asked to appraise the different audio technologies mentioned by
comparing their frequency responses, signal-to-noise ratios and dynamic
ranges. You will also consider convenience of use of the various media
types shown in Figure 1. This will allow you to develop an insight into
why some technologies have succeeded and others fallen by the wayside.
You also will discover that absolute audio fidelity is not always
uppermost in the minds of either designers or the consumers.
78 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 78
Changes in technology not only affect the recording media and equipment
but also the way in which we listen to the music. A case study of a
personal jukebox will be used to illustrate this. This device offers the
listener immediate access to hundreds of hours of recordings as opposed
to between five and sixty minutes offered by the formats mentioned
above. There is also a brief look at surround sound technologies.
The ability to record sound and in particular digital recording
technologies have had enormous impact upon the legal aspects of the
music industry. Since the invention of the tape recorder consumers
have been able to make copies of original recordings for themselves
and others, but the relatively high cost of the equipment combined
with inferior reproduction did not overly worry the music industry’s
executives and lawyers. However, as digital copies can be near perfect
facsimiles of the original digital master recording and made with
relatively low copying equipment costs, the music industry has had to
develop ways to prevent us from making too many copies. The last
section of this chapter considers some of the legal aspects of relevance
to the music industry and introduces some of the initiatives that are
being considered and implemented to control our access to the
recording. The fluid nature of these initiatives means that, to ensure
currency, this discussion will be led through a selection of articles.
Comment
Well do you think your voice sounded like you? Probably not, because
as you may recall from Section 5.4 in Chapter 7 of Block 2 that due to
the directivity of the voice, you do not hear it as others do and so it
sounds unrealistic to you in a sound recording. However, because you
know that the technology you are using is capable of accurately
reproducing sounds I hope you have confidence that what you are
hearing accurately represents how you sound. I
79 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 79
The first sound recording of a human voice, actually reciting the nursery
rhyme Mary had a little lamb, was made over 125 years ago. Just imagine
back then the reaction of people the first time they heard the sound of a
human voice coming from a machine – especially if it was theirs!
“The phonograph [recording machine] was remarkable partly
because it did not look human – it spoke just like a person, but it
looked like a machine, a simple cylinder of tinfoil.”*
So great was this invention and so insatiable was (and still is) our need to
hear recorded sounds – especially music – that within 25 years sound
recording had become a global industry.
Before sound recording was possible few people had the opportunity
to hear music in the way we take for granted today. Apart from
expensive musical boxes and the mechanical music players that were
introduced in Chapter 3 of this block, the only way music could be
heard was in live performances. Take a moment to think how your life
would be without being able to listen to music from CDs, records,
tapes, radio, television or even the Web. How often would you listen to
music if you could only hear it by attending live performances or
making it yourself? The following activity asks you to think about
listening to music before sound recording was invented.
Comment
I thought of the following:
• places of religious worship (singing hymns, listening to the organ, etc.);
• at school (nursery rhymes, group songs and dance);
• in the home (barrel organ, musical box or player piano);
• live concerts (listening to the band in a local park, going to the
music hall, a classical concert or a musical theatre performance);
• dancing (to music from local bands).
A common thread that occurs to me is that on many occasions music
was created by people (amateurs rather than professionals) meeting
together – at church, school or the local public house for example.
Most of the music was live, with just the possibility of hearing a
mechanical instrument such as a barrel organ. I
As you will discover the technologies used within the record industry
have not always been capable of delivering sounds from systems with
ideal characteristics for recording and playback. In fact user convenience
can be as important a consideration as the fidelity of the reproduced
sound.
81 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 81
As is often the case with truly great inventions Edison was not the
only inventor working independently on recording sounds. In April
1877 a sealed letter was deposited at the Académie des Sciences in
Paris by an impoverished French poet and amateur scientist, Charles
Cros. The contents described an apparatus that:
“consists in obtaining traces of the movements to and fro of a
vibrating membrane and in using this tracing to reproduce the same
vibrations, with their intrinsic relations of duration and intensity,
either by means of the same membrane or some other one equally
adapted to produce the sounds which result from this series of
movements.”**
Unfortunately Cros could not afford to patent his idea and it was Edison
who, in the late autumn of 1877, filed for a US patent on his phonograph.
Differences existed between the two inventions as, for example, in Cros
proposing a glass disc whilst Edison actually used a tin-foil cylinder.
The sound quality in the clip of Edison speaking you have just
listened to is not very good in comparison to what we have come to
expect today. This is because the system used an acoustic recording
method, described in Box 1 ‘Sounds on cylinders’.
Run the computer animation associated with this activity. This animation
demonstrates Edison’s mechanical recording and playback process.
It is based on his original design for the phonograph which was
patented in 1878. I
What does the fact that a person had to shout into the horn of the
recording machine, as described in Box 1, tell you about the sensitivity
of Edison’s apparatus?
Comment
The fact a person had to shout indicates that the recording machine
was very insensitive. This was due to the mechanical stiffness (inertia)
of the mechanism that cut the groove into the recording medium, which in
this case was tin-foil. This had a direct effect on the frequency response
and dynamic range of mechanical recording machines. I
83 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 83
All of the above ideas have been developed in one way or another.
Which of the above do you consider have benefited the most from the
advances in digital audio technologies described in Chapter 4 of this
block?
Comment
Although all the ideas have benefited in one way or another, the
advances have been in different ways. Not all these ideas require the
wide frequency response, high dynamic range and low signal-to noise
characteristics demanded from high quality audio systems for music
reproduction. The convenience of digital voice recorders for dictation
and talking books and the ability to use multimedia in general and
particularly language education have used the advantages of digital
audio compression techniques rather than striven for ultimate sound
84 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 84
(b)
(a)
Figure 4
(a) A miniature phonograph alongside a toy doll in which it was used; (b) the digital
audio circuit board and loudspeaker from a modern talking greetings card
Given the fact that wax was easier to cut than tin-foil why do you think
wax cylinders offered an improved sound quality?
Comment
The sound quality was improved because there was less stiffness
(inertia) in the recording system. This gave greater sensitivity as well
as an improved frequency response and a lowered signal-to-noise ratio.
These improvements were only marginal compared with today’s
technologies but nevertheless were a distinct advance. I
However, the expected law suits did not arise. Firstly, the wording of the
two patents was different. Edison described his recording technique as
‘embossing or indenting’ the medium, whereas the Bell–Tainter patent
portrayed their way of recording as ‘engraving’, implying a different
approach. Secondly, a businessman named Jesse H Lippincott, who
was looking to invest cash in a new venture, offered to invest in
both inventions thus securing sole rights to sell recording machines
through his ‘North American Phonograph Company’.
The whole enterprise nearly failed for, just like Edison and Bell,
Lippincott saw the future of the recording machine for dictation in
businesses such as government bureaux and offices. Actually
shorthand-typists did not appreciate this use of the machine, seeing
it as a threat to their jobs. They even went as far as sabotaging the
machines, making them useless for dictation. After two years of
unsuccessful trading Lippincott, now in ill-health and with poor
finances, lost control of his company to Edison who still only saw
business potentials for his invention.
“He could not or would not countenance the potentialities of the
phonograph as a medium for entertainment.”*
In this activity you will assess the ‘quality’ of a hard wax cylinder
recording made in 1902 by looking at its bandwidth, dynamic range
and signal-to-noise ratio. This is the first of a number of computer
activities where you will use the course’s sound editing software to do
this. As each of the activities follows the same procedure and involves
features of the sound editor you have not used before, you will find a
single set of steps on how to carry them out in the Block 3 Companion.
You should therefore run the course’s sound editing software, open the
computer sound file associated with this activity and follow the steps in
the Block 3 Companion on how to determine the quality of a recording.
Comment
I obtained the following characteristics for this recording:
Compared with the values of a modern sound system that you should
have suggested in Activity 3 these values are poor. Do bear in mind
however that the users had no other systems by which to make
comparisons. Also, as a new technology, the phonograph was still one
of the miracles of the age. I
88 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 88
own homes.
(a) From reading Box 2, what restrictions do you consider have been
placed on the ultimate sound quality of cylinder and disc
recordings in terms of bandwidth and dynamic range?
(b) Why were these restrictions employed?
*In the USA, phonograph is the generic name given to all record playing
equipment. In the United Kingdom gramophone is more generally used,
although phonograph may be used when referring to cylinder players.
89 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 89
Comment
(a) The linear speed controls the upper bandwidth frequency. If the
cylinder or record would have revolved faster, the linear speed
would have increased and with it the upper frequency response.
In the case of discs the track spacing is dependent on the dynamic
range as louder sounds require a greater deviation of the groove
meaning an increased track spacing.
(b) In both cases the recording-time (capacity) of the recording
medium would be decreased so shortening the playing-time. A
trade-off between ultimate quality and sound quality had to be
made. I
Listen to the two audio tracks associated with this activity. The first track
contains an original recording by Emile Berliner and this is followed in
the second track by a repeat of Edison’s recording from Activity 4.
How do you think Berliner’s recording compares with that of Edison? Can
you think why both men should have chosen to recite nursery rhymes?
Comment
The recording by Berliner was taken from an original 5 in (13 cm)
diameter vulcanite disc made in 1889. I think you will agree that the
reproduction is poor compared with the recording of Edison. However
remember that Edison’s recording was made in 1927 as a demonstration
of his original invention. I wonder why both Edison and Berliner recited
nursery rhymes? Perhaps neither could think of anything more propitious
to say at the time, but bearing in mind the poor quality of the sound, it
may have been that using well-known rhymes would make the recording
more comprehensible. Maybe it was an early marketing ploy. I
(a) Which cutting method, vertical or lateral, would give the recording
engineer the opportunity to record the louder sound?
(b) Why was it advantageous to record sounds at the highest possible
level in these early recordings? I
Figure 12
An example of
Berliner’s vulcanite
disc
*Shellac is a resin derived from the secretion of the Lac beetle (Coccus lacca)
found in Malaysia.
92 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 92
Comment
As you will recall from Chapter 4 of this block, every time an analogue
recording is copied there is additional noise added to the original sound
lowering the signal-to-noise ratio. Using the original matrix will ensure
the best possible sound. Unfortunately access to the original matrix is
not always possible and may reissues of original 78 recordings use
normal mass-produced discs. The topic of remastering is covered later
in this chapter. I
Comment
I obtained the following characteristics for this recording:
The reason why the records produced from the Caruso recordings were
so popular was that he was singing popular music of the time and that
the quality of his voice suited the technology. You may recall from
Chapter 7 in Block 2 that a trained tenor uses a singer’s formant to
emphasise voice partials between 1 and 3 kHz which centres on the
frequency response of a mechanical gramophone – as you found out in
Activity 14.
Edison had little to offer in way of competition to Berliner’s ever
growing catalogue. He failed to make inroads into Europe and hence
record the popular artists of the time, who tended to live and perform
in that part of the world. Although the United States saw the origins of
the talking machine, Europe really transformed it into a musical
instrument by the choice of music and performers. Finally, in 1913
Edison introduced a disc, shown together with some of his cylinders
in Figure 15. Typically, it had the same vertical cut groove method
used on his cylinders, which made it unusually thick (6.5 mm) and of
course it needed a special Edison disc player. Despite offering a
superior sound Edison’s disc was too late – Berliner’s gramophone
records were too well established by virtue of the material they offered.
The Edison Company continued to supply cylinders and discs until
1929 when manufacture finally ceased.
Suggest some reasons why the acoustic recording process limited the
types of instruments and voices used? I
groove wall
magnetic material
audio
signal
stylus
(b) groove
Comment
The sound would be very tinny, thin and lacking in bass notes. This is
because when the disc is replayed by an electro-magnetic pickup the
voltage output at low frequencies is reduced in level but at high
frequencies is boosted. So to play any record using an electro-magnetic
pickup it is first necessary to set the replay amplifier characteristics to
match the RIAA equalisation. I
The ideal characteristic for such an RIAA replay amplifier is shown in Figure 17. The gain for frequencies
above the low corner frequency is reduced until the middle corner frequency is reached. The gain then
remains level until the high corner frequency is reached when the gain is further reduced. Since it is
not possible in practice to produce a response that has straight lines and sharp corners, Figure 17
also shows
the actual
low corner frequency response of
a practical
18 dB ideal response
17 dB
amplifier
alongside
middle corner frequency this ideal
high corner frequency
response.
This filtering
0 dB
s h o u l d
actual therefore
response restore or
–13.6 dB
equalise
the levels of
–18 dB
the audio
20 50 500 1k 2.12k 10k 20k
frequencies
f/Hz
to those of
the original
Figure 17 Response of an RIAA replay amplifier
performance.
Prior to the RIAA agreement manufacturers specified their own equalisation, examples of which
are included, along with the RIAA characteristics, in Table 1. The use of the RIAA characteristic
also produces a useful improvement in signal-to-noise ratio. By attenuating the higher frequencies
‘surface noise’ in the pickup output caused by dust and groove wear is reduced. However boosting
the lower frequencies means mechanical noise from the turntable and external electrical noise
(e.g. mains hum) can be increased.
Table 1 Equalisation characteristics required for 78 rpm and LP disc media
* These figures are relative to the amplifier gain at a frequency of 1 kHz. **These figures are not specified.
98 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 98
Comment
I obtained the following characteristics for this exceptional recording:
Characteristic Estimated value
Frequency response 50 Hz – 9 kHz although there is evidence
of harmonics above this frequency
Signal-to-noise ratio 50 dB
Dynamic range 50 dB
Can you suggest how the record buying public responded to these two
new ‘standards’?
Comment
The immediate response from the record-buying public was to stop
purchasing records until the outcome of the battle was decided! I
Fortunately for all the record companies a truce was declared by 1950,
with the 78 rpm disc the loser. The LP was adopted for classical
recordings and the 45 for popular music. In Europe the change took a
little longer, but by the end of 1952 LPs were available from European
manufacturers.
100 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 100
The LP is not quite the end of the story of the gramophone record. As
far back as 1931 the British engineer Alan Blumlein designed and
patented a stereophonic (from the Greek meaning solid sound)
recording system which used two sound channels to create a virtual
sound ‘stage’ where an individual sound source (instrument, voice,
etc.) could be located at any point between two loudspeakers placed at
the front left and front right of the listener. The location of the source
is determined by the relative intensity in the two channels. The patent
covered two possible ways of cutting the groove in the record to allow
two separate channels to be recorded. The V-L (vertical-lateral) method
combined hill-and-dale and lateral cutting systems whereas the 45/45
technique was similar except the cutter was tilted at an angle of 45° to
the surface of the disc putting the stereo signal into the groove walls,
as illustrated in Figure 19.
90° 90°
V-L 45/45
Figure 19 The V-L and 45/45 recording techniques compared
In 1958 the 45/45 standard was adopted by the industry, having been
patented in the United States by Westrex/Bell as early as 1937.
Can you think of a problem that existing record users that might find
with the introduction of stereo discs?
Comment
Non-compatibility with existing monophonic (single channel) systems
meant the need to produce both mono and stereo discs. Retailers
would have to stock both versions of the record. A stereo disc could be
damaged if played on a mono pickup which was not designed to be
compatible with stereo discs. This is because the stereo pickup needs
to move in both the horizontal and vertical directions to cope with the
45/45 movement whereas the monophonic pickup only moved
horizontally and this could potentially cause damage to the stereo
groove wall. Also not all the music information could be recovered by
having a monophonic pickup. ‘Stereo compatible’ monophonic
pickups were eventually manufactured to overcome this problem
allowing the production of monophonic discs to be phased out. I
This brings to a close the story of the record (cylinder and disc), the
main source of recorded sound for nearly a hundred years. Apart from
refining manufacturing techniques, little change to the technology took
place. There is still a demand for vinyl discs from audiophiles, who
believe the analogue sound cannot be surpassed. But it is DJs, who
have made ‘turntablism’ an art form in its own right by creating new
music by ‘scratching’ tracks from vinyl discs, that are keeping disc
record playing alive. This demonstrates a use of the phonograph not
even imagined by Edison!
101 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 101
Your answers to the first two parts of the above activity should have
described an effect known as wow. Wow is a low frequency pitch
variation which, in discs, is caused not only by the spindle hole in the
disc not being exactly centred, or by the disc being warped, but can also be
caused by slow variations in the disc motor speed. There is a second
related effect called flutter which is a higher frequency pitch variation
caused mainly by faster variations in the speed of the disc motor.
2.3.1 Introduction
Sounds, pictures, measurement data, financial statistics, personal details,
etc., can all be recorded and stored on magnetic media, i.e. materials
that are able to be magnetised to store information for future retrieval.
Construct a table of all the different types of magnetic media you think
you may have used and what you kept on each type of media. Were
you able to put any of the media to more than just one use, i.e. store
different sorts of things on that media? Do not worry at this time if you
are uncertain as to what I mean by magnetic media.
Comment
The media types I thought of are shown in Table 2. I
* There are two forms of the spelling. My convention will be that discs originally
designed for music storage will be spelt with a ‘c’, whereas disks used for computer
storage will be spelt with a ‘k’.
102 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 102
Did you notice from my list in the above activity that most magnetic
media can have more than one use? I doubt that the designers of the
compact cassette ever imagined it being used to store computer data
(as was the case in the 1980s when cassettes were used with home
computers). Magnetic media is incredibly versatile for recording and
storing information due to its convenience of use, low cost, reusability
and reliability, although not all these qualities are necessarily exploited.
For example audio and video recordings are often made only once and
kept indefinitely, whereas data on a computer disk may be changed
by the minute, as with the word-processor data file I am updating as
I write (and rewrite!) this section of the course.
Listen to the two audio tracks associated with this activity. The first
track contains a recording of the famous statement made by the Rt. Hon.
Neville Chamberlain on his return from Munich. This was recorded on
a steel tape recorder in September 1938. As a comparison, the second
the track contains a copy of the 1932 experimental disc recording that
you analysed in Activity 19.
Notice the difference in background noise between the magnetic steel
tape recording and the disc recording of 6 years earlier. I
Poor signal-to-noise ratio meant that steel tape was eventually discarded
but one of the first home magnetic recording machines, the Webster
wire recorder, described in Box 7, used thin steel wire, echoing
Poulsen’s idea.
pay-off reel
take-up reel
heads
guide guide
Soon tape recorders were in use by the American radio networks for
pre-recording their broadcasts, the entertainer Bing Crosby being one
of the greatest proponents of the technology. Recording companies
were also quick to embrace the benefits of tape – especially the ease by
which mistakes could be edited and retakes inserted. Also the ability
to record for longer periods meant less need for recording sessions to
be split into short takes. Early domestic recorders were used primarily
for playing stereo recordings, but they were costly, both in terms of the
hardware and the media, a pre-recorded stereo tape cost five times that
of the equivalent mono LP disc. The sales of pre-recorded tape
plummeted once stereo LPs became available in 1958. From that point
on domestic tape recorders, similar to the one illustrated in Figure 24
were used mainly by enthusiasts for home recording.
Disc recordings have always had a specific advantage over tape when it
comes to accessing a particular part of a recording. By describing the
different technologies used to store the sound can you suggest what
that advantage might be? I
Listen to the audio track associated with this activity. You will hear an
original live digital recording made at the Open University especially
for this activity. During the recording four different formats were
used: direct digital, an original compact cassette tape without noise
reduction, the same tape with Dolby B noise reduction and a metal
compound cassette tape again with noise reduction. Listen carefully to
the differences in the quality of the recorded sound.
Note, the differences are quite subtle and you may find them easier to
distinguish using headphones.
Comment
I expect you noticed the very intrusive tape-hiss after about 18 seconds.
I am sure you agree this is unacceptable for most music recordings
although could be tolerated for speech. The tape hiss is reduced to an
acceptable listening level after a further 25 seconds by using a Dolby B
noise reduction processor. This cuts the tape-hiss by about 10 dB but
maintains the tonal balance of the sound. This would not be possible
108 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 108
In this activity you will view the noise levels of the recording in
Activity 27 using the course’s sound editing software.
Run the course’s sound editing software and open the computer sound
file associated with this activity. Open a linear frequency analysis
window and play the sound clip. Can you see the noise levels
increasing and decreasing as explained in Activity 27? Do the levels
decrease by the amount suggested in that activity? What happens at the
very end of the recording? Note, you may need to play the sound two
or three times to answer these questions.
Comment
As the sound played, I hope you could see on the frequency analysis
display the general noise level (sometimes called the noise floor)
increasing and decreasing as explained in the comment to Activity 27.
Looking at the graph scale and estimating the actual values of the noise
level you should also see that the noise floor does vary by roughly the
amounts suggested in the previous activity. At the very end, did you
notice the noise floor reduce to the low level it was at the start of the
recording, indicating a rather sneaky final switch back to the original
digital sound? I
Comment
The effect of stretching the tape is to make it longer. This means
that it takes more time for the original length of tape to pass the
head, effectively slowing the tape speed. The pitch of the signal in
the original recording will consequently be lowered. I
3 RESTORING RECORDINGS
3.1 Introduction
You have seen that for over 125 years it has been possible to make
and store sound recordings and you have had the opportunity to
listen to some recordings from over 100 years ago. Over that time an
extraordinary number of recordings have been made. Just as an
example, way back in 1930 the UK Columbia Record Catalogue ran
to over 300 pages and they were only one of several major record
companies. Many original masters or at least very good copies of
earlier recordings still exist.
Comment
I thought of the following list of possible reasons, but this is not
exhaustive:
• a particularly fine performance;
• a performance involving the composer;
• a previously unreleased recording;
• a first performance of a particular work;
• the last performance of a noted artist;
• cheaper than recording a new performance. I
*March, I., Greenfield E., and Layton R. (2001) The Penguin Guide to Compact
Discs 2002 Edition, London, Penguin, p. 448.
111 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 111
Listen to the two audio tracks associated with this activity which
demonstrate what can be achieved with the restoration of an old recording
using modern digital techniques. The tracks contain an excerpt from the
same recording of Elgar’s Violin Concerto in B minor but taken from
different media. The first track is from the original 78 rpm disc and the
second track is the same section of the piece from the recently remastered
CD. Notice how the restoration process has reduced the background noise
without affecting the sounds of the instruments. I
Read the article associated with this activity which you will find in the
Block 3 Reader. The article is taken from the August 2002 edition of
the Gramophone magazine. It looks at some of the problems already
experienced with storing CDs.
Comment
The article raises a number of problems regarding the life-span of CDs.
To prevent some of these occurring, archivists now create discs using
precious metals such as silver or gold to try to avoid the problems
associated with aluminium. There is also a plan to put audio data
directly onto computer networked servers to make it available via the
Internet as well as providing archive resources. I
3.3.1 Introduction
The aim of the audio restoration engineer is to produce a recording which
offers to the listener an experience as close to the original performance as
possible. The problem however is that the expectations of the consumer
have risen to well beyond what can be offered by most early recordings.
Nearly silent backgrounds coupled with huge variations in sound levels
have become the norm on CDs. Due to the nature of the original recording
methods and the media used, even with the best replay equipment
available, it is impossible to reproduce the original performances
without degradation of the sound introduced by the ravages of time.
Degradation of the sound may be classified into two broad areas, global
and localised. Global degradation include background noise, wow and
flutter, and distortion. Localised degradation covers clicks, crackles
and deep scratches.
To restore a recording the following processes are used:
1 Select the best source or sources of the original performance, use
the original master recordings whenever possible.
2 Convert the recording to the digital domain applying appropriate
playback characteristics.
3 Join sections together, adjusting speed and editing as necessary.
4 Remove unwanted sounds such as such as clicks, crackles, buzzes
and hiss (collectively known as unwanted artefacts).
5 Apply equalisation (EQ).
6 Make the master record.
Actually there is stage missing between each of the steps. The most
important aspect of the restoration process is that of listening. So the ears
are the most important instrument the restoration engineer possesses.
113 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 113
Listen to the audio track associated with this Activity. This is a short
section from three different sources of Miles Davis’s Kind of Blue. In
order, the sources are: an early LP, the first CD to be released and the
latest issue of the CD. At the end of the track there is a single trumpet
note taken from each of the three sources, played in quick succession.
Comment
Could you hear the difference between the three sources? Remember
that no measuring equipment spotted the mistake – just a keen pair of
ears. Listening, which requires much practice, is all important when
restoring musical performances. I
Read the article associated with this activity which you will find in the
Block 3 Reader. The article is taken from the February 2003 edition of
the Gramophone magazine and describes the work of Ward Marston,
one of the most highly regarded restoration producers. This article
brings together the points I have mentioned above. I
116 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 116
From your reading of the article in Activity 36, what does the restoration
producer consider to be his greatest asset? I
In this activity you will have the opportunity to try your hand at
restoring a short section of a recording from a 78 rpm disc using the
facilities provided by the course’s sound editing software. Since this
activity involves features of the program you have not used before, you
should run the course’s sound editing software, open the computer
sound file associated with this activity and then follow the steps for
this activity which you will find in the Block 3 Companion.
Comment
Before leaving this activity, make sure you have saved your finished
work as you will need it in the next activity. I
In this activity you will add equalisation to the 78 rpm disc sound clip
that you restored in Activity 38. Since this activity involves a number of
detailed steps, you should run the course’s sound editing software, open
the computer sound file you saved in Activity 38 and then follow the
steps for this activity which you will find in the Block 3 Companion. I
4.1 Introduction
The front cover of this part of Block 3 shows an Apple iPod which is
an example of an audio device called a personal jukebox. A personal
jukebox offers a portable music environment for anyone wanting to
listen to music. The iPod gives immediate access to nearly 500 hours
of (compressed) digital audio files (depending on the model). Digital
audio data, either copied from the user’s own sources or (legally)
downloaded from the Internet, is stored in a preferred format in a
database on a personal computer which is then transferred to iPod. By
accessing a similar database on iPod any file may be replayed in high
quality sound in a way similar to that of the original jukeboxes
illustrated in Figure 18.
Cypress
Semiconductor
CY7C68013 display connector
(USB)
Samsung
TI TSB43AA82 K4S561633C-RL75 Synaptics, Inc. hard disk
(FireWire interface) Touchpad and display drive
(memory)
daughter card
Figure 30 Inside an iPod
120 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 120
Microprocessor
The Portalplayer PP5002 ‘Superintegration System-On-Chip’ micro-
processor is a complete ‘computer on a chip’ designed specifically to
support hard-disk audio jukeboxes by providing the following facilities:
• decoding of various standard digital audio data formats;
• audio processing effects including a 5-band graphic equaliser,
preset listening modes, bass boost, etc.;
• low power consumption and power management features;
• control of the hard disk drive;
• interfaces for the LCD panel, the keypad touch controller and the
on/off switch called the ‘hold’ switch.
Skip protection
Any rotating disk system requires the section of the disc containing
the data to be accessed by a read head which is positioned by an
actuator. A physical shock can cause the head to move from its current
position causing a break in the data being read and therefore
disturbance to the sound output. To avoid this problem the digital
audio data from the disk is stored in a temporary data memory prior to
conversion. Any shock will cause a break and/or error in the data
entering the buffer and because the buffer is filled more quickly than it
is emptied any break or error due to the shock can be corrected by
reading the data from the disk again. The buffer also acts as a power
saving device as once it is filled the hard disk may be switched off
until, at a predetermined low point, it is switched on again and the
buffer topped-up. The iPod skip memory is 32 Mbytes providing an
average of 25 minutes of playing time. At the time of writing memory
chips are still much more expensive per megabyte than hard disk
memory hence the need for both types of memory in iPod.
LCD display
Liquid crystal displays do not generate light but rely on turning
opaque and so interrupting incident light reflected from behind the
display or from a back-light which provides illumination when the
ambient lighting is too low. LCDs have a very low power consumption
except when using the back-light. The quality of any display depends
on its resolution and the iPod display is high resolution which enables
very detailed graphical pictures to be displayed
as well as text as shown in Figure 31.
Storage
The hard disk drive has a 1.8 inch format (this
refers to the diameter of the rotating magnetic
disk inside the drive assembly – the overall
width of the drive is slightly bigger than this). In
2004 the disk in iPod is available in three sizes:
15, 20 and 40 Gbytes. Directory information
reduces these sizes by about eight percent
giving the 15 Gbytes model a total storage
capacity of 13.8 Gbytes for digital audio data.
Figure 31 Close up of iPod’s display
121 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 121
Your answer to Activity 41 should have shown you that the iPod can
store a huge number of songs. This will require careful organising
preferably by suitable database software. Various programs exist, for
example iTunes. Textual information about songs can be embedded in
files storing MP3-processed sounds and this information is used to
organise files by genre, composer or artist. iPod software will be
discussed later in this section.
Battery
The size and weight of portable equipment is governed to a large
extent by the battery and its ability to supply sufficient energy to
meet the needs of the user. This is usually given as the number of
hours of continuous operation. Using the latest battery technology
allowed the iPod designers to keep the size of the player to a
minimum. The rechargeable lithium-ion (Li-ion) battery used in
iPod supplies 3.7 volts with a storage capacity of 850 milliampere
hours (i.e. it can theoretically supply 0.85 amps continuously for
one hour). iPod will run for about 8 hours between charges in normal
use and the battery should have an operational life of 2–3 years
depending on usage.
Connections
The following connections are supplied as standard:
• a dual-purpose connector for connecting the host desktop computer
which, depending on the connecting lead used, provides either a
USB interface or a fast serial interface called FireWire;
• a combined stereo headphone and wired remote control output;
• a stereo line out connection but only when attached to its Dock.
iTunes download services are only available in the USA but a well-
known soft-drinks manufacturer is hosting a download site in the UK
although not for iPod.
File format
Apple Windows
Data type computer computer
Uncompressed AIFF WAV
Compressed AAC MP3, MP3 pro
Most of the file formats in Table 4 have been described earlier in the
course, but the AAC and MP3 pro formats may be unfamiliar to you. In
essence both systems offer greater compression than existing methods by
using lower bit rates, typically 96 kbit/s or less as opposed to 128 kbit/s
with MP3, but still preserve audio quality. Advanced Audio Coding
(AAC) was developed by the Fraunhofer Institute as an development of
their MP3 format. MP3 pro was developed by Thompson Multimedia.
MP3 pro decoders can decode standard MP3 files.
Describe in your own words why the simple MP3 players were
relatively unsuccessful compared with personal jukeboxes. Use the
iPod as an example of the latter. I
123
5 SURROUNDED BY SOUND
The goal of inventors such Edison and Berliner was to reproduce the
sounds as accurately as their systems allowed. Advancements in
recording technologies increased the fidelity of the sound to such a
degree that not only could the performance be reproduced faithfully
but the ambience of the surroundings in which the performance was
taking place could also be captured.
Stereophonic systems give accurate spatial information to the listener
by positioning the performers and instruments between the two
loudspeakers. However, to experience the true effect the listening
position is quite critical and not suitable for a group of listeners. To
get a true effect the recorded sound needs to envelop the listeners
giving an aural experience close to that of being in a concert hall. A
quadraphonic system, with four independent sound sources, against
the two sources in stereo, was introduced to consumers in the late
1960s. Unfortunately the disc-based technology of the time was not
adequate and there were a number of conflicting standards which led
to the system being a commercial disaster.
Since the 1950s suitably equipped cinemas have offered 4 and 6
channel sound in the wide screen movie presentations offered by
CinemaScope and Todd-AO film formats. Simple two-channel sound
would not localise the dialogue for audiences sitting off-centre leading
to an unrealistic appearance to the performers.
Initially these multi-channel sound systems were analogue, using
magnetic tracks along the edge of the film. In the early 1970s Dolby
Laboratories developed an analogue system that used light which
passed through special optical tracks at the edge of the film onto
sensors to generate four surround sound channels, left front, centre
front, right front and rear, plus two extra low frequency effects (LFE)
channels for the bangs and crashes heard in many films.
By 1992 Dolby Laboratories had developed Dolby Digital for cinemas
which provided six discrete sound channels in what is known as a
‘5.1’ format. This provides five full-bandwidth sound channels for
front left, centre and right, plus rear left and right (these are the ‘5’)
plus a sixth limited-bandwidth LFE channel (the ‘.1’ channel).
The digital audio data is contained between the sprocket holes of the
film as illustrated in Figure 33.
To meet the needs of the consumer Dolby adapted their ‘5.1’ system for
DVD video systems under the name of Dolby Digital using digital data
compression techniques and psychoacoustic coding and called AC3.
DVD-Audio discs also support Dolby Digital sound. Figure 34 shows a
home with a typical ‘5.1’ installation. To ensure films encoded with
earlier surround systems still provide a surround sound experience in
the home, Dolby developed the four channel Pro Logic and more
recently the six channel Pro Logic II decoders, the latter able to decode
stereo into a pseudo surround sound (i.e. altering the phases of the
two channels to create sounds coming apparently from outside the
space between the two speakers as discussed in Chapter 1 of this
block).
left right
subwoofer
left right
surround surround
You have an audio system that supports ‘7.1’ surround sound. How
many audio channels will you have and what does each one do? I
125 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 125
6.1 Introduction
Who owns the music you listen to? You may think you do, after all
you bought the disc and you can to listen to the contents as often as
you want. However, although you own the medium you don’t own the
right to do whatever you want with the contents. In the UK you may
make a copy of the music for your own use – after all you are unlikely to
play the two recordings at the same time so you are simply transferring
the contents to another medium for your own convenience. For example,
if you only have a compact cassette player in your car you may copy all
or part of the contents of a CD onto compact cassette for your personal
use when in the car. However you may not copy all or part of the contents
of the disc for anyone else to listen to, nor may you borrow a disc and
copy any part of it for your own personal use.
Comment
It is quite legal to listen to a CD that you have borrowed from a friend
as long as it is the original CD and not a copy. The right to listen to the
disc has been temporally transferred to you by the owner who cannot
listen to the disc whilst you have it. I
Comment
No, not unless you wrote down, or in some way recorded the theme
before you heard it, otherwise you have no rights to it at all. I
126 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 126
Comment
A work can be performed on broadcast systems such as television,
cable and radio, in pubs, clubs, factories, shopping malls and discos or
at live concerts. I
Copyright has a limited life. The exact length of time depends upon
the nature of the material but for recorded sound it is (in 2004) 50 years
from the end of the calendar year in which the work was recorded or
released if the release date is within 50 years of it being recorded. Once
copyright expires the work goes into the public domain which means
it may be freely used by anyone. However, discovering whether a work
is truly in the public domain is sometimes quite difficult.
Read the articles and any surrounding text associated with this activity
which you will find detailed in the Block 3 Reader. These articles may
be provided in the reader booklet, or they may be found on-line –
details about this will be found in the reader. The articles contain up-
to-date information about the copyright situation with regard to music
recordings. With each article the Course Team may include a short
introduction, a list of any new terms used in the article, a conclusion
and one or more self-assessment activities. I
128 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 128
7 SUMMING UP
Listen to the two audio tracks associated with this activity. Both tracks
contain the same performance of ‘So what’ from Miles Davis’s Kind of
Blue album. The first track is the version taken from a vinyl LP record,
and the second is from the remastered CD where the pitch has been
corrected (see Box 10). The tracks each last around 9 minutes, so do
not feel you have to listen to all of both tracks unless you want to.
However, you should listen critically to at least some of both
recordings comparing in particular the quality of the recordings in
terms of frequency response, noise level, distortion and dynamic range
– but do remember that in both cases the original recording was made
on a professional analogue tape recorder of the late 1950s so the
quality can possibly never match that of today’s totally digital
recordings. I
130 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 130
SUMMARY OF CHAPTER 5
The history of the recorded music industry and so multiple master recordings had to
is as much about the people involved, the be made. However, even when a
recordings they made and the prevailing manufacturing process was developed, it
times as it is about the technology involved. was found that multiple copies of discs were
Changes in the technology of recorded much easier to produce than cylinders. The
music have not only affected the way in basic processes used to duplicate the early
which we listen to music but they have also discs are still used in today’s manufacture
had an enormous impact on the legal of CDs. (Section 2.2.4)
aspects of copyright. (Section 1)
The gramophone disc system was enhanced
Before sound could be recorded, people’s by the use of a clockwork turntable motor,
experience of listening to music was mainly a standardised speed of 78 rpm and double-
confined to live concerts, music-making in sided discs. (Section 2.2.5)
the home and mechanical music such as a
barrel organ. With the advent of recorded Discs eventually succeeded over cylinders
music, not only could performers hear how mainly due to the choice of artists and
they sounded to others, but it has shifted music that was made available on this
performer’s emphasis in performance from medium rather than the quality of sound.
immediacy to perfection. (Section 2.1) (Section 2.2.6)
High quality analogue multitrack magnetic The Apple iPod is an example of one of
tape recorders have had a huge impact on these personal jukeboxes. This device uses
the recording industry, and have only a miniature hard disk to store a huge
recently been superseded by equivalent quantity of sound in a variety of formats
digital recorders. (Section 2.3.5) and is designed to be used in conjunction
with a desktop computer. Its popularity
Many early recordings need to be preserved stems from its novel design, friendly user
as they have historical importance and interface and incorporation of easy-to-use
restoration of recordings has become an database software for the management of
important part of the recorded music the songs stored. The iPod is an example
industry. (Section 3.1) of how audio and computer technologies
are converging. (Section 4.3)
Cylinders and discs have proved to be very Surround sound systems enable listeners
stable media compared with magnetic tape to experience sounds as if they were
which suffers over time from a number of positioned within the recording
problems from flaking to print through. The environment. In general the number of
reliability of CDs has still to be proved, but channels is given by the nomenclature in
already there are signs that CDs deteriorate the description, e.g. ‘5.1’ indicates 5
faster than might be expected. (Section 3.2) surround channels plus one low frequency
effects channel. Leading the development
The audio restoration process uses both
of surround sound for consumer systems
analogue and digital technologies with the
are the Dolby Laboratories who have much
aim of extracting the best quality of the
experience in cinema surround sound
sound from the source(s) available. The best
systems. (Section 5)
sources are first selected, and converted to
digital form using the highest possible Copyright protects peoples’ creations or
sample rate and number of quantisation ‘works’ from exploitation by others, and is
levels. Editing of sections is then carried of great concern to the music industry now
out and unwanted artefacts are removed. that digital technology has made it easy for
Finally equalisation may be added to people to make multiple copies of
further improve the result. A good ear is recordings without any degradation of the
vital in every stage of the restoration sound. Protection of a work can be
process. (Section 3.3) achieved by legal and/or technical means.
The law provides protection and payments
The miniaturisation of electronic to creators and performers through a system
circuitry, the emergence of small digital of licences and royalties. Technology can
memories for laptop computers and the also secure rights to creators and artists by
development of sophisticated sound restricting copying of digital media.
compression systems such as MP3 have However, the situation is continually
all helped in the development of small changing with new legal and technological
pocket-sized devices called personal measures being introduced to try to keep
jukeboxes which are able to store a large up with the progress of sound recording
number of songs. (Section 4.2) technology. (Section 6)
132 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 132
Activity 3
The bandwidth of a system is the range of frequencies over which an
audio device responds equally (i.e. has a flat response). In the case of
an audio CD system the frequency response would need to be flat over
the range of frequencies the CD contains, i.e. 20 Hz to 20 kHz.
Dynamic range is derived from the amplitude range between the
loudest level that can be reproduced without producing distortion and
the softest level that can be reproduced without being enveloped in
noise. Dynamic range is the ratio between these two values and is
usually expressed in decibels. In the case of an audio CD system the
dynamic range needs to be at least 90 dB.
Signal-to-noise ratio gives an indication of the noise in the system.
It is expressed as a ratio of the wanted signal power to the noise power
in the system, and is usually expressed in decibels. A typical value for
digital audio equipment is 100 dB.
Activity 12
(a) Edison’s vertical (hill and dale) method used on phonograph
cylinders was more suitable for recording loud sounds as a greater
depth of deflection of the diaphragm was possible. Berliner’s
lateral groove on the gramophone disc limited deflection of the
diaphragm otherwise the groove wall would be broken. So, in
principle vertical recording gave a better dynamic range.
(b) The larger the movement of the diaphragm in the phonograph
pickup the louder the sound and so the signal-to-noise ratio was
increased thus making the system more suitable for mechanical
reproduction.
Activity 16
The frequency range of many instruments was outside that of the
mechanical transducer. Timpani and large stringed instruments just
would not be heard. Similarly, many of the small woodwind family
suffered in this way. Further, the acoustic qualities of many
instruments were too delicate to be reproduced above the surface noise
of the disc. An alto or tenor voice with either a piano or brass
instruments offered the best opportunity to obtain a good recording.
Activity 22
(a) If the hole is not in the centre the disc will turn eccentrically
causing variations in the linear speed and consequently changes in
the pitch of the replayed sound.
(b) If the disc is not flat there will be an audible noise every time the
‘bump’ is encountered. This will increase the signal-to-noise ratio.
(c) The smoother the material the better the signal-to-noise ratio as
any ‘roughness’ in the groove will be reproduced as noise. (This is
apparent when comparing the signal-to-noise ratio of 78 rpm discs
made of shellac and slate dust which is quite rough against the
smooth plastic material of the LP.)
133 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 133
Activity 25
Table 5 compares a 78 rpm disc with magnetic tape media.
Tape could offer twice the bandwidth and six times the playing time.
The frequency response of magnetic tape was between 30 Hz and 15 kHz,
and the playing time was up to 30 minutes. The frequency response of a
78 rpm record was 30 Hz to 8 kHz. The playing time was up to 5 minutes
for each side of a 12-inch (30 cm) disc.
Activity 26
On the tape the sound is recorded as a series of magnetic fluctuations
along its length. In order to get to a particular part of the recording the
tape must be wound forwards or backwards. This may take several
minutes especially if the required sound is at the other end of the tape.
On discs the sound is also recorded serially as a single spiral track.
However to find the equivalent sound on a disc is a relatively quick
operation, by simply placing the pickup at the appropriate place on the
disc surface. This takes the same time no matter where it is on the
disc. The speed and ease of access to particular songs has always given
an advantage to the disc over tape as a commercial replay medium.
Activity 34
The main problem with using any medium for archiving is that of it
deteriorating. In the case of CDs, this includes flaking or oxidisation of
the reflective layer, fogging up of the plastic disc and damage due to
mishandling or bad storage. In addition, as technology continues to
change rapidly, unless a programme of constant updating to new
media is employed, suitable replay devices will have to be kept
working to ensure the data on any of today’s media will be able to be
recovered in the future.
Activity 37
A good sense of hearing!
Activity 40
A stereo CD stores 2 channels of music using 16 quantisation levels at
a 44.1 kHz sampling rate which gives: 2 × 16 × 44 100 = 1 411 200 bits/s.
Compressed by a ratio of 11:1 gives a bit rate of: 1 411 200/11 =
128 291 bits/s.
The memory card stores 64 Mbytes of data which is 64 × 1 048 576 × 8
= 536 870 912 bits. Thus with MP3 compression the memory card can
store 67 108 864/128 291 = 4185 seconds or 69.75 minutes of sound.
134 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 134
Activity 41
The storage capacity of a HDD is about 8% less than the total. So
the capacity of a 40 Gbytes HDD is 40 – ((40 × 8)/100) = 36.8 Gbytes or
approximately 36 800 Mbytes. (This is an approximation because 1 Gbyte
is actually 1024 and not 1000 Mbytes.)
If an average song needs 4 Mbytes then the total number of songs that
can be stored is approximately 36 800/4 = 9 200 songs.
Activity 42
MP3 players were fitted with small memories allowing playing times
of around an hour. Additional memory cards could be fitted to some
players but these were relatively expensive and only held a similar
amount of music. Exchanging music files was controlled to an extent
by encrypting the data to a specific player.
Personal jukeboxes overcome the limited playing time by using
miniature hard disk drives. This gives a much larger storage ability but
requires careful management from database software on a desktop
computer.
Activity 43
‘7.1’ surround sound has eight audio channels. As with all surround
sound systems, the ‘.1’ channel is the low frequency effect channel.
The other seven are front left, front centre, front right, rear left and rear
right and side left and side right.
135 TA225 BLOCK 3 SOUND PROCESSES CHAPTER 5 THE MUSIC BUSINESS 135
LEARNING OUTCOMES
After studying this chapter you should be able to:
Acknowledgements
Grateful acknowledgement is made to the following sources for
permission to reproduce material in this chapter:
Figures 8 & 9: The Royal Scottish Museum, Edinburgh;
Figure 14: Ampex GB Limited.
We also thank Nigel Bewley (British Library Sound Archive),
Daniel Leech-Wilkinson (Kings College, London) and Robert Philip
(Open University) for advice on the restoration of recordings;
Stephen Potter for loan of the 78 rpm disc set of the Elgar Violin
Concerto and Bill Strang for playing the piano for Activities 17 and 26.
Every effort has been made to trace all the copyright owners, but if
any has been inadvertently overlooked, the publishers will be
pleased to make the necessary arrangements at the first opportunity.
137 TA225 BLOCK 3 SOUND PROCESSES INDEX 137
INDEX
multiplex (COFDM) 63
CRC see cyclic redundancy check code
coder
critical listening
compression 53
see also audio restoration
division multiplex
correction code (CIRC) 37, 38
combinational parity 38
cyclic redundancy check code (CRC) 36
communication link
cylinder (recording medium) 81, 88, 89,
broadband 60
113
tape 106
manufacture 91
bump 35
tin-foil 82, 85
CD-R 10
wax coated 85
data coding 36
disc mastering 12
disc speed 37
duplication 12
DAB see digital audio broadcasting
father 12
DASH see digital audio stationary head
manufacturing 12
metal master 12
data compression 38, 53, 53
mother 12
and master recordings 59
player specification 37
ratio 55
son 12
directions 51
specification 33, 32
Davis, M. 8, 113, 129
system description 32
Decca Company, The 99
table of types 33
decoder
compatibility
pulse code modulation 19
compression
diaphragm (in a transducer) 80, 82
pulse code modulation see pulse code signal-to-noise ratio see signal-to-noise
66, 127
Electrical and Musical Industries (EMI) 98
outline system 63
electromagnetic pickup 96
recorder specification 28
EMI see Electrical and Musical Industries
following
EQ’d master 10
digital bandwidth 21
discs 97
disc 113
error correction 27, 35, 37, 37, 51
strategies 39
distortion 13
dither noise 23
see also parity bit
dropout 33
limits to 16
FM radio 66
range
bandwidth 15
Service
signal-to-noise ratio 62
ear
System
range 13
frequency modulation (FM)
editing
of an amplifier 15
editing
full-duplex 51
Gaisberg, F. 93
LFE see low frequency effects
gold-moulding 91
linear interpolation (in error correction) 39
frequency response 95
Lippincott, J.H. 86
pickup 89
listening
quality 93
critical see critical listening
graphophone 85
lithium-ion (Li-ion) battery 119, 121
clockwork powered 86
localised degradation 112
groove 88
long-playing disc see vinyl LP
half-duplex 51
see also MP3, MPEG, Ogg Vorbis,
hearing threshold 57
loudspeaker 80
interference 33, 51
chromium dioxide see chromium
reflection interference
compact cassette see compact cassette
interference
comparison with discs 109
interleaving 38
editing see splicing
hiss 18
(ISO) 56
S-VHS 29
audio/radio 61, 66
speeds 103
specification 119
recorder 108
Organisation
multitrack see Alesis digital audio tape
jam 113
main service channel (MSC) 63
jitter 33
allocation by BBC 63
jog-proof system 36
Marconi Company, The 103
Kerr effect 39
matrix (master disc) 91
number 91
stamper 91
legal aspects
mechanical audio recorder
of music industry 78
dynamic range 82
mechanicals 126
noise transient 115
Protection Society
North American Phonograph Company 86
Melba, N. 93
merging bits 36
see also parity bit
microgroove 99
optical disk 32
dynamic range 13
optical pickup 37
Microprocessor 120
oversampling 35, 37, 32, 50
editing 38
parallel-to-serial converter 20
MDLP 38
see also pulse code modulation converter
playback 39
parity 35
recorder specification 38
checking 35
recording 39
combinational see combinational parity
system description 38
Patti, A. 93
Company 103
PCM-VTR 26
mix-down 9
perceptual coding 56
Morse Code 55
Philips Gloeilampenfabrieken 106
standards 56
pickup 89
photo resist 12
portable player 60
pro 122
electromagnetic see electromagnetic
pickup
MPEG see Moving Pictures Expert Group
analogue 108
manufacture 91
music industry 5, 50
see also 35 rpm disc, 78 rpm disc,
job functions 6
polyvinyl chloride 91
post production 10
engineer
Poulsen, V. 102
musical box 79
print through 111
muting 39
PRS see Performing Right Society
Napster 127
pulse code modulation (PCM) 18, 35
needle 88
converter 18
Nipper 92
quadraphonic 123
noise 13
quality of sound 77
digital 19
error 20, 22
Rachmaninov, S. 98
scrambling see interleaving
radio
self-clocking 35
radio frequency 52
(SCMS) 50, 51, 126
interference 33
session recording 8
reflection interference
shellac 89, 91, 99, 111
station bandwidth 63
Sibelius, J. 9
random error 33
improvement using RIAA
equalisation 97
RCA-Victor 92, 99
singer’s formant 95
record
LP see vinyl LP
single bit conversion 33, 51
record companies 11
Smith, O. 102
record distributor 11
record producer 6, 7
recorder
recording engineer 8
steel tape 102, 103
distribution 11
stereophonic (stereo) 100, 123, 129
manufacture 11
Stravinsky, I. 98
planning 7
streaming audio 61
Reeves, A. 18
super audio compact disk (SACD) 10, 15,
remastering 111
and copy protection see copy protection
WAVE 122
watermark see watermark in super audio
of America
surround sound 9, 123, 129
see also wrap angle, guard band, Pro-Logic see Dolby Pro-Logic
following Swann, D. 77
run-length coding 55
Tainter, C. 83
sampling
sampling rates
tape recorder see magnetic tape recorder
table of values 21
telegraphone 102
115 groove 33
transfer function 16
stereo/mono compatibility see
transmission medium 51
compatibility of mono and stereo vinyl
LPs
uncorrelated (noise) 23
VTR see video tape recorder
VCR 31
wire recorder 102
Verdi, G. 93
see also Webster wire recorder
VHS 31
wobble groove 39
tape 101
wow 101, 112, 115
Victor Records 92
wrap angle 28
Acknowledgement
Cover image: © 1997 Photodisc, Inc.