Volume 1


3. 19. 35. 42. 51. 64. 75. 86.

Microphone Technology The Use of Microphones Loudspeaker Drive Units Loudspeaker Systems Analog Recording Digital Audio Digital Audio Tape Recording Appendix 1 – Sound System Parameters

Copyright Notice This work is copyright © Record-Producer.com You are licensed to make as many copies as you reasonably require for your own personal use.

Chapter 1: Microphone Technology The microphone is the front-end of almost all sound engineering activities and, as the interface between real acoustic sound travelling in air and the sound engineering medium of electronics, receives an immense amount of attention. Sometimes one could think that the status of the microphone has been raised to almost mythological proportions. It is useful therefore to put things in their proper perspective: there are a great many microphones available that are of professional quality. Almost any of them can be used in a wide variety of situations to record or broadcast sound to a professional standard. Of course different makes and types of microphones sound different to each other, but the differences don't make or break the end product, at least as far as the listener is concerned. Now, if you want to talk about something that really will make or break the end product, that is how microphones are used. Two sound engineers using the same microphones will instinctively position and direct them differently and there can be a massive difference in sound quality. Give these two engineers other mics, whose characteristics they are familiar with, and the two sounds achieved will be identifiable according to engineer, and not so much to according to microphone type. There are two ways we can consider microphones, by construction and by directional properties. Let's look at the different ways a microphone can be made, to start off with. Microphone Construction There are basically three types of microphone in common use: piezoelectric, dynamic and capacitor. The piezoelectric mic, it has to be said, has evolved into a very specialized animal, but it is still commonly found under the bridge of an electro-acoustic guitar so it is worth knowing about. Piezoelectric The piezoelectric effect is where certain crystalline and ceramic materials have the property of generating an electric current when pressure or a bending force is applied. This makes them sensitive to acoustic vibrations and they can produce a voltage in response to sound. Piezo mics (or

even though that would be possible. or other instrument with a piezo transducer. This means that they can produce voltage but very little current. but it is not as crisp and clear as it would have to be to capture delicate sounds with complete accuracy. not the magnet . since it is always the coil that moves. To compensate for this. particularly on drums. Re-configure these components and you have a coil of wire attached to a thin. Remember that it is possible to exchange voltage for current. The dynamo is a device for converting rotational motion into an electric current and consists of a coil of wire that rotates inside the field of a magnet.a transducer is any device that converts one form of energy to another) are high impedance. This will usually be inside the body of the electro-acoustic guitar. Dynamic microphones have always been noted for providing good value . perhaps after a year or more of service. The dynamic mic is also sometimes known as the moving coil mic. the capacitance of a cable provides a path between signal conductor and earth conductor through which high frequencies can ‘leak’). Dynamic This is ‘dynamic’ as in ‘dynamo’. and vice versa. sounds distorted. lightweight diaphragm that vibrates in response to sound.transducers as they may be called . The sound can be good. neither does the mic need any power to operate. It is not necessary therefore to have a preamplifier close to the microphone. Examples of dynamic mics are the famous Shure SM58 and the Electrovoice RE20. This is a fairly low output impedance that can drive a cable of 100 meters or perhaps even more with little loss of high frequency signal (the resistance of a cable attenuates all frequencies equally. a preamplifier has to be placed very close to the transducer. The preamp will run for ages on a 9 volt alkaline battery. using a transformer. The dynamic mic produces a signal that is healthy in both voltage and current. The characteristics of the dynamic mic are primarily determined by the weight of the coil slowing down the response of the diaphragm. it is almost certainly the battery that needs replacing. but it is worth remembering that if an electro-acoustic guitar. The coil in turn vibrates within the field of the magnet and a signal is generated in proportion to the acoustic vibration the mic receives. All professional dynamic mics incorporate a transformer that gives them an output impedance of somewhere around 200 ohms.

the amount of charge – the higher will be the voltage across the terminals of the capacitor. If the ribbon has a problem. it is that the output of the single-turn ‘coil’ is very low. and it is reasonable to say that many engineers could identify the sound of a ribbon mic without hesitation.e. Examples of ribbon mics are the Coles 4038 and Beyerdynamic M130. formerly known as the ‘condenser mic’. works by storing electrical charge. Here. albeit a coil with only one turn.for money.jpeg" border=0 width=69 height=114 align=RIGHT hspace=5 vspace=5 alt="">When the ribbon vibrates in response to sound it acts as a coil. Since the ribbon is very light. of any type. Together they form the plates of a capacitor. but other types are now starting to challenge them on these grounds. In place of the diaphragm and coil there is a thin corrugated metal ribbon. the diaphragm is paralleled by a ‘backplate’. <img src="/graphics/coles4038. Electrical charge can be thought of as quantity of electrons (or the quantity of electrons that normally would be present. Capacitor The capacitor mic. but aren't). A capacitor. The greater the disparity in number of electrons present – i. There is the equation: . it has a much clearer sound than the conventional dynamic. The ribbon does however also have a low impedance and provides a current which the integral transformer can step up so that the voltage output of a modern ribbon mic can be comparable with a conventional dynamic. The ribbon is located in the field of a magnet. works in a completely different way to the dynamic. Ribbon Mic There is a variation of the dynamic mic known as the ribbon microphone.

Of course there is a . stays constant. The capacitor mic is therefore much more accurate and faithful to the original sound than the dynamic. The charge. as long as it is either continuously topped up or not allowed to leak away. It is light and very responsive to the most delicate sound. Sennheiser MKH 40 The great advantage of the capacitor mic is that the diaphragm is unburdened by a coil of any sort. Therefore as the distance between the plates is changed by the action of acoustic vibration. the capacitance will change and so must the voltage between the plates. Putting this another way round: V = Q/C or: voltage = charge / capacitance Now the tricky part: capacitance varies according to the distance between the plates of the capacitor. Tap off this voltage and you have a signal that represents the sound hitting the diaphragm of the mic. because ‘C’ is already taken by capacitance.Q=CxV or: charge = capacitance x voltage Note that charge is abbreviated as ‘Q’.

Electret The electret mic is a form of capacitor microphone. it has to be said that there are some very good electret mics available. It also requires continually topping up with charge to replace that which naturally leaks away to the atmosphere. Modern capacitor mics use phantom power. These mics are still in widespread use so you would expect to come across them from time to time. However. so it is usually considered that the compromises involved in manufacture compromise sound quality. which is an attenuator placed between the capsule and the amplifier to prevent clipping on loud signals. Electret mics do still need power for the internal amplifier. Dynamic mics of professional quality are not bothered by the presence of phantom power in any way. A capacitor mic therefore needs power for these two reasons: firstly to power an integral amplifier. This is that the impedance of the capsule (the part of any mic that collects the sound) is very high. this can take the form of a small internal battery. However. and secondly to charge the diaphragm and backplate. just as magnetic energy is locked into a magnet. You have to remember to switch in on at the mixing console but that's pretty much all there is to it. Not all materials are suited to forming electrets.downside too. Otherwise a sharp crack of speaker-blowing proportions is produced.very high. or when phantom power is switched on. Phantom power places +48 V on both of the signal carrying conductors of the microphone cable actually within the mixing console or remote preamplifier. However the charge is permanently locked into the diaphragm and backplate. A capacitor microphone often incorporates a switched -10 dB or -20 dB pad. meaning that only the backplate of the capacitor is an electret therefore the diaphragm can be made of any suitable material. Old capacitor mics used to have bulky and inconvenient power supplies. So. which is sometimes convenient. phantom power is connected automatically. most of which are backelectrets. Not just high . That's why it is called ‘phantom’ – because you don't see it! In practice this is no inconvenience at all. One operational point that is important however is that the fader must be all the way down when a mic is connected to an input providing phantom power. . simply by connecting a normal mic cable. and 0 V on the earth conductor.

in case the battery runs down or isn’t fitted. .Electret mics that have the facility for battery power can also usually be phantom powered.

fairly obviously an omnidirectional mic is equally sensitive all round. as commonly happens. The hypercardioid is a more tightly focussed pattern than the cardioid. The cardioid is most sensitive at the front. at the expense of a slight rear sensitivity. . It is not at all correct. In fact it is only insensitive right at the back. I say a family of polar patterns but it really is a spectrum with omnidirectional at one extreme and figure-of-eight at the other. but is only 6 dB down in response at an angle of 90 degrees. The figure-of-eight is equally sensitive at front and back. A cardioid is slightly less obvious.Directional Characteristics The directional characteristics of microphones can be described in terms of a family of polar patterns. To explain these patterns further. known as a lobe in the response. to call this a unidirectional microphone. The polar pattern is a graph showing the sensitivity in a full 360 degree circle around the mic. Cardioid and hypercardioid are simply convenient way points.

We will see how this affects the use of microphones at another time. the diaphragm will be shielded from sound approaching from the rear and rearward HF response will drop. The diaphragm is completely open to the air at both sides. In fact the off-axis response of most microphones is nothing short of terrible and the best you can hope for is a smooth roll-off of response from LF to HF. (You could also imagine that a . but at low frequencies the pattern will spread out into omni. 180 degrees out of phase with the signal from the front. At high frequencies the pattern will tighten into hypercardioid. Often though it is very lumpy indeed. and the mic effectively compares the changing pressure of the outside air under the influence of the sound signal with the constant pressure within.the only difference being that the rear produces an inverted signal. in theory as we said. Omnidirectional Looking at directional characteristics from a more academic standpoint. When the sound source is exactly at the side of the diaphragm it produces equal pressure at front and back. apart from a tiny slowacting air-pressure equalizing vent. Pressure acts equally in all directions. but is almost never borne out in practice. All of this is nice in theory. the omnidirectional microphone is sensitive to the pressure of the sound wave. Therefore the figure-of-eight microphone is not sensitive at the sides. Figure-of-Eight At the other end of the spectrum of polar patterns the figure-of-eight microphone is sensitive to the pressure gradient of the sound wave. therefore the mic is equally sensitive in all directions. therefore there is no pressure gradient and the microphone produces no output. and lessens as the sound source moves round to the side. Take a nominally cardioid mic for example. It may be an almost perfect cardioid at mid frequencies. there is a difference in pressure at the front and rear of the diaphragm. and the microphone is sensitive to this difference. Even though it is very light and thin. The significant knock-on effect of this is that the frequency response off-axis – in other words any direction but head on – is never flat. In practice. at higher frequencies where the size of the mic starts to become significant in comparison with the wavelength. The diaphragm is completely enclosed. The pressure gradient is greatest for sound arriving directly from the front or rear.

.sound wave would find it hard to push the diaphragm sideways – sometimes the intuitive explanation is as meaningful as the scientific one). but it is often thought of as being ‘warmer’ than the more objectively accurate sound of an omnidirectional microphone. The explanation for this is sufficiently complicated to fall outside of the required knowledge of the working sound engineer. Allowing partial access only to one side of the diaphragm would therefore seem to be a viable means of producing the in-between patterns. therefore one would expect the polar response of cardioid and hypercardioid microphones to be inferior to that of omnidirectional and figure-of-eight mics. All directional microphones exhibit a phenomenon known as the proximity effect or bass tip-up. and the figure-of-eight microphone where the diaphragm is completely open on both sides. A cardioid or hypercardioid mic therefore provides access to the rear of the diaphragm through a carefully designed acoustic labyrinth. and indeed it is. Cardioid and Hypercardioid To produce the in-between polar patterns one could consider the omnidirectional microphone where the diaphragm is open on one side only. This produces a signal that is not accurate. Unfortunately the effect of the acoustic labyrinth is difficult to equalize for all frequencies. The practical consequences are that close miking results in enhanced low frequency.

It is often thought that the best and most accurate microphones are the true omnidirectional and the true figure-of-eight. By varying the relative polarization of the diaphragms and backplate. AKG C414 . and that mimicking these patterns with a multipattern mic is less then optimal. This is achieved by mounting two diaphragms back-toback with a single central backplate. any of the four main polar patterns can be created.Multipattern Microphones There are many microphones available that can produce a selection of polar patterns. in practice multipattern mics are so versatile that they are commonly the mic of first choice for many engineers. Nevertheless.

but obviously less flexible. This is much more convenient than setting two mics on a stereo bar. Some stereo mics use the MS principle where one cardioid capsule (M) captures the full width of the sound stage while the other figure-of-eight capsule (S) captures the side-to-side differences. The slots in the barrel allow off-axis sound to cancel giving a highly directional response. the more directional it is. Neumann stereo microphones Interference Tube Microphone This is usually known as a shotgun or rifle mic because of its similarity in appearance to a gun barrel. The MS output can be processed to give conventional left and right signals. The sound quality of these microphones is inferior to normal mics so they are only used out of necessity. The longer the mic. Sennheiser interference tube microphone .Special Microphone Types Stereo Microphone Two capsules may be combined into a single housing so that one mic can capture both left and right sides of the sound field.

and at sports events to capture comments from the pitch. The polar response is hemispherical. Boundary Effect Microphone The original boundary effect microphone was the Crown PZM (Pressure Zone Microphone) so the boundary effect microphone is often referred to generically as the PZM. In this mic. and is almost always omnidirectional. or inset into a wooden or metal plate. Miniature microphones are used in television and in theater. these reflections add to the signal in phase rather than interfering with it. This looks like a satellite dish antenna and is used for recording wildlife noises. and can also be seen in police interview rooms where obviously a clear sound has to be captured for the interview recording. Instead of mounting it on a stand. which lends itself to very compact dimensions. the random vibration of the molecules does not cancel out . Since the diaphragm is small and not in contact with many air molecules. it is taped to a flat surface. By mounting the capsule within around 7 mm from the surface. The characteristic sound of the boundary effect microphone is therefore very clear (as long as there are no other nearby reflecting surfaces). This type of mic is usually of the electret design. although it is rarely ever clipped to the tie these days. It can be used for many types of recording. One of the main problems in the use of microphones is reflections from nearby flat surfaces entering the mic.A close relation of the interference tube microphone is the parabolic reflector mic. where there is a requirement for microphones to be unobtrusive. the capsule is mounted close to a flat metal plate. Crown PZM microphone Miniature Microphone This is sometimes known as a ‘tie-clip’ mic.

as effectively as it does in a microphone with a larger diaphragm. Miniature microphones therefore have to be used close to the sound source; otherwise noise will be evident.

Beyerdynamic MCE5

Vocal Microphone For popular music vocals it is common to use a large-diaphragm mic, often an old tube model. A large diaphragm mic generally has a less accurate sound than a mic with a diaphragm 10-12 mm or so in diameter. The off-axis response will tend to be poor. Despite this, models such as the Neumann U87 are virtually standard in this application due to their enhanced subjective ‘warmth’ and ‘presence’. Microphone Accessories First in the catalogue of microphone accessories is the mic support. These can range from table stands, short floor stands, normal boom stands, tall stands up to 4 meters for orchestral recording, fishpoles as used by video and film sound recordists, and long booms with cable operated mic positioning used in television studios. Attaching the mic to the stand is a mount that can range from a basic plastic clip, to an elastic suspension or cradle that will isolate the microphone from floor noise. The other major accessory is the windshield or pop-shield. A windshield may be made out of foam and slipped over the mic capsule, or it may look like a miniature airship covered with wind-energy dissipating material. For blizzard conditions windshield covers are available that

look as though they are made out of yeti fur. The pop-shield, on the other hand, is a fine mesh material stretched over a metal or plastic hoop, used to filter out the blast of air cause by a voice artist's or singer's ‘P’ and ‘B’ sounds.

Check Questions • What is the piezoelectric effect? • Where would you find a piezo-electric transducer? • What is attached to the diaphragm of a dynamic microphone? • What passive circuit component is incorporated in the output stage of all professional microphones? (Note that some microphones use an active circuit to imitate the action of this component). • Describe the sound of a dynamic microphone. • How does a ribbon microphone differ from an ordinary dynamic microphone? • What is the old term for 'capacitor microphone'? • Why does the capacitor microphone have a more accurate sound than a dynamic microphone? • Why does a capacitor microphone need to be powered (two reasons)? • What precaution should you take when switching on phantom power? • Can dynamic microphones of professional quality be used with phantom power switched on? • What is a pad? • Why does an electret microphone need to be powered? • Describe the actual polar response of a typical nominally omnidirectional microphone. • Describe the proximity effect. • What is an 'acoustic labyrinth', as applied to microphones?

.• Why does a boundary effect microphone give a clear sound? • Why are large-diaphragm microphones used for popular music vocals? • Describe the differences between wind shields and pop shields.

but is out of the direct line of fire of the breath. interview or discussion Television presentation. One is to position the microphone so that it points at the mouth.Chapter 2: The Use of Microphones Use of Microphones for Speech In sound engineering. there are commonly considered to be three classes of sound: speech or dialogue. The audio book is in this category. and when we get it. In the recording and most types of broadcasting of speech there are some definite requirements: • • • • No pops on ‘P’ or ‘B’ sounds. broadcast or amplified: • • • • • • • • Audio book Radio presentation. There are a number of scenarios where speech may be recorded. We have all been conditioned to expect a certain quality of sound from our stereos. Each has its own considerations and requirements regarding the use of microphones. the requirement is for speech that is as natural as possible. not the real acoustic sound of the human voice. music and effects. So often we see microphones used actually in the . even if it isn’t in objective terms. radio and television receivers. In an ideal world perhaps it should even sound as though a real person were in the same room. as opposed to communications which will not be considered here. No breath noise or ‘blasting’ Little room ambience or reverberation A pleasing tone of voice Popping and blasting can be prevented in two ways. Sometimes what we regard as a natural sound is the sound that we expect to hear via a loudspeaker. it sounds natural. hifis. as are many radio programs. interview or discussion News reporting Sports commentary Film and television drama Theatre Conference In some of these. There is a qualification however on the term ‘natural’.

one would be looking for a large-diaphragm capacitor microphone. this will work fine. If the studio is acoustically treated. one essential requirement is the microphone should be out of shot or unobtrusive. Special acoustic tables are also available which absorb rather than reflect sound from their surface. and there are some classic models such as the Electrovoice RE20 that are commonly seen in this application. Generally. the use of microphones on television varies according to geography. The resultant sound quality is in accordance with French subjective requirements. Even a discussion can take place with three or four people each holding a microphone. for a radio discussion where hoop-type pop shields would mar face-to-face visual communication among the participants. In television broadcasting. ‘A pleasing tone of voice’? Well. A windshield is really what it says. Radio microphones are commonly used in television to give freedom of movement and also freedom from cables on . It can be for public address. The requirement for little room ambience or reverberation is handled by placing the microphone quite close to the mouth – around 30 to 40 cm. Some work particularly well for speech. Second. as they are prone to through constant handling. or a quality dynamic microphone for natural or pleasing speech for audio books or radio broadcasting. Oddly enough. for example. This can be positioned between the mouth and the microphone and is surprisingly effective in absorbing potential pops and blasts. Ideally this is an open mesh stocking-type material stretched over a metal or plastic hoop. In France for example. Often the conventional mic is held on stand-by to be brought on quickly if the miniature mic fails. for example. is to have a miniature microphone attached to the clothing in the chest area. although its unobtrusiveness visually has value. and is not 100% effective for pops. The other way is to use a pop shield. Sometimes a foam windshield of the type that slips over the end of the microphone is used for this purpose.line of fire of the breath that it seems as though it is simply the ‘correct’ way to use a microphone. but it isn’t for broadcasting or recording. first choose your voice talent. backed up by a conventional mic on a desk stand. The usual combination for a news anchor. it is a fact that some microphones flatter the voice. it is quite common for a television presenter to hand hold a microphone very close to the mouth.

concealing the mic in the costume can affect sound quality so care must be taken. but this has come to be accepted as the sound of sports commentary so it is now a requirement. while keeping the mic – and the shadow of the mic – out of shot. oddly. News Reporting For news reporting. should there be one. The operator can position and angle the mic to get the best quality dialogue (while monitoring on headphones). but a little bit of harshness or degradation sometimes.the floor. Sports Commentary Sports commentary is a very particular requirement. a fishpole (or boom as it is sometimes known) topped by a shotgun or rifle mic with a cylindrical windshield is the norm. Such a microphone is easily pointable (the reporter isn’t a sound engineer) and brings home good results without any trouble. This often takes place in a noisy environment so the microphone must be adapted to cope with this. and the positioning bar on the top of the mic ensures that the commentator always holds it in the correct position (as. The Coles 4104 is an example of a 1950s design that is still widely used. leaving plenty of free space for the cameras to roll around smoothly. often with radio transmitters. Miniature microphones are also used in this context. indeed it is always held .sports commentators often like to move around in their commentary box as they work). It is a noise-cancelling microphone that almost completely suppresses background noise. Sometimes in the studio a microphone might be mounted on a large floor mounted boom that can extend over several meters (we’re not in fishing . The sound quality of a news report may not be all that could be imagined. a robust microphone – perhaps a short shotgun – can be used with a general-purpose foam windshield for both the reporter and interviewee. Obviously they must not be visible at all. However. The result is a mic that has a heavily compromised sound quality. makes the report more ‘authentic’. Film and Television Drama For film and television drama.

Shotgun or rifle mics are positioned at the front of the stage (an area sometimes known for traditional reason as ‘the floats’. One is that the microphones should point inwards from the front corners of the lectern. Theatre In theatre the choice is between personal miniature microphones with radio transmitters. it isn’t necessary to have a high sound level in the auditorium. Therefore the microphone has to be unobtrusive so that it can be placed fairly close to the mouth without drawing undue attention to itself (the cluster of broadcasters’ microphones in front of the lectern is another matter. is to be used then area miking is usually sufficient. but they don’t have to be so close). or area miking from the front and sides of the stage. Conference I use this term loosely to cover everything from company boardrooms to political party conferences. For straight drama. You will see that there can be a vast difference in scale. The movements of the actors have to be planned to take account of this. In fact in most theatres it is perfectly acceptable for the sound of the actors’ voices to be completely unamplified. . Personal microphones allow a higher sound level before feedback since they are close to the actor’s mouth. In this case the boom operator has winches to point and angle the microphone. There are two schools of thought on this issue. You will have noticed that in this context microphones are often used in pairs. The drawback is that there will be positions on the stage from which the actors cannot be heard. The AKG C747 is very suitable for this application. or reinforcement. To achieve reasonably high sound levels the microphone has to be close to the mouth. This lies beyond what we normally consider to be sound engineering and is categorized in the specialist field of sound installation. However if amplification.country anymore). The party conference is another matter. In the boardroom it has become common to use gooseneck microphones or boundary effect microphones that are specifically designed for that purpose. yet the candidate – for obvious reasons – does not want to look like a microphone-swallowing rock star. therefore the mics are sometimes called ‘float mics’) to create sensitive spots on stage from which the actors can easily be heard.

It sounds neither pleasant nor natural. It had been. they will instinctively back away from the mic.This allows the speaker to turn his or her head and still receive adequate pickup. and at the right level for the audience. apart from the front few rows. to them. they were unable to hear a single unamplified word he said. If their voice seems too loud. But unfortunately. both microphones can pick up the sound while the sound source – the mouth – is moving towards one mic and away from the other. The speaker will learn. through not hearing their voice coming back through the PA system. I once saw the chairman of a large and prestigious organisation stand away from his microphone because he thought it wasn’t working. that they can only turn so far before useful pickup is lost. The Doppler effect comes into play and two slightly pitch shifted signals are momentarily mixed together. It is worth saying that in this situation. the person speaking must be able to hear their amplified voice at the right level. The alternative approach is to mount both microphones centrally and use one as a backup. If they can’t hear their amplified voice they will assume the system isn’t working. Unfortunately. as the head moves. .

some scenarios: • • • • • • • • Recording Broadcast Public address Recording studio Location recording Concert hall Amplified music venue Theatre The requirements of recording and broadcasting are very similar. Broadcasters. They need to get a quick. The recording studio is a very comfortable environment for microphones. or a natural sound may not be wanted for whatever reason. except that broadcasting often works to a more stringent timescale. except that the microphone has to be closer because it can’t discriminate direct sound from reflected sound in the way the human ear/brain can. wherever you would normally choose to listen from is the right position for the microphone. and in television broadcasting microphones must be invisible or at least unobtrusive. So.Use of Microphones for Music The way in which microphones are used for music varies much more according to the instrument than it possibly could for speech where the source of sound is of course always the human mouth. by the way. reliable result. First. Ultimate sound quality is not of such importance. tend to place the microphone closer than recording engineers. and a close mic position is simply safer for this purpose. It is always a good starting point to follow these two rules. practical. The microphone will always be closer than a natural comfortable listening distance. The engineer is able to use any microphone he or she desires and has . There are two golden rules: Point the microphone at the sound source from the direction of the best natural listening position. but of course it may not always be possible.

Primarily this is to achieve level without risk of feedback. Usually it is against fire regulations to have microphones among the audience. which don’t need much gain anyway. It wouldn’t even be possible to listen from this position. right against the grille cloth of a guitarist’s speaker cabinet and within millimetres of the heads of the drums. the most distant mics would be the drum overhead mics..available. The ultimate example would be a microphone clipped to the bridge or sound hole of a violin. the concert hall is a reasonably good place to record in as at least they are used to the requirements of music (the owners of many good recording venues often have higher priorities – religious worship being a prominent example). large and ugly. The placement of the mic is significant. Generally therefore there will be a stereo pair of mics slung from the ceiling. to the point that natural direction has very little meaning. This necessitates that microphones are very much closer than the natural listening position. microphones are used as close to the singer’s lips as possible. The mic may be old. For string and wind instruments there are a variety of clip-on mics available. In theatre musicals. In rock music PA. although even these are not entirely immune to feedback. prone to faults etc. There are also contact mics that pick up vibrations directly from the body of the instrument. As far as comfort goes. supplemented by a number of mics on stage. which are closer than the engineer would probably prefer them to be under ideal circumstances. In this context. then it will be used. the problem is always in getting sufficient level without feedback. Location recording is not quite so comfortable and you need to be sure that the microphones are reliable and easy to use. cumbersome to use perhaps with an external power supply (not phantom) and pattern selector. named for Mme Lavalier . However this has also come to be understood as the ‘rock music sound’ because it is what the audience expects. preferably without external power supplies and with a simple stand mount rather than a complicated elastic suspension. The original ‘lavalier’ placement. For amplified music. There are however restrictions on the placement of microphones during a concert. unless the mics are positioned in such a way that they don’t impede egress and cables are very securely fixed. the best option for the lead performers is to use miniature microphones with radio transmitters. but if it gets the right sound.

This actually captures a very good vocal sound. Still. If a boom is not considered acceptable. no-one said that it was easy going on stage. clip on mics are good for string instruments. The best place for a miniature microphone is on a short boom extending from behind the ear. closely placed. then the mic may protrude a short distance from above the ear.who reportedly wore a large ruby from her neck. Wind instruments are generally loud enough for conventional stand mics. has long gone. Mics and booms are available in a variety of flesh colours so they are not visible to the audience beyond the second or third row. For the orchestra in a theatre musical. The chest position is great for newsreaders but it suffers from the shadow of the chin and boominess caused by chest resonance. . or descending from the hairline. So-called ‘booth singers’ can use conventional mics. One of the biggest problems with miniature microphones in the theatre is that they become ‘sweated out’ after a number of performances and have to be replaced. It has to be tried to be believed.

what is stereo? The word ‘stereophonic’ in its original meaning it suggests a ‘solid’ sound image and does not specify how many microphones. The sound can be good. More practically. There are a number of techniques: • • • • • • • • Coincident crossed pair Near-coincident crossed pair ORTF Mercury Living Presence Decca Tree Spaced omni MS Binaural The coincident crossed pair technique traditionally uses two figure-ofeight microphones angled at 90 degrees pointing to the left and right of the sound stage (and. as a word. When it works. They would be angled at 120 degrees were it not for the drop off in high frequency response at this angle in most mics. However. the results do not bear out the theory. listen to a recording of an orchestra and pinpoint where every instrument is in the sound image. combines both Greek and Latin roots. two cardioid microphones can be used. to the left and right of the area where the audience would be also). However perfect the mathematics look on paper. (By the way. you should be able to sit in an equilateral triangle with the speakers. due to the rear pickup of the figure-of-eight mic. Just as well perhaps. This system was originally proposed in the 1930s and mathematically inclined audio engineers will claim that this gives perfect reproduction of the original sound field from a standard pair of stereo loudspeakers. and you can with . it has come to mean two channels and two loudspeakers using as few or as many microphones that are necessary to get a good result. and the results can be very satisfying. some people complain that ‘stereophonic’. channels or loudspeakers are to be used. it is possible to use just two or three microphones to pick up the entire ensemble in stereo. because if it had been exclusively Latin it would have been ‘crassophonic’!) When recording a group of instruments or singers.Stereo Microphone Techniques Firstly. A 110-degree angle of separation is a reasonable compromise.

Coincident crossed pair Separating the mics by around 10 cm tears the theory into shreds. named for the Office de Radiodiffusion Television Francaise. uses two cardioid microphones spaced at 17 cm angled outwards at 110 degrees. or wherever the recording was made. and is simply an extended near-coincident crossed pair. . but it sounds a whole lot better. The fact that human beings do not have coincident ears might have something to do with it. Near-coincident crossed pair The ORTF system. The problem is that you just don’t feel like you are in the concert hall.effort tell where the instruments are supposed to be in the sound image.

Record each to its own track on 35mm magnetic film. They may have a little noise and distortion. and additional microphones might be used where necessary. used for classical music recordings on the Mercury label. but of course the early omni mics did become directional at higher frequencies. The same can be said of the Decca tree. . but fine. Later recordings were made to two-track stereo. the worse the mono compatibility. The further apart the microphones are spaced.The redeeming feature of the coincident crossed pair is that you can mix the left and right signals into mono and it still sounds fine. ORTF Mercury Living Presence was one of the early stereo techniques of the 1950s. used by the Decca record company. This is not dissimilar from the Mercury Living Presence system but baffles were used between the microphones in some instances to create separation. you might work out that one microphone pointing left. as used in cinema audio. These recordings stand up remarkable well today. another pointing center and a third pointing right might be the way to do it. Mono. but the sound is wonderfully clear and alive. and there you have it! Nominally omnidirectional microphones were used. If you imagine trying to figure out how to make a stereo recording when there was no-one around to tell you how to do it. positioned towards the sides of the orchestra. although near-coincident and ORTF systems are still usable. We call this mono compatibility and it is important in many situations – the majority of radio and television listeners still only have one speaker.

With a coincident crossed pair. as explained previously. This is of practical benefit when it is necessary to record a single performer in stereo. The main drawback is that a recording made in such a way sounds terrible when played in mono. what happens on playback is that the sound seems to cluster around the loudspeakers and there is a hole in the middle of the sound image. To prevent this. one microphone would be pointing to the left of the performer.being so dissimilar to the human hearing system . the other would be pointing to the right. The M and S signals can be combined without too much difficulty to provide conventional left and right signals. There is no theory on earth to explain why this works . It is sometimes proposed as an advantage of MS than it is possible to control the width of the stereo image by adjusting the level of the S signal. a centre microphone can be mixed in at a lower level so that the ‘hole’ is filled. The MS system. uses a cardioid microphone to pick up an all-round mono signal. much more distant from each other than in the above systems. If only two microphones are used spaced apart by perhaps as much as two meters or more.but it can work very well. It just seems wrong not to point a microphone directly at the performer.Decca tree Another obvious means of deploying microphones in the early days of stereo was to place three microphones spaced apart at the front of the orchestra. getting the best possible sound quality from the mic. and with the MS system you do. and a figure-of-eight mic to pick up the difference between left and right in the sound field. This is exactly the same as adjusting the width by turning the mixing console’s panpots for .

A little technical help is therefore called for. neither will capture the natural sound of the instrument. Normally the stereo mic system. There are many books and texts that claim to tell you how and where to position microphones for all manner of instruments. In addition to the stereo miking system. if that’s what you want. spoiling the effect. A binaural recording played on speakers doesn’t work because the two channels mix on their way to the listener. not book learning. with the other microphones used to compensate for the distance to the rear of the orchestra. but only on headphones. Binaural stereo attempts to mimic the human hearing system with a dummy head (sometimes face. it is common to mic up every section of an orchestra. whether it is a classical orchestra. another is close to the bell. It is worth looking at some specific examples: Saxophone There are two fairly obvious ways a saxophone can be close miked. If you move the mic further away. Small changes in microphone position can affect the sound quality enormously. Instruments We come back to the two golden rules of microphone placement. film music. Sectional mics shouldn’t be used to compensate for poor balance due to the conductor or arranger. Therefore it is in reality no advantage at all. There have been a number of systems attempting to make binaural recordings work on loudspeakers but none has become popular. as above. shoulders and chest too) with two omnidirectional microphones placed in artificial ears just like a real human head. Of the two saxophone close miking positions. It works well. Close mic positions almost never do. and to add just a little presence to instruments where appropriate. or the backing for a popular music track. crossed pair or whatever. One is close to the mouthpiece.the left and right signals closer to the centre. leads to success. Sometimes however classical composers don’t get the balance quite right and it is not acceptable to change the orchestration. up to . is considered the main source of signal. Experience. but the key is to experiment and find the best position for the instrument – and player – you have in front of you. The difference in sound quality is tremendous. The same applies to all close miking.

the metal of the instrument. Also as you move away you will capture more room ambience. and that is a compromise that has to be struck. Recording drums is an art form and experience is by far the best guide. a mic for the hihat perhaps. You can even place a microphone under a grand piano to capture the vibration of the soundboard. Dynamic mics generally sound better for drums. and two overhead mics for the cymbals. bell. .around a meter. capacitor mics for cymbals. Piano Specifically the grand piano – it is common to place the microphone (or microphones) pointing directly at the strings. or a kit that isn’t well set up. You can position the microphones all the way at the bass end of the instrument. but listen out for noise from the foot pedals. mouthpiece. and a rich full sound will be captured. There are some points to bear in mind: You can’t get a good recording of a poor kit. particularly cymbals. Drums The conventional setup is one mic per drum. Move the microphones below the edge of the case and angle them so that they pick up reflected sound from the lid and a more natural sound will be discovered. and the holes that are covered and uncovered during the normal course of playing. more controlled sound. you will be able to capture the sound of the whole of the instrument. The closer the microphones are to the higher strings. the brighter the sound will be. Natural sound against room ambience. but it might be the sound you want. The mics have to be placed where the drummer won’t hit them. It is often necessary to damp the drums by taping material to the edge of the drum head to get a shorter. It’s subjective. spaced apart by maybe 30 cm. Oddly enough no-one ever listens from this position and it doesn’t really capture a natural sound. It can even sound quite good. or the stands.

is important. Otherwise it will sound more like a military bass drum than the dull thud that we are used to. The snares on the underside of the snare drum may rattle when other drums are being played. and this is commonly done. but it’s a start. Noise gates will be covered later. as is the position of the kick drum mic either just outside. Careful adjustment of the tension of the snares is necessary. Perhaps this is a brief introduction to the use of microphones. or some distance inside the drum.The kick drum should have its front head removed. and perhaps even a little damping. And to round off I’ll give away the secret of getting good sound from your microphones: Listen! . The brute force technique is to use a noise gate on every microphone channel. Microphones should be spaced as far apart from each other as possible and directed away from other drums. or there should be a large hole cut out so that a damping blanket can be placed inside. The choice of beater – hard or soft . Every little bit helps as the combination of two mics picking up the same drum from different distances leads to cancellation of groups of frequencies.

performed solo. what is 'area miking'? • How is feedback avoided in live sound (the simplest technique)? • Why must the speaker at a conference hear his or her own amplified voice at the right level? • Write down. Describe the effect of two alternative close miking positions. the two golden rules for microphone positioning • Why do microphones have to be placed closer than a natural listening position? • Where are personal mics worn in the theater? • What is stereo? • Describe the coincident crossed pair. • What is the benefit of separating the microphones (relate this to the human hearing system)? • What is the value of mono compatibility? • Why is it desirable to mic up every section of an orchestra independently? • Pick an instrument other than those mentioned in the text. does the pianist sit on the left or the right? Why? • Why do drums often need to be damped? . copy if you wish. • When you look at a grand piano. on stage.Check Questions • What problem is commonly found in live sports commentary? • What does a fishpole operator concentrate on while working? • In theater.

The electrostatic loudspeaker (and this time it is a loudspeaker rather than just a drive unit) uses electrostatic attraction rather than magnetism. Everything else. not a natural sound source. The electrostatic loudspeaker has the most natural sound quality. a coil of wire (sometimes called the ‘voice coil’) positioned within the field of the magnet and a diaphragm that pushes against the air. it creates a magnetic field that interacts with the field of the permanent magnet causing motion in the coil and in turn the diaphragm. or I should say ‘drive unit’ as this is only one component of the complete system. is as close to the capabilities of human hearing as makes hardly any difference at all. It is probably fair to say that 99. Loudspeakers can be categorized by method of operation and by function: • • • • • • • • • • Method of operation: Moving coil Electrostatic Direct radiator Horn Function: Domestic Hi-fi Studio PA In this context we will use ‘PA’ to mean concert public address rather than announcement systems that are beyond the scope of this text.Chapter 3: Loudspeaker Drive Units Loudspeakers are without doubt the most inadequate component of the audio signal chain. but is not . amplify the signal and convert it back into sound and you will know without any hesitation whatsoever that you are listening to a loudspeaker. The moving coil loudspeaker.999% of the loudspeakers you will ever come across use moving coil drive units. However. When a signal is passed through the coil. even the microphone. The components consist of a magnet. is the original and still most widely used method of converting an electric signal to sound.

In a director radiator drive unit. Let's look at these in more detail: Moving Coil Drive Unit Perhaps the best place to start is a 200 mm drive unit intended for low and mid frequency reproduction. Hence it is rarely used in professional audio outside of. the diaphragm pushes directly against the air. occasionally. and for a given input power the horn will be louder. This is not very efficient as the diaphragm and the air have differing acoustic impedance. A horn makes the transition from vibration in the diaphragm to vibration in the open air more gradual. The next question would be. A moving coil drive unit can be constructed as either a direct radiator or a horn. therefore it is more efficient. Increase the diameter to 300 mm or 375 mm and many more air molecules feel the impact. so why are larger drive units ever necessary? The answer is to achieve a higher sound level.capable of high sound levels. A 200 mm drive unit only pushes against so much air. which creates a barrier for the sound to cross. This isn't the biggest drive unit available. classical music recording. why are 300 .

mm or 375 mm drive units not used more often. and it will produce reasonably distortion-free sound up to around 4 kHz or so. Of course. but you need a big drive unit to shift large quantities of air at low frequency. This isn't quite the right way to look at it. At high frequency. or pulp diaphragms that have been doped to stiffen them adequately. 200 mm is a good compromise. at high frequencies there isn't so much time and at some frequency the diaphragm will start to deviate from the ideal rigid piston. Fairly obviously. The diaphragm could be flat and still produce sound. At low frequencies there are fewer opportunities to move air. It takes a certain time for movement of the coil to propagate to the edge of the diaphragm. It is up to the designer to ensure that the distortion created doesn't sound too unpleasant. Early moving coil drive units used paper pulp diaphragms. it is called break up. It will produce enough level at low frequency for the average living room. Unfortunately. when space is available? The answer to that is in the behavior of the diaphragm: The diaphragm must not bend in operation otherwise it will produce distortion. in this context. due to the vibration ‘breaking up’ into a number of different modes. since the motor is at the center and vibrations are transmitted to the edges. the diaphragm needs to be stiff. Carbon fiber . it is often thought that a larger drive unit will operate down to lower frequencies. doesn't mean severe distortion or anything like that. Modern drive units use plastic diaphragms. By the way. ‘Break up’. In fact most low frequency drive units are operated well into the break up region. the drive unit vibrates backwards and forwards rapidly. moving air on each vibration. Any size of drive unit will operate down to as low a frequency as you like. which were not particularly stiff. the ultimate in stiffness would be a metal diaphragm. It is sometimes said that the diaphragm should operate as a ‘rigid piston’. High frequencies will tend to bend the diaphragm more than low frequencies. However. it would be heavy and the drive unit would be less efficient. When the diaphragm bends. The cone shape is the best compromise between stiffness and large diameter. therefore the area of the drive unit needs to be greater to achieve the desired level. The material of the diaphragm has a significant effect on its stiffness.

Because of these two factors. the diaphragm is designed to bend and distort. The drive unit will entirely cease to function. Moving up the frequency range: as we have said. as long as it sounds good. It might be stating the obvious at this stage. or placed too close to a theatrical pyrotechnic effect. and because it is smaller it spreads sound more widely. Even if it didn't. breaking the circuit (‘thermal damage’). there would still be the problem that a large sound source will tend to focus sound over a narrow area. Often the diaphragm is dome shaped rather than conical. A good acoustics text will supply the explanation). This is significant in PA. Therefore a mid frequency drive unit has to be used (sometimes known as a squawker!). the diaphragm will bend and produce distortion. A smaller diaphragm is more rigid at higher frequencies. higher frequencies are handled by a smaller drive unit. and a high frequency drive unit as a tweeter. In fact. The other is to ‘shock’ the drive unit with a loud impulse. but it . The coil will get hotter and hotter and eventually will melt at one point. but a low frequency drive unit is commonly known as a woofer. It is part of the sound of the instrument and a distortionfree sound would not meet a guitarist's requirements). this is the characteristic of direct radiator loudspeakers: that their angle of coverage decreases as the frequency gets higher. In loudspeakers where a low frequency drive unit greater than 200 mm is used. where a single loudspeaker has to cover a large number of people. it will not be possible to use the woofer up to a sufficiently high frequency to hand over directly to the tweeter. but it is certainly so. This can happen if a microphone is dropped. (It is worth noting that in drive units used for electric guitars. (It is perhaps counter-intuitive that a large sound source will focus the sound. One is to drive it at too high a level for too long. The function of dividing the frequency band among the various drive units is handled by a crossover.diaphragms have also been used with some success. This is part of the designer's art and isn't of direct relevance to the sound engineer. more on which later. The impulse won't contain enough energy to melt the coil. Damage There are two ways in which a moving coil drive unit may be damaged. becoming narrower as the frequency increases.

on the other hand. at best. but the coil will scrape against the magnet producing a very harsh distorted sound. To get the best performance from a loudspeaker. Many drive units can be repaired. the capacity of a loudspeaker to soak up this power is only an intelligent guess. although the power of an amplifier can be measured very accurately. This is the load presented to the amplifier. where a low impedance means the amplifier will have to deliver more current. The rating on the cabinet is therefore only a guide. the amplifier should be rated higher in terms of watts. One common question regarding damage to loudspeakers is this: What should the power of the amplifier be in relation to the rated power of the loudspeaker? In fact. as under normal circumstances that is all it would be expected to handle. but of course damage is best avoided in the first place. and it won't blow the drive units unless you push the level too high. It wouldn't be unreasonable to connect a 200 W amplifier to a 100 W speaker. It is up to the sound engineer to control the level. that a 100 W amplifier was connected to a 200 W loudspeaker (two-way. A common nominal impedance is 8 ohms. The trick is to listen to the loudspeaker. Suppose. or shift it from its central position with respect to the magnet (‘mechanical damage’). In a 200 W loudspeaker. The sound engineer might push the level so high that the amplifier started to clip. and you will find that the actual impedance departs significantly from nominal according to frequency. and it will blow. the tweeter could be rated at as little as 20-30 W. with woofer and tweeter). The drive unit will still function. Normally this isn't particularly significant. During the design process. and a low number of complaints from people who have pushed their purchases too hard.may break apart the turns of the coil. and hence ‘work harder’. It will tell you when it is under stress if you listen carefully enough. ‘Nominal’ means that this is averaged over the frequency range of the drive unit or loudspeaker. Clipping produces high levels of high frequency distortion. the manufacturer will test drive units to destruction and arrive at a balance between a high rating (in watts) that will impress potential buyers. Impedance Drive units and complete loudspeaker systems are also rated in terms of their impedance. But under clipping conditions the level supplied to the tweeter could be massively higher. except in two situations: .

To be honest. give R1 some significant impedance. as would happen with a long run of loudspeaker cable. the above points are not always at the forefront of the working sound engineer's mind. You could think of the output impedance of the amplifier in series with the impedance of the loudspeaker as a potential divider. or perhaps the amplifier might even go into protection mode to avoid damage to itself. but they are significant and worth knowing about.variable with frequency and you will now see a rather less than flat frequency response. The output impedance of a power amplifier is very low – just a small fraction of an ohm. .At some frequency the impedance drops well below the nominal impedance. The power amplifier will be called upon to deliver perhaps more power than it is capable of. causing clipping. However. Work out the potential divider equation with R1 equal to zero and you will see that the output voltage is equal to the input voltage. Make R2 the loudspeaker impedance . and you will see a voltage loss.

what should be the power of the amplifier. according to the text? .Check Questions • What is the difference between the terms 'loudspeaker' and 'drive unit'? • How does a moving coil drive unit work? • Comment on the two qualities of an electrostatic loudspeaker. • When is a separate midrange drive unit necessary? • Comment on the two damage modes of moving coil drive units. • If a loudspeaker is rated at 100 W. • What is a director radiator drive unit? • What is the function of a horn? • Why are drive units larger than 200 mm sometimes used? • What is meant by the phrase 'rigid piston'? • Why is the diaphragm of a moving coil loudspeaker normally cone shaped? • Why does the diaphragm bend more at higher frequencies? • What is 'break up'? • Does breakup occur in a woofer in normal operation? • Why should a guitar drive unit distort intentionally? • Comment on the 'beaming' effect of a large drive unit.

the cancellation getting worse at lower frequencies. there are problems: The diaphragm now has to push against the air 'spring' that is trapped inside the cabinet. this happens with the open back cabinet too). A baffle is simply a flat sheet of wood with a hole cut out for the drive unit. The baffle can be folded around the drive unit to create an open back cabinet. . The logical extension of the baffle and open back cabinet is to enclose the rear of the drive unit completely.Chapter 4: Loudspeaker Systems Cabinet (Enclosure) The moving coil drive unit is as open to the air at the rear as it is to the front. Sound will leak through the cabinet walls anyway. and much of the energy will 'bend' around to the front. particularly at low frequencies. is almost 7 meters. it works. However. This present significant opposition to the motion of the diaphragm. The cabinet will itself vibrate and is highly unlikely to operate anything like a rigid piston or have a flat frequency response. which you will still find in use for electric guitar loudspeakers. this leaked sound is inverted (or we can say 180 degrees out of phase) and the combination of the two will tend to cancel each other out. for example. The simple solution to this is to mount the drive unit on a baffle. For a 200 mm drive unit the frequency at which cancellation would start to become significant is 1700 Hz. The drawback is that the partially enclosed space creates a resonance that colors the sound. (Of course. But to work well down to sufficiently low frequencies it has to be extremely large. The wavelength at 50 Hz. It would now seem that the rear radiation is completely controlled. This occurs at frequencies where the wavelength is larger than the diameter of the drive unit. Amazingly. Since the movement of the diaphragm to the rear is in the opposite direction to the movement to the front. Sound diffracts readily. hence it emits sound forwards and backwards. The backwardradiated sound causes a problem. creating an infinite baffle.

Thus. Points of order: 'Springiness' is more properly known as compliance. In the case of the bass reflex enclosure. The small plug of air in the port bounces against the compliance of the larger volume of air inside and resonates readily. Another term for 'infinite baffle' is acoustic suspension. the resonant frequency is set just at the point where an equivalently sized infinite baffle would be losing low end response. careful design of the drive unit to balance the springiness of the trapped air inside the cabinet against the springiness of the suspension can work wonders. You will occasionally hear of this as a ported or vented cabinet. The next step in cabinet design is the bass reflex enclosure.At this point it is worth saying that the bare drive unit is often used in theater sound systems where there is a need for extreme clarity in the human vocal range. properly designed.a perfect example of the principle . A Helmholtz resonator is nothing more than an enclosed volume of air connected to the outside world by a narrow tube. The bass reflex cabinet borrows the theory of the Helmholtz resonator. The infinite baffle. The only real problem is that the compromises that have to be made to make this design work result in poor low frequency response. Despite these problems. The Helmholtz resonator can be designed via a relatively simple formula to have any resonant frequency you choose. called the port. Low frequencies can be bolstered with conventional cabinet loudspeakers. The port can stick out of the enclosure as in a beer bottle .or inwards. is widely regarded as the most natural sounding type of loudspeaker (electrostatics excepted). You would need a very deep understanding of loudspeakers (starting with the Thiele-Small parameters of drive units) to be able to design a loudspeaker that would work well for studio or PA use. Electric guitar loudspeakers are not so critical. Try blowing across the top of the beer bottle (when empty) and you will see. the resonance of the enclosure can assist the drive unit just at the .

There are other cabinet designs. These are sometimes known as 'bass bins'. To make any significant difference to the efficiency of a loudspeaker at low frequencies.can on occasion find the distortion quite pleasant. a horn drive unit may be up to 5% efficient. The air in the throat of the horn becomes so compressed at high levels that significant distortion is produced. meaning essentially that when the input ceases the diaphragm returns straight away to its rest position. There is a whole theory to horns that deserves consideration. Additionally. The competent loudspeaker designer is in control of this and a degree boominess will be balanced against a subjectively 'good' . This can result in socalled 'boomy' bass. However.bass response.point where its output is weakening. However. in a bass reflex loudspeaker the drive unit will overshoot the rest position and then return. There is of course a cost to this. . Depending on the quality of the design. folded horn cabinets can be constructed that make enough of a difference to be worthwhile. but here we will simply list some of the basics: Whereas a direct radiator drive unit may be only 1% efficient (i. notably the transmission line.including the writer of this text! . the length and area of the horn have to be very large. but these are not generally within the scope of professional sound engineer so they will be excluded from this text.if not accurate . 100 W of electrical power converts to just 1 W of sound power).e. a loudspeaker with boomy bass will tend to translate any low frequency energy into output at the resonant frequency. it may do this more than once creating an audible resonance. some people . This a carefully tuned and recorded kick drum will come out as a boom at the loudspeaker's resonant frequency. this extending the low frequency response usefully. which is generally undesirable. Whereas an infinite baffle loudspeaker can be designed with a low-Q resonance. Horns We have covered horns to some degree already.

the adjacent loudspeaker takes over. there should be one cabinet pointing almost directly at you.. mid and high frequencies according to the number of drive units in the loudspeaker. with a quality sound system. Crossover The function of the crossover is to separate low. The shape of the curvature of the horn can be one of any number of mathematical functions. Next time you are in a theater.. if director radiator loudspeakers were used in the theater. With careful calculation and design it is possible to produce a constant directivity horn which has an even frequency response over an angle of up to 60 degrees. A passive crossover is generally internal to the cabinet and consists of a network capacitors. all with pretty much the same quality of sound. it doesn't need to be powered. This leads to the concept of the center cluster loudspeaker system that is widely used wherever intelligibility is a prime requirement in a PA system. other than for special theatrical effects. A crossover can be passive or active. where directionality isn't significant. A number of constant directivity horn loudspeakers are arrayed so that where the coverage of one is just starting to fall off. or even just an arbitrary shape. This means that one loudspeaker can cover a sizable section of the audience. the whole of the auditorium has to be covered with high quality sound. or large place of worship. An active crossover on the other hand does contain transistors or ICs and . There will be more on this when we cover PA system specifically. The problem in theater musicals is that the sound has to be intelligible otherwise the story won't be understood by the audience (many of whom in a London West End theater would be European tourists who wouldn't have English as their first language). Also. then people who were on-axis would received good quality sound. inductors and resistors. plus or minus 30 degrees or so. Those members of the audience who were further from the 'straight ahead' position would received lower levels at high frequency and therefore a duller sound.The most important application of the horn is in high quality PA systems such as those used for theater musicals. The solution is the constant directivity horn. and there should be no other loudspeaker pointing at you from any other location in the building. (More information on directivity. take a look at the loudspeakers. Having no active components. Apart from any loudspeakers that are dedicated to bass.).

one for each division of the frequency band. say. A system with a three-band active crossover would require three power amplifiers. 18 or 24 dB per octave. As it happens. and the slopes of the filters. 4 kHz to pass and then cut off everything above that completely. but most systems these days use 18 dB/octave or 24 dB/octave. in the band of frequencies where the slope has kicked in. It is impractical. There are issues with the phase response of crossover filters that vary according to slope. 12. High frequencies would be sent to the woofer at sufficient level that there would be audible distortion due to break up. active crossovers have advantages: • Accurate • Cutoff frequency and slope can be varied . to have a filter that allows frequencies up to. say. a slope of 6 dB per octave is useless. and actually undesirable. 12 dB/octave is workable. So frequencies beyond the cutoff frequency (where the response has dropped by 3 dB from normal) are rolled off at a rate of 6. Crossovers have two principal parameter sets: the cut off frequencies of the bands.requires mains power. but this is an advanced topic that few working sound engineers would contemplate to any great extent. The slopes mentioned are actually the easy ones to design. 9 dB per octave would be much more complex. It sits between the output of the mixing console and a number of power amplifiers . Low frequencies would be sent to the tweeter that could damage it. A filter with a slope of. In other words. Passive crossovers have a number of advantages: • Inexpensive • Convenient • Usually matched by the loudspeaker manufacturer to the requirements of the drive units • And the disadvantages: • Not practical to produce a 24 dB/octave slope • Can waste power • Not always accurate & component values can change over time Likewise. as the frequency doubles the response drops by that number of decibels.

Some loudspeaker systems come as a package with a dedicated loudspeaker control unit. The control unit consists of three components: • Crossover • Equalizer to correct the response or each drive unit • Sensing of voltage (and sometimes) current to ensure that each drive unit is maximally protected .no wastage of power & better control over diaphragm motion • Limiters can be built into each band to help avoid blowing drive units And the disadvantages: • Expensive • It is possible to connect the crossover incorrectly and send LF to the HF driver and vice versa. • A third-party unit would not compensate for any deficiencies in the driver units.• Power amplifier connects directly to drive unit .

the hi-fi loudspeaker will win easily. and listening levels are generally well below what we call 'rock and roll'. radios or portables. although for the sake of their hearing they should not do this too often. engineer and musicians might just like to monitor at high level. That way. Obviously the quality suffers. there are four main usage areas of loudspeakers: domestic. Another consideration is that the acoustically treated control room will absorb a lot of the loudspeaker's energy. Mixes were also assessed on tiny Auratone loudspeakers just to make sure they would sound good on cheap domestic systems. . all the detail in the sound can be assessed properly and any faults or deficiencies picked up. This would seem odd because twenty-five years ago anyone in the recording industry would have said that studio monitors have to be as good as possible so that the engineer can hear the mix better than anyone else ever will. The living room environment is generally fairly small. so that any given loudspeaker would seem quieter than it would in a typical living room. This has resulted in an intense design effort to make smaller loudspeakers louder. If you put an expensive PA loudspeaker next to a decent hi-fi loudspeaker in a head-to-head comparison at a moderate listening level. This means that the loudspeaker can be optimized for sound quality. Recording studio main monitors have to be capable of higher sound levels. PA speakers are the ultimate example of this. It is generally true that a loudspeaker that is optimized for high levels won't be as accurate as one that has been optimized for sound quality. The most fascinating use of loudspeakers is the near field monitor. the producer. The hi-fi market is significant in that this is where we will find the very best sounding loudspeakers. and the best examples can be very satisfying to listen to with few objectionable features. although it still has to be said that moving coil loudspeakers always sound like loudspeakers and never exactly like the original sound source. We will skip non-critical domestic usage and move directly on to hi-fi. Near field monitors are now almost universally used in the recording studio for general monitoring purposes and for mixing. hi-fi.Use of Loudspeakers As mentioned earlier. For one thing. There has been a trend over the last couple of decades for PA speakers to be smaller and hence more cost effective to set up. studio and PA.

That would be the Yamaha NS10 then! . although their bass response is lacking due to their small size. However. The success if nearfield monitoring is something of a mystery. but every manufacturer has a nearfield monitor in their range. The NS10 and later NS10M are now no longer in production. And since so little is quantifiable.a small domestic loudspeaker with a dreadful sound. the NS10 made it easier to get a great mix . but the fact is that it does. Some actually now sound very good. A slightly upmarket Auratone if you like.That was until the arrival of the Yamaha NS10 .and not only that but a mix that would 'travel well' and sound good on any system. and found that by some magical an indefinable means. It must have found its way into the studio as cheap domestic reference. the best recommendation for a nearfield monitor is that it has been used by many engineers to mix lots of big-selling records. It shouldn't work. someone must have used it as a primary reference for a mix.

• What is the advantage of the horn regarding efficiency? • What is the (greater) advantage of the constant directivity horn? • What is a 'center cluster'? • What is meant by the 'slope' of a crossover? • • Contrast some of the principal features of active and passive crossovers. Comment on the use of nearfield monitors .Check Questions • What problem is caused by sound coming from the rear of the drive unit? • What is a baffle? • How large does a baffle have to be to work well at low frequencies? • What is an 'open back' cabinet? • What is an 'infinite baffle' cabinet? • What problem in an infinite baffle cabinet is caused by the trapped air inside? • What is 'compliance'? • What is a 'bass reflex' enclosure? • What is the advantage of a bass reflex loudspeaker compared to an infinite baffle? • What is the disadvantage of a bass reflex loudspeaker compared to an infinite baffle? • Briefly describe a horn drive unit in comparison with a direct radiator drive unit.

When a greater magnetizing force is applied and the initial lack of enthusiasm to become magnetized has been overcome. As digital formats become increasingly diverse. With tape. the material hardly responds at all. Electricity is an easy medium to work in. and it is often true to say that it is easier to mix a recording made on analog than it is to mix a digital multitrack recording. The other useful feature of analog recorders is that they are universal. right up to the point where it is magnetized as much as it can be. compared to magnetism. When a small magnetizing force is applied. This isn't really to say that they sound better. but their sound is often said to be 'warm'.like a flat mirror compared (linear) to a funfair mirror (non-linear). Magnetic material does not respond linearly to a magnetizing force. when we say that it is 'saturated'. then it does respond fairly linearly. you just mount the reel on the recorder and press play. Early tape recorders (and wire recorders) had no means of compensating for the inherent non-linearity of magnetic material. In essence. The tape recorder was apparently used to broadcast orchestral concerts at . You can take a tape anywhere and find a machine to play it on.the sound quality was too poor. It is straightforward to build an electrical device that responds linearly to an input. As we saw earlier. Unfortunately. Top professional studios still have analog recorders because they have a sound quality that digital just can't match. analog recording is not dead. individual studios become more and more isolated with audio being subject to an often complex export process to transfer it from one studio's system to another. as in a dictation machine . which in an electrical amplifier reduces distortion tremendously. but simply for the information content. and it was left up to scientists in Germany during World War II to come up with a solution. in fact their faults are easily quantifiable. History Magnetic tape recording was invented in the early years of the Twentieth Century and became useful as a device for recording speech. no-one has devised a way of applying negative feedback to analog recording. a tape recorder converts an electrical signal to a magnetic record of that signal.Chapter 5: Analog Recording Contrary to what you might read in home recording magazines. 'linear' means without distortion .

It has to be said that line up is an exacting procedure and many modern recording engineers have so much else to think about (their digital transfers!) that line-up is better left to specialists. Despite AC bias. recording onto disc was possible. during the 1940s. However. This happens inside the recorder and no intervention is required on the part of the user. Since the response of tape to a small magnetizing force is very small. Prior to AC bias. In traditional recording. There is a lot of history to the analog recorder. then harsh clipping takes place. The Sound of Analog There are three characteristic ingredients of the analog sound: • • • • Distortion Noise Modulation noise Distortion The invention that transformed the analog tape recorder from a dictation machine to a music recording device. is used to overcome this initial resistance. the more the distortion.all hours of day and night. (Obviously. DC bias was used courtesy of a simple permanent magnet. which we don't need here. The higher the level you attempt to record on the tape. and the linear region of the response only starts at higher magnetic force levels. However the level of the bias signal has to be set correctly for optimum results. AC bias uses a high frequency (~100 kHz) sine wave signal mixed in with the audio signal to 'help' the audio signal get into the linear region which is relatively distortion-free. The . to the consternation of opposing countries who wondered how Germany could spare the resources to have orchestras playing in the middle of the night. After hostilities had ceased. a constant supporting magnetic force. or bias. analog recording produces a significant amount of distortion. but the characteristic crackle always gave the game away). US forces brought some captured machines back home and development continued from that point. It isn't like an amplifier or digital recorder where the signal is clean right up to 0 dBFS. considerable distortion remained. was AC bias. but it is certainly interesting as the development of the tape recorder coincides with the development of recording as we know it now. this is the job of the recording engineer before the session starts.

but it is certainly a feature. Noise As well as producing more distortion than any other type of audio equipment. unless this aspect of the character of analog recorders is simulated. the output will consist of 1 kHz plus two ranges of other frequencies. At 3%. although some 'noise management' will be necessary of the part of the mix engineer. which creates modulation noise. It is debatable whether noise is a desirable component of analog recording.more significant . which is a component used to minimize the problem. and the 'flutter damper roller'.a signal to noise ratio of around 65 dB is about the best you can hope for and represents the state of the art since tape recorders matured around the early 1970s. they just don't same the same. More is unacceptable. but the fact is that it actually sounds quite pleasant! It is also different in character than vacuum tube (valve) distortion so it is an additional tool in the recording engineer's toolkit. Modulation noise is noise that changes as the signal changes. the analog tape recorder produces more noise too .distortion increases gradually from barely perceptible to downright unpleasant. which is very high compared to any other type of equipment. We some times hear of the term 'scrape flutter'. Most analog recordings peak at a level that will produce around 1% distortion. and has two causes.cause of modulation noise is irregularities in the speed of tape travel. These irregularities are themselves caused by eccentricity and roughness in the bearings and other rotating parts. Noise isn't really the ogre it is made out to be. One is Barkhausen noise which is produced by quantization of the magnetic domains (a gross over-simplification of a phenomenon that would take too much understanding for the working sound engineer to bother with). but to my ears. The other . It may not sound promising to use a medium that produces so much distortion. Modulation Noise There have been digital 'analog simulators'. If a 1 kHz sine wave tone is recorded onto analog tape. then there is no reason why it should be troublesome in the final mix. and by the tape scraping against the static parts. most engineers will be thinking about backing off. some strong and . If levels are set correctly to maximize the use of the available dynamic range up to the 1% or 3% distortion point.

thus creating more stronger sidebands containing a greater range of frequencies. subjectively. These are known in radio as 'sidebands' and the concept has exactly the same meaning here. causes a 'thickening' of the signal which accounts for the fat sound of analog. compared to the more accurate. Modulation noise. Don't try it with your hard disk! .consistent. It has even been known for engineers to artificially increase the amount of modulation noise by unbalancing one of the rollers. but thin sound of digital. others weaker and ever-changing due to random variations.

The Anatomy of the Analog Tape Recorder .

Magnetic Tape Magnetic tape comprises a base film. It can also function as a playback head. Other magnetic materials have been tried. upon which is coated a layer of iron oxide. • The pinch wheel holds the tape against the capstan. The supply reel motor is energized in the reverse direction to maintain the tension of the tape against the heads. There are two major manufacturers of analog tape (there used to be several): Quantegy (formerly known as Ampex) and Emtec (formerly known as BASF).The Studer A807 pictured here is typical of a workhorse stereo analog recorder. but none suits analog audio recording better than iron. in other contexts. known as 'rust'. . The oxide is bonded to the base film by a 'binder'. • The tension arm smooths out any irregularities in tape flow. or more properly 'ferric' oxide. take-up real and capstan. which also lubricates the tape as it passes through the recorder. lessening modulation noise. Oxide of iron is sometimes. • The erase head wipes the tape clean of any previous recording. one each for the supply reel. • The capstan provides the motive force that drives the tape at the correct speed. It does not itself pull the tape through. Let's run through the major components starting from the ones you can't see: • Three motors. sold mainly into the broadcast market. • The flutter damper roller reduces vibrations in the tape. usually with reduced high frequency response. • The record head writes the magnetic signal to the tape. • The playback head plays back the recording. • The tach (short for tachometer) roller contains a device to measure the speed of the tape in play and fast wind. The take-up reel motor provides sufficient tension to collect the tape as it comes through.

Quarter-inch tape was in the past very widely used as the standard stereo medium. and for replay or remix of archive material. but they are only used in conjunction with 'legacy' equipment which is being used until it wears out and is scrapped. at the expense of certain compromises. and not care about the massive cost in tape consumption! At 30 ips. Two-inch tape is used on twenty-four track recorders. there are also irregularities (sometimes known as 'head bumps. Half-inch tape is used on stereo recorders for the final master.so-called 'long play' tape can fit a longer duration of recording on the same spool. A twenty-four track recorder can record . Higher speeds are better for capturing high frequencies as the recorded wavelength is physically longer on the tape. The widths in common use today are two-inch and half-inch.obviously . .). Other widths are still available. The most common tape speed in professional use used to be 15 inches per second (38 cm/s). or as 'woodles') in the bass end.5 mm. Oddly enough.Tape is manufactured in a variety of widths. but there is now little point in using it as it has no advantages over other options that are available. metrication doesn't seem to have reached analog tape and we tend to avoid talking about 50 mm and 12. (It is also manufactured in two thickness . a standard reel of tape costing up to $150 lasts about sixteen minutes. but these days it is more common to use 30 ips (76 cm/s).twenty-four separate tracks across the width of the tape. thus keeping instruments separate until final mixdown to stereo. The speed at which the tape travels is significant. However.

which is often regarded as the best analog multitrack ever made. but the top three historically have been Ampex. but Ampex only made fifty of them.Analog Recorders in Common Use Otari MTR90 Mk III There have been many manufacturers of analog tape recorders. you will commonly find the Ampex MM1200 and occasionally the Ampex ATR124. All over the world you will find the Otari MTR90 (illustrated with autolocator) which is considered . Otari and Studer. In the US.

As soon as tape was invented. and splicing in a section of leader tape. The sync output isn't of such good sound quality since the record head is optimized for recording. the record head is used as a playback head. it is commonplace to 'bounce' several tracks. It has a sound quality which is as good as the best within a very fine margin. there are certain points of relevance to the equipment itself. but operational facilities are not totally up to modern standards. with the old wire recorders. The more recent A827 and A820 are also very good. The most basic form of tape editing is 'top and tailing'. while overdubbing. Another technique worth mentioning at this stage is editing. but sadly no longer manufactured. For example. The problem here is that there is a gap between the record head and the playback head. Likewise the . The first is the necessity to be able to listen to or monitor previously recorded tracks while performing an overdub. it will not drop out of record mode without stopping the tape. This means cutting the tape to within 10 mm or so of the start of the audio. of work to the highest professional standard. thus freeing up tracks for further use. sonicly and operationally. usually white (about two meters). people used to weld the wire together. nevertheless it is certainly good enough for monitoring. although the heat killed the magnetism at the join. therefore causing a delay. perhaps vocal harmonies. The Studer A800 is still a prized machine and is fully capable.to be a good quality workhorse machine. to one or two tracks (two tracks for stereo). The slight loss of quality has to be tolerated. people were cutting it apart and sticking it back together again. sings in time with the output from the playback head. The Studer range is also well respected. If the singer. This has to be done using the sync output of the record head. In fact. Multitrack Recording Techniques How to set about a multitrack recording session is a topic in itself and will be explained later. The playback head is used for final mixdown. and is still available to buy. otherwise the bounce won't be in time with the other tracks. In this situation we talk about taking a 'sync output' from the record head. To get around this problem. Also. The Studer A80 represents the coming of age of analog multitrack recording in the 1970s. for example. the signal will be recorded on the tape a couple of centimeters away. However.

Even two inch tape can be edited. The equivalent technique in tape-based multitrack recording is the 'spin in'. In the original sense of the term. otherwise performance will suffer. The two machines would be backed up a little way. then a wax pencil mark could be made on corresponding rotating tape guides and the tapes backed up by the same number of revolutions. and this was the difficult part. It takes courage to cut through a twenty-four track two-inch tape though. then both set into play. Editing can also be used to improve a performance by cutting out the bad and splicing in the good. Maintenance There is a difference between the maintenance of an analog recorder and a digital recorder. These are the elements of maintenance: . red leader is joined on.is that once they are recorded. Splicing tape is available with exactly the right degree of stickiness to join the tape back together. in fact it is normal to record three or four takes of the backing tracks of a song. If the two machines were identical mechanically. When the edit is done in the right place (usually just before a loud sound). the two machines had to be in sync. and splice together the best sections. Of course. At the end of the tape. would be copied onto another tape recorder. When the digital sampler became available. Compared to modern disk recorders. all the tracks have a fixed relationship in time. it was used in place of the second recorder. the multitrack would be punched into record. Firstly you can do a lot of first-line maintenance on an analog machine. You can't do more than run a cleaning tape on a digital recorder. and cut with a single-sided razor blade. the main limitation of tape-based multitrack . The multitrack would be wound to where the audio was to be copied. guided by an angled slot.analog and digital . or whatever audio was required to be repeated. No blank tape is left on the spool once top and tailing is complete. The second is that you have to do the maintenance. The tape is placed in a special precisionmachined aluminum editing block.tape is cut ten seconds or so after the end of each track and more leader inserted between tracks. At the right moment. It sounds hit and miss. In a disk recorder. but it could be made to work amazingly quickly. it will be inaudible. or copy it to a new location in the song. a good version of the chorus. it is easy to move one track backwards or forwards in time.

a 1 kHz tone at the studio's standard electrical level is recorded onto a blank tape and the record level adjusted for unity gain. drinking alcohol . therefore it would not be cost-effective to use it. Record level .optimizes distortion. Line-up: Line up.the heads need to be absolutely vertical with respect to the tape otherwise the will be cancellation at HF.zenith. LF record EQ . High frequency playback EQ . or alignment. maximum output level and noise. It is not the same as drinking alcohol. the other is to make sure that a tape played on one recorder will play properly on any other recorder.the 10 kHz tone on the calibration tape is played and the HF EQ adjusted. the metal parts will collect a residual magnetism that will partially erase any tape that is played on the machine. Also. . wrap and height are not so critical and therefore do not need to be checked so often. Isopropyl alcohol is only one of a number of alcohol variants. The pinch wheel is made of a rubbery plastic. Just one tiny drop is enough. A special demagnetizer is used for which proper training is necessary. otherwise the condition can be made even worse. The other adjustments of the head . but it often is.one is to get the best out of the machine and the tape. Demagnetizing the heads: After a while. Bias level . In theory it shouldn't be cleaned with isopropyl alcohol.ethanol .adjusted for flat LF response.the 1 kHz tone on a special calibration tape is played and the output aligned to the studio's electrical standard level.Cleaning: the heads and all metallic parts that the tape contacts are cleaned gently with a cotton bud dipped in isopropyl alcohol. You can buy special rubber cleaner from pro audio dealers but in fact you can use a mild abrasive household liquid cleaner.adjusted for flat HF response. so don't be tempted. The following parameters are aligned to specified or optimum values: Azimuth .attracts additional taxes in some countries. has two functions . Playback level . and it has good cleaning properties. HF record EQ .

but it is enough for a starting point considering that analog recorders are now quite rare.The line-up procedure used to be considered part of the engineer's day-today routing. In fact the machines are so simple and are infinitely maintainable . To conclude. this is certainly far from a complete treatise on analog tape recording. or would it be better to use the real thing? . Does it make sense therefore to use digital emulation to achieve a pale shadow of the analog sound. the sound of analog is very much the sound of recording. Also. but is now often left to a specialist technician. analog recording has a long history and will almost certainly have a long future ahead.a fifteen year old Studer A800 will still be working for its living in fifteen years time. as we understand it. Even so. You can't say that for digital recorders.

Check Questions • Give two reasons why analog recorders are still in use in top professional studios. • What are the two functions of line-up? . • Comment on distortion in analog recording. • What is the function of AC bias? • What is the distortion level of peaks in an analog recording? • Why is the concept of clipping not relevant in analog recording? • Why is the supply reel motor driven in the opposite direction to the actual rotation of the reel? • What is the capstan? • What is the pinch wheel? • What is the tach roller? • What two tape widths are in common top-level professional use? • Name three twenty-four track analog tape recorders. • Comment on noise in analog recording. • What is 'bouncing'? • Comment on cut and splice tape editing. • Comment on modulation noise in analog recording. make and model.

With Dolby A and subsequently SR noise reduction. In the domestic domain. Everything else performs as well as anyone could possibly want. The problem is that they are usually comparing an encoded recording with decoding switched on and off.sound compensates for dirty and worn heads and the decoded version sounds dull in comparison!). Digital audio can be copied identically as many times as necessary (although this doesn't always work as well as you might expect. any possible improvement in sound quality is hardly relevant. not a scientific. you can lower the recording level to improve the distortion performance of analog tape. point of view. More on this in another module). You couldn't get away from the problems. although this is from a subjective. And often there were several generations of copies between original master and final product. And if you don't have a problem with noise. because it is a choice. before CD there was only the vinyl record.it was a necessity. and of course the CD has become a great success. A recording well made with Dolby SR noise reduction can sound very good indeed. People with long memories will know that they used to yearn for a format that wasn't plagued with the clicks. (Some people say that they don't like Dolby B noise reduction. The extra brightness of the Dolby B encoded . Well there was the compact cassette too.but not decoded . analog recording wasn't a choice . Some would say better than 16-bit digital audio. the quality would deteriorate significantly. but that never even sounded good even with Dolby B noise reduction. The release of the CD format was eagerly anticipated. That is why they are used in both the professional and domestic domains. digital audio recorders can greatly outperform analog in both signal to noise ratio and distortion performance. noise performance was vastly improved. Actually you could. Analog record also had the problem that when a tape was copied. Many recording engineers and producers like the sound of analog now. Done properly. When the question arises of why the other parts of the signal chain have mostly been changed over to digital. the only exceptions being the . Well almost anyone.Chapter 6: Digital Audio Why digital? Why wasn't analog good enough? The answer starts with the analog tape recorder which plainly isn't good enough in respect of signal to noise ratio and distortion performance. In the days before digital. pops and crackles of vinyl. to the point where it wasn't a problem at all.

An analog signal is continuous. then the electrical signal is an analog of the original.microphone and the loudspeaker. in which case the high frequency response will be worse than it could be. let's see how it works. The smoothly changing analog signal is therefore turned into a stair-step approximation.. as it became possible to achieve. digital audio in general was showing that it could offer advantages over analog in terms of price and facilities offered. . but we are still some way off truly digital transducers becoming available. Digital Theory Firstly. Digital mixing consoles came rather later because they require an incredible processing power.the signal can never exceed these and will be clipped if it tries) and random variations at a very low level that we hear as noise. and have the advantage that settings can easily be stored and recalled. It follows the changes of the original without any kind of subdivision. Its useful dynamic range lies between a maximum value which the analog signal cannot exceed (generally the positive and negative voltage limits of the power supply . If I say that electrical voltage is a similar concept to the pressure of water behind a tap (excuse me. what do we mean by analog? Analog comes from the word analogy. Digital effects were first. Any changes that happen completely between sampling periods are ignored. Having established the reasons we have digital audio. since digital audio knows no 'in-between' states. for instance. but if the sampling periods are close enough together. Digital systems analyze the original in two ways: firstly by 'sampling' the signal a number of times every second. the ear won't notice..separately identifiable levels. The other is by 'quantizing' the signal into a number of discrete . This is an important feature that we shall discuss more when we discuss mixing consoles. then I am making an analogy. By the time digital recording and reproduction had become properly established. It might not be able to track the changes fast enough for complete accuracy. Digital mixing consoles don't sound better than analog. faucet). They do however offer more facilities for the price. digital reverberation for a tiny fraction of the cost of an electromechanical system. If I convert an acoustic sound to an electrical signal where the rise and fall in sound pressure is imitated by a similar rise and fall in voltage.

in digital terms. a sampling frequency of at least 40 kHz ( twice 20 kHz) is necessary. has to be at least twice that frequency.536 levels. the sampling frequency. Compact disc and DAT both use 65. To reproduce any given frequency. In practice. This. or sampling rate. Let's go deep. Therefore a digital . and by increasing the number of quantization levels. Without going into binary arithmetic. So to convert the full range of human hearing to digital.1 kHz (exactly this to coincide with the requirements of early digital equipment). is a nice round number corresponding to 16 bits.. each bit provides roughly 6 dB of signal to noise ratio. and 48 kHz which is used in broadcasting (since in the early days of digital it was easier to convert to the standard satellite sampling frequency of 32 kHz). but it can be made better by increasing the sampling frequency (sampling rate). more quantization levels must be used. a 'safety margin' has to be added.. the digital signal here is only a crude approximation of the original. so we get the standard compact disc sampling frequency of 44. To reduce the quantization error between the digital signal and the original analog.As you can see.

secondly to add a backup data stream so that if a section of data is corrupted.to a pit or vice-versa. it can be reconstituted from other data nearby. particularly filters with the steep slopes necessary to maximize frequency response. The solution is not to allow frequencies higher than half the sampling rate (in fact less. In any storage medium there are physical defects that would damage the data if nothing were done to prevent such damage.the spiral would disappear! Hence a system of coding is used that rearranges the binary digits in such a way that they are forced to change every so often. But what if the signal was stuck on '0' for a period of time . what happens if a digital system is presented with a frequency higher than half the sampling frequency? The answer is that a phenomenon known as aliasing will occur. So additional data is added to the raw digital signal. These are only distantly related to the input frequencies and absolutely unmusical (unlike harmonic distortion. is . to give a margin of safety) into the system. the tiny pits in the aluminized audio layer themselves form the spiral that the laser follows from the start of the recording to the end. Adding error correction involves a compromise between preserving the integrity of the digital signal. but not be too wasteful on storage or bandwidth by having a sampling rate that is unnecessarily high. In the compact disc system. which can be quite pleasant in moderation). Therefore an 'anti-aliasing' filter is used just after the input. The question will arise. It might be possible to record the binary digits directly but that wouldn't offer the best advantage. sampled and quantized. Once the signal has been filtered. simply to make a workable system.the level surface . firstly to check on replay whether the data is valid or erroneous. It is fair to say that the error correction system on CD. it must be coded. Additionally there is the need for error correction. and indeed might not work. The design of the filters is one of the distinguishing points that make different digital systems actually sound different. and not adding any more extra data than necessary. There are other such constraints that we need not go into here. A binary '0' is coded by no transition. Filter design is complex. What happens is that these higher frequencies are not properly encoded and are translated into spurious frequencies in the audio band. and on DAT.audio system with 16-bit resolution has a signal to noise ratio (at least in theory) of 96 dB. A binary '1' is coded by a transition from 'land' .

because of its steep slope. an error is detected but it is too severe to be corrected. Missing data is therefore 'interpolated' . Error concealment. but digital tricks have now been developed to make the filter's job easier. that are higher than half the sampling rate.from surrounding data and the result hopefully will be inaudible. as decoding. the output is filtered with what is known as a 'brickwall' filter. To spare the details that only electronics experts need to know. the digital signal goes through a D to A convertor and out comes an analog signal. The A to D decoder incorporates three levels of protection against damaged data: Error correction. therefore design is more straightforward. but it could cause severely audible distortion if allowed into any other equipment that couldn't properly handle it. But as in all things. and better. fairly obviously. more modern digital systems are cleverer. or A to D.just one of the many scientific words for 'guess' . However. Sampling: measuring the signal level once per sampling period. for each sampling period. The only problem is that it now contains a strong component at the sampling frequency.536 levels (in a 16-bit system) is closest to the input signal level. c) is recordable or transmissable in the chosen medium. Coding: converting the result to a binary number according to a scheme that incorporates a) error detection. b) provision for error correction. The reverse process is known. Quantization: deciding which of the 65. To obviate this therefore. Obviously this is above audibility. in the analog domain. if you ever get chance to see a CD . Analog to Digital Conversion Filtering: removing frequencies. an error is detected in the data and completely corrected by using the additional error-correction data specifically put there for the purpose.very good. All of the above is known as analog to digital encoding. Once again the design of the filter does affect the sound quality.

in this context. Of course. It is a reasonable rule of thumb that CD-quality stereo audio requires about 10 Megabytes per minute of storage. 96 kHz digital audio will therefore. Often. and 'K' meaning x1024.1 or 48 kHz digital audio is roughly 750 Kbps. Compare this with the bandwidth of a modem (56 Kbps). This will never happen in practice. Bandwidth Bandwidth. 24-bit. Muting. The abbreviation for bit is 'b' and for byte is 'B'. 24-bit resolution will in theory give a signal to noise ratio of 144 dB. there is always a nagging doubt that this is only just good enough. ISDN2 (128 Kbps) and common ADSL Internet connections (512 Kbps). None of these systems is capable of transmitting even a single channel of digital audio. by simple . is the rate of flow of data measured in kilobits per second. Also. a frequency which is perfectly well catered for these days by a 44. in this case the error is so bad that the system shuts down momentarily rather than output what could be an exceedingly loud glitch. some of the available dynamic range may be used as additional headroom. but these are often confused.player that has correction and concealment indicator lights. 1 kilobit is 1024 bits. but even so the resulting recording will be remarkably quiet. even though most of us cannot even hear up to 20 kHz. This of course. How well concealment is done is one of the factors that make different digital systems sound different. you will notice that an awful lot of concealment goes on just to play an average disc. hence the need for MP3 and similar data-reduction systems. to play safe while recording. affects storage requirements. The bandwidth of a single channel of 16-bit 44. but the real achievable signal to noise ratio is probably as good as anyone could reasonably ask for.1 or 48 kHz sampling rate. 24/96 The quest for ever better sound quality leads us to want to increase both the sampling rate and the resolution. and it would be worthwhile to have a really high sampling rate to put all doubt at an end. as are the multiplier prefixes 'k' meaning x1000. the term byte is used where 1 byte = 8 bits.

Digital Interconnection Digital interconnection comes in a number of standards. Megabytes are getting cheaper all the time. There is another problem however . It's worth a quick look at Digidesign's comments on hard disk specifications to maximize track count. disks are getting ever faster and most of the problems of this nature are in the past. which are summarized here: AES/EBU • • • • • • • • • • Also known as AES3 1985 (the year it was implemented) Standard for professional digital audio Supports up to 24-bit at any sampling rate Transmits 2 channels on a single cable Uses 110 ohm balanced twisted wire pair cables usually terminated with XLR connectors Can use cables of length up to 100 meters Electrical signal level 5 volts Standard audio cables can be used for short distances but are not recommended as their impedance may not be the standard 110 ohm and reflections may occur at the ends of the cable Data transmission at 48 kHz sampling rate is 3.multiplication. for one thing. there is a certain data throughput rate beyond which the system will struggle and possibly fail to record or playback properly. Before long it will be possible to get virtually any number of tracks quite easily.the more short segments you cut the audio into.data bandwidth. require 30 Megabytes per stereo minute. Try this at three times the data rate and the track count. by the 'edit density' . A standard modern hard drive should be easily capable of achieving 24 tracks of playback under normal circumstances (the track count is affected. However. Of course. or the reliability is bound to suffer. and the more widely the data is physically separated on the disk.072 Megabit/s (64x the sampling rate) Self clocking but master clocking is possible S/PDIF . the harder it will be to play back). When recording onto a hard disk system.

to prevent consumer machines from making digital copies of digital copies.Glass fiber can be used for longer lengths (1 kilometer). some AES/EBU inputs can recognise an S/PDIF signal • Some of the bits within the Channel Status blocks are used for SCMS (Serial Copy Management System). MADI • an extension of the AES3 format (AES/EBU) • supports up to 24-bit/48 kHz sampling rate (higher rates are possible) • transmits 56 channels on a 75 Ohm video coaxial cable with BNC connectors • Length limited to 50 meters. Fiber-optic cable can be used for longer lengths • Data transmission rate is 100 Megabit/s • Requires a master clock .• Two types: • Electrical • Uses 75 Ohm unbalanced coaxial cable with RCA phono connectors • Cable lengths limited to 6 meters. TOSLINK does not specify the protocol to be used • ST-type . • Meant for consumer products but may be seen on professional equipment • Supports up to 24-bit/48 kHz sampling rate • Self-clocking • It ought to be necessary to use a format converter when connecting with AES/EBU since the electrical level is different (0.Uses plastic fiber optic cable and same connectors as Lightpipe (below). TOSLINK is an optical data transmission technology developed by Toshiba.a dedicated master synchronization signal must be applied to all transmitters and receivers. • Optical • TOSLINK . ADAT Optical . However.5 V) and the format of the data is different also.

although self-clocking is possible . unbalanced cables with 25-pin D-sub connectors • Bidirectional interface: a single cable carries data in both directions • Cable length limited to 5 meters • Data transmission at 48 kHz sampling rate is 3 Megabit/s (like AES/EBU) • Intended for a master clock system.• Sometimes known as 'Lightpipe' • Implemented on the Alesis ADAT MDM and digital devices such as mixing consoles.. or up to 30 meters with glass fiber cable • Data transmission at 48 kHz is 12 Megabit/s • Self clocking • Channels can be reassigned (digital patchbay function) TDIF (Tascam Digital Interface Format) • Implemented on Tascam's family of DA-88 recorders and other digital devices such as mixing consoles • Supports of to 24-bit/multiple sampling rates • Transmits 8 channels on multicore. synthesizers and effects units • Supports of to 24-bit/48 kHz sampling rate • Transmits 8 channels serially on fiber-optic cable • Distance limited to 10 meters.

is it likely that fewer tracks can be replayed simultaneously at the 24-bit/96 kHz standard. are occupied by one minute of CD-quality stereo digital audio? Why. • What is the minimum sampling rate for a digital system capable of reproduction up to 20 kHz (ignoring any 'safety margin'). • Why does a digital to analog convertor need a filter? • What is error correction? • What is error concealment? • What happens (or at least should happen) if an error is neither corrected nor concealed? • How many Megabytes of data. approximately. of a digital system with 20-bit resolution? • Why is coding necessary? Give two reasons. • What is 'aliasing'? • What two sampling rates are most commonly used in digital audio? • Describe quantization. • What is the signal to noise ratio. in a hard disk recording system. than at the CD- . in theory. why was this the most pressing need? • What types of equipment are currently not available in digital form? • Describe 'sampling rate'.Check Questions • To which type of sound engineering equipment was digital audio first applied? • In relation to the question above.

1 kHz standard? .quality 16-bit/44.

15 millimeters/second. the cassette housing is very small. a rotary head recorder lays tracks diagonally across the width of the tape. not just because it makes the cassette easier to lose. this is rather too small. the actual writing speed is a massive 3. but not necessarily so for semi-pro 'domestic' recorders. Unlike analog tape which records the signal along a track parallel to the edge of the tape.5 mm thick. Unlike an analog tape. indicates that the system uses a rotary head like a video recorder. The width of each track is 13. Sony professional DAT Having said that DAT’s size is a disadvantage for professional users. and tracking tolerances could be such that a tape recorded on one recorder could be absolutely guaranteed to play properly on any other. it really is amazing how it achieves what it does working at microscopic dimensions. This would allow for error concealment to be minimized. R-DAT.Chapter 7: Digital Audio Tape Recording The original purpose of DAT (Digital Audio Tape) was to be a replacement for the Compact Cassette (or simply 'cassette'. For professional users. So even though the tape speed is just 8. In fact. this may seem to .591 millionths of a meter. the tracks are recorded without any guard band between them. Since DAT was intended to be a consumer product right from the start.133 meters/second. 73 x 54 mm and just 10. but because there will always be a feeling that DAT could have been a better system if there had been a bit more space for the data. erasing that section. Since the same heads are used for recording and playback. the tracks are recorded by heads which are around 50% wider than the final track width and each new track partially overlaps the one before. DAT’s full title. as we now know it). This is generally the case for professional machines.

which are misaligned by 40 degrees. and the tape track itself. But there are differences. which means that at times neither of the two heads is in contact with the tape. The ‘azimuth’ of a tape head refers to the angle between the head gap. In an analog recorder the azimuth is always adjusted to 90 degrees. The signal that is recorded on the tape is of course digital. giving long head and tape life • If an extra pair of heads is mounted on the drum. the standard DAT format uses 16 bit sampling at a sampling frequency of 48 . then it will also see part of the preceding track and part of the next track.present a problem because if the head is centred on the track it is meant to be reading. This is necessary so that there can always be a head in contact with the tape during the time that each video frame is built up on the screen. and the adjacent tracks. known as azimuth recording. data can be read off the tape at any rate that is convenient and stored up in a buffer before being read out at a constant speed and converted to a conventional audio signal. where recording takes place. As you know. one head is set at -20 degrees and the other to +20 degrees. Won't this result in utter confusion? Of course it doesn't. there is a strong similarity between a DAT recorder and a video cassette recorder. A video recorder uses a large head drum with the tape wrapped nearly all the way around. The head drum in a DAT machine is a mere 30mm in diameter (and spins at 2000 revolutions per minute). but as I said. • Tape tension is low. each head receives a strong signal from the tracks that it recorded. this can be compensated for. so that the head gap is at right angles to the track. give such a weak signal that it can be rejected totally. In DAT. Mechanically. With digital audio. which uses two heads. and they lay down tracks alternately. The tape is wrapped only a quarter of the way around. and very dissimilar to either analogue audio or video signals. Both use a rotary head drum on which are mounted the record/playback heads. because a system originally developed for video recording is used. simultaneous offtape monitoring can be performed during recording just like a three-head analogue tape recorder. This 90 degree wrap has its advantages: • There is only a short length of tape in contact with the drum so high speed search can be performed with the tape still wrapped. So on playback.

in such a way that errors can be detected. which logs the time taken since the beginning of the tape • P-time. • Skip ID tells the machine to go directly to the next Start ID. there is a lot of scope for errors to be made during the record/replay process. thus performing an ‘instant edit’. in fact 37. As an extra precaution against dropouts. • Start ID marks the beginning of each item. This converts the original analog audio signal to a stream of binary numbers representing the changing level of the signal. another technique called interleaving is employed which scatters the data so that if one section of data is lost. and if the wrong digit comes back from the tape it is likely to be very much more audible than a drop-out would be on analog tape. But since the dimensions of the actual recording on the tape are so small. uses a technique called Double Reed-Solomon Encoding which duplicates much of the audio data. Those at present in use include: • A-time. which logs the time taken since the last Start ID. • End ID marks the end of the recording on the tape. like the Compact Disc. If there is a really huge drop-out on the tape. Fortunately DAT. Not all of the capacity of the Sub Code areas is in use as yet. then the DAT machine will simply mute the output rather than replay digital gibberish. even if the tape is slightly distorted and the track curved.5%. The pulse code modulated audio data is recorded in the centre section of each diagonal track across the tape. then either corrected completely or concealed so that they are not obvious to the ear. • Sub Code areas allow extra data to be recorded alongside the audio information. allowing for extra expansion of the DAT system. There is other data too: • 'ATF' signals allow for Automatic Track Finding which makes sure that the heads are always precisely positioned over the centre of the track. then there will be enough data beyond the site of the damage which can be used to reconstruct the signal. • There is also provision for SMPTE/EBU timecode .kHz.

Sony PCM 3348 . Normal Density and Double Density and there are also three tape speeds. among other things. the tape speed and the layout of the tracks on the tape. also the modulation method and error correction strategy.55 mm). nominally Slow. Medium and Fast (a further variation is caused by each of the three speeds being slightly different according to whether 44.3 mm) and 1/2” (12. For each tape width there are two track geometries. it doesn't necessarily mean that a machine will be built to accommodate it. According to the above. there must be twelve combinations all of which conform to the DASH format. but just because a particular combination of parameters is possible.DASH DASH stands for Digital Audio Stationery Head.1 kHz or 48 kHz sampling is used). This could make life confusing. The DASH specifications include matters such as the size of the tape. The format is based on two tape widths: 1/4” (6.

We are now accustomed to new products and systems which offer new features yet are compatible with material produced on earlier versions. Off goes the 3348 back to the hire company.1 kHz is 70. of any vintage. but it would be advisable to read the manual before pressing record and play. Some of the differences between digital and analog recording stem from the fact that the heads are not in the same order. This must be audio history's only example of forward as well as reverse compatibility. put the twenty-four track tape on this and record another twenty-four tracks in the guard bands left by the other machine. with a little touching up. It shows what thinking ahead can achieve. and then the producer decides as the tracks fill up that he or she really needs more elbow room for overdubs. you may start a project on a 3324.The original Sony 3324. So you hire a 3348. are all that are required. (The cue tracks are there so that audio can be made available in other than normal play speed +/. The 3324 is totally two-way compatible with the larger 3348 which can record forty-eight digital tracks on the same tape. Continuing my (hypothetical) example. two analog cue tracks. the producer is sacked and another one brought in who decides that the extra twenty-four tracks are unnecessary embellishments and the original tracks.01cm/s. DASH doesn't need an erase head because the tape is always recorded to a set level of magnetism which overwrites any previous recordings without further . and recent 24-track machines. DASH Operation The first thing you are likely to want to do with your new DASH machine is of course to make a recording with it.normal varispeed). a control track and a timecode track. On an analog recorder we are used to having three heads: erase. record and play.now recorded with forty-eight tracks . the tape . when it is decided that the project is costing too much and going nowhere. To give an example.is placed back on the 3324 and the original twenty-four tracks are successfully sweetened and mixed with not a murmur from the tracks that are now not wanted. use the normal density geometry on 1/2” tape which allows twenty-four digital audio tracks. The tape speed at 44.

all are fixed function). Converter Delay The main text deals with some of the implications of delays caused by the process of recording digital signals onto tape and playing them back again. and you'll need it if you want to have confidence monitoring. 105 milliseconds in fact. and a third head is available as an option if you need it. record a bit. There will be a slight delay while the playback signal is processed. as would typically happen in classical sessions. you can see why this won't work in the digital domain. The machine can format while recording . which on an basic DASH machine is followed a record head only. So the first head that the tape should come across should be the record head. play it back. If you wish. On any digital recording medium the tape has to be formatted to be used. but on DASH it is often better to do it in advance. If this seems incorrect.in Advance Record mode . digital operations take a little time. which corresponds to about 75 mm of tape.intervention. Since there are different ways to format a tape and make recordings. Assemble is when you want to put the tape on. record a bit more etc.but this is best done in situations where you will be recording the whole of the tape without stopping. You can take comfort from the fact that it can be done in one quarter of real time. There is another problem caused by delays in the . For most purposes two heads are enough. you can ‘pre format’ a tape but this obviously takes time. Right? Wrong. Insert and Assemble. and the machine will lay down timecode simultaneously. by the way. and another delay while the record signal is processed and put onto tape. So if you imagine analog overdubbing where the sync playback signal comes from the record head itself. The first head is a playback head. (There are no combined record/playback heads. To perform synchronous overdubs there has to be a playback head upstream of the record head otherwise the multitrack recording process as we know it just won’t work. Insert is for when you have recorded or formatted the full duration of the material and you want to go back and re-record some sections. Advance mode is as explained above. On DAT the formatting is carried out during recording. you have to remember that while analog processes take place virtually instantaneously. the 3342S has three different recording modes: Advance.

for example. as used for analog records will clog a DASH head with their fibers. This is a good feature.using the monitoring arrangement described above you will hear the input signal added to the same signal returned from the recorder but delayed by about 1. an analog record can be aligned by a knowledgable engineer. Likewise. a DASH machine should only be cleaned by an expert. Two synchronized 24-track machines are obviously more versatile in this respect than one 48-track. or thousands of dollars worth of damage can be caused. This will caused phase cancellation and an odd sound. Cotton buds. have an inherent delay of about 1. this is possible but it was found in practice that edits were often unreliable. Fortunately.7 milliseconds. cleaned by the recording engineer in the normal course of studio activities. The convertors used in the Sony 3324S. and should be. but alignment of a DASH machine is something that is done every six . The heads can be cleaned with a special chamois-leather cleaning tool. but when you have made the punch in . This will be returned to the console and you will hear the level go up by approximately 3dB because you are now monitoring the same signal via two paths. Briefly. Sony have included an analog cross fade circuit which will imitate what is happening in the digital domain. Maintenance Although an analog recorder can be. Editing of DASH tapes is now done by copying between two machines synchronized together with an offset. You will probably set up the monitoring so that you and the performer can hear both the output from the recorder and the signal to be recorded. wiping in a horizontal motion only.7ms.A/D conversion itself. The performer will play along with his part until the drop in. On the 3324S you can make a cross fade punch in of up to about 370 milliseconds. but without the delay. when the recorder will switch over to monitor the input signal. while being very high quality. Imagine the situation where you are punching into a track on an analog recording to correct a mistake. Editing DASH was designed to be a cut-and-splice editing format.

Also. remote ports. when an analog project is recorded on twin 24-track recorders. . With the aid of its human assistant it can even align the heads and tape tension. it is often considered more convenient for editing to copy the tapes to a Sony 3348. error rates.months or so by a suitably qualified engineer carrying a portable PC and a special test jig in his tool box. with confidence that tapes will be replayable after many years. The PC runs special service software which can interrogate just about every aspect of the DASH machine checking head hours. The single 3348 is far faster and more responsive than synchronized analog machines. sampler card etc etc. making the mixing process faster and smoother. Current significance The current significance of DASH is as a machine that can record onto a relatively cheap archivable medium.

• Recordings are made on commonly available video tapes: ADAT takes S-VHS tapes. The ADAT (Alesis Digital Audio Tape) was closely followed by the Tascam DTRS (Digital Tape Recording System) format (below right). DTRS takes Hi-8 • Tape need to be formatted before use. . but this is only appropriate when a continuous recording is to be made for the entire duration of the tape. On its introduction it was considered a triumph of engineering to an affordable price point. • Multiple machines can be easily synchronized to give more tracks. Alesis ADAT-XT Tascam DA98-HR There are certain similarities: • Both formats capable of 8 tracks.MDM The original modular digital multitrack was the Alesis ADAT (below left). Formatting can take place during recording.

For a 24-track system.60 minutes. with reduced track count) • The differences are these: • Maximum record time: ADAT . four machines (4 x 8 = 32) are necessary to account for the one that will always be on the repair bench. DTRS 24-bit. DTRS . .• Very maintenance-intensive. • High resolution versions available (ADAT 20-bit.108 minutes • ADAT popular in budget music recording studios • DTRS popular in broadcast and film post-production One further difference is that it is probably fair to say that the ADAT has reached the end of its product life-cycle. 96 kHz. although there are undoubtedly still plenty of them around and in use. DTRS however is still useful as a tape-based system offering a standard format and cheap storage. 192 kHz.

Check Questions • Was DAT originally intended as a professional or a domestic recording medium? • What is the sampling rate of standard DAT? • What is the resolution of standard DAT? • What is 'azimuth recording'? • Describe the head wheel in DAT recorder. • What is SCMS? • What is the distinguishing feature of a DAT machine capable of near-simultaneous off-tape monitoring? • What is the sub-code area of the DAT tape used for? • What is 'interleaving'? • What is the width of the tape used for 24-track DASH? • What is the width of the tape used for 48-track DASH? • Describe how 24-track and 48-track DASH machines are compatible. • How are DASH tapes edited? • In DASH, why does a playback head come before the record head in the tape path? • Comment on the cleaning requirements of DASH • How many tracks does a modular digital multitrack (MDM) have? • How can more tracks be obtained? • Comment on the types of usage of ADAT and DTRS machines.

Appendix 1: Sound System Parameters Level A large part of sound engineering involves adjusting signal level: finding the right level or finding the right blend of levels. The level of a real sound traveling in air can be measured in µN/m2 (or µPa/m2 – micropascals per square meter if you prefer), or more practically dB SPL with reference to 0 dB SPL or 20 µN/m2. The level of a signal in electrical form can be measured in volts, naturally, or it can be measured in dB. The problem is that decibels are always a comparison between two levels. For acoustic sounds, the dB SPL works by comparing a sound level with the reference level 20 µN/m2 (the threshold of hearing). Therefore we need a reference level that works for voltage. Going in back in history, early telecommunication engineers were interested in the power that they could transmit over a telephone line. They decided upon a standard reference level for power, which was 1 mW (1 milliwatt, or one thousandth of a watt). This was subsequently called 0 dBm. The ‘m’ doesn't stand for anything, it just means that any measurement in dBm is referenced to 1 mW. Today in audio circuitry, we are not too concerned about power except at the final end product – the output of the power amplifier into the loudspeaker. For the rest of the time we can happily measure signal level in voltage. Going back into history, standard telephone lines had a characteristic impedance of 600 ohms. (‘Characteristic impedance’ is a term hardly ever used in audio so explanation here will be omitted). The relationship between power, voltage and impedance is: P = V2/R. Working out the math we find that a power of 1 mW delivered via a 600 ohm line develops a voltage of 0.775 V. This became the standard reference level of electrical voltage, and it is still in use today. There is a slight problem here. Over the years it became customary to refer to a voltage of 0.775 V as 0 dBm. This is not wholly correct. It is only true when the impedance is 600 ohms, which is not necessarily the case in audio circuitry. Despite this, any reference you find to 0 dBm, in practice, means 0.775 V regardless of what the impedance is.

Technical sound engineers abhor inconsistencies like this, so a new unit was invented: dBu, where 0 dBu is 0.775 V, without any reference to impedance. Once again, the ‘u’ doesn't stand for anything. ‘dBu’ is sometimes written ‘dBv’ (note lower case ‘v’). Confusingly there is also another reference: dBV (note upper case ‘V’), where 0 dBV is 1 volt. In summary: 0 dBm = 1 mW 0 dBu = 0.775 V 0 dBv = 0.775 V 0 dBV = 1 V There are more: dBr is a measurement in decibels with an arbitrary quoted reference level dBFS is a measurement in decibels where the reference level is the full level possible in a specific item of digital audio equipment. 0 dBFS is the maximum level and any measurement must necessarily be negative, for example –20 dBFS. All of the above (with the exception of dBFS) refer to electrical levels. We also need levels for magnetic tape and other media. Analog recording on magnetic media is still commonplace in top level music recording, and outside of the developed countries of the world. Magnetic level is measured in nWb/m (nanowebers per meter). ‘Nano’ is the prefix meaning ‘one thousandth of a millionth’. The weber (Wb) is the unit of magnetic flux. Wb/m is the unit of magnetic flux density, or simply ‘flux density’. Wilhelm Weber the person (pronounced with a ‘v’ sound in Europe, with a ‘w’ sound in North America), by the way, is to magnetism what Alessandro Volta is to electricity. There are a number of magnetic reference levels in common use. Ampex level, named for the company that developed the tape recorder from German prototypes after World War II, is 185 nWb/m. NAB (National Association of Broadcasters, in the USA) level is 200 nWb/m. DIN (Deutsche Industrie Normen, in Europe) level is 320 nWb/m. In summary:

Either you have to keep a close eye on level and resign yourself to making corrections often. Operating Level An extension of the concept of level is operating level. the standard operating level of professional equipment is 0 dBu. sometimes higher. It’s just a figure to keep in mind as the roughly correct level for your signal. Much of the time the actual level of your signal will be lower. This is the level around which you would expect your material to peak. Tape recorders would be aligned so that a tone at 200 nWb/m gives a reading of 0 VU. Digital equipment also has an ‘operating level’. then it is common to align the VU meters so that 0 VU equals +4 dBu. There is also a semi-professional operating level of –10 dBV. In some studios . of sorts. although distortion increases considerably beyond that.several of them in fact. In electrical terms. but NAB and DIN are the most used in North America and Europe respectively. Magnetic tape also has a standard ‘operating level’ . This does cause some difficulty when fully professional and semiprofessional equipment is combined within the same system. albeit an important minority: In a studio where VU meters are used.digital recorders such as DAT are aligned so that –18 dBFS (18 dB below maximum level) is equivalent to +4 dBu and 0 . In short: 200 nWb/m on tape normally equates to +4 dBu and 0 VU Most brands of tape can give good clean sound up to 8 dB above 200 nWb/m and even beyond. To simplify a little since analog magnetic tape is now a minority medium.Ampex level: 185 nWb/m NAB level: 200 nWb/m DIN level: 320 nWb/m It’s worth noting that none of these reference levels is better than any other. according to what combination of equipment you happen to be using.mainly broadcast . or buying a converter unit that will bring semi-pro level up to pro level.

Suppose the signal then needed to be made smaller. Unity gain implies a change in level of 0 dB. Most people who record digitally record right up to the highest level they think they can get away with without risk of red lights or ‘overs’. . If you hadn't aligned your machines to unity gain then the levels would be all over the place. you could do things like copy tapes. Gain Gain refers to an increase or decrease in level and is measured in dB. so unity gain – in the digital domain at least – tends to happen automatically. Some engineers find it fun to play around with these numbers. but what about making it stay the same level? What kind of gain is this? The answer is ‘unity gain’ and it is a surprisingly useful concept.VU. but it doesn’t fully exploit the dynamic range of DAT. then the function of the decibel as a comparison between two levels holds good. apart from being spared changes in level between record and playback. then a gain of –20 dB would bring it down to around 100 mV. Then. In the analog era it was important to align a recorder so that whatever level you put in on record. edit bits and pieces together and the level wouldn’t jump. Since gain refers to both the signal level before gain was applied. This certainly allows plenty of headroom (see later). Apply a gain of 60 dB and it is multiplied by a thousand giving around 1 V – enough for the mixing console to munch on. or just concentrate on the audio. There is work available for both types of engineer. Your degree of fluency in the numbers part of decibels depends on whether you want to be a technical expert. With digital equipment. The signal level from a microphone could be around 1 mV. and signal level after gain is applied. The need to make a signal bigger or smaller is fairly easy to understand. for instance. you got that same level out on replay. it is actually the norm for digital input and output to be of the same level. or attenuated.

It is not however sufficient to quote a frequency range. how do you measure the level of an AC waveform meaningfully? A simple peak-to-peak measurement.RMS and Peak Levels How do you measure the level of an AC (alternating current) waveform? Or to put it another way. A waveform of level 100 Vrms would bring an electric fire element to the same temperature as a direct (DC) voltage of 100 V. is 20 Hz to 20 kHz. but it doesn't necessarily tell you how much subjective loudness potential the waveform contains. but it will not tend to sound very loud. equally and any deviations from an equal response are defined. as we say) might have strong peaks. A very ‘peaky’ waveform (or a waveform with a high crest factor. we are looking for a ‘flat frequency response’ which means that the equipment in question responds to all frequencies. which is rather different. Frequency Response It is generally accepted that the range of human hearing. within its limits. A waveform with lower peaks. taking into account a selection of real live humans of various ages. A waveform of level 100 Vpeak-to-peak would be significantly less warm. It is necessary to quote a frequency response. The correct way to describe the frequency response of a piece of equipment is this: 20 Hz to 20 kHz +0 dB/-2 dB or this: 20 Hz to 20 kHz ±1 dB Of course the actual numbers are just examples. In addition. but greater area between the line and the x-axis of the graph will tend to sound louder on delivery to the listener. we are not looking for any old frequency response. but the concept of defining the allowable bounds of deviation from ruler-flatness is the key. shows the height (or amplitude) of the waveform. . The most meaningful measurement of level is the root-mean-square technique. and sound equipment must be able to accommodate this. Cutting out all the math. the RMS measurement tells you the equivalent ‘heating’ capability of a signal. or peak measurement.


The range of Q in common use in audio is from 0. Q is calculated thus: Q = f0/(f2-f1) where f0 is the center frequency of the band.1 up to around 10.Q Q is used in a variety of ways in electronics and audio but probably the most significant is as a measure of the ‘sharpness’ of a filter or equalizer. although specialist devices such as feedback suppressers can vastly exceed this. A high Q would mean that only a narrow band of frequencies around the center frequency is affected. then a high Q setting of 4 or 5 will allow you to home in on the exact frequency and deal with it without affecting surrounding frequencies too much.3 would be more appropriate. A low Q would mean that a wide range of frequencies is affected. an equalizer could be set to boost a range of frequencies around 1 kHz. Whether you need to use a low Q setting or a high Q setting depends on the nature of the problem you want to solve. If there is a troublesome frequency. then a low Q of perhaps 0. For example. It may be evident from this that Q is a ratio and has no units. f2 and f1 are the frequencies where the response has dropped –3 dB with respect to f0. If it is more a matter of shaping the spectrum of a sound to improve it or allow it to blend better with other signals. Q doesn't stand for anything either. for example acoustic guitars sometimes have an irritating resonance somewhere around 150 Hz to 200 Hz. it’s just a letter. .

. A microphone with a large diaphragm will have many molecules impinging on its surface. and the random motion of the molecules will tend to average out and be insignificant in comparison with the wanted signal. Once again. then the noise level can be much higher. Air molecules are in constant motion at any temperature above absolute zero and since sound is nothing more than the motion of air molecules.sound of a very low level. When sound is converted to an electrical signal. but sound none the less. even in the quietest settings. Noise occurs naturally in acoustics. then the random intrinsic motion must produce sound . then Johnson noise can be insignificant. If the signal is carried by only a small current with relatively few electrons (in a high impedance circuit). or alternatively as a nonmeaningful component of a sound. We are not generally aware of this source of noise. A microphone with a small diaphragm however (such as a clip-on mic) will only be in contact with comparatively few air molecules so the averaging effect will be less and the noise higher in level in comparison with the wanted signal. the signal is carried by electrons. electrons are in constant random motion causing what is called Johnson noise. but some microphones are. We can extend this concept to any medium that can carry or store a sound signal.Noise Noise can be described as unwanted sound. If the signal is carried by a large current (in a low impedance circuit).

most of the time the original signal will fall between levels. the signal to noise ratio of any digital system can be calculated by multiplying the number of bits by six. Digital equipment requires a better signal to noise ratio. digital equipment suffers from noise too. which can only turn in a signal to noise ratio of around 65 dB. . Digital audio systems are not immune to noise. Quantization noise is more grainy in comparison to analog noise and therefore subjectively more annoying. This is only adequate when used for information content only. This is actually greater than the useful dynamic range of the human ear. When a signal is converted to digital form. but in practice this idealized figure is never attained. Of course. We said earlier that a common operating level is +4 dBu. The noise is quite audible behind low-level signals. a signal to noise ratio of 80 dB or more is considered good. This would mean that the signal to noise ratio is 84 dB. for instance in a dictation machine. we might obtain a reading somewhere around –80 dBu. but any irregularities such as dust or scratches translate into noise on playback. If all signal were removed and the noise level at the output of the console measured. therefore the theoretical signal to noise ratio would be 24 x 6 = 144 dB. it is analyzed into a certain number of levels. Outside of the professional domain. In analog equipment. 65. a compact cassette recorder without noise reduction can only manage around 45 dB. or for music which is loud all the time and therefore masks the noise. The signal is stored as undulations in the groove. One more example would be a vinyl record groove. So the compact disc format with a resolution of 16 bits has a signal to noise ratio of 16 x 6 = 96 dB. if all other parts of the system are optimized. The worst piece of equipment as far as noise is concerned is the analog tape recorder. Signal to Noise Ratio Signal to noise ratio is one measure of how noisy a piece of equipment is. As we said.536 in the compact disc format for example. The inaccuracies necessarily produced are termed quantization noise. therefore the analysis is only an approximation.Noise is cause by variations in the consistency of the medium. Currently the professional standard is moving to 24-bit resolution. In basic terms.

and reducing the level again on playback – at the same time reducing the level of tape noise. It is annoying. Some noise reduction systems have means of minimizing this effect. Noise reduction systems. Modulation Noise Noise as discussed above is a steady-state phenomenon. and this is mainly of relevance to microphone preamplifiers. Quantization noise in digital systems is also a form of modulation noise.in this case the measurement would be –55 dBu. as mainly used in analog recording. A low frequency signal with few higher harmonics is probably the worst case and will demonstrate modulation noise quite clearly. Unfortunately. the noise level changes. the better the mic input circuit. also have the effect of creating modulation noise. there is another type of noise that constantly changes in level. At very low signal levels it is sometimes possible to hear the noise level going up and down with the signal. for example. the noise level is now in a state of constant change and thereby drawing attention to itself. work well when properly aligned. The effect is that as the signal level changes. This means that the gain control was set to 70 dB and the noise measured at the output of the mic preamp . then I am afraid the measurement is useless.Another way of measuring the noise performance of equipment is EIN or Equivalent Input Noise. Perhaps it is giving the game away to say that the reason a gain of 70 dB is quoted is because mic preamps normally give their optimum EIN figures at a fairly high gain. If the EIN figure does not give the source impedance. All of the various Dolby systems. However. An example spec might be 'EIN at 70 dB gain: -125 dBu (200 ohm source)'. . The '200 ohm source' bit is necessary to make the measurement meaningful. This can be irritating when the signal is such that it doesn't adequately mask the noise. When the set amount of gain is subtracted from this we get the amount of noise that would have to be present at the input of a noiseless mic amp to give the same result. The lower the gain at which a manufacturer dare quote the EIN. and that is modulation noise. One source of modulation noise is that which occurs in analog tape recorders. but the ear has a way of tuning out sounds that don’t change. Noise reduction systems work by bringing up the level of low-level signals before they are recorded.

This is why distortion is sometimes desirable as an effect .rather like musical harmonics in fact.Where you are most likely to hear modulation noise is on a so-called Hifi VHS video recorder. distortion is measured as a percentage. It is worth saying that signal to noise ratio should be measured with any noise reduction switched out. On some machines. anything less than 0. otherwise the comparison between peak or operating level and the artificially lowered noise floor when signal is absent gives an unfairly advantageous figure unrepresentative of the subjective sound quality of the equipment in question. any item of sound equipment 'bends' or distorts the sound waveform to a greater or lesser extent. The discontinuous nature of the audio track causes a low frequency fluttering noise which requires noise reduction to minimize. For a mixing console or an amplifier. Distortion Unfortunately. Distortion normally comes in two varieties: harmonic distortion and intermodulation distortion. This produces. Usually.1% is normally considered quite adequate.it enhances musical qualities. harmonic distortion always comes in integral multiples of the incoming frequency . 3 kHz. suppose you input a 1 kHz tone into a system. Looking at the harmonic kind first. from any given input frequency. this noise reduction is not wholly effective and the modulation noise created can be very irritating. In fact. used with taste and control of course. . From the output you will get not only that 1 kHz tone but also a measure of 2 kHz. additional unwanted frequencies. 4 kHz etc. although once again it's the analog tape recorder that lets us down with distortion figures of anything up to 1% and above.

the simplest possible sound with no harmonics The effect of even-order harmonic distortion on a sine wave .Sine wave .

This is where two frequencies combine together in such a way as to create extra frequencies that are not musically related. For instance. In modern circuit designs the peaks of the waveform are flattened off causing a rather unpleasant sound.The effect of odd-order harmonic distortion on a sine wave Intermodulation distortion is not so musical in its effect. if you input two frequencies. or strange things can happen such as the signal completely disappearing for a second or two. This is where a signal ‘attempts’ to exceed the level boundaries imposed by the voltage limits of a piece of equipment. then intermodulation will produce sum and difference frequencies – 2100 Hz and 100 Hz. A third form of distortion is clipping. In vintage equipment the peaks can be rounded off. 1000 Hz and 1100 Hz. .

in which case there is a resistive path causing the leakage.Crosstalk Crosstalk is defined as a leakage of signal from one signal path to another. Timecode – used to synchronize audio and video machines – is an audio signal which to the ear sounds like a very unpleasant screech. But above operating level there needs to be a certain amount of headroom before the onset of clipping. if you have cymbals or hihat on one channel of your mixing console and you find they are leaking through to the adjacent channel. The worst problem caused by crosstalk is when timecode leaks from its allocated track or channel into another signal path. an effect known as fringing allows low frequencies to leak into adjacent tracks on replay. This is most important in a mixing console where the level of each individual signal can vary considerably due to: 1) less than optimal setting of the gain . For instance. Crosstalk can consist of the full range of audio frequencies. which jump from one circuit track to another through capacitance. In analog tape recorders. This would typically be 4 dBu in a professional studio. It only takes a little crosstalk to allow timecode to become audible. More often crosstalk is predominantly higher frequencies. Headroom I have already mentioned the concept of operating level which is the 'round about' preferred level in a studio. then you have a crosstalk problem.

The recording system is at the end of the signal chain and there are fewer variables. Wow and flutter are both caused by irregularities in mechanical components of analog equipment such as tape recorders and record players. the worse the signal to noise ratio. then much more headroom must be allowed because of the more unpredictable level of the signal. You will hear it most often. Wow causes a longterm cyclic variation in pitch that is audible as such. when we can have flutter-free digital equipment any time we want it. 2) gain due to EQ. which used to be thought of as wholly unwelcome. Of course. Wow and flutter are measured in percentage. then the levels are known and easily controllable therefore hardly any headroom is required.1% is considered good. on old-style juke boxes that still use vinyl records. If it is a stereo mix from a multitrack recording.control. the more headroom you allow. Flutter causes a ‘dirtying’ of the sound. or perhaps 3) unexpected enthusiasm on the part of a musician. but it hasn't quite got there yet so we need some explanation. when signals are mixed together. Flutter is a faster cyclic variation in pitch that is too fast to be perceived as a rise and fall in pitch. In recording systems. where less than 0. old-style analog tape recorders that inevitably suffer from flutter to some extent have a characteristic sound quality that is often thought to be desirable. . If it is a recording of live musicians in a concert setting. and at its worst. it does depend on the nature of the signal source. Wow is just plain unpleasant. Nevertheless. Now. Professional equipment can handle levels up to +20 dBu or +26 dBu. Wow and Flutter The era of wow and flutter is probably coming to an end. Also. and also because there isn't likely to be a second chance if clipping occurs. therefore there is always plenty of headroom to play with. so it is always something of a compromise. it is common to reduce headroom to little or zero. the resulting level isn't always predictable.

Check Questions • What is meant by '0 dBm'? • What is meant by '0 dBu'? • What operating level is commonly used by semi-professional equipment? • What does the term 'dBFS' mean? • What level is commonly used as the reference level for analog magnetic tape in North America? • Which has the greater heating effect: 100 V RMS or 100 V DC? • What is meant by 'unity gain'? • Why is it not acceptable to quote the frequency response of a piece of equipment as '20 Hz to 20 kHz'? • What is meant by 'signal to noise ratio'? • What is meant by 'EIN'? • What is modulation noise? • What is harmonic distortion? • What is intermodulation distortion? • What is clipping? • What is headroom? .

Sign up to vote on this title
UsefulNot useful