Professional Documents
Culture Documents
DAS9T02 - Data Reduction
DAS9T02 - Data Reduction
- Why we use it
- How it works
│ DW-AKADEMIE │ Seite 1
>
Why we use it
│ DW-AKADEMIE │ Seite 2
>
Comparison of data rates
1 week 120 GB 30 GB 20 GB 10 GB
10 years 60 TB 15 TB 10 TB 5 TB
│ DW-AKADEMIE │ Seite 3
Capacity requirements
│ DW-AKADEMIE │ Seite 4
>
Capacity requirements
• 1 hour 120 GB 11 GB 2 GB
• HD of 500GB 4h 45 h 250 h
• low cost server 2000 GB 16 h 180 h 1000 h
• large server 5TB 40 h 450 h 2 500 h
• very large server 10 TB 80 h 900 h 5 000 h
• mass storage system 1 PB 8 000 h 80 000 h 500 000 h
│ DW-AKADEMIE │ Seite 5
>
The basic ideas of data reduction
- Digital audio contains REDUNDANCY
(meaningless data)
- e.g. 00000000000000000000000
could be expressed as ”23 x 0”
- 010111
- achieved through mathematical procedures
information
(linear PCM)
redundancy
│ DW-AKADEMIE │ Seite 6
>
Compressing Audio Files
⇓
¾ “Zipped” file using lossless
data compression sound.zip
70MB
⇓
sound.wav
¾ “Un-zipped” file, (48kHz, 24bit)
identical to the original 100MB
│ DW-AKADEMIE │ Seite 7
>
The basic ideas of data reduction
information
irrelevant information
redundant data
│ DW-AKADEMIE │ Seite 8
>
Irrelevancy in audio signals
PCM noise
inaudible
audible
signal
signal
f
│ DW-AKADEMIE │ Seite 9
>
Frequency masking
f
│ DW-AKADEMIE │ Seite 10
>
Frequency masking
f
│ DW-AKADEMIE │ Seite 11
>
Temporal masking
- low level signals after a loud sound event will not be audible
- post-masking
masking threshold
t
100ms
│ DW-AKADEMIE │ Seite 12
>
Temporal masking
│ DW-AKADEMIE │ Seite 13
Temporal masking
low level signals shortly before a loud sound event will remain inaudible
- pre-masking
masking threshold
masker
masked signal
t
20ms
│ DW-AKADEMIE │ Seite 14
>
Temporal masking
Listen to this
- The second burst is pre-masked by successive music
│ DW-AKADEMIE │ Seite 15
>
Data reduction encoding
│ DW-AKADEMIE │ Seite 16
>
Data reduction encoding
Fast
Fourier
description of a time block
Transform in the frequency domain
│ DW-AKADEMIE │ Seite 17
Data reduction encoding
│ DW-AKADEMIE │ Seite 18
Data reduction encoding
│ DW-AKADEMIE │ Seite 19
Data reduction encoding
sub-band
linear coding
filter bank
PCM or MCDT
scale factor
extraction
Fast psycho-
bit
Fourier acoustical
allocation
Transform model
│ DW-AKADEMIE │ Seite 20
Data reduction encoding
sub-band
linear coding coding
filter bank
and
PCM or MCDT
scale factor bit packing
extraction
Fast psycho-
bit
Fourier acoustical
allocation
Transform model
│ DW-AKADEMIE │ Seite 21
>
Data reduction standard
│ DW-AKADEMIE │ Seite 22
>
The MPEG standards
│ DW-AKADEMIE │ Seite 23
>
The MPEG standards
│ DW-AKADEMIE │ Seite 24
>
MPEG 1
- Coding of moving picture and associated audio for digital storage media
at up to about 1.5Mb/s
- Part 3 standardised the audio compression formats
- Three Layer were standardised
- Layer 1
- Layer 2
- Layer 3
- The three layer are downward compatible to each other
│ DW-AKADEMIE │ Seite 25
>
MPEG 1
- Layer 1
- low complexity of encoder and decoder
- low compression rate ( 4 )
- relatively high bit rates (192kb/s/ch)
- developed for Philips DCC
- outdated today
Layer 1
│ DW-AKADEMIE │ Seite 26
>
MPEG 1
- Layer 2
- medium complexity of encoder and decoder
- medium compression rate ( 6 )
- moderate bit rates ( 128kb/s/ch)
- developed for DAB
- most commonly used in the studio environment
Layer 2
Layer 1
│ DW-AKADEMIE │ Seite 27
>
MPEG 1
- Layer 3
- high complexity of encoder and decoder
- high compression rate ( 12 )
- low bit rates ( 64kb/s/ch)
- designed for signal transmission (ISDN)
- all future MPEG standards are based on Layer 3
Layer 3
Layer 2
Layer 1
│ DW-AKADEMIE │ Seite 28
>
MPEG 1
Layer 3
Layer 2
Layer 1
bit rate (kb/s/ch)
32 64 96 128 160 192 224 256
24 12 8 6 5 4 3
data reduction factor (related to 16bit/48kHz)
│ DW-AKADEMIE │ Seite 29
>
MPEG Stereo Modes
- Mono
- One channel is recorded and transmitted only
- If the input signal is stereo, the encoder will build the mono sum
- Stereo (dual mono)
- This is the true stereo mode
- Two fully independent audio channels (left and right)
will be encoded and transmitted
- Joint Stereo (intensity stereo, mid-side stereo)
- The encoder will eliminate additional redundancy of stereo signals
by coding similar signals in the left and right channel only once.
- Joint stereo provides more effective use of the bit budget
and will therefore reduce artifacts in the signal
- Joint stereo produces a less clear stereo image
│ DW-AKADEMIE │ Seite 30
Data Reduction Sound Demonstration
Layer 2 Layer 3
sub-bands 32 576
│ DW-AKADEMIE │ Seite 32
>
MPEG 2
- Low sample rate audio
- reduced sample rates, reduced audio bandwidth
- reduction of audio bandwidth is less annoying than encoding artefacts
- the compression format for Worldspace satellite radio
- multi-channel applications
- 5+1 audio channels
- used for film, video and DVD application (Europe)
- Advanced Audio Coding (AAC)
- non-backward compatible to MPEG 1
- allows very low bit rates at improved quality
- is widely used for MP3 files in the internet
- the compression format for DRM
│ DW-AKADEMIE │ Seite 33
>
Problems of data reduction
│ DW-AKADEMIE │ Seite 34
>
More Problems
│ DW-AKADEMIE │ Seite 35
>
Conclusions
- Data reduction produces high quality audio but it has its limitations
- Data reduction can be used
- to store signals more economically
- to transmit signals more economically
- to employ new transmission channels (e.g. ISDN)
- in the broadcasting environment for simple radio productions
- Data reduction should not be used
- if the signal is entitled to later sound processing
- during the production of music, drama
or any other complex audio production
- for archiving of important sound material
- if it gives no particular advantages
│ DW-AKADEMIE │ Seite 36
•