Professional Documents
Culture Documents
28/09/2007 1
Some of the following slides are extracted from:
• http://www.cs.sjsu.edu/faculty/kim/cs286/contents/
• Seminario di Ilaria Bartolini - Corso di Sistemi Informativi 2 –
Università di Bologna
www-db.deis.unibo.it/courses/SI2/slides/MMFormati.pdf
• Relazioni di Sistemi Informativi 2 – Università di Bologna
http://www-db.deis.unibo.it/courses/SI2/onlinerels.html
• Corso di L.Mainetti – Politecnico di Milano
http://www.elet.polimi.it/people/mainetti
• http://w3.westnet.gr/mp3/make_mp3s.htm
• http://grahammitchell.com/writings/vorbis_intro.html
! "
28/09/2007 2
# $ % & '(
) '( * )
# +
,
,
28/09/2007 3
!
!
"
#
!$$ %
&
!' (
) $* +$
, +$ * +$- " "
. +$- * !/
!/ * !$
0 1$$ * 1-
2 3 !- * 1-
28/09/2007 4
!
28/09/2007 5
" #
• The sound intensity is measured in Decibel
• The decibel of a generic physic quantity is defined as:
= ' - )
• Where is a reference value.
• In audio domain the Decibel is called dBspl an is
defined as:
= ' - )
• P0 is the reference level, and it is typically set at the
sound pressure in air equal to 2*10-5 Pascal
28/09/2007 6
$%
, "3 4
- % "3
" " "
"3
5
+$ %
1$ 3 3
7$ 3
!+$ -
!6$ 3
28/09/2007 7
"
&
28/09/2007 8
28/09/2007 9
" '& $ %
Phases
• Low-pass filtering:remove frequencies larger than f max
• Sampling: measure single value
• Quantization: Relate value to interval
• Encode: assign binary code
Important factors:
• Sampling rate
– number of points used to capture the sound wave in 1 second
– unit: Hertz
• Quantization depth:
– amount of information used to store the round-off amplitude of
each sample
– unit: bits
28/09/2007 10
& $%
28/09/2007 11
& $%
28/09/2007 ! 12
& $%
8 '
"3 " " 8'
+
9 "
: : 8'
+
4 ; <<!-
3
$ ++$=- 3
4> 3
? @ 3 AB #
6$$ 6<$$ 8 C *7D
28/09/2007 13
E
+ 6 <
"3 "
28/09/2007 14
E
%
7 !1
28/09/2007 15
( $#
• All that we said is true for sources with one audio
channel (“monophonic”)
28/09/2007 16
$ $
28/09/2007 17
) * $
• The WAVE (.wav) format is developed by IBM and Microsoft.
• WAV files are the default uncompressed audio in the pulse-code
modulation (PCM) format on Windows. It is recognized by almost all
computer systems.
• Lossless storage method, which keeps all the samples of an audio
track, professional users or audio experts may use the WAV format for
maximum audio quality.
• It is limited to files that are less than 6 GB in size, due to its use of a
32 bit unsigned integer to record the file size header
• A full pop song in WAVE format may take up to 40 MB of disk space
or more.
• Say you sample at 44 KHz for stereo audio; then effectively, you will
have 44 K * 2 samples. If you are using 16 bits per sample, then given
the duration of audio, you can calculate the total size of the wave file
as:
• Size [BYTE] = sampling rate * number of channels * (bits per sample /
8) * duration in seconds
• Number of samples per second = sampling rate * number of channels
28/09/2007 18
+ ,- $
28/09/2007 19
+ ,- $
• It uses a lossy compression algorithm that is designed to
greatly reduce the amount of data required to represent
the audio recording
• The compression removes certain parts of sound that
are outside the hearing range of most people. It provides
a representation of pulse-code modulation — encoded
audio in much less space than straightforward methods,
by using psychoacoustic models to discard components
less audible to human hearing, and recording the
remaining information in an efficient manner.
• Similar principles are used by JPEG, an image
compression format.
28/09/2007 20
+ ,- $ #
• “WAV” files can be converted into MP3
files using different bit-rate, that
represent the “compression ratio”
(inversely proportional) for this
compressed format
28/09/2007 21
+ ,- $
28/09/2007 22
+ ,- $
28/09/2007 23
+ ,. / / 0
28/09/2007 24
+ " $
• The Musical Instrument Digital Interface (MIDI) is a
universally adopted language to exchange musical
information between synthesizers and computers.
• At minimum, a MIDI representation of a sound includes
values for the note's pitch, length, and volume. It can
also include additional characteristics, such as attack
and delay time.
• Since a MIDI file only represents control information, it is
far more concise than formats that record the sound
directly. An advantage is very small file size.
• MIDI devices:
– Controllers: create MIDI message (Keyboards)
– MIDI synthesizers: receive messages and create sound
28/09/2007
http://www.harmony-central.com/MIDI/Doc/tutorial.html 25
* # $
28/09/2007 28
#
28/09/2007 29
Information Technology and Arts Organizations
.( "
28/09/2007 30
.2 3
28/09/2007 31
.2
A digital videocamera is connected to a computer
and acquires video with a resolution of
320x240 pixels with a color depth of 16bit per
pixel.