Professional Documents
Culture Documents
MPM12 Rhythm PDF
MPM12 Rhythm PDF
-1
0.5 1 1.5 2 2.5
time (s)
Beginning of Another one bites the dust by Queen.
-1
0.5 1 1.5 2 2.5
time (s)
Beginning of Another one bites the dust by Queen.
-1
0.5 1 1.5 2 2.5
time (s)
Beginning of Another one bites the dust by Queen.
-1
0.5 1 1.5 2 2.5
time (s)
Beginning of Another one bites the dust by Queen.
-1
0.5 1 1.5 2 2.5
time (s)
Beginning of Another one bites the dust by Queen.
-1
0.5 1 1.5 2 2.5
time (s)
Beginning of Another one bites the dust by Queen.
-1
0.5 1 1.5 2 2.5
time (s)
Beginning of Another one bites the dust by Queen.
Hi-hat level
Beat level
Kick level
1
-1
0.5 1 1.5 2 2.5
time (s)
Beginning of Another one bites the dust by Queen.
-1
2 4 6 8 10 12 14 16 18
time (s)
1
0.5
0
0 2 4 6 8 10 12 14 16
lag (s)
Beat Spectrum.
Zafar RAFII, Spring 2012 19
Foote’s Beat Spectrum
• Use
– The beat spectrum reveals the hierarchically
periodically repeating structure of the audio
Beginning of Another one bites the dust by Queen.
1
Periodicity at
0
the measure level
-1
Sub-periodicity at 2 4 6 8 10
time (s)
12 14 16 18
0.5
Sub-periodicity at
the beat level
0
0 2 4 6 8 10 12 14 16
lag (s)
Beat Spectrum.
Zafar RAFII, Spring 2012 20
Foote’s Beat Spectrum
• Calculation
– Compute the power spectrogram from the audio
using the STFT (square of magnitude spectrogram)
4
x 10 Power spectrogram
Audio 2
1
frequency (kHz)
1.5
0 1
0.5
-1
2 4 6 8 10 12 14 16 18
time (s)
2 4 6 8 10 12 14 16 18
time (s)
2 2
frequency (kHz)
frequency (kHz)
1.5 1.5
1 1
0.5 0.5
2 4 6 8 10 12 14 16 18 0 2 4 6 8 10 12 14 16
time (s) lag (s)
Spectrogram at 10 kHz Autocorrelation at 10 kHz
1
2
1 0.5
0 0
2 4 6 8 10 12 14 16 18 0 2 4 6 8 10 12 14 16
time (s) lag (s)
Zafar RAFII, Spring 2012 22
Foote’s Beat Spectrum
• Calculation
– Compute the mean of the autocorrelations (of the
rows)
4 4
x 10 Power spectrogram x 10 Autocorrelation plots
2 2
frequency (kHz)
frequency (kHz)
1.5 1.5
1 1
0.5 0.5
2 4 6 8 10 12 14 16 18 0 2 4 6 8 10 12 14 16
time (s) lag (s)
Beat spectrum
1
0.5
0
0 2 4 6 8 10 12 14 16
lag (s)
Zafar RAFII, Spring 2012 23
Foote’s Beat Spectrum
• Notes
– The first highest peak in the beat spectrum does
not always correspond to the repeating period!
– The beat spectrum does not indicate where the
beats are or when a measure starts!
This is how you find the period…
1
0.5
0
0 2 4 6 8 10 12 14 16
This is not lag (s)
Beat Spectrum.
the periodZafar RAFII, Spring 2012 24
Foote’s Beat Spectrum
• Notes
– The beat spectrum can also be built using the
similarity matrix [Foote et al., 2001]
– A beat spectrogram can also be built using
successive beat spectra [Foote et al., 2001]
Beat Spectrogram.
3
lag (s)
0
2 4 6 8 10 12 14 16 18
time (s)
Zafar RAFII, Spring 2012 25
Foote’s Beat Spectrum
• Question
– Can we use the beat spectrum for source
separation?...
– To be continued…
𝑛
𝑘=1 𝑋 𝑘,𝑗1 𝑋(𝑘,𝑗2 )
𝑆(𝑗1 , 𝑗2 ) =
𝑛 2 𝑛 2
𝑘=1 𝑋(𝑘,𝑗1 ) 𝑘=1 𝑋(𝑘,𝑗2 )
16 0.9
14 0.8
0.7
12
0.6
time (s)
10
0.5
8
0.4
6
0.3
4
0.2
2 0.1
0
2 4 6 8 10 12 14 16 18
time (s)
Zafar RAFII, Spring 2012 29
The Similarity Matrix
• Use
– Visualize time structure of an audio [Foote, 1999]
– Identify repeating/similar patterns
Similarity matrix.
Similarity between
18 1 Very
the times at
16 0.9
similar
14 0.8
2 s and 10 s. 0.7
12
0.6
time (s)
10
Region of high 0.5
self-similarity 8
0.4
around 3 s. 6
0.3
4
0.2
frequency (kHz)
1.5
0 1
0.5
-1
2 4 6 8 10 12 14 16 18
time (s)
2 4 6 8 10 12 14 16 18
time (s)
2 2
frequency (kHz)
frequency (kHz)
1.5 1.5
1 𝑋 1 𝑋
0.5 0.5
2 4 6 8 10 12 14 16 18 2 4 6 8 10 12 14 16 18
𝑋𝑗 𝑋𝑗
time (s) time (s)
𝑋𝑗 (𝑖)
𝑋𝑗 (𝑖) =
𝑛 2
𝑘=1 𝑋𝑗 (𝑘)
Zafar RAFII, Spring 2012 32
The Similarity Matrix
• Calculation
– Compute the dot product between any two pairs
of columns and save them in the similarity matrix
4 Similarity matrix
x 10 Normalized spectrogram
18
2
16
frequency (kHz)
1.5
𝑋
14
1 12
0.5
time (s)
10
8
𝑆
2 4 6 8 10 12 14 16 18
6
𝑋𝑗1 𝑋𝑗2 𝑗1
time (s)
4
2
𝑛
𝑆(𝑗1 , 𝑗2 ) = 𝑘=1 𝑋𝑗1 (𝑘)𝑋𝑗2 (𝑘) 𝑗2
2 4 6 8 10 12 14 16 18
time (s)
8
time (s)
10
6
8
4
6
2
4
2 4 6 8 10 12 14 16 18
time (s) 2