Perception: 1. Ear Physiology 2. Auditory Psychophysics 3. Pitch Perception 4. Music Perception
Perception: 1. Ear Physiology 2. Auditory Psychophysics 3. Pitch Perception 4. Music Perception
Lecture 3:
Perception
1.
2.
3.
4.
Ear Physiology
Auditory Psychophysics
Pitch Perception
Music Perception
Dan Ellis
http://www.ee.columbia.edu/~dpwe/e4896/
2013-02-04 - 1 /24
1. Ear Physiology
Outer
ear
Midbrain
Inner ear
(cochlea)
2013-02-04 - 2 /24
The Ear
Pinna
Cochlea
(inner ear)
Eardrum
(tympanum)
2013-02-04 - 3 /24
The Cochlea
Basilar Membrane
(BM)
Cochlea
16 kHz
Resonant
frequency
50 Hz
0
Position
35mm
http://www.wadalab.mech.tohoku.ac.jp/FEM_BM-e.html
2013-02-04 -
/24
Hair Cells
Cochlea
Tectorial
membrane
Basilar
membrane
Auditory nerve
Outer Hair Cell
(OHC)
2013-02-04 - 5 /24
Auditory Nerve
50
time / ms
Typical nerve
signal (mV)
2013-02-04 - 6 /24
Nerve Responses
Onset enhancement
Frequency selectivity
Dynamic range
Spike
count
100
Time
100 ms
Tone burst
One fiber:
~ 25 dB dynamic range
dB SPL
300
Spikes/sec
80
60
40
200
100
20
Intensity / dB SPL
0
100 Hz
1 kHz
(log) frequency
20
40
60
80
100
10 kHz
Hearing dynamic range > 100 dB
2013-02-04 - 7 /24
10
20
30
40
50
60
time / ms
2013-02-04 - 8 /24
Auditory Models
Filterbank + nonlinearity
Sound
Outer/middle
ear
filtering
Cochlea
filterbank
IHC
60
channel
50
40
30
20
10
0
0.1
0.2
0.3
0.4
0.5
time / s
2013-02-04 - 9 /24
2. Auditory Psychophysics
- distinction is important!
2013-02-04 - 10/24
Loudness perception
Webers law:
Loudness
L
log(L) =
log10 (L) =
dB(I) =
log(L)
I 0.3
0.3 log(I)
0.03 dB(I)
33.3 log10 (L)
log(I)
Log(loudness rating)
2.4
Textbook figure:
L I 0.3
2.2
2.0
1.8
1.6
1.4
-20
-10
10 Sound
Hartmann
90level
tracks 19+20
2013-02-04 - 11/24
Equal Loudness
120
80
40
0
100
1000
10,000
freq / Hz
2013-02-04 - 12/24
Masking
masking
tone
masked
threshold
log freq
temporal effects
level / dB
Forward/backward
Masking tone
20
Elevated masking
threshold skirt
18
16
14
12
10
20
8
15
6
4
10
2
5
0
0
50
100
150
time / ms
200
250
freq / Bark
tracks 23-25
2013-02-04 - 13/24
Limits of Hearing
two-interval
forced-choice:
X = A or B?
time
Roughly...
2013-02-04 - 14/24
3. Pitch Perception
70
60
50
40
30
20
10
0.05
0.1
time/s
2013-02-04 - 15/24
Hypothesis:
resolved
harmonics
frequency channel
Pitch strength
Duifhuis et al. 82
AN excitation
broader HF channels
cannot resolve
harmonics
frequency channel
2013-02-04 - 16/24
Autocorrelation
freq
time
autocorrelation
Summary
autocorrelation
10
20
30
common period
(pitch)
lag / ms
2013-02-04 - 17/24
Competing Cues
P r( |x1 )P r( |x2 )
arg max
P r( )
2013-02-04 - 18/24
4. Music Perception
Grey 75
instruments
notes
rhythm
2013-02-04 - 19/24
Scene Analysis
common onset
common harmonicity
8000
6000
Pierce 83
4000
2000
9 time / s
2013-02-04 - 20/24
Consonance
Musical intervals
Pitch Helix
E4896 Music Signal Processing (Dan Ellis)
relate to
harmonic
proximity
2013-02-04 - 21/24
Rhythm
Sensitive to periodicity
speech? breathing? brain?
Onsets + autocorrelation?
variations in tapping
4/4 vs 3/4
40
40
30
30
20
20
10
10
0
10
-2
10
-2
80
80
60
60
40
40
20
20
10
10
10
100
200
300
400
500
600
700
800
900
1000
0
0
10
1000
100
500
0
0
100
200
300
400
500
600
700
800
900
1000
-100
2013-02-04 - 22/24
Sequences
frequency
TRT: 60-150 ms
1 kHz
f:
2 octaves
track 36
time
2013-02-04 - 23/24
Summary
2013-02-04 - 24/24
References
2013-02-04 - 25/24