document
document
Summary
The accurate reproduction of acoustic real-world environments is becoming of increasing importance in hearing
device research and development. It is thereby often required to accurately predict the sound pressure at the
microphones of a hearing device in a simulated or recorded acoustic environment. For that reason, an extensive
set of head-related transfer functions (HRTFs) was measured in free-field with a pair of behind-the-ear (BTE)
hearing aids placed on a Head and Torso Simulator (HATS). Transfer functions to the in-ear HATS microphones
were also measured. A spherical head model was applied to extend the useable frequency range towards low
frequencies. Special care was given to preserve the phase properties of the measurements so that the HRTFs could
be widely used in phase-sensitive technical applications, including the evaluation of spatial signal processing
algorithms (i.e. directional microphones, beamformers) in hearing devices and the evaluation of sound field
resynthesis methods. The extended HRTF set can also be used for research in psychoacoustics. It is available for
download at: http://www.nal.gov.au/download/HATS_BTE_hrirDatabase.zip.
PACS no. 43.66.Pn, 43.66.Qp, 43.20.Fn, 43.66.Ts
837
ACTA ACUSTICA UNITED WITH ACUSTICA Oreinos, Buchholz: Measurement of HRTFs
Vol. 99 (2013)
Table I. The coordinates of the 1784 points on the sphere where free-space configuration. Instead, the loudspeaker magni-
the HRTFs and BTE transfer functions were measured. Resolu- tude response was equalized, with a 512-tap FIR filter,
tion of elevation range: 5◦ . Number of measurements: N using a measurement provided by our collaborators, and
the magnitude responses of the BTE microphones were
Elevation range Azimuth resolution N
equalized, with a 64-tap FIR filter, after being measured
◦
[−40 , −40 ]◦
5 ◦
1224 inside a hearing aid test box. The frequency response of
(−60◦ , −40◦ ) ∪ (40◦ , 60◦ ) 8◦ 270 the HATS ear simulators (4158C and 4159C) were already
(−70◦ , −60◦ ] ∪ [60◦ , 70◦ ) 10◦ 144 calibrated up to 20 kHz, on purchase of the product, so
(−80◦ , −70◦ ] ∪ [70◦ , 80◦ ) 15◦ 96 no further correction was necessary. The gain necessary
(−90◦ , −80◦ ] ∪ [80◦ , 90◦ ) 30◦ 48 to compensate for the in-ear microphones/amplifiers sen-
−90◦ , 90◦ 360◦ 2
sitivity was estimated by comparing the measured (0◦ , 0◦ )
HRTF to the calibration chart provided by B&K, for the
specific HATS that was used, as well as by comparing
purpose, it had to be placed on a custom-built mount that the measured HRTFs to the ITU-T P.58 recommendation
was shaped around the top of the head using a thermo- (“Head and torso simulator for telephonometry”) values
plastic material (polymorph pellets). Figure 1b shows the for HRTFs at 0◦ , 90◦ , 180◦ , 270◦ on the horizontal plane.
setup used for the upside down mounting of the HATS. For The sensitivity of the four BTE microphones/amplifiers
those measurements, the source (loudspeaker) coordinates was compensated for by minimizing the RMS error be-
(azimuth θ and elevation δ) were referenced to the upside tween their individual magnitude responses, averaged over
down HATS according to the transformations: δ = −δ and all source directions, and the corresponding averaged in-
θ = −θ, so that the role of the ears remained consistent. ear microphone responses in the frequency range of 300–
The use of asymmetric pinnae for the left and right ears of 600 Hz.
the HATS dictated this choice. If both pinnae were exactly
the same, another option would have been to transform the
coordinates according to δ = −δ and θ = θ, while ex- 2.2. Spherical head model realisation
changing the roles of left and right side microphones (in- In this work, a spherical head model was applied to extend
ear and BTE). the useable frequency range of the HRTFs, i.e. to extrapo-
In both measurement conditions (standing upright and late information to lower frequencies, while preserving the
upside down) the accurate alignment of the HATS was fa- sensitive phase information of the HA microphones. This
cilitated by using two laser beam pointers mounted on the model approximates the human head by a rigid sphere and
hoop at an arc distance of π/2. Correct alignment in all as such, it does not take into account the acoustic reso-
three axes could be attained by targeting the laser beams nances and scattering by the pinnae, at high frequencies,
to the ear canals and the tip of the nose of the HATS. That or the shadowing effects of the torso. Although it is an ide-
resulted in effectively aligning the centre of the interaural alised model, it has been suggested that it can be used to
axis with the centre of the hoop, which is the centre of the significantly improve the low frequency accuracy of mea-
coordinate system against which the measurements were sured HRTFs [15, 16, 17]. Using the Fourier-Bessel series
referenced. An observation that will prove useful in sec- expansion of the wave equation solution [18] for the in-
tion 2.2 is that the centre of the interaural axis does not cident (interior problem) and scattered (exterior problem)
coincide with the centre of the HATS’ head. sound field created by a single point source positioned at
In order to further verify the correct alignment of the (rs , θs , δs ) and imposing the boundary condition of zero
setup, all loudspeaker locations with elevation −55◦ ≤ δ ≤ total radial velocity on the rigid sphere of radius r = a, the
−40◦ were measured with the HATS standing both up- pressure on its surface can be computed as [19]
right and upside down. The corresponding transfer func-
tions were compared during the measurement process and p(r = a, θ, δ) = (1)
were found to match very well (absolute magnitude er- ∞
h−m (krs )
i−1
ror between the upright and upside down HRTF measure- (2m + 1)Pm (cos γ),
ments, averaged across all available overlapping locations, m=0 (ka)2 h−m (ka) h−0 (krs )
−55◦ ≤ δ ≤ −40◦ and 0◦ ≤ θ < 360◦ , < 1 dB up to at
least 4–5 kHz and < 2 dB up to at least 10 kHz), confirm- where h−m is the mth-order spherical Hankel function of the
ing the centred placement of the HATS in both conditions. second kind, h−m its derivative and γ is the angle between
Table I summarizes all 1784 measurement positions. the vector of the point source and that of the observation
HRTFs are typically normalized by the free-field re- point. Pm is the mth-degree Legendre polynomial defined
sponse measured with the same loudspeaker and micro- as
phones as used in the HRTF measurements, but with the 1 dm 2
microphones placed in the origin of the applied coordinate Pm (x) = (x − 1)m . (2)
2m m! dxm
system (here the centre of the hoop) and the head (HATS)
removed [1]. Unfortunately, due to time restrictions on the Equation (1) assumes waves normalized so that they have
use of the facility, the loudspeaker response couldn’t be unit amplitude and zero phase at the origin, having an im-
measured with the in-ear and the BTE microphones in a plied e+iωt time dependence.
838
Oreinos, Buchholz: Measurement of HRTFs ACTA ACUSTICA UNITED WITH ACUSTICA
Vol. 99 (2013)
Table II. Spherical head model parameters. centre of the hoop and [xic , 0, zic ] is the position vector
of the centre of the hoop (centre of the interaural axis)
Parameter Value referenced to the centre of the HATS head. This change of
Head radius 0.1 m coordinate systems is necessary so that, for example, the
Ear canal entrance locations L: (100◦ , −11◦ ) angle γ between the vector of the left ear (100◦ , −11◦ ), as
R: (−100◦ , −11◦ ) referenced to the HATS’ centre system, and the leftmost
Front BTE microphone locations L: (104◦ , 10◦ ) source point, whose coordinates expressed in the system
R: (−104◦ , 10◦ ) centred at the midpoint of the interaural axis are (90◦ , 0◦ ),
Rear BTE microphone locations L: (109◦ , 10◦ ) is correctly calculated to be zero.
R: (−109◦ , 10◦ )
“Point” source distance 1.2 m One-sided pressure spectra were calculated with the
spherical head model for each of the 6 microphone lo-
cations (see Table II) at 4097 (nfft = 8192) equidistant
frequency points between 0 Hz and fs /2. The correspond-
The spherical head model response was computed using ing real-valued head-related impulse responses (HRIRs)
the parameters shown in Table II. Given the non-spherical were then derived after first “mirroring” the resulting spec-
nature of the HATS’ head as well as the existence of tra to negative frequencies by applying the complex con-
the torso, which was disregarded by the applied spherical jugate transformation: H (−iω) = H ∗ (iω) and then apply-
head model, the parameters of TABLE II were adjusted ing an inverse (discrete) Fourier transform (nfft = 8192).
in a more heuristic, rather than rigorous, way. The head- Note that the nfft was chosen long enough to avoid time-
related values were chosen by comparison to their loca- aliasing after the successive time shifting and filtering of
tions on the HATS while iteratively minimizing the RMS the impulses.
error of the modelled and measured ILDs and ITDs. The
sound source distance was taken to be exactly the radius
of the hoop where the speaker was mounted. The result- 2.3. Combination of the head model with the mea-
ing mean absolute errors, averaged across all 1784 mea- sured data
dB
surement points, were |ILDdB meas − ILDmod | < 0.9 dB and A primary question that needed to be addressed was how
|ITDmeas − ITDmod | < 46 µs, where the ILDs were esti- to combine the measured HRTFs with the transfer func-
mated at the region 300–600 Hz. A formal optimisation tions derived from the spherical head model at low fre-
was impractical due to the computational complex nature quencies. Algazi et al. [14] have proposed cross-fading
of the spherical head model. the magnitudes of the model and the data (linearly in
It should be mentioned that alternative approaches exist dB) while either keeping the entire phase response of the
to the spherical head model, e.g. the snow-man model de- model, or re-inserting the estimated time of arrival of the
scribed by Algazi et al. [14], which are expected to provide data impulses to the minimum-phase inverse-DFT of the
a more accurate representation of the low frequency re- combined (cross-faded) magnitude responses. The latter
sponse of the HRTFs. However, the spherical head model method, called minimum-phase reconstruction [15], has
already resulted in very small mean absolute ITD and ILD been perceptually verified for sound localization. How-
errors and thus, such complex models were not further ever, the measured transfer functions of this HRTF set
considered here. will be used as inputs to hearing aids whose processing
In order to maximize the computational efficiency, is very different from human auditory processing. In par-
equation (1) was computed using a set of recursive for- ticular, multi-microphone signal enhancement techniques
mulas as described by Duda et al. [16] instead of being di- (e.g., directional microphones and adaptive beamformers)
rectly computed via the analytic expression. Given that the are highly sensitive to very small phase and amplitude
origin of the coordinate system used in the above spherical variations. Hence, the HRTFs must preserve as accurately
head model is defined in the centre of the spherical head as possible phase and amplitude relationships, especially
and thus is different from the one used in the HRTF mea- between the individual BTE hearing aid microphone pairs.
surements (see section 2.1), the source positions (radius In this regard the method proposed by Algazi et al. [14] did
rs , azimuth θs , elevation δs ) to be used for the spherical not provide satisfactory results and needed to be modified.
head model computation had to be derived from the source The combination method pursued in this study consists
positions of the HRTF data set according to of time aligning the model’s impulse responses to the cor-
y responding measured impulse responses and then com-
θs = arctan , bining them using “crossover” type low-pass and high-
x + xic pass filters with cut-off frequencies in the regions of 400–
z + zic
δs = arctan , (3) 500 Hz. First, the time of arrival (TOA) for all 1784 mea-
(x + xic )2 + y 2 sured and modelled HRIRs was estimated separately for
rs = (x + xic )2 + y 2 + (z + zic )2 , the in-ear and BTE microphones according to the method
proposed by Nam et al. [20] as the time instant of the max-
where [x, y, z] = [rs cos θs cos δs , rs sin θs cos δs , rs sin δs ] imum peak of the absolute of the cross-correlation func-
is the position vector of a given source, referenced to the tion between a given HRIR and its minimum-phase ver-
839
ACTA ACUSTICA UNITED WITH ACUSTICA Oreinos, Buchholz: Measurement of HRTFs
Vol. 99 (2013)
sion,
ITU−T P.58 recom. values
τ̂ = argmax h[n − τ]hmp [n] . (4) 20 Left in-ear mic.
τ n Right in-ear mic.
15
Magnitude (dB)
Since the BTE hearing aid microphones were only spaced
apart by dmic = 0.009 m, their maximum TOA difference 10
across all source locations was about 26 µs. Considering
5
that the applied sampling frequency was fs = 44.1 kHz
and thus, the resolution of the applied TOA estimation 0
method was only T = 1/fs ≈ 22.7 µs, the HRIRs had
to be up-sampled (a factor of 100 was chosen) before the −5
2 4
10 103 10
cross-correlation operation of equation (4) was performed. Frequency (Hz)
An alignment delay was then calculated from the esti-
mated TOA of the modelled HRIRs, τ̂mod , and the corre- Figure 2. Processed (0◦ , 0◦ ) HRTF magnitude response, extended
sponding measured HRIRs, τ̂meas , given by to low frequencies using the spherical head model, compared
against the values of the ITU-T P.58 recommendation.
[τ̂mod − τ̂meas ]
Δ= . (5)
100
otherwise be introduced by adding the low-pass and high-
To preserve even the smallest phase differences between pass HRTF components. The filtering was done in the fre-
each BTE microphone pair, the average alignment delay quency domain, as mentioned above, and an impulse re-
for each microphone pair was always applied to both mi- sponse was computed taking the inverse DFT (as in sec-
crophones. In this way any (small) TOA estimation er- tion 2.2) of the combined modelled/measured conjugate-
ror only affected the absolute timing of the BTE micro- mirrored transfer function.
phone pair signals, but the inter-microphone phase dif- The final impulse responses of the combined HRTFs
ferences provided by the spherical head model were pre- were then truncated to 256 samples using a one-sided
served. Since the combination of the measured and mod- Tukey window with a tapered-to-constant-section-ratio of
elled HRTFs was performed in the frequency domain, the 1/5. Special care was taken in the truncation process to
derived alignment delays Δ were transformed into com- maintain the very short pre-ringing that was introduced by
plex gain factors, the linear phase crossover filtering process. Finally, the im-
pulse responses were saved separately to MATLAB files
gΔ(if ) = e−i2πf Δ/fs , (6) (.mat extension), with every file having the impulse re-
sponses of all six microphones relevant to a given source
which were then multiplied with the corresponding mod- position. The final HRTF data set is publically available at
elled HRTFs. No further amplitude adjustments were re- http://www.nal.gov.au/download/HATS_BTE_hrirDatabase.zip.
quired than those already described in section 2.1.
Inspired by the design of loudspeaker crossovers, 256-
tap long linear-phase FIR filters were used to realise the 3. Results
crossover between measured and time-aligned modelled
HRTFs that resembled the magnitude spectrum of a com- Figure 2 shows the derived (0◦ , 0◦ ) HRTF response (mea-
plementary pair of asymmetric Linkwitz-Riley filters. An surement combined with the spherical head model), along
8th order low-pass filter with a cut-off frequency of 500 Hz with the corresponding ITU-T P.58 recommendation’s
and a 4th order high-pass filter with a cut-off frequency of range of values, highlighting the compliance with this
400 Hz were chosen. The 4th order HP filter slope com- standard. The HRTFs at 0◦ , 90◦ , 180◦ , 270◦ on the hor-
bined with the roll-off of the measurements effectively re- izontal plane were also compared to the relevant ITU-T
sulting in a higher order HP response, almost complimen- P.58 values (not shown here) and were similarly found to
tary to the 8th order LP filter. The different slopes and comply.
cut-off frequencies of the filters was chosen to: (1) ap- The magnitude spectra of the measured, modelled, and
ply a small gain-peak of 1–2 dB around 300–500 Hz and combined HRTFs are plotted in Figure 3 for an example
in turn to provide a smoother transition between measured source direction of (60◦ , −30◦ ). Responses for the left ear
and modelled HRTFs and (2) ensure good agreement of are shown in the left figures and for the right ear in the
the combined HRTF responses with data from the ITU- right figures. The top figures refer to the front microphone
T P.58 recommendation and the HATS’ own calibration of the BTE hearing aids and the bottom figures refer to the
chart. This filter design approach in the frequency domain in-ear microphones. In the top panels of Figure 3 it can be
controlled the steepness of the slopes of the cross-over fil- seen that the overall trend of the modelled HRTFs matches
ters and thereby limited the temporal ringing of the filters with the measured HRTF of the BTE microphone. In con-
in the time domain. Choosing a linear-phase filter design trast, the measured in-ear responses deviate significantly
minimized any phase (or comb-filter) effects that could from the model at higher frequencies. This deviation is
840
Oreinos, Buchholz: Measurement of HRTFs ACTA ACUSTICA UNITED WITH ACUSTICA
Vol. 99 (2013)
20 20 measurement
model
10 10 combined data
0 0
−10 −10
Magnitude (dB)
−20 −20
2 3 4 2 3 4
10 10 10 10 10 10
20 20
10 10
0 0
−10 −10
−20 −20
2 3 4 2 3 4
10 10 10 10 10 10
Frequency (Hz)
841
ACTA ACUSTICA UNITED WITH ACUSTICA Oreinos, Buchholz: Measurement of HRTFs
Vol. 99 (2013)
842
Oreinos, Buchholz: Measurement of HRTFs ACTA ACUSTICA UNITED WITH ACUSTICA
Vol. 99 (2013)
(a)
843
ACTA ACUSTICA UNITED WITH ACUSTICA Oreinos, Buchholz: Measurement of HRTFs
Vol. 99 (2013)
istence of measurements from both in-ear and BTE mi- function estimate. Proc. International Conference on Audi-
crophones permits realistic simulations of aided scenarios tory Display, 2003, 119–122.
where sound reaches the hearing aids of the wearer but at [9] T. Nakado, T. Nishino, K. Takeda: Head-related transfer
the same time leaks through to the eardrum (this “acous- function measurement in sagittal and frontal coordinates.
Acoust. Sci. & Tech. 29 (2008) 335–337.
tic path” will be affected by the fitting of the hearing aid,
which may include ear-molds with different vent sizes). [10] H. Kayser, S. D. Ewert, J. Anemüller, T. Rohdenburg, V.
Hohmann, B. Kollmeier: Database of multichannel in-ear
and behind-the-ear head-related and binaural room impulse
Acknowledgement responses. EURASIP, J. on Advances Signal Processing
(2009).
The authors acknowledge the financial support of the
HEARing CRC, established and supported under the Co- [11] B. P. Bovbjerg, F. Christensen, P. Minnaar, X. Chen: Mea-
suring the head-related transfer functions of an artificial
operative Research Centres Program – an initiative of the head with a high directional resolution. AES 109th Con-
Australian Government. In addition, we would like to vention, Los Angeles, USA, 2000.
thank Johahn Leung, Heather C. Kelly, and Prof. Simon [12] M. Pauli, S. Favrot, J. M. Buchholz: Improving hearing
Carlile for letting us use the Auditory Neuroscience Lab- aids through listening tests in a virtual sound environment.
oratory, Department of Physiology, University of Sydney, Hearing Journal 63 (2010) 40–44.
as well as two anonymous reviewers for their helpful sug- [13] J. M. Kates: Digital hearing aids. Plural Publishing, San
gestions. Diego, CA, 2008.
[14] V. R. Algazi, R. O. Duda, D. M. Thompson: The use of
References head-and-torso models for improved spatial sound synthe-
sis. AES 113rd Convention, Los Angeles, USA, 2002.
[1] J. Blauert: Spatial hearing: The psychophysics of human [15] D. J. Kistler, F. L. Wightman: A model of head-related
sound localization. MIT, Cambridge, MA, 1983. transfer functions based on principal components analysis
[2] E. M. Wenzel, M. Arruda, D. J. Kistler, F. L. Wightman: and minimum-phase reconstruction. J. Acoust. Soc. Am.
Localization using nonindividualized head-related transfer 91 (1992) 1637–1647.
functions. J. Acoust. Soc. Am. 94 (1993) 111–123. [16] R. O. Duda, W. L. Martens: Range dependence of the re-
[3] S. Moreau, J. Daniel, S. Bertet: 3D sound field recording sponse of a spherical head model. J. Acoust. Soc. Am. 104
with higher order ambisonics. Objective measurements and (1998) 3048–3058.
validation of spherical microphone. AES 120th Conven- [17] D. S. Brungart, W. M. Rabinowitz: Auditory localization of
tion, Paris, France, 2006. nearby sources. Head-related transfer functions. J. Acoust.
[4] D. B. Ward, T. D. Abhayapala: Reproduction of a plane- Soc. Am. 106 (1999) 1465–1479.
wave sound field using an array of loudspeakers. IEEE [18] E. G. Williams: Fourier acoustics. Academic, San Diego,
Trans. Speech Audio Process. 9 (2001) 697–707. CA, 1999.
[5] N. Epain, P. Guillon, A. Kan, R. Kosobrodov, D. Sun, [19] J. Daniel: Spatial sound encoding including near field ef-
C. Jin, A. van Schaik: Objective evaluation of a three- fect: Introducing distance coding filters and a viable, new
dimensional sound field reproduction system. Proceedings ambisonics format. AES 23rd International Conference,
of 20th International Conference on Acoustics, ICA, Syd- Copenhagen, Denmark, 2003.
ney, Australia, 2010. [20] J. Nam, J. S. Abel, J. O. Smith: A method for estimating
[6] W. G. Gardner, K. D. Martin: HRTF measurements of a interaural time difference for binaural synthesis. AES 125th
KEMAR. J. Acoust. Soc. Am. 97 (1995) 3907–3908. Convention, San Francisco, USA, 2008.
[7] V. R. Algazi, R. O. Duda, D. M. Thompson, C. Avendano: [21] R. Duraiswami, D. N. Zotkin, N. A. Gumerov: Interpola-
The CIPIC HRTF database. Proc. IEEE Workshop on Ap- tion and range extrapolation of HRTFs. Proceedings of the
plications on Signal Processing to Audio and Acoustics, ICASSP ’04, 2004.
2001, 99–102. [22] S. Busson, R. Nicol, B. F. G. Katz: Subjective investi-
[8] E. Grassi, J. Tulsi, S. Shamma: Measurement of head- gations of the interaural time difference in the horizontal
related transfer functions based on the empirical transfer plane. AES 188th Convention, Barcelona, Spain, 2005.
844