0% found this document useful (0 votes)

26 views32 pages

Evaluating Spatial Sound Systems

A Conference Presentation from the Light and Sound Interactive 2019 Conference on the objective evaluation of spatial sound reproduction systems and methods.

Uploaded by

Mark Bocko

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views32 pages

Evaluating Spatial Sound Systems

A Conference Presentation from the Light and Sound Interactive 2019 Conference on the objective evaluation of spatial sound reproduction systems and methods.

Uploaded by

Mark Bocko

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Evaluating Spatial Sound

Systems
Mark F. Bocko

Audio & Music Engineering

Audio Engineers love specs …
• Predicting which speakers will sound good …

2
How many speakers are enough?
$
NHK 22.2
$ $
$
$ $
$
$ $$ $ $ $
$ $ $ $ $
$$
$ $ $

$
Quantitatively evaluate Framework
any spatial sound Specify listening space &
1
Specify virtual acoustic
2

reproduction method in speaker placement sources to be created

any space … Compute signals driving 3

each loudspeaker
• Incorporate quantitative models of binaural (Your favorite method)

hearing into audio system design tools 4

Compute acoustic field at Compare
• Identify the computable quantities that listener (directional IR) & Assess

correspond to what listeners report they hear

(locations, spatial extent of sources, diffusiveness) Compute sound field- 5
listener interaction
(head model)
• Make the design of systems for creating spatial
audio more deterministic and less trial and error 6 7
Compute percepts Infer virtual acoustic
• Both for free space sound reproduction (binaural fusion model) source properties
• And for headphone based reproduction

4
Outline
• How the ear works – very briefly
• Meddis hair cell model

• Cross-correlation model of directional hearing

• Audio coherence and spatial hearing
• Interaural time and level differences
• Spectral coloring from source elevation
• Correlograms
• Examples
5
Human
Auditory
System

6
7
Reissner Membrane

Scala Vestibuli

Tectorial Membrane
Organ of Corti

Scala Tympani
Basilar Membrane

8
©2013 by American Physiological Society
9
Meddis Hair
5

Input Signal
0

Cell Model -5

0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05
Time (sec)
150

Cell Probability
Deflection
100

Around 3000 inner hair cells

0
along the length of the basilar
~ Firing
-50
membrane Hair
-100

-150
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05
Time (sec)

Neuron firing is
Neuronal Pulse Stream

1.2

irregular and 0.8

clustered near
0.6

0.4

signal peaks
0.2

0
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05
Time (sec)
10
Meddis Hair
5

Input Signal
0

Cell Model -5

0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05
Time (sec)
150

Cell Probability
Deflection
100

~ Firing
-50
Hair
-100

-150
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05
Time (sec)
Neuronal Pulse Stream

1.2

Spontaneous 0.8

0.6

firing rate 0.4

0.2

0
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05
Time (sec)
11
Binaural Fusion Model

ea r Low Freq
l ef t
m
Fro
High Freq

u t
tp
Ou

Site of r
Binaural ht ea
m rig
Fusion Fro
To right
cochlea
To left Represent as a bi-directional delay line
cochlea
12
Binaural fusion mechanism à 2 msec windowed cross-correlation
2 msec *
DELAY LINE FROM RIGHT EAR

DELAY LINE FROM LEFT EAR

W(T)

xr(t) t
T

t1 t2 t3
TW
W(T)
xl(t) 𝜏
T The lag where the peak in the cross-correlation
appears is the Interaural Time Difference
t

t1 - 𝜏 t2 - 𝜏 t3 - 𝜏
TW • Jeffress, L. A. (1948). A place theory of sound localization. Journal
of comparative and physiological psychology, 41(1), 35. 13
Interaural Time Difference and source direction
(in the horizontal plane)

0 50 100 Perceived ITD (direction to source) is

determined by location of the peak in the
Sl S Sr
short-time cross-correlation function
Low frequency limit of
Rayleigh diffraction around sphere

q !#
ITD = 𝑠𝑖𝑛(𝜃)
"$
30° 30°
c is the speed of sound

ITD = 0 when 𝜃 = 0
L R ITD = (3/2)*(d/c) when 𝜃 = 90°
2d
d

Note: Factor of 3/2 is due to diffraction around listeners head

14
Role of coherence in binaural hearing

3 Sec white noise bursts

S1 S2
• S1 alone
• S2 alone
• S1 + S2 the same
• S1 + S2 different

15
Demonstration of lateralization as a function of noise burst duration
• Play a series of uncorrelated stereo noise bursts of decreasing duration
(2sec 1sec 0.5sec 0.2sec 0.1sec 50msec 20msec 10msec 5msec 2msec 1msec)

Series of uncorrelated
2msec stereo noise bursts

• At about 2 msec and less, each burst is identified with a specific location
• The cross-correlation function always has a peak somewhere! But it is different each time.
• The auditory percept being computed by the brain is updated about every 2 milliseconds
16
-0.5

Auditory
10 20 30 40 50 60 70 80
Sample Number
Cross-correlation Function

“Sluggishness”
1
“L” click

Norm X-Corr
0.5

• How quickly can a listener follow time- 0

-0.5
varying binaural cues? -1

• Evidence for a 200 - 300 msec threshold

-80 -60 -40 -20 0 20 40 60 80
Lag (samples)

• Distribution of 2 msec window ITD’s has a

“memory” of 100 - 300 msec Series of L, C, R located clicks
60

30
10 msec 50 msec 100 msec 250 msec 500 msec
20

10 Your brain averages over a hundred or more 2 msec windows

0
-20 -15 -10 -5 0 5 10 15 20 25 and constructs a histogram of interaural time differences.
Histogram of ITD’s
17
Correlograms – Frequency dependent interaural time differences

u e n cy
Freq

Frequ
n cye
Del
ay
2-D (ITD & frequency) map encodes source location
Brain decodes these maps to source locations ITD
ITD à lateral position of source Stereo speaker pair – center panning
Frequency dependence à source elevation (anechoic conditions)
18
Procedure
• For a given head model …
• Compute the reference correlograms for all possible sound source directions
• Specify the multi-channel reproduction system, the influence of the room, and
the signals driving each speaker (for whatever method you choose)
• Compute the resulting correlogram
• Project the computed correlogram onto the reference set to infer the direction
• One may infer a superposition of source directions
• Specific methods
• Decompose into spherical harmonics (orthogonality helps)
• Error minimization
• Machine learning

19
So how does the method work? … assessing the effect of reverberation

Aula Carolina
(Aachen)

20
Reverberation broadens the source image

250 Trials - Stereo Loudspeakers @ +/- 30 degrees - Delta = 0 (center pan)

80
Reverb
Anechoic

50
Number of Trials

Note: Random nature of nerve

impulse stream creates a spread

of image width, even in a non-

Reverberant space
0
-30 -20 -10 0 10 20 30
21
Perceived Incident Angle of Sound Source in Degrees
Spatial Blur – experimental measurements
The model reproduces the observed angular acuity.

Spread arises from statistics of neuronal pulses.

22
Blauert, J., “Spatial Hearing: The Psychophysics of Human Sound Localization”, MIT Press 1983.
Spatial acuity with one ear!
If you don’t believe the cross-correlation model look at this!

23
Blauert, J., “Spatial Hearing: The Psychophysics of Human Sound Localization”, MIT Press 1983.
Sl Sr Modeling Stereo Reproduction

Frequency dependence of head

diffraction

𝑅"!" 𝑡, 𝜏 = 𝑅# 𝑡, 𝜏 + 𝑓 $ 𝜔 𝑅# 𝑡, 𝜏

+ 𝑓 𝜔 𝑅% 𝛿 𝜏 + 𝜏& + 𝛿(𝜏 − 𝜏& )

𝜏& = left-right ear delay

𝑅# 𝑡, 𝜏 is the cross-correlation of the Sl and Sr

L R
d
24
2
L Speaker Apparent Intended R Speaker

Stereo Sweet Spot calculation 1.5

• Compute peak of distribution of ITD’s for

1
Dl Dr
a real source at the intended location
• Compute peak of distribution of ITD’s for y
(x0,y0)
the stereo rendered intended source
0.5

• Infer the apparent source direction from

peak of ITD distribution x
0
(0,0)
• This example is for coherent sources – the
formalism also can be used with partially
coherent sources, i.e., real signals in
-0.5

reverberant spaces.

-1

-2 -1.5 -1 -0.5 0 0.5 1 25 1.5

Main Points
• Integrated a quantitative neurological model into a spatial audio analysis tool
• Randomness of auditory nerve firing events is important
• Predicts measured angular acuity
• Two time scales are in play
• Short ( ~ 2 msec) window for cross correlation in brainstem
• Longer ( ~ 100 msec) histogram “memory” (higher level processing)
• We can predict what a listener will tell you they hear
• Location and spread of sound source
• There’s a lot left to do …
• Integrate with room modeling software for a complete analysis package
• Create synthesis tools – find the designs and algorithms that best reproduce a desired spatial
sound effect
• Continue to refine auditory models
• Distance cues
26
END

27
Cochlea
28
Cross-correlation (similarity of two signals)
[x1 x2 x3] [x1 x2 x3] [x1 x2 x3] [x1 x2 x3] [x1 x2 x3]
[y1 y2 y3] [y1 y2 y3] [y1 y2 y3] [y1 y2 y3] [y1 y2 y3]
Lag -2 -1 0 1 2

Delay = 0 Two random sequences Two random sequences Delay = 30 samples

10 10

5 5

0 0

-5 -5

-10
-10
20 40 60 80 100 120 140 160 180 200 20 40 60 80 100 120 140 160 180 200

Cross-correlation
Cross-correlation
1
1

0.8
0.8

0.6
0.6

0.4
0.4

0.2
0.2

0
0

-0.2
-0.2 0 50 100 150 200 250 300 350 400
0 50 100 150 200 250 300 350 400

Signals are correlated but delayed

Uncorrelated signals
Two random sequences
10

-5

-10
20 40 60 80 100 120 140 160 180 200

Cross-correlation
0.2

0.15

0.1

0.05

-0.05

-0.1

-0.15
0 50 100 150 200 250 300 350 400

No dominant peak in cross-correlation

Precedence effect
• Law of the first wave-front …
• Direction is inferred from 1st wave-front (up to about 30-40 msec)
• Haas effect – short delays enhance “spaciousness”

0 – 2 msec delay 0 – 40 msec delay 0 – 200 msec delay

(in 20 steps) (in 20 steps) (in 20 steps)

Explained by saturation and recovery time of hair cell response.

31
Directional impulse responses
Directional Impulse Response

Track both the time of 10

-3

arrival and the direction 2.5

of each room reflection 1.5

0.5

z
-0.5
(Matlab Demo: Imp_Resp_w_Angle_3.m) -1

-1.5

-2

-2.5
2

2 0 -3
1 10
0
-3 -1 -2
10 -2
y
-3 x 32

Evaluating Spatial Sound Systems
No ratings yet
Evaluating Spatial Sound Systems
32 pages
Interactive Audio: Sound, Waves, The Ear 3D Audio
No ratings yet
Interactive Audio: Sound, Waves, The Ear 3D Audio
102 pages
Surround Sound Systems Overview
No ratings yet
Surround Sound Systems Overview
124 pages
Intro To Immersive Audio For VR-2
No ratings yet
Intro To Immersive Audio For VR-2
56 pages
Sound Localization and Auditory Cues
No ratings yet
Sound Localization and Auditory Cues
43 pages
LAB 4 Definitiu
No ratings yet
LAB 4 Definitiu
40 pages
Understanding Sound Localization Techniques
No ratings yet
Understanding Sound Localization Techniques
28 pages
Stewart Spatial Auditory 2010
No ratings yet
Stewart Spatial Auditory 2010
186 pages
Localize (Phase Ambiguity)
No ratings yet
Localize (Phase Ambiguity)
54 pages
Sound Localization Explained
No ratings yet
Sound Localization Explained
17 pages
Spatial Sound: Technologies & Psychoacoustics
No ratings yet
Spatial Sound: Technologies & Psychoacoustics
37 pages
HRTF
No ratings yet
HRTF
5 pages
Exploring Audio and Sensory Displays in VR
No ratings yet
Exploring Audio and Sensory Displays in VR
13 pages
A System For Spatial Hearing Research
No ratings yet
A System For Spatial Hearing Research
8 pages
Binaural Sound Localization Factors
No ratings yet
Binaural Sound Localization Factors
47 pages
Auditory System Biophysics Guide
No ratings yet
Auditory System Biophysics Guide
5 pages
Point-Of-View: Focalised. Focalisation Is The Camera Eye
No ratings yet
Point-Of-View: Focalised. Focalisation Is The Camera Eye
21 pages
03 Binaural
No ratings yet
03 Binaural
19 pages
Sound Localization in Human Hearing
No ratings yet
Sound Localization in Human Hearing
19 pages
Understanding Pitch and Loudness Perception
No ratings yet
Understanding Pitch and Loudness Perception
58 pages
An Introduction To The Psychology of Hearing by Brian Moore 6th Edition PDF
100% (2)
An Introduction To The Psychology of Hearing by Brian Moore 6th Edition PDF
457 pages
Moving Sound Source Synthesis For Binaural Electroacoustic Music Using Interpolated Head-Related Transfer Functions (HRTFS)
100% (1)
Moving Sound Source Synthesis For Binaural Electroacoustic Music Using Interpolated Head-Related Transfer Functions (HRTFS)
24 pages
Sound Localization and Auditory Perception
No ratings yet
Sound Localization and Auditory Perception
10 pages
Risoud - Sound Source Localization
No ratings yet
Risoud - Sound Source Localization
6 pages
Sound Localization in Virtual Auditory Space
No ratings yet
Sound Localization in Virtual Auditory Space
6 pages
Enhancing Two-Channel Stereo Sound
No ratings yet
Enhancing Two-Channel Stereo Sound
7 pages
Eaa Ws Meran Hottopica
No ratings yet
Eaa Ws Meran Hottopica
44 pages
HRTF Review
No ratings yet
HRTF Review
28 pages
Sound Localization and Auditory Analysis
No ratings yet
Sound Localization and Auditory Analysis
58 pages
Understanding Sound Perception and Speech
No ratings yet
Understanding Sound Perception and Speech
47 pages
Spatial Sound Generation and Perception by Amplitude Panning Techniques
No ratings yet
Spatial Sound Generation and Perception by Amplitude Panning Techniques
59 pages
2001 - Spatial Sound Generation and Perception - Ville Pulkki
No ratings yet
2001 - Spatial Sound Generation and Perception - Ville Pulkki
59 pages
Spatial Hearing
No ratings yet
Spatial Hearing
511 pages
Auditory System Function and Measurement
No ratings yet
Auditory System Function and Measurement
51 pages
Sound Localization in Behavioral Neuroscience
No ratings yet
Sound Localization in Behavioral Neuroscience
21 pages
Cues For Loclization
No ratings yet
Cues For Loclization
21 pages
Virtual Acoustic Distance Perception
No ratings yet
Virtual Acoustic Distance Perception
11 pages
Human Sound Localization Mechanisms
No ratings yet
Human Sound Localization Mechanisms
13 pages
Music Signal Processing: Ear & Perception
No ratings yet
Music Signal Processing: Ear & Perception
25 pages
Understanding Spatial Hearing Mechanics
No ratings yet
Understanding Spatial Hearing Mechanics
19 pages
Perceptual Aspects in Spatial Audio Processing
No ratings yet
Perceptual Aspects in Spatial Audio Processing
7 pages
HRTF vs Panning in Binaural Audio Navigation
No ratings yet
HRTF vs Panning in Binaural Audio Navigation
7 pages
David Griesinger
No ratings yet
David Griesinger
88 pages
Hearing, 2nd Edition Complete EPUB Ebook
100% (20)
Hearing, 2nd Edition Complete EPUB Ebook
16 pages
Ch. 9
No ratings yet
Ch. 9
51 pages
Sound Propagation and Spatial Hearing
No ratings yet
Sound Propagation and Spatial Hearing
6 pages
Understanding Auditory Perception Mechanisms
No ratings yet
Understanding Auditory Perception Mechanisms
24 pages
Ambisonics Theory
No ratings yet
Ambisonics Theory
10 pages
Psycho Acoustics
No ratings yet
Psycho Acoustics
64 pages
Human Auditory System Overview
No ratings yet
Human Auditory System Overview
11 pages
Perceived Spaciousness in Audio Systems
No ratings yet
Perceived Spaciousness in Audio Systems
126 pages
Real-Time Multiple Audio Beamforming System: Johan Lindqvist Martin Sollenberg
No ratings yet
Real-Time Multiple Audio Beamforming System: Johan Lindqvist Martin Sollenberg
61 pages
FR Presentation Script
No ratings yet
FR Presentation Script
5 pages
William M. Hartmann - How We Localize Sound
No ratings yet
William M. Hartmann - How We Localize Sound
7 pages
Sound Localization Using Microphone Arrays: Anish Chandak 10/12/2006 COMP 790-072 Presentation
No ratings yet
Sound Localization Using Microphone Arrays: Anish Chandak 10/12/2006 COMP 790-072 Presentation
33 pages
Physics and Biology of Audition
No ratings yet
Physics and Biology of Audition
100 pages
Acoustics and Illumination
100% (1)
Acoustics and Illumination
109 pages
Impact of Learning Styles on Cognition
No ratings yet
Impact of Learning Styles on Cognition
3 pages
Hassan Nawaz - 202321192010
No ratings yet
Hassan Nawaz - 202321192010
15 pages
6G Subliminal Spiritual Implant Removal
No ratings yet
6G Subliminal Spiritual Implant Removal
2 pages
EBSCO-FullText-12 08 2025
No ratings yet
EBSCO-FullText-12 08 2025
6 pages
Navigating Dopamine and Goal Achievement
No ratings yet
Navigating Dopamine and Goal Achievement
2 pages
Understanding Authentic Assessment Methods
No ratings yet
Understanding Authentic Assessment Methods
8 pages
Interpersonal Skills for Success
100% (1)
Interpersonal Skills for Success
47 pages
Mayer 1992
No ratings yet
Mayer 1992
8 pages
Impact of Family Pathology On Behavioura PDF
No ratings yet
Impact of Family Pathology On Behavioura PDF
12 pages
Body Image and Nutrition in Athletes
No ratings yet
Body Image and Nutrition in Athletes
1 page
Soccer Psychology Assessment
No ratings yet
Soccer Psychology Assessment
2 pages
Neuroethics: Anticipating The Future 1st Edition Judy Illes Ebook Sample Available
No ratings yet
Neuroethics: Anticipating The Future 1st Edition Judy Illes Ebook Sample Available
45 pages
Attending Behaviour
No ratings yet
Attending Behaviour
4 pages
Modafinil Borderline
No ratings yet
Modafinil Borderline
8 pages
B.Sc. Nursing 3 Year Students (2021-22) College of Nursing, GMCH, Chandigarh. Psychiatric Nursing
No ratings yet
B.Sc. Nursing 3 Year Students (2021-22) College of Nursing, GMCH, Chandigarh. Psychiatric Nursing
2 pages
Effective Candidate Selection Strategies
No ratings yet
Effective Candidate Selection Strategies
28 pages
Cognitive Development Study Guide Piaget Vygotsky
No ratings yet
Cognitive Development Study Guide Piaget Vygotsky
3 pages
Anglais AI
No ratings yet
Anglais AI
2 pages
Removal Exam
No ratings yet
Removal Exam
11 pages
Evolution of The Self: Anger Always Makes Sense
No ratings yet
Evolution of The Self: Anger Always Makes Sense
4 pages
BBC 6 Minute English-The Male Brain, The Female Brain
No ratings yet
BBC 6 Minute English-The Male Brain, The Female Brain
3 pages
Understanding Diagnosis in Mental Health
No ratings yet
Understanding Diagnosis in Mental Health
59 pages
Play With The Mind
No ratings yet
Play With The Mind
2 pages
Miss Sanober
No ratings yet
Miss Sanober
10 pages
Chapter 5
No ratings yet
Chapter 5
3 pages
PR 1
No ratings yet
PR 1
2 pages
Learning Principles and Applications - 8th Edition Secure Download
100% (13)
Learning Principles and Applications - 8th Edition Secure Download
17 pages
Ide Konten Psikologi 30 Hari
No ratings yet
Ide Konten Psikologi 30 Hari
3 pages
See112 - Remedial Instruction in Listening
No ratings yet
See112 - Remedial Instruction in Listening
15 pages
Interdisciplinary Consumer Behavior Insights
No ratings yet
Interdisciplinary Consumer Behavior Insights
20 pages

Evaluating Spatial Sound Systems

Uploaded by

Evaluating Spatial Sound Systems

Uploaded by

Evaluating Spatial Sound

Audio & Music Engineering

reproduction method in speaker placement sources to be created

any space … Compute signals driving 3

hearing into audio system design tools 4

correspond to what listeners report they hear

• Cross-correlation model of directional hearing

Around 3000 inner hair cells

irregular and 0.8

firing rate 0.4

DELAY LINE FROM LEFT EAR

0 50 100 Perceived ITD (direction to source) is

Note: Factor of 3/2 is due to diffraction around listeners head

3 Sec white noise bursts

• How quickly can a listener follow time- 0

• Evidence for a 200 - 300 msec threshold

• Distribution of 2 msec window ITD’s has a

10 Your brain averages over a hundred or more 2 msec windows

250 Trials - Stereo Loudspeakers @ +/- 30 degrees - Delta = 0 (center pan)

Note: Random nature of nerve

impulse stream creates a spread

of image width, even in a non-

Spread arises from statistics of neuronal pulses.

Frequency dependence of head

+ 𝑓 𝜔 𝑅% 𝛿 𝜏 + 𝜏& + 𝛿(𝜏 − 𝜏& )

𝜏& = left-right ear delay

𝑅# 𝑡, 𝜏 is the cross-correlation of the Sl and Sr

Stereo Sweet Spot calculation 1.5

• Compute peak of distribution of ITD’s for

• Infer the apparent source direction from

-2 -1.5 -1 -0.5 0 0.5 1 25 1.5

Delay = 0 Two random sequences Two random sequences Delay = 30 samples

Signals are correlated but delayed

No dominant peak in cross-correlation

0 – 2 msec delay 0 – 40 msec delay 0 – 200 msec delay

Explained by saturation and recovery time of hair cell response.

Track both the time of 10

arrival and the direction 2.5

of each room reflection 1.5

You might also like