Geometry of Musical Chords Explained
Geometry of Musical Chords Explained
# n
b
&
b
# n
#
b
#
b
#
b
b b
C F C G C D
7
G
7
C
7
F
7
Fs
7
F
7
As
7
Bf
7
{B, C, Df}
A B C D
Fig. 1. Efficient voice leading between transpositionally and inversionally related chords.
These progressions exploit three near-symmetries: transposition (A-B), inversion (C), and
permutation (D). Sources: classical music (A), jazz (B), Wagners Parsifal (C), Debussys
Faun (C), and contemporary atonality (D) (Soundfile S1).
9
01
00
02
03
04
05
06
70
80
90
t0
e0
[00]
11
12
13
14
15
16
17
81
91
t1
[e1]
e1
e2
e3
e4
e5
6e
7e
8e
9e
te
ee
22
23
24
25
26
27
28
92
[t2]
t2
t3
t4
5t
6t
7t
8t
9t
tt
33
34
35
36
37
38
[39] 93
49
59
69
79
89
99
44
45
46
47
[48]
48
58
68
78
88
55
56
[57]
57
67
77
[66]
66
unison
transposition
tritone
unison
minor second
minor second
major second
major second
minor third
minor third
major third
major third
perfect fourth
perfect fourth
Fig. 2. The orbifold T
2
/S
2
. C = 0, Cs = 1, etc., with Bf = t, and B = e. The left edge is
identified with the right. The voice leadings (C, Df)(Df, C) and (C, G)(Cs, Fs) are
shown; the first reflects off the singular boundary.
Table 1. Common sonorities in Western tonal music. The middle column lists the best
equal-tempered approximation to the first n pitch classes of the harmonic series. The right
column lists other good approximations. All divide the octave nearly evenly.
2 notes CG CFs
3 notes CEG CEfGf
CEfG
CEGs
4 notes CEGBf CEfGfA
CEfGfBf
CEfGBf
CEGB
5 notes CDEGBf CDEGA
CDEGB
6 notes CDEFsGBf CDEfFGBf
CDEFsGsBf
7 notes CDEFsGABf CDEFGABf
CDEfFsGABf
SUPPORTING ONLINE MATERIAL
TABLE OF CONTENTS
1. Overview of the argument 1
2. Pitch and pitch class 1
3. Comparing voice leadings 2
4. Minimal voice leadings and voice crossings 4
5. Derivation of the voice-leading orbifolds 7
6. Efficient voice leading and symmetry 8
7. Maximal evenness and minimal voice leading 11
8. A polynomial-time algorithm for finding a minimal 13
voice leading between arbitrary chords
9. Supplementary Figures and Tables 15
10. References 28
1. Overview of the argument. Our goal is to model composers judgments about the
relative sizes of voice leadings. One cannot assume that these judgments will be
consistent with any mathematical norm or metric (3). However, it is reasonable to
stipulate that a theoretical model should reflect widely recognized features of Western
music. Since musicians treat transposition and inversion as preserving musical distance, I
will require that voice-leading comparisons be invariant under transposition and inversion
of any of their individual musical voices (2). Since Western pedagogues instruct
composers to minimize voice-leading size while eschewing voice crossings (13), I
will require that crossed voice leadings be no smaller than their natural, uncrossed
alternatives (34). These minimal requirements suffice to establish a number of
interesting results about voice leading (68).
Sections 2 and 3 articulate mathematical constraints on methods of comparing
voice leadings. Section 4 shows that these constraints formalize my two musical
requirements. Section 5 derives the voice-leading orbifolds. Sections 6 and 7 relate the
structure of a chord to its voice-leading possibilities. Section 8 introduces a further
constraint on methods of comparing voice leadings, and uses it to derive a polynomial-
time algorithm for finding a minimal voice leading (not necessarily bijective) between
arbitrary chords.
2. Pitch and pitch class. Pitches are modeled as real numbers. The distance between two
pitches p, q is the absolute value of their difference, |q p|. Pitch classes are modeled as
points in the quotient space R/12Z. They are sets of real numbers {p + 12k | k Z}, with
p representing some pitch in the pitch class. We can label these sets using real numbers
in the range 0 x < 12. The elements of R/12Z form a group under addition of their
labels modulo 12Z. The distance between pitch classes a and b, written ||b a||
12Z
, is the
2
smallest nonnegative real number x such that, if p is a pitch belonging to pitch class a,
then either p + x or p x belongs to pitch class b.
Let (p
1
, p
2
, ..., p
n
)(q
1
, q
2
, ..., q
n
) be a voice leading between multisets of pitches,
and let (a
1
, a
2
, ..., a
n
)(b
1
, b
2
, ..., b
n
) be a voice leading between multisets of pitch
classes. I will say that the two voice leadings are associated if the pitches p
i
and q
i
all
belong to the pitch classes a
i
and b
i
, respectively. The displacement multiset of the voice
leading (p
1
, p
2
, ..., p
n
)(q
1
, q
2
, ..., q
n
) is the multiset of distances {|q
i
p
i
|}. Similarly, the
displacement multiset of the voice leading (a
1
, a
2
, ..., a
n
)(b
1
, b
2
, ..., b
n
) is the multiset of
distances {||b
i
a
i
||
12Z
}. I will require that a method of comparing voice leadings be
invariant under transposition and inversion of its individual voices: in any voice leading,
one can replace the voice (p, q) with (q + x, p + x) without changing the voice leadings
size. It follows that the size of a voice leading depends only on its displacement multiset.
I will also require that the size of a displacement multiset be nondecreasing in each of its
members: we can replace any member of a displacement multiset with a smaller number
without increasing the size of the resulting voice leading (3). This ensures that a voice
leading between multisets of pitch classes will be the same size as the smallest associated
voice leading between multisets of pitches.
3. Comparing voice leadings. A method of comparing voice leadings is a relation
over multisets of nonnegative real numbers that is reflexive, transitive, and total (a total
preorder; see Table S1 for more information). The relation must satisfy what I call the
distribution constraint:
{x
1
+ c, x
2
, ..., x
n
} {x
1
, x
2
+ c, ..., x
n
} {x
1
, x
2
, ..., x
n
}, for x
1
> x
2
, c > 0
(NB: since multisets are unordered, the numerical subscripts do not have ordinal
significance: x
1
is no more first than x
2
or x
n
.) The first inequality requires that the total
preorder not consider an uneven distribution of values to be smaller than a more even
distribution with the same total sum: if X is an n-member displacement multiset whose
members sum to x, then {x, 0, ..., 0} X {x/n, x/n, ..., x/n}. Thus, x semitones of
motion in a single voice yields at least as large a voice leading as x semitones of motion
distributed over multiple voices. This requirement is a weakened relative of the triangle
inequality (4). The distribution constraints second inequality requires that the size of a
multiset be nondecreasing in each of its members: increasing the size of any number in a
multiset never decreases that multisets size. If a total preorder satisfies both inequalities
strictly, I will say that it strictly satisfies the distribution constraint.
Every music-theoretical method of comparing voice leadings satisfies the
distribution constraint.
3
A. Smoothness. The size of a voice leading is the sum of the objects in the
displacement multiset (57). Thus {2, 2} > {3.999} > {1, 1, 1}.
Smoothness is sometimes called the taxicab norm. It reflects aggregate
physical distance on keyboard instruments. Smoothness satisfies the
distribution constraint non-strictly.
B. L
p
vector norms. Smoothness is analogous to the L
1
vector norm,
though the components of vectors are ordered whereas the members of
displacement multisets are not. The analogues to the L
p
vector norms
strictly satisfy the distribution constraint for finite p > 1. The Euclidean
vector norm L
2
has been used by Callender (8).
C. Semitonal and stepwise voice leadings. According to the L
vector
norm, the size of a displacement multiset is its largest member. The
musical terms semitonal voice leading and stepwise voice leading
refer to this measure of voice-leading size. Semitonal voice leadings have
an L
1
(i)
||
12Z
}. The size
of this voice leading depends on the difference between the effects of the function F and
the permutation
-1
. The voice leading will be small when F(a
i
) a
1
(i)
, for all i, and
will be trivial with strict equality.
Thus, the internal structure of any chord A, as represented by the size of the voice
leadings (a
1
, a
2
, ..., a
n
)(a
(1)
, a
(2)
, ..., a
(n)
), for all permutations , determines the
9
functions F such that A and F(A) can be linked by efficient voice leading. For example,
when the notes {a
1
, a
2
, ..., a
n
} are all close together, the voice leadings (a
1
, a
2
, ..., a
n
)
(a
(1)
, a
(2)
, ..., a
(n)
) will resemble the trivial voice leading. In this case, A and F(A) can be
linked by efficient voice leading only when F resembles the identity. By contrast, when
the notes {a
1
, a
2
, ..., a
n
} divide the octave nearly evenly, then there will be a voice leading
(a
1
, a
2
, ..., a
n
)(a
(1)
, a
(2)
, ..., a
(n)
) that resembles transposition by 12/n semitones. In
this case, A and T
12/n
(A) can be linked by efficient voice leading.
Observation B. Let G be an isometry of pitch class space, and let S
G
be invariant
under G. The size of any voice leading from A to S
G
sets an upper bound on the size of
the minimal voice leading(s) from A to G(A).
Suppose that G is an isometry. It follows that for any voice leading from A to S
G
,
with S
G
invariant under G, we can find an equally large voice leading from S
G
to G(A). If
a voice leading from A to S
G
has displacement multiset {d
1
, d
2
, ..., d
n
}, then there is some
permutation such that the minimal voice leading from A to G(A) has displacement
multiset less than or equal to {d
1
+ d
(1)
, d
2
+ d
(2)
, ..., d
n
+ d
(n)
}. The distribution
constraint tells us that no multiset {d
1
+ d
(1)
, d
2
+ d
(2)
, ..., d
n
+ d
(n)
} can be larger than
{2d
1
, 2d
2
, ..., 2d
n
}. The size of any voice leading from A to S
G
thus sets an upper bound
on the size of the minimal voice leading(s) from A to G(A). Note that the same reasoning
can be used when we require that a bijective voice leading from A to A involve the
permutation : the size of a voice leading from A to S
, with S
||a
(i)
+ x a
i
||
12Z
. Since ||a||
12Z
+ ||b||
12Z
||a + b||
12Z
, this sum is
greater than or equal to ||nx + (a
(i)
a
i
)||
12Z
= ||nx||
12Z
. By construction, ||nx||
12Z
= ||nc||
12Z
,
since x c modulo (12/n)Z, and ||nc||
12Z
= n||c||
12Z
, since we can always choose c such that
12
||c||
12Z
6/n. Thus the voice leading can be no smaller than the minimal voice leading
between E and T
x
(E).
I now prove a corollary that applies to collections of k equally spaced pitch
classes. Musically, these pitch classes represent an equal-tempered chromatic scale.
Such discrete musical universes will not always contain chords that divide the octave into
n perfectly even pieces. For example, no five-note chord in twelve-tone equal
temperament divides the octave perfectly evenly. But as Douthett and Clough have
shown (13), equal-tempered scales will always contain a unique collection of maximally
even chords dividing the octave as evenly as possible (14). These maximally even
chords are the discrete analogues to the perfectly even chords we have been considering.
In what follows, it will be convenient to relabel the pitch classes so that the
equally-spaced points in our chromatic scale have integer co-ordinates. We will therefore
temporarily abandon R/12Z in favor of R/kZ, where k is an integer.
COROLLARY. Let A and M be integer-valued n-note submultisets of
R/kZ, and let M be maximally even. Then, for any integer x, the minimal
bijective voice leading between A and T
x
(A) can be no smaller than the
minimal bijective voice leading between M and T
x
(M).
The proof is very similar to the proof of Theorem 2. Since M is maximally even
(14), we can find a (not necessarily integer-valued!) chord {e
1
, e
2
, ..., e
n
} that divides the
octave into n precisely even parts, such that the crossing-free voice-leadings between M
and T
x
(M) can be written
(
e
1
,
e
2
, ...,
e
n
)(
e
1
+ c
e
2
+ c
, ...
e
n
+ c
||
kZ
, ||
||
kZ
}, and summing to n||c||
kZ
(15). By the
distribution constraint, this multiset is as small as any integer-valued n-note multiset with
the same or greater sum. As we saw in Theorem 2, the minimal bijective voice leading
between A and T
x
(A) will have a displacement multiset summing to at least n||c||
kZ
. Hence
it can be no smaller than the minimal bijective voice leading between M and T
x
(M).
The preceding corollary can also be applied to scales that do not divide pitch class
space evenly. To see why, note that we can always devise a metric on pitch class space
such that a given scale evenly divides the octave. For example, given the C major scale
(C, D, E, F, G, A, B) we can define a metric according to which the distances CD, DE,
EF, FG, GA, AB, and BC all equal one [Fig. S11(a)]. Musicians use the term
scale step to refer to this scale-dependent unit of distance. Two subsets of the scale are
related by scalar transposition if they are related by rotation relative to this new metric
13
[Fig. S11(b)]. Since our scale evenly divides the octave according to the new metric, the
preceding corollary applies.
Relative to this metric, the familiar diatonic tertian triads {C, E, G}, {D, F, A},
{E, G, B}, {F, A, C}, {G, B, D}, {A, C, E}, and {B, D, F} are maximally even and
related by transposition [Fig. S11(b)]. It follows from our corollary that the minimal
bijective voice leadings between these tertian triads will be as small as possible: a
bijective voice leading between any pair of transpositionally related three-note diatonic
subsets can be no smaller than the minimal bijective voice leading between diatonic
tertian triads related by that transposition. Analogous facts hold for the tertian seventh
chord {C, E, G, B} and its transpositions, which are also maximally even. Our corollary
therefore generalizes a result of Agmon (16), who first noted the special voice-leading
properties of diatonic tertian triads and seventh chords.
8. A polynomial-time algorithm for finding a minimal voice leading between two
chords. Given two chords A and B, how do we find a minimal voice leading between
them? The question is nontrivial, since a minimal voice leading need not be bijective.
For example, using any of the standard methods of comparing voice leadings, the
voiceleading (0, 0, 4, 6)(10, 0, 6, 6) is smaller than any of the bijective voice leadings
between {0, 4, 6} and {6, 10, 0} (17). Adding additional voices therefore allows us to
decrease the size of the voice leading. The large number of non-bijective voice leadings
between any two chordsroughly 2
mn
, where m and n are their cardinalitiesmeans that
an exhaustive search may be impractical, particularly in time-critical applications such as
interactive computer music.
Suppose, however, that our method of comparing voice leadings is a total
preorder satisfying both the distribution constraint and what I will call the recursion
constraint:
{x
1
, x
2
, ..., x
m
} {y
1
, y
2
, ..., y
n
} implies {x
1
, x
2
, ..., x
m
, c} {y
1
, y
2
, ..., y
n
, c}
The recursion constraint mandates a straightforward relationship between the size of a
multiset and the size of its sub-multisets. Every music-theoretical method of comparing
voice leadings satisfies this constraint. When a total preorder satisfies both the
distribution and recursion constraints, we can use the technique of dynamic
programming, common in computer science, to determine a minimal voice leading
between arbitrary chords in polynomial time (order n
2
m).
Define the ascending distance from pitch class a to pitch class b as the smallest
nonnegative real number x such that, if p is a pitch belonging to pitch class a, then p + x
belongs to pitch class b. Let(a
1
, a
2
, ..., a
m
, a
m+1
= a
1
) order the notes of chord A based on
increasing ascending distance from arbitrarily chosen a
1
. (Note that I repeat the first
element a
1
as the last element of the list.) Similarly for (b
1
, b
2
, ..., b
n
, b
n+1
= b
1
). The
14
notation [a
1
, ..., a
i
][b
1
, ... b
j
] will refer to all voice leadings from {a
1
, a
2
, ..., a
i
} to
{b
1
, b
2
, ... b
j
} that can be written with both chords subscripts in nondecreasing order.
Thus [a
1
, a
2
][b
1
, b
2
, b
3
] refers to (a
1
, a
1
, a
2
)(b
1
, b
2
, b
3
), (a
1
, a
1
, a
2
, a
2
)(b
1
, b
2
, b
3
, b
3
),
and so on.
If a crossing-free voice leading contains the pair (a
i
, b
j
), with i, j > 1, then it must
contain at least one of the following: (a
i-1
, b
j
), (a
i
, b
j-1
), or (a
i-1
, b
j-1
). By the recursion
constraint, the smallest voice leading of the form [a
1
, ..., a
i
][b
1
, ..., b
j
] will be the voice
leading that adds the pair (a
i
, b
j
) to the smallest voice leading of the form [a
1
, ..., a
i-1
]
[b
1
, ..., b
j
], [a
1
, ..., a
i
][b
1
, ..., b
j-1
], or [a
1
, ..., a
i-1
][b
1
, ..., b
j-1
]. Thus, once we have fixed
the pair (a
1
, b
1
) we can recursively compute the smallest voice leading between A and B
containing that pair. We do this by creating a matrix whose entries e
i, j
record the size of
the minimal voice leading of the form [a
1
, ..., a
i
][b
1
, ..., b
j
]. It is trivial to fill in the first
row and column of the matrix. For any other entry e
i, j
, we simply add the voice (a
i
, b
j
) to
the smallest of the voice leadings in the entrys upper, left, and upper-left neighbors.
Figure S12 illustrates the technique, identifying the smallest voice leading
between the C and E major-seventh chords, {4, 7, 11, 0} and {4, 8, 11, 3}, such that the
voice leading contains the pair (4, 4). In constructing this matrix I have used the L
1
taxicab norm to measure voice-leading size. The voice leading in the bottom-right
entry, (4, 4, 7, 11, 0)(3, 4, 8, 11, 11), is one of the minimal voice leadings between the
two chords that contains (4, 4). To remove this last restriction, we would need to repeat
the calculation three more times, each time cyclically permuting the order of one of the
chords so as to fix a different initial pair. As it happens, however, the voice leading
shown in Figure S12 is minimal. This follows from the fact that the mapping in the top-
left position, (4, 4), contributes nothing to the overall size of the voice leading; we can
therefore add it to any voice leading without increasing its L
1
size.
Figure S12 includes in each entry both the numerical size of the voice leading and
the voice leading itself. With the L
1
norm this is unnecessary: we need to keep track of
the size, but not the voice leading. To determine the value of entry e
i, j
we can simply add
the distance between the pair (a
i
, b
j
) to the minimum value in the entries e
i-1, j
,
e
i, j-1
, and e
i-1, j-1
. (For the other L
p
norms we can calculate the p
th
power of the voice-
leading size in this way, taking the p
th
root before output.) Having filled in the matrix, we
can recover a minimal voice leading between the two chords by tracing back a path that
moves from the bottom-right entry to the top left, moving only up, left, and diagonally
up-and-left, such that the size of the voice leading decreases as much as possible with
each step. The entries in boldface indicate the path such a traceback algorithm would
take. Due to the circular structure of pitch-class space, the voice leading in the lower
right-hand corner of the matrix counts the pair (a
1
, b
1
) = (a
m+1
, b
n+1
) twice; this can easily
be corrected prior to output. The resulting algorithm is easy to implement and suitable
for time-critical applications such as interactive computer music.
15
G
D
A
E
B
/
C
f
F s / G f C
s
/
D
f
A
f
E
f
B
f
F
.
.
.
.
.
.
.
.
.
.
.
.
C
{5
6
}
{
t
e
}
{
3
4
}
{
8
9
}
{
1
2
}
{
6
7
}
{
e
0
}{
4
5
}
{
2
3
}
{
7
8
}
{
0
1
}
{
9
t
}
{024579e}
{
0
2
4
6
7
9
e
}
{
1
2
4
6
7
9
e
}
{
1
2
4
6
8
9
e
}
{
1
3
4
6
8
9
e
}
{
1
3
4
6
8
t
e
}
{ 1 3 5 6 8 t e }
{
1
3
5
6
8
t
0
}
{
1
3
5
7
8
t
0
}
{
2
3
5
7
8
t
0
}
{
2
3
5
7
9
t
0
}
{
2
4
5
7
9
t
0
}
Figure S1. The circle of fifths depicts minimal voice leadings between diatonic collections
(major scales). Each diatonic collection can be transformed into its neighbors by moving
one pitch class by one semitone. For example, the C major scale can be transformed into the
G major scale by moving the pitch class 5 (F) to 6 (Fs). Here and elsewhere, the letters t
and e refer to 10 (Bf) and 11 (B), respectively.
16
C c
f F
a A
[e] [E]
e E
[g] [G]
d D
fs
gs
Fs
[as]
[cs]
cs
[Cs]
Af
Ef
bf
g
G
B
ds
b
Bf
[D]
[Fs]
[Bf]
[Bf]
[Bf]
Df
Figure S2. The Tonnetz. Nineteenth-century theorists such as Hostinsky, Oettingen, and
Riemann explored a graph that is the geometric dual of the one shown here. The graph
displays efficient voice leadings among the 24 familiar major and minor triads. Uppercase
and lowercase letters indicate major and minor triads, respectively. Triads connected by
horizontal lines share both root and fifth, and can be linked by voice leading in which one
note moves by one semitone. (For example, the C major triad can be transformed into a C
minor triad by changing E to Ef.) Triads connected by a NE/SW diagonal also share two
notes and can be linked by single-semitone voice leading. (For example, the C major triad
can be transformed into an E minor triad by changing C to B.) Triads connected by a
NW/SE diagonal share two notes and can be linked by voice leading in which one note
moves by two semitones. (For example, the C major triad can be transformed into an A
minor triad by changing G to A.)
1
2
3
4
5
6
7
8
9
1
0
1
1
0
0.17
2.5
C C
s
/
D
f
D
D
s
/
E
f
E
F
F s / G f G
G
s
/
A
f
A
A
s
/
B
f
B
Figure S3. The quotient space R/12Z is a circle whose circumference is twelve units long.
The twelve familiar pitch classes of Western equal-temperament evenly divide this circle.
Since the circle is continuous, it contains a point for every conceivable pitch class. The
figure shows the locations of the pitch class 0.17, which is seventeen cents (hundredths of a
semitone) above pitch class C, and pitch class 2.5 (D quarter tone sharp), which is halfway
between D and Ef.
17
(C4, C4) (D4, D4) (E4, E4) (F4, F4) (Fs4, Fs4) (Cs4, Cs4) (Ef4, Ef4)
(D5, D4) (E5, E4) (F5, F4) (Fs5 Fs4) (Cs5, Cs4) (Ef5, Ef4)
(D4, D5) (E4, E5) (F4, F5) (Fs4, Fs5) (Cs4, Cs5) (Ef4, Ef5) (Fs3, Fs4) (Gs3 Gs4) (Bf3, Bf4) (B3, B4) (C4, C5) (G3, G4) (A3, A4)
(Gs4 Gs4) (Bf4, Bf4) (B4, B4) (C5, C5) (G4, G4) (A4, A4)
(Fs4, Fs3) (Gs4 Gs3) (Bf4, Bf3) (B4, B3) (C5, C4) (G4, G3) (A4, A3)
(C4, Cs4)
(B3, Cs4)
(D4, Ef4) (E4, F4) (F4, Fs4) (Cs4, D4) (Ef4, E4)
(C5, Cs4) (D5, Ef4) (E5, F4) (F5, Fs4) (Cs5, D4) (Ef5, E4)
(Cs4, C5) (Ef4, D5) (F4, E5) (Fs4, F5) (D4, Cs5) (E4, Ef5)
(Cs4, C4) (Ef4, D4) (F4, E4) (Fs4, F4) (D4, Cs4) (E4, Ef4)
(C4, D4) (D4, E4) (E4, Fs4) (F4, G4) (Cs4, Ef4) (Ef4, F4)
(B4, Cs4) (C5, D4) (D5, E4) (E5, Fs4) (Cs5, Ef4) (Ef5, F4)
(D4, C5) (E4, D5) (Fs4, E5) (G4, F5) (Ef4, Cs5) (F4, Ef5)
(Fs4, Gs4) (G4, A4) (A4, B4) (B4, Cs5) (Gs4, As4) (Bf4, C5)
(F4, G3) (Fs4, Gs3) (G4, A3) (A4, B3) (B4, Cs4) (Gs4, As3) (Bf4, C4)
(B3, D4) (C4, Ef4) (D4, F4) (E4, G4) (Cs4, E4) (Ef4, Gf4)
(B4, D4) (C5, Ef4) (D5, F4) (E5, G4) (Cs5, E4) (Ef5, Gf4)
(D4, B4) (Ef4, C5) (F4, D5) (G4, E5) (E4, Cs5) (Gf4, Ef5)
(D4, B3) (Ef4, C4) (F4, D4) (G4, E4) (E4, Cs4) (Gf4, Ef4)
(F4, Af4) (Fs4, A4) (Gs4, B4) (As4, Cs5) (G4, Bf4) (A4, C5)
(F4, Af3) (Fs4, A3) (Gs4, B3) (As4, Cs4) (G4, Bf3) (A4, C4)
(Bf3, D4) (B3, Ds4) (Df4, F4) (Ef4, G4) (E4, Gs4) (C4, E4) (D4, Fs4)
(B4, Ds4) (Df5, F4) (Ef5, G4) (E5, Gs4) (C5, E4) (D5, Fs4)
(Ds4, B4) (F4, Df5) (G4, Ef5) (Gs4, E5) (E4, C5) (Fs4, D5)
(Fs4, As4) (Af4, C5) (Bf4, D5) (A4, Cs5) (F4, A4) (G4, B4)
(E4, Gs3) (Fs4, As3) (Af4, C4) (Bf4, D4) (A4, Cs4) (F4, A3) (G4, B3)
(Bf3, E4) (A3, Ds4) (B3, F4) (Df4, G4) (Ef4, A4) (C4, Fs4) (D4, Gs4)
(Bf4, E4) (B4, F4) (Df5, G4) (Ef5 A4) (C5, Fs4) (D5, Gs4)
(F4, B4) (E4, Bf4) (Fs4, C5) (Gs4, D5) (G4, Cs5) (A4, Ds5)
(Ef4, A3) (F4, B3) (E4, Bf3) (Fs4, C4) (Gs4, D4) (G4, Cs4) (A4, Ds4)
(Bf3, Gf4) (A3, F4) (B3, G4) (Cs4, A4) (Gs3, E4) (C4, Af4)
(Bf3, G4) (A3, Fs4) (B3, Gs4) (Cs4, As4) (Af3, F4) (C4, A4)
(Bf4, G4) (A4, Fs4) (B4, Gs4) (Cs5, As4) (Af4, F4) (C5, A4)
(D4, Bf4)
(Bf4, Gf4) (A4, F4) (B4, G4) (Cs5, A4) (C5, Af4) (D5, Bf4) (Ds4, B3) (F4, Df4) (G4, Ef4) (Gs4, E4) (E4, C4) (Fs4, D4) (D4, Bf3)
(Bf3, Af4) (A3, G4) (B3, A4) (Cs4, B4)
(D4, C4) (E4, D4) (Fs4, E4) (G4, F4) (Ef4, Cs4) (F4, Ef4) (Cs4, B3)
(Af3, Gf4) (C4, Bf4)
(Bf4, Af4) (A4, G4) (B4, A4) (Cs5, B4) (Af4, Gf4) (C5, Bf4)
(G3, F4)
(Bf3, A4) (A3, Gs4) (B3, As4) (Af3, G4) (C4, B4) (G3, Fs4)
(Bf4, A4) (A4, Gs4) (B4, As4) (Af4, G4) (C5, B4) (G4, Fs4)
(Bf4, B4) (A4, Bf4) (B4, C5) (Af4, A4) (G4, Af4) (Fs4, G4)
(Bf4, B3) (A4, Bf3) (B4, C4) (Af4, A3) (G4, Af3) (Fs4, G3)
(Bf3, F4) (A3, E4) (B3, Fs4) (Df4, Af4) (C4, G4) (D4, A4)
(Bf4, F4) (A4, E4) (B4, Fs4) (Df5, Af4) (C5, G4) (D5, A4)
(Bf3, Ef4) (B3, E4) (Df4, Gf4) (Ef4, Af4) (C4, F4) (D4, G4)
(Bf4, Ef4) (B4, E4) (Df5, Gf4) (Ef5, Af4) (C5, F4) (D5, G4)
(Ef4, Bf4) (E4, B4) (Gf4, Df5) (Af4, Ef5) (F4, C5) (G4, D5)
(Ef4, Bf3) (E4, B3) (Gf4, Df4) (Af4, Ef4) (F4, C4) (G4, D4)
(Fs4, B4) (G4, C5) (A4, D5) (Gs4, Cs5) (E4, A4) (F4, Bf4)
(Fs4, B3) (G4, C4) (A4, D4) (Gs4, Cs4) (E4, A3) (F4, Bf3)
P
P
P
P
(a)
(b)
Figure S4. (a) A portion of the infinite plane representing ordered two-note chords of
pitches. The four quadrants are equivalent to within octave displacement and permutation.
Each is equivalent to Figure 2 in the main paper. (b) An abstract representation of the
symmetries relating the four quadrants. The lower-left quadrant is related to the upper-left
by a reflection that preserves their common border. This action permutes each dyad.
Translation of any quadrant diagonally up and right transposes the first element of each
dyad by an ascending octave. Translation of any quadrant diagonally down and right
transposes the second element of each dyad by an ascending octave. These operations
suffice to generate the infinite, periodic figure. The space can be described, metaphorically,
as wallpaper; Figure 2 in the main paper provides the pattern, Figure S4(b) shows how the
pattern is to be assembled, and Figure S4(a) shows a portion of the result. The diamond in
the center of Fig. S4(a) corresponds to the 2-torus shown in Figure S9.
18
Figure S5. The orbifold T
3
/S
3
is a prism whose two triangular faces are identified by way
of a 120 rotation. Several familiar equal-tempered chords are depicted on the figure.
Augmented triads, which divide the octave into three equal parts, are shown as dark cubes.
Minor chords are light spheres and major chords are dark spheres. Lines connecting
augmented, minor, and major chords indicate that they can be linked by voice leading in
which a single voice moves by a single semitone. Since minor and major chords divide the
octave nearly evenly, they are clustered near the center of the orbifold. Triple unisons,
which contain only one pitch class, are found on the edge of the figure.
19
(a) (b) (c)
CCC
CCC
EEE
EEE
GsGsGs
CEGs
CEGs
GsGsGs
transposition
CC
CFs CFs
CC
FsFs
FsFs
CCCC
AAAA
EfEfEfEf
FsFsFsFs
EfEfEfEf
FsFsFsFs
AAAA
CCCC
t
r
a
n
s
p
o
s
i
t
i
o
n
t
r
a
n
s
p
o
s
i
t
i
o
n
Figure S6. (a) The orbifold T
2
/S
2
is a two-dimensional prism (parallelogram) whose base (a
line segment) is glued to the opposite face. Before gluing, the base must be twisted so that
chords on the left edge match those on the right. This twist is a reflection that can be
represented as a rotation in three Euclidean dimensions. The line at the center of the figure
contains chords that divide the octave evenly. (b) The orbifold T
3
/S
3
is a three-dimensional
prism whose two triangular faces are glued together. Before gluing, rotate one face by 120,
so that the chords match. The result is the bounded interior of a twisted triangular 2-torus.
Augmented triads, which divide the octave into three equal parts, lie on the line at the center
of the figure. Major and minor chords are close to this line, as shown in Figure S5.
Rotating the prism around the central line by 120 transposes every chord by major third.
(c) The orbifold T
4
/S
4
is a four-dimensional prism whose two tetrahedral faces are glued
together. The dashed lines extend into the fourth dimension. Before identifying the two
faces, twist one so that the chords match. The twist is a reflection, as in the two-
dimensional case. Diminished seventh chords, which divide the octave into four equal
pieces, lie at the center of the orbifold. Familiar four-note tonal chords lie close to this
chord.
20
(a) (b)
p
1
p
2
r r
p
1
< p
2
, q
1
> q
2
q
1
q
2
p
1
p
2
q
1
q
2
Figure S7. (a) | r p
2
| / | r p
1
| = | q
2
r | / | q
1
r |. (b) Any line segment that crosses (p
1
, q
2
)
crosses either (p
1
, q
1
) or (p
2
, q
2
).
(a) (b)
p
p
+ x
1
x
2
{x
1
+ c, x
2
} {x
1
+ c, x
2
, x
3
}
{x
1
, x
2
, x
3
} {x
1
, x
2
+ c}
p
+ x
1
+ c
p
+ x
1
p
p
+ x
1
+ x
2
p
+ x
1
+ x
3
+ c
p
+ x
1
+ c
p
+ x
1
p
p
+ x
1
+ x
2
p
+ x
1
+ x
3
+ c
p
+ x
1
+ c
p
+ x
1
p
p
+ x
1
x
2
x
1
> x
2
, c > 0 x
3
x
2
> 0, c > 0
p
+ x
1
+ c
p
+ x
1
Figure S8. For any violation of the distribution constraint, we can find a crossed voice
leading that is smaller than its natural uncrossed alternative. The crossed voice leading at
the bottom of each column is smaller than the uncrossed voice leading above it.
21
(a) (b)
10 00 20 30 40 50 60 70 80 90 t0 e0 [00]
11 01 21 31 41 51 61 71 81 91 t1 e1 [01]
22 12 02
23 13 03
32 42 52 62 72 82 92 t2 e2 [02]
33
24 14 04 34
43 53 63 73 83 93 t3 e3 [03]
44
25 15 05 35 45
54 64 74 84 94 t4 e4 [04]
55
26 16 06 36 46 56
65 75 85 95 t5 e5 [05]
66
27 17 07 37 47 57 67
76 86 96 t6 e6 [06]
77
28 18 08 38 48 58 68 78
87 97 t7 e7 [07]
88
29 19 09 39 49 59 69 79 89
98 t8 e8 [08]
99
2t 1t 0t 3t 4t 5t 6t 7t 8t 9t
t9 e9 [09]
tt
2e 1e 0e 3e 4e 5e 6e 7e 8e 9e te
et [0t]
ee
[20] [10] [00] [30] [40] [50] [60] [70] [80] [90] [t0] [e0]
[0e]
[00]
B
A
01 00 02 03 04 05 06 07 08 09 0t 0e [00]
11 12 13 14 15 16 17 18 19 1t 1e [10]
22 23 24 25 26 27 28 29 2t 2e [20]
33 34 35 36 37 38 39 3t 3e [30]
44 45 46 47 48 49 4t 4e [40]
55 56 57 58 59 5t 5e [50]
66 67 68 69 6t 6e [60]
77 78 79 7t 7e [70]
88 89 8t 8e [80]
99 9t 9e [90]
tt te [t0]
ee [e0]
[00]
A
D
B
C
Figure S9. (a) The space of ordered two-note chords of pitch-classes is a 2-torus. To
identify points (x, y) and (y, x), we need to fold the torus along the AB diagonal. The
resulting figure, shown in (b), is a triangle with two of its sides identified. This is a Mbius
strip. To see why, cut figure (b) along the line CD and glue AC to CB. (To make this
identification in Euclidean 3-space, you will need to turn over one of the pieces of paper.)
The result is a square with opposite sides identified, as in Figure 2 of the main paper.
22
.... .
0
3
(a) the dominant-seventh chord {0, 3, 6, 8}
(b) the minor-seventh chord {0, 3, 5, 8}
(c)
3 2 4
[0] 3 6 8
.... .
0
3 2 3 4
[0] 3 5 8
.... .
a
n-2
a
n-1
x + d
n-2
x + d
n-1
x + d
0
x + d
1
a
2
a
0 a
1
.... .
a
0
2x
x
a
0
+ 2x a
0
x a
0
a
0
+ x
x x x
||
d
0
||
12Z
||
d
n-1
||
12Z
|| d
0
+ d
1
||
12Z
|| d
n-2
+ d
n-1
||
12Z
0
Figure S10. (ab) The dominant-seventh chord {0, 3, 6, 8} and minor-seventh chord
{0, 3, 5, 8} can both be arranged as cycles whose notes are 2, 3, 3, and 4 semitones apart.
To transform the dominant-seventh chord into a chord that divides the octave evenly, one
need only move pitch class 8 to pitch class 9. However, to transform the minor-seventh
chord into a chord that divides the octave evenly, one must move at least two pitch classes.
(For example, one can move 5 and 8 to 6 and 9, respectively.) Consequently, although both
chords have equally small minimal voice leadings to their T
3
forms, the first is closer to the
nearest T
3
-invariant chord according to many metrics. (c) To find a bijective voice leading
from any chord to a chord whose intervals are all equal to x, fix a
0
, move note a
1
by d
0
semitones, note a
n-1
by d
n-1
semitones, note a
2
by d
0
+ d
1
semitones, and so on. Since pitch-
class space is circular, the term d
n/2
x
]
and
x
1
The greatest integer s x and the smallest integer > x, respectively.
Table S1, part 2. A glossary of mathematical terms and symbols.
26
pitch Pitch is the perceptual correlate of fundamental frequency. Pitches
can be modeled as real numbers such that middle C is 60, the
octave has size 12, and semitones have size 1.
pitch class A set consisting of all pitches separated by an integral number of
octaves. A220 and A440 both belong to the pitch class A. Pitch
classes can be modeled as elements of the quotient space R/12Z.
chord A multiset of either pitches or pitch classes.
transposition,
translation
In both pitch and pitch-class space, transposition (or translation)
corresponds to addition by a constant value. If a is a pitch or pitch
class then a + x is the transposition of a by x semitones.
T
x
(A) The transposition of the chord A by x semitones.
transpositionally related Two chords are transpositionally related if one is the transposition
of the other.
inversion,
reflection
In both pitch and pitch-class space, inversion (reflection)
corresponds to subtraction from a constant value. If a is a pitch or
pitch class then x a is the inversion of a with index number x.
I
x
(A) The inversion that maps a to x a. I
x
maps 0 to x and vice versa.
inversionally related Two chords are inversionally related if one is the inversion of the
other.
voice leading A voice leading between two multisets {a
1
, a
2
, ..., a
m
} and
{b
1
, b
2
, ..., b
n
} is a multiset of ordered pairs (a
i
, b
j
), such that every
member of each multiset is in some pair.
(a
1
, a
2
, ..., a
n
)(b
1
, b
2
, ..., b
n
) The voice leading that contains all and only the pairs (a
i
, b
i
). Voice
leadings do not uniquely determine the chords they connect. For
example, the voice leading (0, 0, 4, 7)(11, 2, 5, 7) is both a
bijective voice leading from {0, 0, 4, 7} to {11, 2, 5, 7} and a non-
bijective voice leading from {0, 4, 7} to {11, 2, 5, 7}.
distance The distance between two pitches p and q is the absolute value of
their difference, |q p|. The distance between two pitch classes a
and b, written ||b a||
12Z
, is the smallest nonnegative real number x
such that, if p is a pitch belonging to pitch class a, then either p + x
or p x belongs to pitch class b.
ascending distance The ascending distance from pitch class a to pitch class b is the
smallest nonnegative real number x such that, if p is a pitch
belonging to pitch class a, then p + x belongs to pitch class b.
Table S1, part 3. A glossary of musical terms and symbols.
27
associated voice leading (p
1
, p
2
, ..., p
n
)(q
1
, q
2
, ..., q
n
) is associated with (a
1
, a
2
, ..., a
n
)
(b
1
, b
2
, ..., b
n
) if pitches p
i
and q
i
all belong to the pitch classes a
i
and b
i
,
respectively.
displacement multiset The multiset of distances between notes in the source chord and their
images in the target chord.
distribution constraint {x
1
+ c, x
2
, ..., x
n
} {x
1
, x
2
+ c, ..., x
n
} {x
1
, x
2
, ..., x
n
} for x
1
> x
2
and
c > 0
recursion constraint {x
1
, x
2
, ..., x
m
} {y
1
, y
2
, ..., y
n
} implies {x
1
, x
2
, ..., x
m
, c}
{y
1
, y
2
, ..., y
n
, c}
trivial voice leading A voice leading containing only pairs of the form (x, x).
bijective voice leading A voice leading from A to B such that every member of A is mapped to
exactly one member of B, and vice versa.
independent voice leading A voice leading that cannot be written in the form (a
1
, a
2
, ..., a
n
)
(a
1
+ x, a
2
+ x, ..., a
n
+ x).
parallel voice leading A voice leading that is not independent, and hence acts as a
transposition.
crossing free,
strongly crossing free
Intuitively, a voice leading is crossing free if the notes of the source
chord can be connected to those of the target along minimal-length
line-segments intersecting only at their endpoints. A voice leading
between multisets of pitches (p
1
, p
2
, ..., p
n
)(q
1
, q
2
, ..., q
n
) has no voice
crossings if p
i
> p
j
implies q
i
q
j
, for all i, j n. It is strongly crossing
free if (p
1
+ 12k
1
, p
2
+ 12k
2
, ..., p
n
+ 12k
n
)(q
1
+ 12k
1
, q
2
+ 12k
2
, ..., q
n
+ 12k
n
)
is crossing free, for all integers k
i
. Thus one cannot introduce crossings
into a strongly crossing-free voice leading simply by shifting the octave
in which its voices appear. A voice leading between multisets of pitch
classes is crossing free if it is associated with a strongly crossing-free
voice leading between multisets of pitches in which no voice moves by
more than six semitones.
voice crossing Two voices (p
1
, q
1
) and (p
2
, q
2
) cross if the voice leading (p
1
, p
2
)
(q
1
, q
2
) is not crossing free.
maximally even Let {e
0
, e
1
, ..., e
n-1
} be a subset of R/kZ that divides the octave into n
precisely even parts. {m
0
, m
1
, ..., m
n-1
} is maximally even if m
i
=
e
i
.
For any maximally even chord {m
0
, m
1
, ..., m
n-1
}, and any integer d, the
set {m
i + d (mod n)
m
i
} will contain either a single integer-valued point in
R/kZ or a pair of adjacent integer-valued points in R/kZ. The n-note
maximally even subsets of a chromatic scale are related by
transposition.
Table S1, part 4. A glossary of musical terms and symbols.
28
REFERENCES
1. R. Gauldin, Harmonic Practice in Tonal Music (Norton, New York, 2004).
2. S. Laitz, The Complete Musician (Oxford, New York, 2003).
3. D. Huron, Music Perception 19, 1 (2001).
4. It is possible to show that the triangle inequality implies the distribution constraint, given
a few minimal assumptions. Let P, Q, and R be points in R
n
, representing ordered series of
pitches, and let PQ be the voice leading that maps the i
th
component of P to the i
th
component of Q. Suppose our method of comparing voice leadings is a metric that assigns a
(real-valued) size to every displacement multiset. Let D(PR) refer to the size of the
displacement multiset associated with the voice leading PR. Suppose that D(PR)
D(PQ) + D(QR), for all P, Q, and R (the triangle inequality), and that D(PR) =
D(PQ) + D(QR) whenever Q lies on the line segment PR. It follows that the
distribution constraints first inequality will be satisfied.
5. D. Lewin, Journal of Music Theory 42, 15 (1998).
6. R. Cohn, Journal of Music Theory 42, 283 (1998).
7. J. Straus, Music Theory Spectrum 25, 305 (2003).
8. C. Callender, Music Theory Online 10 (2004).
9. R. Cohn, Journal of Music Theory 41, 1 (1997).
10. J. Douthett, P. Steinbach, Journal of Music Theory 42, 241 (1998).
11. http://music.princeton.edu/~dmitri/ChordGeometries.html.
12. Consequently, for all of the standard voice-leading metrics, we can always find minimal
bijective voice leadings from A to I
x
(A) and A to S
I
x
with displacement multisets equal to
{d
1
, d
2
, ..., d
n
} and {d
1
/2, d
2
/2, ..., d
n
/2}.
13. J. Clough, J. Douthett, Journal of Music Theory 35, 93 (1991).
14. Let {e
0
, e
1
, ..., e
n-1
} be a subset of R/kZ that divides the octave into n precisely even
parts. {m
0
, m
1
, ..., m
n-1
} is maximally even if m
i
=
e
i
. For any maximally even chord
{m
0
, m
1
, ..., m
n-1
}, and any integer d, the set {m
i + d (mod n)
m
i
} will contain either a single
integer-valued point in R/kZ or a pair of adjacent integer-valued points in R/kZ. The n-note
maximally even subsets of a chromatic scale are related by transposition. In twelve-tone
equal temperament, the maximally even chords are {0, 6}, {0, 4, 8}, {0, 3, 6, 9}, {0, 2, 4, 5, 7},
{0, 2, 4, 6, 8, 10}, their transpositions, and the complements of these chords.
15. There is an integer d such that e
i+d (mod n)
+ x = e
i
+ c. So n||c||
kZ
= ||e
i+d (mod n)
+ x e
i
||
kZ
=
||
e
i+d (mod n)
+ x
e
i
||
kZ
= ||
e
i
+ c
e
i
||
kZ
, where we use the fact that ||c||
kZ
k/2n.
16. E. Agmon, Musikometrica 3, 15 (1991).
17. Note that voice leadings do not uniquely determine the chords they connect: the voice
leading (0, 0, 4, 6)(10, 0, 6, 6) is both a non-bijective voice leading between {0, 4, 6} and
{6, 10, 0} and a bijective voice leading between {0, 0, 4, 6} and {6, 6, 10, 0}.