The Languages of Spacetime
The Languages of Spacetime
23072.274488/2022-85 1981708
Documento assinado eletronicamente por Felipe Brandão Guisoli, Usuário Externo, em 22/12/2022,
às 09:49, conforme horário oficial de Brasília, com fundamento no art. 5º do Decreto nº 10.543, de 13
de novembro de 2020.
[Link] 2/2
Agradecimentos
If the map f is a continuous and bijective between two topological spaces, whose invese is
also continuous, then f is called a homeomorphism.
The idea is that, as long as we work in the chart Ï we can pretend we are in Rn , just as the
Earth looks perfectly flat if we do not go too far. Suppose, for example, we have a function
f : U æ R. We can turn it into a function from Rn æ R using f ¶ Ï≠1 , as figure 2.3 shows.
CHAPTER 2. MANIFOLDS, TOPOLOGY AND DIFFERENTIAL GEOMETRY 17
Definition
A topological n-dimensional manifold M is a topological space such that every point has a
neighbourhood U homeomorphic to an open subset in Rn .
The manifold M is differentiable if the transition function Ï≠1 – ¶ Ï— is smooth where it is
defined.
Some topological manifolds — the differentiable ones — M can be represented as a union of
finite set of coordinate charts U , and the set of coordinate charts Âu that cover M is called an
atlas on M .
The idea is that every point in a differentiable manifold lives in some open subset U– that
looks like Rn , and that we can tell if any function on the manifold is smooth by looking at
transition functions between charts. If there is a function f : M æ R and one uses a chart
Ï– : U– æ Rn , then we say that f is smooth if
– : R æ R
f ¶ Ï≠1 n
is smooth.
But one could instead use a chart Ï— : U— æ Rn . In this case, consider V = U– fi U— the
overlap of the two charts, the grey area represented in figure 2.4. The representation of f in
this chart is
— : R æ R.
f ¶ Ï≠1 n
This function should also be smooth, for the smoothness of a function does not depend on the
chart we use.
But for that to be true, we need
Ï– ¶ Ï≠1
—
to be smooth, since 1 2 1 2
— = f ¶ Ï–
f ¶ Ï≠1 ¶ Ï– ¶ Ï≠1
≠1
— .
From now on, when we mention any manifold, we will always be referring to a smooth
manifold, as defined above.
CHAPTER 2. MANIFOLDS, TOPOLOGY AND DIFFERENTIAL GEOMETRY 18
Diffeomorphism
An isomorphism is a structure-preserving mapping between two structures of the same type
that can be reversed by an inverse mapping. An homeomorphism, as previously defined, is an
isomorphism of topological spaces. A diffeomorphism is a homeomorphism that preserves a
differential structure.
A map f from a manifold M to another manifold N can be built if one has the maps
g : M ‘æ R and h : N ‘æ R, as shown in figure 2.3, by the composition h≠1 ¶ g.
If there exists a diffeomorphism f between M and N we say that these two manifolds are
diffeomorphic. We will consider the spacetime as a 4-dimensional differentiable manifold.
2.2 Vectors
2.2.1 Introduction
One can think of a vector field in a manifold as a field of arrows, tangent to the space in each
point, as it is in Rn . If we have a direction, we can differentiate a function f in that direction.
The partial derivative of f in the direction of a vector v is, in Rn :
vf = v · Òf = v µ ˆµ f,
where we are thinking of the vector v as something whose purpose is to get a function f and
spit out another function, which is the partial derivative of f in the v direction, that’s why we
wrote it as vf (something like v is an operator acting on f ).
If we look at the first and last member we have vf = v µ ˆµ f , which holds for every function
f , so one may think that we can say that a vector field v can be written as
v = v µ ˆµ , (2.1)
CHAPTER 2. MANIFOLDS, TOPOLOGY AND DIFFERENTIAL GEOMETRY 19
which says that the vector field v can be expanded in the basis ˆµ .
Here, v is the vector field, while v µ ˆµ is something that acts on a function and give its partial
derivative. For us, a vector field on a manifold will be exactly that: entities whose main purpose
is to differentiate functions.
Definition
A vector field v on a manifold M is a function from C Œ (M ) to C Œ (M ) satisfying:
• v(–f ) = –v(f )
for – œ R and f, g œ C Œ (M ). Here, C Œ (M ) stands for the set of all complex functions infinitely
differentiable in M , as usual.
So it is an object that acts linearly on functions and obeys the Leibniz rule. If we denote
by V (M ) the set of all vector fields in a manifold M one can show that this is indeed a vector
space, as expected.
• vp (f + g) = vp (f ) + vp (g)
• vp (–f ) = –vp (f )
• vp (f g) = vp (f )g + f vp (g)
and we call the tangent vector vp at point P the function C Œ æ R that satisfies these 3
properties.
We call Tp (M ), the tangent space at P, the set of all tangent vectors at p œ M . The tangent
space is indeed a vector space, with the sum of tangent vectors and the multiplication by a
scalar defined in the natural way
• (vp + Êp )(f ) = vp (f ) + Êp (f )
• (–vp )(f ) = –vp (f )
Òf · v = vf, (2.4)
where in the last step we have written the vector v in the vector basis ˆµ :
v = v µ ˆµ ,
hence
ˆf
vf = v µ ˆµ f = v µ = Òf · v.
ˆxµ
So, we are after an object df that keeps track of the derivative of f in all directions in
the manifold M , just as the gradient does. In Rn , the gradient of f is a vector field, and the
directional derivative is calculated via a dot product, as shown before. But, taking dot products
involves a choice of metric, and manifolds, in general, do not come pre-equipped with it. So, we
will leave the choice of a metric to a further development. Hence it would be nice if, in a first
approach, the df which will generalize the gradient was not a vector field, so that it would not
be necessary to take a dot product in order to extract the directional derivative information.
We will call our df here a 1-form, and it will have the same properties as the gradient does,
so to speak, for each input vector v the operator df · v = vf spits out a scalar function, which is
the directional derivative of f in the direction of v.
So, our df , when fed with a vector v œ V (M ) (the tangent vector space in a manifold M )
will spit out a function g œ C Œ (M ), and it will do it in a linear way, such as the gradient does:
df · (v + u) = df · v + df · u ,
df · (gv) = g (df · v) , (2.5)
for g œ C Œ (Rn ).
CHAPTER 2. MANIFOLDS, TOPOLOGY AND DIFFERENTIAL GEOMETRY 22
Composition
The addition of two 1-forms Ê and µ and multiplication by a scalar (function) g is defined
via
(Ê + µ)(v) = Ê(v) + µ(v) (2.8)
and
(gÊ)(v) = gÊ(v) . (2.9)
1. Ê(v) = Ê(v ‹ ˆ‹ ) = v ‹ Ê‹
2. ʵ dxµ (v) = ʵ dxµ (v ‹ ˆ‹ ) = v ‹ ʵ (ˆ‹ dxµ ) = v ‹ ʵ (”‹µ ) = Ê‹ v ‹
One can then see the 1-forms as actually dual vectors. Just as a vector field v at M gives a
tangent vector vp at each point P of M , we can assign a cotangent vector Êp at each point P of
M . The space of all cotangent vectors at P, as mentioned before, is called Tpú M . The cotangent
vector Ê at P is rigorously defined to be a linear map from the tangent space Tp M to R.
So, if we have a vector field v on M , we can define the cotangent vector field as
f ú : W ú ‘æ V ú ,
that is defined by
(f ú Ê)(v) = Ê(f (v)). (2.12)
For this we call the cotangent vectors covariant: linear maps between vector spaces gives rise
to maps between their duals that go backwards. This is the convention used in [5], probably
because this objects transforms with the same Jacobian matrix of the linear transformation
itself, while tangent vectors transforms with its inverse, hence, they are called contravariant.
We will develop more on this shortly.
So, if „ is a linear map between the tangent spaces at two different points P and Q in M
„ : Tp M ‘æ Tq N,
v = v µ ˆµ = v Õ‹ ˆ‹Õ . (2.13)
The object v is naturally the same, but its components v µ or v Õ‹ are not, since they depend
on the choice of basis ˆµ or ˆ‹Õ where the components are written.
ˆxµ
Since ˆ‹Õ = ˆµ , then, in (2.13):
ˆxÕ‹
ˆxÕ‹ µ
v Õ‹ = v , (2.14)
ˆxµ
CHAPTER 2. MANIFOLDS, TOPOLOGY AND DIFFERENTIAL GEOMETRY 24
and the components of the vector transforms with the inverse of the Jacobian matrix of the
change of coordinates. Objects that behave this way live in the tangent bundle T M and are
called contravariant.
However, for a 1-form Ê, its components ʵ and Ê‹Õ in the two coordinates systems are
related via
Ê = ʵ dxµ = Ê‹Õ dxÕ‹ , (2.15)
ˆxÕ‹ µ
and since dxÕ‹ = dx we can see that the components of Ê are related by
ˆxµ
ˆxµ
Ê‹Õ = ʵ , (2.16)
ˆxÕ‹
which states that they transform with the Jacobian matrix of the change of coordinates. Objects
that behave this way lives in the cotangent bundle T ú M and are called covariant.
df (v) = v µ ˆµ f,
but df = fµ dxµ , then
df = ˆµ f dxµ . (2.17)
Therefore, the exterior derivative of scalar function is just its gradient in Rn .
Ê · µ = ≠µ · Ê. (2.18)
The 1-forms are given by ʵ dxµ , with the coefficients ʵ being functions.
2-forms look like
1
ʵ‹ dxµ · dx‹ ,
2
CHAPTER 2. MANIFOLDS, TOPOLOGY AND DIFFERENTIAL GEOMETRY 25
where the factor of 1/2 is inserted since dxµ · dx‹ = ≠dx‹ · dxµ . The (dxµ · dx‹ ) term is the
basis of 2-forms.
In general, a p-form looks like
1
ʵ‹...· dxµ · dx‹ · · · · · dx· ,
p!
where the product of p 1-forms dxµ · dx‹ · · · · · dx· is the basis of all p-forms.
but df = ˆ‹ f dx‹ , so
since ˆ· ˆ‹ is symmetric in [‹, · ] but dx· · dx‹ is antisymmetric in the same indices, which means
that d(dÊ) = ≠d(dÊ) and hence it vanishes.
The exterior derivative generalizes all vector derivatives in 3D. For instance, one can easily
show that
• Gradient: d : 0
(R3 ) ‘æ 1
(R3 )
• Curl: d : 1
(R3 ) ‘æ 2
(R3 )
• Divergence: d : 2
(R3 ) ‘æ 3
(R3 )
CHAPTER 2. MANIFOLDS, TOPOLOGY AND DIFFERENTIAL GEOMETRY 26
Ò ◊ (Òf ) = 0,
and
Ò · (Ò ◊ v) = 0,
and has profound consequences in physics.
ú : dx · dy ‘æ dz
ú : dy · dz ‘æ dx
ú : dz · dx ‘æ dy
then we could really see that equation (2.22) would reduce to the curl, as expected. Note
that defining this operator — the star or Hodge operator — in that way is incorporating the
right-hand rule, since we could just as well have defined
ú : dy · dx ‘æ dz
ú : dz · dy ‘æ dx
ú : dx · dz ‘æ dy
which would imply in adopting a left-hand rule.
More generally, we define the Hodge star operator in a n-dimensional manifold M
ú: p
(M ) ‘æ n≠p
(M ), (2.23)
to be the unique linear map from p-forms to (n ≠ p)-forms such that, for all Ê, µ œ p
(M ),
where ÈÊ, µÍ is the inner product of the forms, which is defined using the metric tensor as will
be discussed shortly, and vol is the volume form:
Ò
vol = det(gµ‹ )dx1 · dx2 · · · · dxn ,
2.4 Tensors
2.4.1 Definition
Having defined vectors—the geometrical objects living in T M whose basis in local coordinates
are ˆµ —we can define a new object constructed by composing those with the p-forms—the
objects in the dual space T ú M with the dual basis dxµ . Those objects are called tensors, and
we define the bundle of (r, s) tensors to be the tensor product of r copies of T M and s copies of
T úM :
ú
T
¸
M ¢ T M ˚˙¢ · · · ¢ T M˝ ¢ T
¸
M ¢ T ú M˚˙¢ · · · ¢ T ú M˝ .
r s
An object living in this space is an (r, s) tensor. The (0, 0) tensor are scalar fields. In local
coordinates, any (r, s) tensor is just a linear combination of
–
ˆµ ¢ ˆ‹ ¢ · · · ¢ ˆ‡ ¢ dx
¸
¢ dx— ˚˙
¢ · · · ¢ dx“˝ .
¸ ˚˙ ˝
r s
T = T–—...“
µ‹...‡
ˆµ ¢ ˆ‹ ¢ · · · ¢ ˆ‡ ¢ dx
¸
–
¢ dx— ˚˙
¢ · · · ¢ dx“˝,
¸ ˚˙ ˝
r s
where T–—...“
µ‹...‡
are the components of the tensor in this basis, having r upper indices and s lower
indices. This object, when we change coordinates, will transform r times in a covariant way
and s times in a contravariant way. Hence, the components T̃ of the tensor T in a different
coordinate system will be related to the components in the first coordinate system by
µ‹...‡
T̃–—...“ = T◊„...Ê
· ›...” µ
·
‹
› ... ‡
” ( ) –(
≠1 ◊
)—
≠1 „
...( ) “,
≠1 Ê
¸ ˚˙ ˝¸ ˚˙ ˝
r s
where is the Jacobian matrix of the coordinate transformation and ≠1 its inverse, as expected
since vectors transform with and 1-forms with its inverse. One can do the same thing using
any basis eµ of vector fields and its dual basis eµ of 1-forms.
One way to think about the (r, s) tensor T is as a functional that accepts r 1-forms and s
vector fields as inputs and outputs a function on M in a manner that is C Œ (M )-linear in each
input.
The metric is the object that allows one to measure distances, angles and hence establishes
the dot product in the manifold. For instance, in Minkoski spacetime the dot product of vectors
v and w is
÷(v, w) = v · w = ≠v 0 w0 + v 1 w1 + v 2 w2 + v 3 w3 = ÷µ‹ v µ w‹ ,
CHAPTER 2. MANIFOLDS, TOPOLOGY AND DIFFERENTIAL GEOMETRY 28
and we will adopt this convetion where Minkowski spacetime has signature (3,1). The signature
(m, n) of a metric tensor is the number of positive and negative eigenvalues of the symmetric
tensor ÷µ‹ written in a basis where it is diagonal. Hence, if there are m positive eingenvalues
and n negative ones, one say that this metric has signature (m, n).
A metric g on a manifold M assigns to each point P œ M a metric gp on the tangent space
Tp M . This is the object used to take inner products of tangent vectors v and w at P
g(v, w) = gµ‹ v µ w‹
ÈÊ, µÍ = g –— Ê– µ— .
Òµ v = ˆµ (v ‹ e‹ )
= (ˆµ v ‹ )e‹ + v ‹ (ˆµ e‹ )
= (ˆµ v ‹ )e‹ + v ‹ k
µ‹ ek
= (ˆµ v k + µ‹ v )ek ,
k ‹
(2.28)
where we defined ˆµ e‹ := kµ‹ ek . The symbol kµ‹ tracks how the basis vectors eµ changes from
point to point and it is called the connection, since it allow one to connect a vector in one point
to another.
There are a lot of ways to make this connection. There is, however, a unique connection
that satisfies
• Metric compatibility: Òg = 0.
• Torsion free: for any vector fields v and w we have the Lie bracket [v, w] = Òv w ≠ Òw v =
Lv w vanishing.
CHAPTER 2. MANIFOLDS, TOPOLOGY AND DIFFERENTIAL GEOMETRY 29
This connection is called the Levi-Civita connection, and will allow us to take derivatives of
any geometrical object in arbitrary spaces. For instance, for a 1-form Ê = ʵ dxµ :
Òfl Ê = ˆfl ʵ ≠ k
flµ Êk , (2.29)
Òfl T = ˆfl T–—...“
µ‹...‡
+ µ k‹...‡
flk T–—...“ + ‹ µk...‡
flk T–—...“ + ··· + ‡ µ‹...k
flk T–—...“ ≠ k µ‹...‡
fl– Tk—...“ ≠ k µ‹...‡
fl— T–k...“ ≠ k µ‹...‡
fl“ T–—...k .
¸ ˚˙ ˝ ¸ ˚˙ ˝
r s
CHAPTER 3. THE TRADITIONAL FORMULATION OF GR 31
masses. Newton’s law of gravitation states that the gravitational force between bodies of mass
m and M is given by
mG M
F =G 2 .
r
On the other hand, Newton’s second law states that the dynamics of m is governed by the
equation of motion
F = mI ẍ.
If the inertial mass mI is equivalent to the gravitational mass mG , then the dynamics of bodies
due to a gravitational field will be independent of the body itself:
mG M
F = mI ẍ = G ,
r2
M
=∆ ẍ = G 2 .
r
Hence, the acceleration of bodies due to the effect of gravity will be the same for all bodies
and there are trajectories in spacetime that dictate how bodies will move if they are under
the effect of gravity. These trajectories are a property of that region of spacetime and do not
depend on the free falling body.
Figure 3.1: Observer A sees the apple free falling, but observer B, who is also in a free fall, does
not feel the effect of gravity. For him, the apple is fluctuating over his hand.
This idea has huge consequences, such as the possibility of changing coordinates to cancel the
effect of gravity, as we will briefly show. Consider observer A, which is in a uniform gravitational
field g, studying the movement of particle C, of mass m. He then writes the equation of motion
for that particle
mẍA = mg = FA , (3.1)
where FA stands for the net force acting on particle C in the frame of reference A.
Now consider the coordinate transformation
1
xB = xA ≠ gt2
2
tB = tA = t
CHAPTER 3. THE TRADITIONAL FORMULATION OF GR 32
We are trying to relate the equation of motion in the local inertial coordinates › – to the
reference frame in coordinates xµ who is feeling the effects of gravity. Hence, we can rewrite
equation (3.3) as
A B
d2 › – d ˆ› – ˆxµ
=
d· 2 d· ˆxµ ˆ·
ˆ› – d2 xµ dxµ ˆ 2 › –
= +
ˆxµ d· 2 d· ˆxµ ˆ·
– 2 µ
ˆ› d x dxµ dx‹ ˆ 2 › –
= +
ˆxµ d· 2 d· d· ˆxµ ˆx‹
– 2 µ
ˆ› d x ˆx fl
ˆxfl ˆ 2 › – dxµ dx‹
= +
ˆxµ d· 2 ˆ› – ˆ› – ˆxµ ˆx‹ d· d·
d2 xfl µ
fl dx dx
‹
= + , (3.4)
d· 2 µ‹
d· d·
where we have defined the Christoffel symbol as
ˆxfl ˆ 2 › –
fl
µ‹ := , (3.5)
ˆ› – ˆxµ ˆx‹
ˆxfl
and, in the fourth line we multiplied both sides of the equation by , which does not change
ˆ› –
– fl
ˆ› ˆx
the left-hand side since it is equal to zero. We have also used = ”µfl .
ˆxµ ˆ› –
Therefore, from (3.3) and (3.4) we get the geodesics equation
d2 xfl fl dx
µ
dx‹
+ = 0, (3.6)
d· 2 µ‹
d· d·
which gives the curves in spacetime xfl (· ) that describe the trajectories of bodies moving under
effect of gravity. These curves are called geodesics. They are a property of the geometry of
spacetime and do not depend of the particle in motion, as previously discussed.
Taking equation (3.5) as the definition of the Christoffel symbol — the connection — one
can show that its relation to the metric is given by
1
fl
= g fl‹ (ˆµ g‹⁄ + ˆ⁄ gµ‹ ≠ ˆ‹ gµ⁄ ) . (3.7)
µ⁄
2
• The particle will be moving in low speed comparing with the speed of light:
dx dt
π . (3.8)
d· d·
ˆ· gµ‹ = 0. (3.9)
CHAPTER 3. THE TRADITIONAL FORMULATION OF GR 34
• The gravitational field is weak. Hence, we can introduce the tensor hµ‹ which represents a
low deviation of the spacetime metric gµ‹ from the Minkowski metric ÷µ‹ :
The first condition allows us to reduce the equation of motion (3.6) by neglecting some of
its components:
A B2
d2 xfl dx0
=≠ fl
00 ,
d· 2 d·
For fl = 0 the the second term vanishes since ÷ 0‹ ˆ‹ h00 = ≠”0‹ ˆ‹ h00 = ˆ0 h00 = 0, because of
d2 x0
the stationary gravitational field condition. Hence, the equation states that = 0. Therefore
d· 2
dt
= constant.
d·
Now, for fl = i = 1, 2, 3 we get
A B2
d2 xi 1 i‹ dt
≠ ÷ ˆ‹ h00 = 0,
d· 2 2 d·
A B2
dt
which, dividing by , which is just a constant, as we previously showed, leads us to
d·
d2 xi 1
= ˆi h00 .
dt 2 2
CHAPTER 3. THE TRADITIONAL FORMULATION OF GR 35
But, looking to Newton’s second law for a gravitational force written in terms of the
gravitational potential „ we have
d2 xi
= ≠Ò„,
dt2
which allow us to make the identification
h00 = ≠2„ + c,
where c is a real constant. But, since at infinity the metric must become the Minkowski metric,
we have, at infinity, g00 = ÷00 + h00 = ÷00 and, therefore, the constant c must be zero, since the
potential „ already vanishes at infinity.
Then we get an equation saying that the time-time component of the metric tensor must
satisfy:
ˆµ T µ‹ = 0,
which will be the conservation of energy for ‹ = 0 and the conservation of momentum in the i
direction for ‹ = i.
Therefore, we are looking here for an equation of the form
Ò2 g00 = ≠8fiGT00 .
This is actually a special case, written in a reference frame where the particles are at low
speed. We could write the equation in a more general way as
where the tensor Gµ‹ must have, at most, second order derivatives of the metric tensor, since
we need to recover Poissons’s equation (3.12) in the classical limit. But then: who is this tensor
Gµ‹ that we are looking for?
CHAPTER 3. THE TRADITIONAL FORMULATION OF GR 36
and here the first term in the last line vanishes since it is symmetric in [µ, ‹]. The second term
will dismember in ≠ˆµ ( ‡‹fl V‡ ) = ≠V‡ ˆµ ‡‹fl ≠ ‡‹fl ˆµ V‡ = ≠V‡ ˆµ ‡‹fl ≠ k‹fl ˆµ Vk , which, when
plugged back in the expression above will give
Now note that the term in parenthesis is symmetric in [µ, ‹], therefore it vanishes due the
antisymmetrization. Finally, the equation is reduced to
R‡flµ‹ := ˆµ ‡
‹fl ≠ ˆ‹ ‡
µfl + k ‡
‹fl µk ≠ k ‡
µfl ‹k . (3.15)
Equation (3.15) defines the Riemann tensor, a (1,3) tensor that carries the information about
the curvature of the space. One can raise or lower indices of this tensor as with any other using
the metric:
R· flµ‹ = g· ‡ R‡flµ‹ .
One can also define the Ricci tensor Rfl‹ by the contraction
R = g µ‹ Rµ‹ .
CHAPTER 3. THE TRADITIONAL FORMULATION OF GR 37
Òµ T µ‹ = 0, (3.16)
where the conservation of energy is the equation for µ = 0 and the conservation of the 3-
momentum is satisfied for µ = 1, 2, 3.
But the derivative of the Ricci tensor — and also of the Riemann tensor — is not zero,
so Rµ‹ = 8fiGTµ‹ , although it has the element which generates curvature on one side and the
curvature itself on the other, can not be the equation we are looking for. (Historically speaking,
Einstein and Grossmann dismissed it because they were unable to recover Newtonian physics in
the weak field limit[12], as discussed in section 3.2.2).
On the other hand, one can contract Bianchi’s identity
to get to 3 4
1
Òµ Rµ‹ ≠ Rg µ‹ = 0. (3.17)
2
Now, we have built a symmetric tensor
1
Gµ‹ = Rµ‹ ≠ Rgµ‹ , (3.18)
2
which is related to the curvature, has its covariant derivative vanishing and it is of second order,
since the curvature has at most second order derivatives of gµ‹ .
this would not affect the conservation laws. So, a more general equation would then be
1
Rµ‹ ≠ Rgµ‹ + gµ‹ = 8fiGTµ‹ . (3.20)
2
The parameter is called the cosmological constant, and it was added by Einstein to
give solutions for a static cosmological model. This was done before Hubble’s work about the
expansion of universe.
One can note that the addition of this new term will make appear, in the classical limit,
beyond the gravitational Newtonian force, a repulsive force proportional to and to the distance.
So, for small , this repulsive term would be relevant only for great distances.
This term, although denied by Einstein and considered by him as an error, proved to be
very important later on. It is responsible for the dark energy and it is the term that explained
the accelerated expansion of the universe later detected in 1990.
However, since our work will not enter deep into cosmology, equation (3.19) will be our main
subject.
Ô 1
”( ≠g) = Ô gg µ‹ ”g‹µ
2 ≠g
1Ô
= ≠gg µ‹ ”gµ‹
2
1Ô
=≠ ≠ggµ‹ ”g µ‹ , (3.22)
2
where, in the last step, we used ”(gµ‹ g µ‹ ) = g µ‹ ”gµ‹ + gµ‹ ”g µ‹ = 0. Hence, we are left with
⁄ ; <
1Ô
”S1 = d4 x ≠R ≠ggµ‹ ”g µ‹ . (3.23)
2
For the second term we will use the Palatini identity, which states that
”Rflµ‡‹ = Ò‡ ” fl
µ‹ ≠ Ò‹ ” fl
µ‡ .
”Rµ‹ = ”Rflµfl‹ = Òfl ” fl
µ‹ ≠ Ò‹ ” fl
µfl . (3.24)
where we used the Palatini identity (3.24) in the second line and, in the last one, we did an
integration by parts and neglected the boundary term.
Assuming that the connection is symmetric in [µ, ‹], the variation will vanish if the sym-
metrization of the integrand vanishes:
Òfl g µ‹ + Òfl g ‹µ ≠ ”fl‹ Ò‡ g µ‡ ≠ ”flµ Ò‡ g ‹‡ = 0
Òµ g µ‹ ≠ Ò‡ g ‹‡ ≠ Ò‡ g ‹‡ = 0,
CHAPTER 3. THE TRADITIONAL FORMULATION OF GR 41
Òfl g µ‹ = 0, (3.31)
which states that the covariant derivative Ò with respect to the connection gives a null
derivative of the spacetime metric, i.e., the metric is compatible with the connection.
So, the first equation of motion gives us Einstein field equation and the second equation of
motion states that our connection, previously placed as a dynamical variable, is fixed to be the
Levi Civita connection.
CHAPTER 4. THE HAMILTONIAN FORMULATION OF GR 43
Òa Gab = 0 , (4.2)
Òµ Gµ‹ = ˆµ Gµ‹ ≠ ‡ µ
µ‹ G‡ + µ k
µk G‹ = 0.
and, since
1
‡
= g ‡‹ (ˆ⁄ gµ‹ + ˆµ g⁄‹ ≠ ˆ‹ gµ⁄ ) ,
µ⁄
2
the right-hand side of (4.3) has, at least, second order time derivatives, given that the Christoffel
symbols have at most first order time derivatives and the Einstein’s tensor contains first
derivatives of those symbols. Hence, from the left-hand side of the equation we can infer that
G0‹ has at most first order time derivatives. Therefore, since we have other equations with
second order time derivatives, those four equations for G0‹ are not evolution equations: they are
constraints that the initial data must satisfy. From symmetry, the Einstein’s field equations are
a set of ten partial differential equations, of which only six are time evolution equations. The
equations G0µ = 8fiGTµ0 relate initial values of fields instead of determining how fields evolve.
If we proceed with the computation we can see that only spatial components of the metric
gab appear with their second order time derivatives. The other components do not play the
same dynamical role as gab . The g00 and g0a equations will be the constraints — they will play
the role of the lapse function and the shift vector, as we will see later.
for i = 1, 2, 3, ..., n. From the least action principle, one can get the Euler Lagrange equations
by setting ”S = 0: A B
d ˆL ˆL
i
≠ i = 0. (4.5)
dt ˆ q̇ ˆq
By the chain rule, one can expand the time derivative as
d ˆ dq i ˆ dq̇ i
= i + i
dt ˆq dt ˆ q̇ dt
and, plugging this in (4.5) one gets
A B A B
ˆ2L ˆ2L ˆL
j i
q̈ j
+ j i
q̇ j ≠ i = 0. (4.6)
ˆ q̇ ˆ q̇ ˆq ˆ q̇ ˆq
Hamiltonian equations
Let us consider the usual Legendre transformation
”H = ” q̇ i pi + q̇ i ”pi ≠ ”L(q i , q̇ i )
ˆL ˆL
= ” q̇ i pi + q̇ i ”pi ≠ i ”q i ≠ i ” q̇ i
ˆq ˆ q̇
ˆL
= ” q̇ i pi + q̇ i ”pi ≠ i ”q i ≠ pi ” q̇ i
ˆq
ˆL
= ≠ i ”q i + q̇ i ”pi
ˆq
= ≠ṗi ”q i + q̇ i ”pi
ˆH ˆH
= i ”q i + ”pi .
ˆq ˆpi
In the fourth line one can easily see that the variation ”H depends only on the variations of the
momenta pi and the position q i , not on the velocities q̇ i .
Equating the last two lines we get
A B A B
ˆH ˆH
i
+ ṗi ”q i + ≠ q̇ i ”pi = 0. (4.14)
ˆq ˆpi
For any variation ti = (”q i , ”pi ) tangent to the primary constraint surface, the equation above
shows that the vector A B
ˆH ˆH
V := + ṗi , ≠ q̇ i
(4.15)
ˆq i ˆpi
is normal to the surface, since ti Vi = 0
A basis of normal vectors to C is
A B
ˆÂs ˆÂs
vs = grad(Âs ) = , . (4.16)
ˆq i ˆpi
Then, for some functions ⁄ on the surface of primary constraints, we have
V = ⁄s v s . (4.17)
Finally, with equations (4.15), (4.16) and (4.17) one can get the equations of motion:
ˆH s ˆÂs
ṗi = ≠ + ⁄ , (4.18)
ˆq i ˆq i
ˆH ˆÂs
q̇ i = ≠ ⁄s . (4.19)
ˆpi ˆpi
CHAPTER 4. THE HAMILTONIAN FORMULATION OF GR 46
Comparing the last equations with the Hamilton’s equations of motion, those can be rewritten
as
ˆ(H ≠ ⁄s Âs ) ˆ⁄s
ṗi = ≠ ≠ Â s , (4.20)
ˆq i ˆq i
ˆ(H ≠ ⁄s Âs ) ˆ⁄s
q̇ i = + Âs , (4.21)
ˆpi ˆpi
where we can define the total Hamiltonian of the system as
Htotal = H ≠ ⁄s Âs . (4.22)
Using the Poisson bracket we can rewrite the Hamilton’s equations of motion 4.24 as:
ṗi ¥ {pi , Htotal } (4.26)
Ó Ô
q̇ i ¥ q i , Htotal , (4.27)
which actually is valid for any function F (q, p) on the phase space, as is easily seen:
dF
= i q˙i +
ˆF ˆF
Ḟ = ṗi
dt ˆq ˆpi
A B A B
ˆF ˆHtotal ˆF ˆHtotal
¥ i + ≠
ˆq ˆpi ˆpi ˆq i
= {F, Htotal } . (4.28)
The total Hamiltonian then generates the dynamical flow of the variables of the phase space in
time.
Because the primary constraints Âs are originated directly from the definition of the canonical
momenta, they need to hold during all the evolution of the system. This means that the evolution
of the system must be contained in the surface of primary constraint Âs . These are called
consistency conditions, expressed by
Â̇s ¥ {Âs , Htotal } = 0. (4.29)
These conditions can add new constraints to the evolution of the system, known as secondary
constraints. Those constraints must also satisfy the consistency conditions, which can lead to a
new generation of constraints. This process goes on until no more constraints are generated.
Opening equation (4.29) we get:
Ó Ô
Â̇s ¥ {Âs , Htotal } = Âs , H ≠ ⁄k Âk
Ó Ô
= {Âs , H} ≠ Âs , ⁄k Âk
Ó Ô
= {Âs , H} ≠ ⁄k {Âs , Âk } ≠ Âk Âs , ⁄k
¥ {Âs , H} ≠ ⁄k {Âs , Âk }
= {Âs , H} ≠ ⁄k Csk = 0, (4.30)
where we have defined
Csk := {Âs , Âk } .
If Csk is non singular the structure of the constraint system is uniquely determined: one can
solve for the ⁄k via
⁄k = Csk≠1
{Âs , H} .
In this case, no further constraints arise and we can fulfill the consistency condition. However, if
the matrix Csk is singular, we cannot determine all the ⁄k . In that case, equation (4.30) implies
the secondary constraints aforementioned. Those follow from the equations of motion, not from
the definition of the momenta as the primary constraints.
We call a constraint Âk first class with respect to all constraints if its Hamiltonian vector
field is everywhere tangent to the constraint surface C. That is, for all constraints Âk on the
constraint surface C we must have
{Âs , Âk } = 0,
and we call it second class if that Poisson bracket is nonvanishing on the constraint surface.
First class constraints generate gauge transformations, as we now show.
Consider all constraints, and consider also an arbitrary dynamical variable F , then define
the transformation
F (q, p) ‘æ F (q, p) + {F, ‘Âk } , (4.31)
where ‘ is a control parameter arbitrarily small. Due to the consistency conditions, this
transformation does not affect the Hamiltonian
H(q, p) ‘æ H(q, p) + {H, ‘Âk } ¥ H.
That is, the transformation takes solutions of the equations of motion and constraints into new
solutions. This is a gauge transformation, and that is why constraints are generators of gauge
transformations. Solutions that are related by gauge transformations are then treated as the
same solution.
Any particular choice for the total Hamiltonian will result in equations of motion written
in a specific gauge. But since the theory is invariant under gauge transformations generated
by constraints, the choice of a total Hamiltonian does not matter, and all sets of equations of
motion obtained for different gauges are equivalent.
P‹ : T M ‘æ T‹ M (4.34)
xa ‘æ ≠g(n, x)na = ≠nb xb na ,
These projections allow us to break any geometrical object X (a vector or tensor) in its
tangential (œ TÎ M ) and perpendicular (œ T‹ M ) parts:
For the dual space T ú (M ) the action of those operators is similar. The action of the projection
operator on a 1-form Ê , for instance, is:
PÎ : T ú M ‘æ TÎú M (4.35)
Êa ‘æ Êa + g(n, Ê)na = Êa + n Êb na .
b
CHAPTER 4. THE HAMILTONIAN FORMULATION OF GR 51
So, when applied to vectors tangent to t, the induced metric hab gives the same geometry
as gab .
One can then use the induced metric hab to describe projections of any geometrical object.
In coordinates, for a rank (m, n) tensor, one gets:
(PÎ T )ab11...b
...am
n
= hac11 . . . hacmm hdb11 . . . hdbnn Tdc11...d
...cm
n
. (4.41)
To study the dynamics of the canonical formulation, we consider the induced 3-metric hab as
a time-dependent 3-dimensional tensor field evolving on a family of manifolds t . Then, the
time dependent field hab will be the configuration variables of canonical gravity.
However, in order to do this, we have to define a time evolution vector field ta that specifies
the directions of time derivatives, since one will need to take time derivatives of the induced
metric or any other vector fields.
CHAPTER 4. THE HAMILTONIAN FORMULATION OF GR 52
If one has just two slices in the foliation, it is impossible to say how a field defined on them
changes, unless we can uniquely associate a point on one slice to a point on the other one. The
vector field that connects a point in one slice to its correspondent point in another one is the
time evolution vector field ta , whose integral curves are shown in the right part of figure (4.5).
To ensure that this vector field agrees with the concept of time it is required that
ta Òa t = 1, (4.42)
which states that the change of t in the direction of the time evolution vector field t is just the a
unity.
It is assumed that the spatial coordinates xb are held fixed:
ta Òa xb = 0, (4.43)
so that
ˆ
ta Òa := . (4.44)
ˆt
By introducing the shift vector N a
N a := PÎ ta = hab tb (4.45)
and the lapse function N , which is the amount of the vector field ta in the direction orthogonal
to t :
N na := ta ≠ hab tb , (4.46)
and by acting with na on both sides of equation (4.46) one gets
N na na = na ta ≠ na hab tb = na ta .
Since na na = ≠1, we get
N = ≠na ta . (4.47)
The time evolution vector field t can then be written in its normal and tangential parts
a
A geometrical object — the extrinsic curvature Kab — will naturally arise when we try to
make the induced metric evolve:
Ln hab = nc Òc hab + hac Òb nc + hbc Òa nc
= nc Òc (gab + na nb ) + Òb na + Òa nb
= nc Òc (na nb ) + Òb na + Òa nb
= n c na Òc n b + nc n b Òc n a + Òb n a + Òa n b
= (gac + na nc )Òc nb + (gbc + nb nc )Òc na
= hca Òc nb + hcb Òc na
= Kab + Kba , (4.59)
where the object Kab appears in the context of the evolution of the induced metric hab . Also,
in the third line we developed hac Òb nc = (gac + na nc )Òb nc = Òb na + na nc Òb nc = Òb na , since
nc Òb nc = 0 as shown in equation (4.61).
CHAPTER 4. THE HAMILTONIAN FORMULATION OF GR 55
Definition 4.3.2 (Extrinsic Curvature). Given any normal vector na to the surface , the
extrinsic-curvature tensor is a spatial tensor on defined by
We could also omit the first projector hdb on the definition because
This notion is captured when one splits the derivative Òu v in its normal and tangential parts
where the first term represents the normal part of it and the second one the tangential part. So,
when we parallel transport v, who lives in , in the direction of u, which also lives in , the
emergence of a normal component in this parallel transport measures exactly the curvature in
that region.
This way of thinking agrees with our previous definition of Kab , since, from the that definition,
we had
K(u, v) = ≠g(Òu v, n)
= ≠gab (Òu v a )nb
= ≠(uc Òc v a )na
= uc (Òc na )v a
= (Òc nb )uc v b , (4.64)
where in the third line we used the metric to lower the index of nb and in the fourth line we
used the fact that both v and u are spatial, then Òc (v a na ) = 0, then na Òc v a = ≠v a Òc na . In
the last line we only renamed a dummy index so it agrees with equation (4.63).
With this view, the tensor K measures how much the surface is curved in the way it
sits in M , because it says how much a vector tangent to will fail to be tangent if parallel
transported using the Levi-Civita connection Ò on M .
CHAPTER 4. THE HAMILTONIAN FORMULATION OF GR 56
In components, we have
K(u, v) = Kij ui v j
in local coordinates, where
Kij = K(ˆi , ˆj ).
From this point of view it is easy to see that this tensor is symmetric, since
Kij ≠ Kji = K(ˆi , ˆj ) ≠ K(ˆj , ˆi )
= ≠g(Òi ˆj , n) + g(Òj ˆi , n)
= ≠g(Òi ˆj ≠ Òj ˆi , n)
= ≠g([ˆi , ˆj ], n)
= ≠g(0, n)
= 0. (4.65)
The extrinsic-curvature tensor has some important properties:
1. It is symmetric:
Kab = Kba , (4.66)
as shown right above.
2. As developed in equation (4.59) and using the property above, we get that the extrinsic
curvature tensor is half of the Lie derivative of the intrinsic metric along the unit normal:
1
Kab = Ln hab . (4.67)
2
3. The extrinsic curvature tensor can be related to the intrinsic curvature hab , the shift vector
N a and the lapse function N via
1 1 2
Kab = ḣab ≠ Da Nb ≠ Db Na , (4.68)
2N
which can be proven as follows:
1
Kab = Ln hab
2
1
= [nc Òc hab + hac Òb nc + hbc Òa nc ]
2
1
= [N nc Òc hab + hac Òb (N nc ) + hbc Òa (N nc )]
2N
1
= [(tc ≠ N c )Òc hab + hac Òb (tc ≠ N c ) + hbc Òa (tc ≠ N c )]
2N
1
= Lt≠N hab
2N
1 d c
= h h Lt≠N hcd
2N a b
1 d c
= h h [Lt hcd ≠ LN hcd ]
2N a b
1 1 d c 2
= ha hb Lt hcd ≠ hda hcb LN hcd
2N
1 1 ˙ b ≠ Db N a ,
2
= hab ≠ Da N
2N
CHAPTER 4. THE HAMILTONIAN FORMULATION OF GR 57
where, from the third to the fourth line we used equation (4.48), and in the sixth line we
just smuggled in the induced metric to get the spatial part of the calculation, since Kab
is purely spatial. In the last line we just used the definition of the time derivative of a
tensor given by (4.49) and used the fact that the shift vector is spatial, then
LN hab = PÎ [N c Òc hab + hac Òb N c + hbc Òa N c ]
= PÎ [N c Òc hab ] + PÎ [hac Òb N c + hbc Òa N c ]
= N c PÎ [Òc hab ] + PÎ [(gac + na nc )Òb N c + (gbc + nb nc )Òa N c ]
= N c Dc hab + PÎ [Òb Na + Òa Nb ]
= 0 + Da Nb + Db Na
= Da Nb + Db Na .
We could also use the Ricci equation (4.71) with the relation Rab na nb = Racdd na nb to get
Rab na nb = (Kaa )2 ≠ Kab Kba + Òa v a , (4.72)
where the vector field v a is defined as
v a := ≠na Òc nc + nc Òc na .
Using the Gauss-Codazzi equations with the Ricci equations one can read the Ricci scalar R:
R = 3 R + Kab K ab ≠ (K aa )2 ≠ 2Òa v a . (4.73)
Hence, up to a divergence term, we can decompose the Ricci scalar into a potential term 3 R
and a kinetic term — quadratic in extrinsic curvature. Then, the extrinsic curvature, as shown
in equation (4.68), plays the role of a velocity of the spatial metric hab and is, thus, a candidate
for its momentum when we formulate the GR in terms of canonical variables, as we do next.
CHAPTER 4. THE HAMILTONIAN FORMULATION OF GR 58
and
”Lgrav
pa (x) = = 0. (4.76)
” Ṅ a (x)
The conjugate momenta of the induced metric hab is
”Lgrav
fi ab (x) =
” ḣab (x)
”Lgrav ”Kab
= (4.77)
”Kab ḣab (x)
”Lgrav 1
= , (4.78)
”Kab 2N
where the last line comes from equation (4.68).
So we get Ô
det h 1 ab 2
fi (x) =
ab
K ≠ K cc hab . (4.79)
16fiG
Contracting this relation with hab we get
16fiG ab
Ô fi hab = hab K ab ≠ K cc hab hab
det h
= K aa ≠ 3K cc
= ≠2K aa ,
With this last relation we can express ḣab in (4.68) in terms of its conjugate momenta fi ab :
16fiGN
ḣab = Ô (2fi ab ≠ fi cc hab ) + 2D(a Nb) . (4.82)
det h
Then we can obtain the Hamiltonian through
⁄ 1 2
H(t) = d3 x [fi ab h˙ab ] + ⁄pN + µa pa ≠ L(t), (4.83)
Using equation (4.82) to write ḣab in terms of its conjugate momenta fi ab we can write (4.83)
as
⁄ C 3 4 Ô D
16fiGN 1 N det h 3
H= d3 x Ô fiab fi ab ≠ (fi aa )2 + 2fi ab Da Nb ≠ R + ⁄pN + µa pa . (4.84)
det h 2 16fiG
Applying the consistency conditions (4.29) to the constraints we get secondary constraints:
We can now see that, putting these in (4.84), the lapse function N and the shift vector N a
play the role of Lagrange multipliers of the secondary constraints:
⁄
H= d3 x [N Cgrav + N a Cagrav + ⁄pN + µa pa ] + Hˆ , (4.89)
where the last term refers to the Hamiltonian of the boundary term.
which gives us back equation (4.82). Finally, we have also the equation of motion
Ó Ô
fi̇ ab = fi ab , Hgrav ,
gµ‹ = g(ˆµ , ˆ‹ )
= ˆµ · ˆ‹
= (eIµ eI ) · (eJ‹ eJ )
= eIµ eJ‹ ÷IJ , (5.2)
g = eT ÷ e.
In this way, we can see the tetrad as a similarity transformation that diagonalizes the metric
gµ‹ and scales it to the unit. The ÷ matrix is the euclidean metric if we are talking about 3D
space (then the basis e should be called a triad) or the Minkowski metric if we are talking about
spacetime — where the name tetrad makes more sense.
Taking the determinant of this equation we get
g = ≠e2 , (5.3)
where g stands for the determinant of the spacetime metric gµ‹ and e for the determinant of the
matrix eIµ . The minus sign comes from the determinant of the Minkowski metric.
Hence, the tetrad represents the square root of the metric and has, therefore, all the
information about the geometry of the manifold. We can thus consider the tetrad as the
fundamental description and the metric as a derived concept.
The spacetime indices are contracted with the metric gµ‹ , as usual, and the internal indices
are contracted with the flat spacetime metric ÷IJ , which, consisting of 0s and ±1s is much easier
to deal with than gµ‹ — we will see that this is the whole point of the formalism.
Thinking of e(x) as a square matrix, we can define its inverse eµI such that
CHAPTER 5. TETRADS FORMALISM AND PALATINI ACTION 63
(Òfl v)µ = ˆfl v µ + µ k
flk v , (5.5)
Da T µ1‹...µ r
1 ...‹s
= ˆa T µ1‹...µ r
1 ...‹s
+
+ Ê µ1ak T k...µ
‹1 ...‹s + ... + Ê ak T ‹1 ...‹s +
r µr µ1...k
≠ Ê k a‹1 T µk...‹
1 ...µr
s
+ ... ≠ Ê k a‹s T µ‹11...k
...µr
.
It can be easily seen that the spin connection is a 1-form since, on a curved manifold, when
we move from a point x to a nearby point x + dx it is expected that the local frame will rotate
(in Euclidean space) or Lorentz transform (in Minkowski flat spacetime), thus, an infinitesimal
translation has the effect of rotating the 1-form eI (x) infinitesimally. Hence, if we apply the
exterior derivative d to this 1-form we should get
deI = ≠Ê IJ eJ , (5.7)
for some antisymmetric ÊIJ , since the generators of rotations or Lorentz transformations are
antisymmetric. The minus sign is just a convention. Since eI is a 1-form, deI is a 2-form and
is also a 1-form.
If we evaluate the covariant derivative of the Minkowski metric we get
Êa IJ = ≠Êa JI , (5.10)
CHAPTER 5. TETRADS FORMALISM AND PALATINI ACTION 64
which attest the antisymmetry of the spin connection in its internal indices, as said before. This
means that the coefficients of the connection take values in the Lie Algebra of the Lorentz group
of that signature, as developed in appendix A.
We can built the relation between the spin connection Ê on the internal space and the Levi
Civita connection on the manifold. Remember that we can always express a vector v at point
P by a linear combination of the internal basis vectors eI or by the spacetime basis vectors on
the tangent space Tp M , the ˆµÕ s: v = v I eI = v µ ˆµ .
Also, the connection Ê on the internal space induces a connection on the tangent space Tp M
for a given tetrad e. The covariant derivative D̃ on the internal space is defined via
Òfl v = Òfl (v I eI ) =Òfl (v µ ˆµ )
eI (ˆfl v I ) + v K ÊflJ K eJ = (ˆfl v µ + v k flk )ˆµ ,
µ
eI ˆfl (v µ eIµ ) + v µ eK
µ Êfl K eJ = (ˆfl v + v
J µ k
flk )eµ eI
µ I
from where we can express the spin connection in terms of the Levi Civita connection:
1 2
Ê‹I J = eIfl ˆ‹ eflJ + eµJ fl
‹µ , (5.12)
If we calculate
DKI DIJ „J = d(d„K + Ê KJ „J ) + Ê KL (d„L + Ê LJ „J ), (5.14)
the curvature will immediately emerge. The first term gives
Then, since d2 = 0, the sum gives, maintaining only the operators in the equation:
DKI DIJ = dÊ KJ + Ê KL Ê LJ := F K
J, (5.15)
If we use equation (5.12) in (5.17) we can get the relation between the curvature 2-form F IJ
and the Riemann curvature tensor Rµ‹‡ fl
= ˆµ ‹‡ ≠ ˆ‹ µ‡ + flµ– –‹‡ ≠ fl‹– –µ‡ , which is
fl
Rµ‹‡ = eflI eJ‡ Fµ‹I J . (5.18)
FM NI J = Fµ‹J
I
eµM e‹N . (5.19)
R = FIJ ÷ IJ . (5.22)
It is also easy to see that this scalar, in the internal structure, is the same as the Ricci scalar on
the manifold:
R = FIJ ÷ IJ
= Rµ‹ eµI e‹J ÷ IJ
= Rµ‹ g µ‹
= R,
where, in the second line, we used equation (5.22) with inverses of eIµ applied.
We also may define the torsion in the local Minkowski space. First, the torsion T in the
tangent bundle is given by
T (v, u) = Òv u ≠ Òu v ≠ [v, u], (5.23)
which is, in coordinates:
fl
Tµ‹ = Òµ ˆ‹ ≠ Ò‹ ˆµ = fl
[µ‹] .
CHAPTER 5. TETRADS FORMALISM AND PALATINI ACTION 66
The protocol goes as follows: given a metric, one chooses a basis of tetrads eI satisfying equation
(5.2). Then one can use the first of Cartan’s equations to figure out the spin connection Ê, and,
finally, with the second of Cartan’s equations, one has the curvature 2-form F . This is the
easiest way of computing the Riemann curvature tensor, which can be done with the relations
between F IJ and Rµ‹‡
fl
developed in the previous section.
e1◊ = R
e2„ = R sen◊,
with all the other components vanishing. For the 1-forms, we have eI = eIµ dxµ , so we get
e1 = Rd◊,
e2 = R sen◊d„.
Now, from the first of Cartan’s structural equations we get
de1 + ÊJ1 · eJ = 0
Ê11 e1 + Ê21 e2 = 0
Ê11 Rd◊ = ≠Ê21 R sen◊d„, (5.26)
CHAPTER 5. TETRADS FORMALISM AND PALATINI ACTION 67
de2 + ÊJ2 · eJ = 0
d(R sen◊d„) + Ê12 e1 + Ê22 e2 = 0
R cos ◊(d◊ · d„) + Ê12 Rd◊ + Ê22 R sen◊d„ = 0,
(5.27)
F IJ = dÊ IJ + Ê IK · Ê KJ ,
we get directly the vanishing components R11 = R22 = 0 and the non-vanishing components:
F 12 = dÊ 12 + Ê 1K · Ê K2
= sen◊(d◊ · d„) + Ê 11 · Ê 12 + Ê 12 · Ê 22
= sen◊(d◊ · d„),
F IJ = Fµ‹
I µ ‹
J dx dx ,
F 12 = sen(◊)d◊ · d„ = Fµ‹
1 µ
2 dx dx
‹
1
= F◊„ 2 d◊ · d„,
Now, from (5.18), we can write the components of the Riemann curvature tensor,
fl
Rµ‹‡ = eflI eJ‡ Fµ‹I J
= efl1 e2‡ Fµ‹1 2 + efl2 e1‡ Fµ‹2 1 .
The only non-vanishing component, up to the symmetries of the Riemann tensor, is:
◊
R◊„„ = e◊1 e2„ F◊„1 2 + e◊2 e1„ F◊„2 1
= e◊1 e2„ F◊„1 2
= R≠1 R sen(◊)F◊„1 2
= sen2 ◊.
CHAPTER 5. TETRADS FORMALISM AND PALATINI ACTION 68
Also, from (5.19) and (5.20) we can write the component F11 of the Ricci tensor in internal
indices:
From equation (5.3) we can write det g in terms of the tetrads. And, from equation (5.18)
we can get the Ricci tensor by the usual contraction:
Rµ‡ = Rµ‹‡
‹
= Fµ‹I J e‹I eJ‡ , (5.29)
R = Rµ‡ g µ‡
= Fµ‹I J e‹I eJ‡ g µ‡
= Fµ‹I J e‹I eJ‡ ÷ M N eµM e‡N
= Fµ‹I J ÷ M J e‹I eµM
= Fµ‹IM e‹I eµM
= Fµ‹IJ e‹I eµJ , (5.30)
”e = e e‡K ”eK
‡ = ≠e e‡ ”eK ,
K ‡
(5.33)
1
where we used A[µ‹] = (Aµ‹ ≠ A‹µ ). Hence, in equation (5.34) we have
2!
1 ⁄ 4 [I
”S = d x e eµI e‹J (ˆ[µ ”Ê‹] + 2Ê[µ|K| ”Ê‹] ). (5.35)
IJ |K|J]
8fiG
The first term can be rewritten as
where, in the last step, we just left out the antisymmetrization [µ, ‹] since the expression is
already antisymmetric.
So, we have, for the second term of the action, renaming some dummy indices
⁄
≠ d4 x(2 e e‹[K eµJ] Ê‹I
K
)”ʵIJ , (5.37)
However, if we took the covariant derivative of the e eµ[I e‹J] term we would get
1 2 1 2
D‹ e eµ[I e‹J] = ˆ‹ e eµ[I e‹J] ≠ Ê‹I
K
e eµ[K e‹J] ≠ Ê‹J
K
e eµ[I e‹K] ,
Since
1
e eµ[I e‹J] = ‘IJKL ‘µ‹–— eK L
– e— ,
4
we have, in (5.39):
1 2 1 Ë È
D‹ e eµ[I e‹J] = ‘IJKL ‘µ‹–— (D‹ eK )e L
+ (D e )e
L K
4 – — ‹ — –
1
= ‘IJKL ‘µ‹–— (D‹ eK
– )e— ,
L
2
where, in the first line the two terms in the parenthesis are equivalent since the expression is
antisymmetric in [K, L] and [–, —], contributing with two factors of minus one to the last term,
making it identical do the first one. We are left with
‘µ‹–— (D‹ eK
– )e— = 0,
L
where the symbol ‘IJKL was removed since it’s action on antisymmetric rank 2 tensors is
invertible. If we act with eflL on both sides we are left with
–] = 0.
D[‹ eK (5.40)
This implies that the torsion is zero, and, since we have a metric compatibility — equation
(5.10) — we know that we are talking about the Levi Civita connection.
This implies that the tetrad is constant with respect to the covariant derivative defined via
the connection Ê. So, just as it happened in the Palatini approach for the variables gµ‹ and ,
the variation of the action with respect to the connection told us that the metric was compatible
with the covariant derivative defined by that connection. Here, the tetrad, playing the role of
the metric, is compatible with the connection Ê.
where, in the fourth line we used one of the relations from appendix B, and in the last line we
just removed the antisymmetrization in [–, —] since the expression is already antisymmetric.
CHAPTER 5. TETRADS FORMALISM AND PALATINI ACTION 72
To get the equations of motion we first vary the action with respect to the connection Ê,
which gives us (removing the constants):
⁄
”S = ‘IJKL eI · eJ · (”F KL )
⁄
= ‘IJKL eI · eJ · D(”Ê KL )
⁄ 1 2 ⁄ 1 2
= I
D ‘IJKL e · e · ”Ê J KL
≠ D ‘IJKL eI · eJ · ”Ê KL
⁄ 1 2
= ≠2 ‘IJKL DeI · eJ · ”Ê KL ,
where the boundary term was neglected from the third to the fourth line and we used the
Palatini identity (see appendix C) in the second line.
So, if we set ”S/”Ê = 0 we will get DeI = 0, which states that the connection Ê is torsion
free, as we already knew.
If we now vary the action with respect to the tetrads we will have
⁄
”S = ‘IJKL (”eJ ) · eI · F KL ,
Since ‘‡µ‹fl d4 x = dx‡ · dxµ · dx‹ · dxfl , and contracting ‘IJKL eI‡ = e e–J e—K e“L ‘‡–—“ , this leads us
to
e–J e—K e“L F KL
µ‹ ‘
‡µ‹fl
‘‡–—“ = 0.
Since J is a free index, one can act with eJ◊ to get e—K e“L F KL
µ‹ ‘
‡µ‹fl
‘‡–—“ = 0. However, e—K e“L F KL
µ‹ =
R µ‹ and
—“
1 2
‘‡µ‹fl ‘‡–—“ = ≠2 ”–[µ ”— ”“fl + ”–[fl ”— ”“‹ + ”–[‹ ”— ”“fl ,
‹] µ] µ]
which gives us 1 2
[µ [fl [‹
µ‹ ”– ”— ”“ + ”– ”— ”“ + ”– ”— ”“ = 0.
‹] µ] µ]
R—“ fl ‹ fl
Therefore,
1
R–fl ≠ R”–fl = 0,
2
fl :=
where R– Rµ– and R := Rµ . Acting with gflµ we then obtain Einstein’s field equations in
µfl µ
vacuum, as expected:
1
Rµ‹ ≠ Rgµ‹ = 0.
2
CHAPTER 5. TETRADS FORMALISM AND PALATINI ACTION 73
where we used ”F IJ = D(”Ê IJ ) in the second line and we integrated by parts and neglected the
boundary term in the next line.
From equation (5.24) it can be seen that forcing this variation to vanish leads us to the
torsion free condition:
T I = DeI = 0.
CHAPTER 5. TETRADS FORMALISM AND PALATINI ACTION 74
This can also be taken as the definition of Ê: our connection is the one that can be entirely
determined by the tetrads, given this condition. So, the only independent variable in our theory
is eI . Defining the connection in this way would obviously make it satisfy the equation of
motion — i.e. there is no variation with respect to Ê since it is not an independent variable.
This formulation is called first order, while the formulation where both e and Ê is independent
is called first order.
Finally, varying the action with respect to the tetrad gives us
CA B D
1 ⁄ 1
”S = (”eI ) · eJ · ı+ F IJ .
16fiG “
The second term vanishes since, from the first Bianchi identity (see appendix C) we have
that eJ · F IJ = D2 eJ = 0, since the connection is torsion free. So, the “ term vanishes on-shell,
i.e., when the torsion is zero.
Hence, we are left only with the first term:
1 2
eJ · ıF IJ = 0, (5.47)
which is just Einstein’s field equations in forms notation, which can be shown in a similar way
as we did in the Palatini section, using coordinate notation.
CHAPTER 6. ASHTEKAR FORMULATION OF GR 76
However, in the triad formalism, there is an additional condition to the usual spacetime
split done in the ADM formalism, which is the split in the internal directions of the tetrad in
Minkowski time and space components.
There are two ways of doing this gauge fixing. The first one is to require that ea0 = nI eaI = na
be the unit normal to the foliation, this is known as the time gauge. Here we assumed nI = ”0I
to be a timelike internal vector field. Internal Lorentz transformations that preserve nI are
reduced to spatial rotations around the fixed direction nI .
Another way of doing it, which will show to be more practical in the calculations we are
going to develop, is to open the tetrads in its spatial and time components and set the time
gauge directly from it.
First, let us consider the 1-forms eI . For the Minkowski time component:
e0a = 0, (6.3)
e0 = e0µ dxµ
= e00 dx0 + e0a dxa
= e00 dx0
= N dt, (6.4)
where we defined the lapse function e00 := N . This fixes the Minkowski time in the internal
space: the direction orthogonal to the spatial slice.
CHAPTER 6. ASHTEKAR FORMULATION OF GR 77
ei = eiµ dxµ
= ei0 dx0 + eia dxa
= N i dx0 + eia dxa , (6.5)
where Aib is an SU (2) connection, the canonical conjugate variable to the triad, which will
appear shortly in our development.
= eIµ eJ‹ F KL
fl‡ (dx · dx · dx · dx )
µ ‹ fl ‡
1 2
0 1 2 3
= ≠˜‘µ‹fl‡ eIµ eJ‹ F KL
fl‡ dx · dx · dx · dx
3
= ≠˜‘µ‹fl‡ eIµ eJ‹ F KL
fl‡ dt · d x, (6.7)
where, in the third line we used the fact the the term dxµ · dx‹ · dxfl · dx‡ is completely
anti-symmetric in µ‹fl‡, then, it can be written in terms of the Levi Civita symbol. The minus
sign comes from the fact that we will use ‘˜µ‹fl‡ to represent the Levi-Civita symbol and ‘˜µ‹fl‡ is
defined as being the Levi-Civita symbol multiplied by sign(g) = ≠1. We can then plug this into
the action in (5.45):
A B
1⁄ 1
S= ı+ eI · eJ · F IJ
4 “
I J
1⁄ 1 1
= I J
‘IJKL e · e · F KL
+ ÷IK ÷JL e · e · F
I J KL
4 2 “
⁄ ⁄ I J
1 1 1
=≠ dt d3 x ‘˜µ‹fl‡ fl‡ + ÷IK ÷JL eµ e‹ F fl‡ .
‘IJKL eIµ eJ‹ F KL I J KL
(6.8)
4 2 “
CHAPTER 6. ASHTEKAR FORMULATION OF GR 78
where we defined in the last line ‘˜abc := ‘˜0abc , and the factor of 2 in the first term came from:
hence, the second term in the second line of (6.9) is equal to the first. This was for the first
term in (6.8), but the calculation is analogous for the second term with the ÷ parameters.
Therefore, plugging (6.9) in (6.8) we are led to
I J
1⁄ ⁄
1 1
S=≠ dt d3 x ‘˜µ‹fl‡ fl‡ + ÷IK ÷JL eµ e‹ F fl‡
‘IJKL eIµ eJ‹ F KL I J KL
4 2 “
I J
1⁄ ⁄
1 1 1 1
=≠ dt d3 x ‘˜abc 2 ‘IJKL eI0 eJa F KL bc + 2 ‘IJKL ea eb F 0c + 2 ÷IK ÷JL e0 ea F bc + 2 ÷IK ÷JL ea eb F 0c
I J KL I J KL I J KL
4 2 2 “ “
I J
1⁄ ⁄
3 abc 1 1 1 1
=≠ dt d x ‘˜ ‘IJKL e0 ea F bc + ‘IJKL ea eb F 0c + ÷IK ÷JL e0 ea F bc + ÷IK ÷JL ea eb F 0c .
I J KL I J KL I J KL I J KL
2 2 2 “ “
(6.10)
We now do the same (3+1) split in the internal indices. For the first term inside the brackets
in (6.10) we get
1 1 0 i jk 1 i 0 jk 1 1
bc = ‘0ijk e0 ea F bc + ‘i0jk e0 ea F bc +
‘IJKL eI0 eJa F KL ‘ij0k ei0 eja F 0kbc + ‘ijk0 ei0 eja F k0bc
2 2 2 2 2
1 1 1
= ‘0ijk e00 eia F jkbc + ‘0ijk ei0 eja F 0kbc ≠ ‘0ijk ei0 eja F k0bc
2 2 2
1 1 1
= ‘0ijk e00 eia F jkbc + ‘0ijk ei0 eja F 0kbc + ‘0ijk ei0 eja F 0kbc
2 2 2
1
= ‘0ijk e00 eia F jkbc + ‘0ijk ei0 eja F 0kbc
2
1
= ‘ijk N eia F jkbc + ‘ijk N d eid eja F 0kbc , (6.11)
2
where the second term in the first line vanishes because of the time gauge (e0a = 0) and in
the last term we recovered the definitions of the lapse function N = e00 and the shift vector
ei0 = N i = N d eid , and we also defined ‘0ijk := ‘ijk .
CHAPTER 6. ASHTEKAR FORMULATION OF GR 79
where the first two terms in the first line vanishes due to the time gauge (e0a = 0) and in the
last term we also used ‘0ijk := ‘ijk .
For the last two terms in (6.10) we will open the indices in space and time using the metric
relations above: Y
_
_
_÷00 ÷00 = 1
_
_
]÷ ÷ = ≠”
00 ij
÷IK ÷JL = _ (6.13)
ij
_
_÷ ij ÷00 = ≠”ij
_
_
[
÷ik ÷jl = ”ik ”jl
Then, for the third term in (6.10) we get
1 1 Ë È
bc =
÷IK ÷JL eI0 eJa F KL ÷00 ÷00 e00 e0a F 00bc + ÷00 ÷jl e00 eja F 0lbc + ÷00 ÷ik ei0 e0a F k0bc + ÷ik ÷jl ei0 eja F klbc
“ “
1 Ë È
= e00 e0a F 00bc ≠ ”jl e00 eja F 0lbc ≠ ”ik ei0 e0a F k0bc + ”ik ”jl ei0 eja F klbc
“
1 Ë È
= ≠”jl e00 eja F 0lbc ≠ ”ik ei0 e0a F k0bc + ”ik ”jl ei0 eja F klbc
“
1 Ë È
= ”ik ”jl N d eid eja F klbc ≠ ”jl N eja F 0lbc , (6.14)
“
where the first term in the second line vanishes since the curvature 2-form is anti-symmetric
(F µµ = 0), in the third line the second term also vanishes due to the time gauge, and in the last
line we plugged in the definitions of N and N i = ei0 = N d eid .
Finally, for the fourth term in parenthesis in equation (6.10) we get
1 1Ë È
0c =
÷IK ÷JL eIa eJb F KL ÷00 ÷00 e0a e0b F 000c + ÷00 ÷jl e0a ejb F 0l0c + ÷ik ÷00 eia e0b F k00c + ÷ik ÷jl eia ejb F kl0c
“ “
1 Ë 0 0 00 È
= ea eb F 0c ≠ ”jl e0a ejb F 0l0c ≠ ”ik eia e0b F k00c + ”ik ”jl eia ejb F kl0c
“
1Ë È
= ≠”jl e0a ejb F 0l0c ≠ ”ik eia e0b F k00c + ”ik ”jl eia ejb F kl0c
“
1
= ”ik ”jl eia ejb F kl0c , (6.15)
“
where the first two terms in the third line vanishes due to the time gauge.
Now, plugging (6.11), (6.12), (6.14) and (6.15) in the action (6.10) leads us to
I J
1⁄ ⁄
1 1 1 1
S=≠ dt d3 x ‘˜abc bc + ‘IJKL ea eb F 0c + ÷IK ÷JL e0 ea F bc + ÷IK ÷JL ea eb F 0c
‘IJKL eI0 eJa F KL I J KL I J KL I J KL
2 2 2 “ “
1 ⁄ ⁄
=≠ dt d3 x ‘˜abc {} , (6.16)
2“
CHAPTER 6. ASHTEKAR FORMULATION OF GR 80
One can multiply it by eal and use eia eal = ”li to get to
Contracting now with ‘˜bcd and using ‘˜abc ‘˜bcd = 2”ad we are left with
‘˜bcd ‘ljk ejb ekc = 2Ẽld ,
and, renaming some dummy indices, we get
1
Ẽia = ‘˜abc ‘ijk ejb ekc . (6.24)
2
One can also show that
‘ijk ‘˜abc Ẽjb Ẽkc
eia = . (6.25)
2 det(e)
For that, let us compute eia eal :
‘ijk ‘˜abc Ẽjb Ẽkc a
eia eal = e
2 det(e) l
det(e) ijk
= ‘ ‘˜abc ebj eck eal
2
det(e) ijk
= ‘ ‘ljk det(e≠1 )
2
det(e) i
= 2”l det(e)≠1
2
= ”li ,
where in the second line we just used (6.23). In the third line we used the equation for the 3-
dimensional determinant (see Appendix B) and in the fourth line we used det(A≠1 ) = (det A)≠1 .
Some other useful identities that follow from those definitions are
‘˜abc eja = ebp ecq ‘jpq det e, (6.26)
and
‘ijk Ẽ b Ẽ c
‘˜abc eia = Ô j k , (6.27)
det Ẽ
which follows from (6.25). Those identities will be useful along further development.
1
Opening the anti-symmetrizers and using Ẽai = ‘˜abc ‘ijk eia ejb we get
2
1 Ë È
L1 = ‘ijm Ẽmc
”ik ”jl ‘kl p (ˆ0 cp ≠ ˆc 0p ) ≠ K0k Kcl + Kck K0l + 0
k l
≠ k
0
l
+
2 Ë È
c c
and using ”ik ”jl ‘ijm ‘kl p = 2”pm we are left with
1 c m 1 k l 2
L1 = Ẽpc ˆ0 p
+ Ẽm
≠ Ẽpc ˆc ‘ kl 0 c ≠ ck 0l ≠ K0k Kcl + Kck K0l +
p
0
c
Ë 2 È
+ “ Ẽk ˆ0 Kc ≠ ˆc K0k + ‘k pq (K0p cq ≠ Kcp 0q ) .
c k
Now remember from equation (6.19) that the L1 term is being integrated in space and time.
Hence, integrating by parts we get, neglecting boundary terms:
⁄ ⁄
≠ Ẽpc ˆc p0 = p c
0 ˆc Ẽp
⁄ ⁄
≠ “ Ẽkc ˆc K0k = “K0k ˆc Ẽkc .
Then, we have
1 2
L1 = Ẽpc ˆ0 p
c + p c
0 ˆc Ẽp + Ẽm
c m
‘ kl 0
k
c
l
≠ K0k Kcl +
+ “ Ẽkc ˆ0 Kck + “K0k ˆc Ẽkc + “ Ẽkc ‘k pq (K0p c
q
≠ Kcp 0 ),
q
where in the parentheses of the first line we used the fact that ‘m kl is anti-symmetric in [k, l]
CHAPTER 6. ASHTEKAR FORMULATION OF GR 84
1
and that Akl = (Akl ≠ Alk ). Grouping now some terms and relabeling some indices we get
2
1 2 1 2 1 2 1 2
L1 = Ẽkc ˆ0 k
c + “ Ẽkc ˆ0 Kck + k c
0 ˆc Ẽk + “K0k ˆc Ẽkc + Ẽkc ‘kij i
0
j
c + “Kcj ≠ Ẽkc ‘kij K0i Kcj ≠ “ j
c =
1 2 1 2 1 2 1 2
= Ẽkc ˆ0 k
c + “Kck + ˆc Ẽkc k
0 + “K0k + Ẽkc ‘kij i
0
j
c + “Kcj ≠ Ẽkc ‘kij K0i Kcj ≠ “ j
c =
I A BJ
1 2 Ó 1 2Ô 1 j
= Ẽkc ˆ0 k
c + “Kck + i
0 ˆc Ẽic + Ẽkc ‘kij j
c + “Kcj + “K0i ˆc Ẽic ≠ Ẽkc ‘kij K ≠ j
=
“ c c
1 2 Ó 1 Ó 1 2Ô
1 2Ô
= Ẽkc ˆ0 k
c + “Kck + i
0 ˆc Ẽic + Ẽkc ‘kij≠ K0i ≠“ 2 ˆc Ẽic + Ẽkc ‘kij “Kcj ≠ “ 2 jc =
j
c + “Kcj
“
1 2 Ó 1 2Ô 1 Ó Ô
= Ẽkc ˆ0 kc + “Kck + i0 ˆc Ẽic + Ẽkc ‘kij jc + “Kcj ≠ K0i ≠“ 2 ˆc Ẽic + Ẽkc ‘kij “Kcj ≠ Ẽkc ‘kij “ 2 jc =
“
1 2 Ó 1 2Ô 1 Ó 1 2Ô
= Ẽkc ˆ0 kc + “Kck + i0 ˆc Ẽic + Ẽkc ‘kij jc + “Kcj ≠ K0i ˆc Ẽic + Ẽkc ‘kij jc + “Kcj +
“
1 Ó Ô
≠ K0i ≠“ 2 ˆc Ẽic ≠ Ẽkc ‘kij jc ≠ Ẽkc ‘kij “ 2 jc ≠ ˆc Ẽic =
“
A B
1 2 1 i Ó 1 2Ô
= Ẽk ˆ0 c + “Kc + 0 ≠ K0 ˆc Ẽic + Ẽkc ‘kij jc + “Kcj +
c k k i
“
I A B A BJ
1 1
+ K0 ˆc Ẽi “ +
i c
+ Ẽk ‘ ij c “ +
c k j
=
“ “
A B A B
1 2 1 Ó 1 2Ô 1 Ó Ô
= Ẽkc ˆ0 kc + “Kck + i0 ≠ K0i ˆc Ẽic + Ẽkc ‘kij jc + “Kcj + K0i “ + ˆc Ẽic + Ẽkc ‘kij jc ,
“ “
where, from the fifth to the sixth line we added and subtracted the term ˆc Ẽic + Ẽkc ‘kij j
c inside
the last parentheses of the equation.
Now, using
Akc = kc + “Kck , (6.35)
and introducing the quantities A B
1
– :=
i
+ “ K0i , (6.36)
“
1
⁄i := i
0 ≠ K0i , (6.37)
“
indeed vanishes.
Hence, the first term is just
L1 = Ẽkc ˆ0 Akc + ⁄i Gi , (6.40)
where Gi is called the Gauss Constraint, which generates SU (2) gauge transformations as we
will discuss later.
Note that ”ik ”jl ‘kl p = ‘ijp , and, relabelling some dummy indices we can write
1 2
L2 = N d ‘˜abc eid eja ‘ijk ˆ[b k
c] + N d ‘˜abc eid eja ”ik ”jl k
[b
l
c] ≠ K[bk Kc]l +
1 2
+ N d ‘˜abc eid eja “‘ijk ˆ[b Kc]k + “‘ijk ‘kpq K[bp q
c] ,
From the definition of the Ashtekar-Barbero connection one can easily write the first term
in parentheses as ˆ[b c]k + “ˆ[b Kc]k = ˆ[b Ac]k . Also, in the last line we can write:
‘ijk ‘kpq = ÷ mk ‘ijk ‘pqm = ÷ mk (”ip ”jq ”km ≠ ”iq ”jp ”mk ) = ”ip ”jq ≠ ”iq ”jp .
and, since the entire expression is being multiplied by ‘˜abc , which is already anti-symmetric in
[bc], we can drop out the anti-symmetrizers:
Ó 1 2 Ô
L2 = N d ‘˜abc eid eja ‘ijk ˆb Akc + ”ik ”jl b
k
c
l
≠ Kbk Kcl + “(Kbi c
j
≠ Kbj c)
i
=
Ó 1 2 Ô
= N d ‘˜abc eid eja ‘ijk ˆb Akc + i j
b c ≠ Kbj Kci + “(Kbi c
j
≠ Kbj c)
i
. (6.41)
However, the curvature 2-form of the Ashtekar-Barbero connection Akc is defined, in index
notation, as
1 k 1
Fbc := ˆ[b Akc] + ‘klm Alb Am
c . (6.42)
2 2
CHAPTER 6. ASHTEKAR FORMULATION OF GR 86
5
2 6
1 2 l m
= ‘˜abc
‘ijk ˆ[b Ac] + (”il ”jm ≠ ”im ”jl )( b c + b “Kc + “Kb c + “ Kb Kc )
k l m l m l m
Ë
2 Ó 1 2 ÔÈ
2 l m
= ‘˜ ‘ijk ˆ[b Ac] + ”il ”jm lb m
abc k
c + “ l
K
b c
m
+ K l m
b c + “ K K
b c ) ,
1
where in the last line we used again the notation for anti-symmetric objects Aµ‹ = (Aµ‹ ≠ A‹µ ).
2
Therefore, subtracting ‘˜abc ”il ”jm (1 + “ 2 )Kbl Kcm from both sides we can write
; <
1 Ë Ó1 2 1 2ÔÈ
‘˜
abc
‘ijk Fbck ≠ ”il ”jm (1 + “ 2 )Kbl Kcm = ‘˜abc ‘ijk ˆ[b Akc] + ”il ”jm l m
≠ Kbl Kcm + “ l m
b Kc + Kbl m
.
2 b c c
Ó
2 p bd Ô
= N d |e| ebp Fbd
p
≠ ”il eid ebp ecq ‘pqm (1 + “ 2 )Kbl Kcm , (6.43)
where we used eid eci = ”dc in the fifth line and Aµ‹ = (Aµ‹ ≠ A‹µ )/2 in the last one.
We can also express this in terms of the densitized triad Ẽia = |e|eai :
Ó Ô
L2 = N a |e| ebp Fba
p
≠ ”il eia ebp ecq ‘pqm (1 + “ 2 )Kbl Kcm
Ó Ô
= ≠N a |e| ebp Fab
p
+ ”il eia ebp ecq ‘pqm (1 + “ 2 )Kbl Kcm
Ó Ô
= ≠N a Ẽpb Fab
p
+ ”il eia ebp (1 + “ 2 )Kbl ‘pqm Kcm Ẽqc . (6.44)
1
Therefore we can write that ‘pqm Kcm Ẽqc = ≠ Gp . Hence, in (6.44) we get
“
I A B J
1
L2 = ≠N a p
Ẽpb Fab ≠ ”il eia ebp + “ Kbl Gp .
“
The part with the Gauss constraint Gp is redundant, its content is already covered by L1 .
We are then left with
L2 = ≠N a Ẽpb Fab
p
, (6.45)
or, defining the momentum constraint as
Va := Ẽpb Fab
p
, (6.46)
we can write it as
L2 = ≠N a Va . (6.47)
This is called the vector constraint, which is related to spatial diffeomorphisms, as we will
show and discuss later.
1 1 2
≠ N eia ‘˜abc “‘ijk ≠‘jk p ˆ[b c]p + K[bj Kc]k ≠ j
[b
k
.
2 c]
N ‘imn Ẽmb
Ẽnc 1 2
L3 = ≠ Ô ”ij ˆb Kcj + ‘jpq Kbp cq +
det Ẽ
1 N ‘imn Ẽm
b
Ẽnc Ë 1 2È
+ Ô “ ‘ijk ‘jk p ˆb cp + ‘ijk bj ck ≠ Kbj Kck ,
2 det Ẽ
where we dropped the anti-symmetrizers since the expression is already anti-symmetric in [b, c].
Also, since ‘ijk ‘jk
p = 2”ip , and renaming some dummy indices, we get
N ‘imn Ẽmb
Ẽnc 1 2
L3 = ≠ Ô ”ij ˆb Kcj + ‘ijk Kbj ck +
det Ẽ
5 26
N ‘ Ẽ Ẽ c
imn b
1 1
+ Ô m n “ ”ij ˆb cj + ‘ijk bj ck ≠ Kbj Kck ,
det Ẽ 2
and, regrouping some terms:
C A BD
N ‘imn Ẽ b Ẽ c 1
L3 = Ô m n ”ij “ˆb c
j
≠ Kcj +
det Ẽ “
5 26
N ‘ Ẽm Ẽnc
imn b
1 1
≠ Ô ‘ijk Kbj k
+ “ Kbj Kck ≠ j k
.
det Ẽ
c
2 b c
CHAPTER 6. ASHTEKAR FORMULATION OF GR 88
1 j
Now, plugging Kcj = (A ≠ c)
j
into the equation we are led to
“ c
C A BD
N ‘imn Ẽ b Ẽ c 1
L3 = Ô m n ”ij “ˆb cj ≠ 2 (Ajc ≠ jc ) +
det Ẽ “
C I JD
N ‘imn Ẽm
b
Ẽnc 1 j 1 1 j
≠ Ô ‘ijk (Ab ≠ b ) ck + “
j
(A ≠ b )(Ac
j k
≠ c)
k
≠ j k
,
det Ẽ “ 2 “2 b b c
or, finally
1 k j 1 j k 1 2 1 1
≠ AA + “ j k
= (1 + “ 2 ) k j
≠ Ajb Akc .
2 c b 2 b c 2 b c
2 c b
2
Hence, we can write the Lagrangian as
N ‘imn Ẽmb
Ẽnc Ë È
L3 = ≠ Ô ˆb Aic ≠ (1 + “ 2 )(ˆb ic ) +
“ det Ẽ
5 6
N ‘imn Ẽm
b
Ẽnc 1 j k 1 2
≠ Ô ‘ijk Ab Ac ≠ (1 + “ ) c b , k j
“ det Ẽ 2 2
then,
53 4 3 46
N ‘imn Ẽ b Ẽ c 1 1
L3 = ≠ Ô m n ˆb Aic + ‘ijk Ajb Akc ≠ (1 + “ 2 ) ˆb i
+ ‘ijk k j
“ det Ẽ 2 c
2 c b
However, the expressions in parentheses are the equations in components for the curvature
2-forms of the connections Aij and ij , respectively:
1 i 1
F bc := ˆb Aic + ‘ijk Ajb Akc , (6.48)
2 2
1 i 1
R := ˆb ic + ‘ijk ck jb .
2 bc 2
Hence, the expression can be written as
N ‘imn Ẽm
b
Ẽnc Ë i È
L3 = ≠ Ô F bc ≠ (1 + “ 2 )Ribc . (6.49)
2“ det Ẽ
If one defines the scalar constraint C as
‘imn Ẽm
b
Ẽnc Ë i È
C := ≠ Ô F bc ≠ (1 + “ 2 )Ribc , (6.50)
2“ det Ẽ
then
L3 = N C, (6.51)
where C is the scalar constraint — or Hamiltonian constraint — and N is just a Lagrangian
multiplier.
‘imn Ẽm
b
Ẽnc i
• Hamiltonian constraint: C := ≠ Ô [F bc ≠ (1 + “ 2 )Ribc ]
2“ det Ẽ
We can then get the Hamiltonian
⁄ 1 2
H[Aia ; Ẽia ] = d3 x ⁄i Gi + N a Va + N C , (6.53)
with the first class constraints, which generates the expected gauge freedom: the triad rotations
and spacetime diffeomorphisms, which is discussed in the next section.
If one writes the Hamilton equations that result from this Hamiltonian, one will indeed
reproduce Einstein field equations, as expected.
CHAPTER 6. ASHTEKAR FORMULATION OF GR 90
6.8.1 Electromagnetism
Let us first develop the geometrical interpretation of the constraints for a more familiar
theory, classical electromagnetism. The Lagrangian for electromagnetism is
1⁄ 3
L= d xFµ‹ F µ‹ , (6.54)
4
where Fµ‹ = ˆµ A‹ ≠ ˆ‹ Aµ is the electromagnetic tensor and Aµ is the vector potential. It is easy
to see that the zeroth component of the vector potential will not appear with a time derivative,
since Fµ‹ vanishes for µ = ‹. Hence, only spatial derivatives of A0 appear in the Lagrangian,
which means that it does not play a dynamical role. It is, actually, a Lagrange multiplier, as we
will show later.
Hence, being Ab our dynamical variable, its conjugate momentum is given by
”L
fib = , (6.55)
” Ȧb
which, when one takes the functional derivative of (6.54), gives Ẽ b — the electric field. The
canonical variables are then Ab (x) and Ẽ b (x), where a stands for spatial coordinates, as usual:
a = 1, 2 or 3 (remember that A0 is not a dynamical variable).
When taking the functional derivative we get a density, as expected, since the Lagrangian is
a volume integral, hence, its integrand must be a scalar density. When one takes a functional
derivative, the integral disappears and the result must be a density. The Poisson bracket of
Ẽ b (x) and Ab (x) (which is not a density) is again a density, as expected
Ó Ô
Ab (x), Ẽ b (x) = ”ba ” 3 (x ≠ y).
”L
which should vanish, since fi – = = F –0 is zero for – = 0. Hence, its time evolution should
” Ȧ–
also vanish. Therefore we get ˆa E a = 0, which is Gauss’s law without the presence of charges.
CHAPTER 6. ASHTEKAR FORMULATION OF GR 91
This is a constraint, since it implies that we cannot have any E a for a electric field, but only
configurations for which the divergence is zero. We can now see how constraints are generators
of symmetries. Here, it is good to introduce the idea of a smeared constraint
⁄
G(⁄) := d3 x⁄ˆa E a ,
where the parameter ⁄ is an arbitrary smooth and differentiable function of x. Requiring that
the smeared constraint G(⁄) vanishes for all ⁄ is equivalent to requiring that the constraint
itself vanishes at all points of the manifold. This is important to do since we are dealing with
densities and distributions and not with functions itself. Since distributions behave better under
an integral, it will be in most cases easier to deal with the smeared constraint than with the
constraint itself.
Taking the Poisson bracket of the smeared constraint and the Hamiltonian, one finds out
that it vanishes Ó Ô
G(⁄), H = 0,
as expected, which means that the Hamiltonian does not change under the transformation
generated by the constraint; hence, the theory and the physics is unmodified, and this is indeed
a symmetry, as expected.
Taking the Poisson bracket of the smeared constraint with the conjugate variables we get
Ó Ô
G(⁄), E a = 0,
which means that the electric field is unchanged under the transformation generated by the
constraint. And, finally, one can compute that
Ó Ô
G(⁄), Aa = ˆa ⁄,
which means that the vector potential may change by the gradient of a function ⁄(x). We
already knew that the vector potential is defined up to the gradient of a function, which is a
gauge freedom of the theory. Therefore, we see that the constraints give rise to symmetries,
which are revealed in the gauge freedom of the system. In this context, Gauss law is called the
generator of gauge transformations, since it comes up as a constraint and it gives rise to the
gauge freedom that we have in choosing the potential vector Aµ . Moreover, A0 is the Lagrange
multiplier of the constraint ˆa E a , as we mentioned previously.
By computing the time evolution of the canonical variables Aa and E a one can recover the
rest of Maxwell equations
Ó Ô
Ȧa = Aa , H = Ea + ˆa A0
Ó Ô
Ė a = E a , H = ‘abc ˆb Bc
where it is easy to note that the evolution depends on the choice of the Lagrange multiplier A0 ,
which makes the vector potential defined only up to the gradient of a function ⁄(x), which is
the gauge symmetry of Maxwell’s theory.
6.8.2 Gravity
We may now apply the same reasoning to the Hamiltonian in (6.53) in terms of the conjugate
canonical pair Ẽia and Aia .
CHAPTER 6. ASHTEKAR FORMULATION OF GR 92
We can introduce a combination of the vector and Gauss constraints, which we call the
diffeomorphism constraint:
Ca = Va ≠ Aia Gi . (6.58)
Writing the smeared diffeomorphism constraint V as
⁄
˛) =
C(N d3 xN a Ca
and computing the Poisson bracket of this smeared constraint with a function of the canonical
coordinates f (Ẽ, A), one gets Ó Ô
C(N ˛ ), f (Ẽ, A) ≥ L ˛ f, (6.59)
N
which states that the orbit generated by the constraint in phase space is just the Lie derivative
along N a , up to a constant factor that depends on the choice of renormalization. Therefore, this
is called the diffeomorphism constraint, since it generates infinitesimal spatial diffeomorphism
transformations.
One can also smear the Gauss constraint Gi and get
⁄
G(⁄) = d3 x⁄i Gi ,
and also Ó Ô
G(⁄), E ≥ [⁄, E] (6.61)
which are the SU (2) gauge transformations.
CHAPTER 7. CONCLUSION 94
GR, so the decomposition of the metric gab in its spatial part hab will be necessary for one to
define a time parameter in order to talk about the evolution of the system, which is necessary if
one wants to build a Hamiltonian representation of the system.
This allowed us to develop a Hamiltonian formalism for GR — the ADM formalism —
where we could write a Hamiltonian as a sum of constraints multiplied by Lagrange multipliers.
This provided a Hamiltonian representation of the dynamics of the spacetime geometry. The
canonical variables here are the induced metric hab and its conjugate momenta fiab . With this
Hamiltonian it becomes possible to study the spacetime dynamics in a canonical way, using
every tool of the Hamiltonian formalism.
We then developed the tetrads formalism. Here, we replaced the metric gµ‹ as the main
dynamical variable for the tetrads eIµ and eJ‹ . Since Riemannian manifolds are locally flat,
one can always choose an orthonormal basis of vectors {e0 , e1 , e2 , e3 } for each point P on the
manifold M , and that is the starting point of this new formalism. The tetrads contain the
same geometrical information from the manifold as the metric gµ‹ , since they are related via
gµ‹ = eI‹ eJ‹ ÷IJ . Hence, by taking the determinant, we get g = ≠e2 and the tetrad is just the
square root of the metric and it has, therefore, all the information about the geometry of the
manifold. We can thus consider the tetrad as the fundamental description and the metric as a
derived concept.
We then built the two Cartan’s structural equations. The first one was deI + Ê I K eK = 0,
which allowed us to find the spin connection Ê if we had the tetrads e. The second one was
F K J = dÊ K J + Ê K L Ê L J , which allowed us to find the curvature 2-form FJK . So, given a metric,
one chooses a basis of tetrads eI , founds the spin connection and then the curvature 2-form with
both of Cartan’s structural equations.
Since we had the relation between F IJ and the Riemann curvature tensor Rµ‹‡ fl
, we were then
able to translate the Palatini action in terms of the tetrads and the curvature 2-form F. This
action led us to the same conclusions that we arrived before, only now in a different language —
the Einstein field equations and the metric compatibility equation in the notation of differential
forms. Finally, we did a slight modification in this action, by introducing a parameter “, the
Barbero-Immirzi parameter. This did not change the equations of motion, but we were led to a
more general action — the Holst action.
Finally, we developed the Hamiltonian formalism using the Holst action, which led us to the
formulation of GR in terms of some new variables — the Ashtekar-Barbero connection Aia and
the densitized triad Ẽjb . In a way, we mixed what was developed in the last two chapters, since
the main idea here consisted in developing a Hamiltonian formalism after we did the (3+1)-split
of the geometry. However we have used the Holst action instead of the Einstein-Hilbert one,
hence, our variables were in terms of the triads — the spatial part of the tetrads — and not
the 3-metric hab . Although the development was extensive, the steps followed in this part led
us to the construction of a constrained Hamiltonian for GR, which allowed us to extract some
symmetries of the system, the constraints giving rise to gauge transformations.
This puts us one step behind the quantization of gravity. In the Hamiltonian formalism,
one can promote the Poisson brackets to commutators and the conjugate variables to operators,
and, at least in theory, quantize gravity. The path followed for the canonical quantization of
gravity in this approach with the Ashtekar formulation of gravity is known as loop quantum
gravity, which can be studied in a future work.
APPENDIX C. BIANCHI AND PALATINI IDENTITIES 100
we have that
IJ
”Fµ‹ = ˆ[µ ”Ê‹]
IJ
+ Ê[µK
I KJ
”Ê‹] J
≠ Ê[µK IK
”Ê‹]
= D(”Ê IJ )µ‹
”F IJ = D(”Ê IJ )
APPENDIX D. TENSOR VS TENSOR DENSITIES 102
This quantity is the same in any coordinate system, which is why it is not a tensor. However
we can relate the tensor density à to the proper tensor A via
1 2Ê
à = |g|1/2 A
where Ê is the weight of the tensor density and g stands for the determinant of the metric.
One can show that
‘˜µ‹‡fl = |g|1/2 ‘µ‹‡fl
and thus the Levi Civita symbol is a tensor of weight +1.
For the indices (IJKL), we have not used the tilde since those indices are of the internal
flat space, and, hence, ‘IJKL is actually a tensor and not a density.
It is also possible to define the 3-dimensional symbol as ‘˜abc := ‘˜0abc , which could also be
done for 2D or 1D. The same could be done for the (IJKL) indices in the internal space.