Jump to content

Linearized gravity: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Phys (talk | contribs)
No edit summary
rm inconsistent reference to natural units; some language ce
 
(100 intermediate revisions by 63 users not shown)
Line 1: Line 1:
{{Short description|Linear perturbations to solutions of nonlinear Einstein field equations}}
'''Linearized gravity''' is an approach to [[general relativity]] in which the [[metric tensor]] is treated as a sum of a solution of [[Einstein's equations]] (usually the flat space) and a perturbation <math>h</math>.
{{General relativity sidebar |equations}}


In the theory of [[general relativity]], '''linearized gravity''' is the application of [[perturbation theory]] to the [[Metric tensor (general relativity)|metric tensor]] that describes the geometry of [[spacetime]]. As a consequence, linearized gravity is an effective method for modeling the effects of gravity when the [[gravitational field]] is weak. The usage of linearized gravity is integral to the study of [[gravitational waves]] and weak-field [[gravitational lensing]].
:<math>g=\eta+h</math>


== Weak-field approximation ==
where &eta; is the nondynamical background metric that we are perturbing about.
The [[Einstein field equation]] (EFE) describing the geometry of [[spacetime]] is given as
: <math>R_{\mu\nu} - \frac{1}{2}Rg_{\mu\nu} = \kappa T_{\mu\nu}</math>
where <math>R_{\mu\nu}</math> is the [[Ricci tensor]], <math>R</math> is the [[Ricci scalar]], <math>T_{\mu\nu}</math> is the [[energy–momentum tensor]], <math>\kappa = 8 \pi G / c^4</math> is the [[Einstein gravitational constant]], and <math>g_{\mu\nu}</math> is the [[spacetime]] [[metric tensor]] that represents the solutions of the equation.


Although succinct when written out using [[Einstein notation]], hidden within the Ricci tensor and Ricci scalar are exceptionally nonlinear dependencies on the metric tensor that render the prospect of finding [[Exact solutions in general relativity|exact solutions]] impractical in most systems. However, when describing systems for which the [[curvature]] of spacetime is small (meaning that terms in the EFE that are [[Quadratic function|quadratic]] in <math>g_{\mu\nu}</math> do not significantly contribute to the equations of motion), one can model the solution of the field equations as being the [[Minkowski metric]]<ref group="note">This assumes that the background spacetime is flat. Perturbation theory applied in a spacetime that is already curved can work just as well when this term is replaced with the metric representing the curved background.</ref> <math>\eta_{\mu\nu}</math> plus a small perturbation term <math>h_{\mu\nu}</math>. In other words:
The perturbation is treated using the methods of [[perturbation theory]]. The adjective "linearized" means that initially we neglect all higher-order terms in the perturbation, leading to linear different equations.
: <math>g_{\mu\nu} = \eta_{\mu\nu} + h_{\mu\nu},\qquad |h_{\mu\nu}| \ll 1.</math>


In this regime, substituting the general metric <math>g_{\mu\nu}</math> for this perturbative approximation results in a simplified expression for the Ricci tensor:
This leads to signifant simplications as the full [[Einstein field equation]]s are horrendously [[nonlinear]] and coming up with solutions of it is notoriously hard and can't be done in closed form in all but the simplest cases.
: <math>R_{\mu\nu} = \frac{1}{2}(\partial_\sigma\partial_\mu h^\sigma_\nu + \partial_\sigma\partial_\nu h^\sigma_\mu - \partial_\mu\partial_\nu h - \square h_{\mu\nu}),</math>
where <math>h = \eta^{\mu\nu}h_{\mu\nu}</math> is the [[Trace (linear algebra)|trace]] of the perturbation, <math>\partial_\mu</math> denotes the partial derivative with respect to the <math>x^\mu</math> coordinate of spacetime, and <math>\square = \eta^{\mu\nu} \partial_\mu \partial_\nu</math> is the [[d'Alembert operator]].


Together with the Ricci scalar,
This method is useful to derive the Newtonian limit, including the first corrections, much like for a derivation of the existence of [[gravitational wave]]s that lead, after [[quantization]], to [[graviton]]s. This is why the conceptual approach of linearized gravity is the canonical one in [[particle physics]], [[string theory]], and more generally [[quantum field theory]] where classical (bosonic) fields (with small or zero masses) are expressed as [[coherent state]]s of particles.
: <math>R = \eta_{\mu\nu}R^{\mu\nu} = \partial_\mu\partial_\nu h^{\mu\nu} - \square h,</math>
the left side of the field equation reduces to
: <math>R_{\mu\nu} - \frac{1}{2}Rg_{\mu\nu} = \frac{1}{2}(\partial_\sigma\partial_\mu h^\sigma_\nu + \partial_\sigma\partial_\nu h^\sigma_\mu - \partial_\mu\partial_\nu h - \square h_{\mu\nu} - \eta_{\mu\nu}\partial_\rho\partial_\lambda h^{\rho\lambda} + \eta_{\mu\nu}\square h).</math>
and thus the EFE is reduced to a linear second order [[partial differential equation]] in terms of <math>h_{\mu\nu}</math>.


=== Gauge invariance ===
This approximation is also known as the '''weak field approximation''' as it is only valid for tiny h's.
The process of decomposing the general spacetime <math>g_{\mu\nu}</math> into the Minkowski metric plus a perturbation term is not unique. This is due to that different choices for coordinates may give different forms for <math>h_{\mu\nu}</math>. In order to capture this phenomenon, the application of [[Gauge theory|gauge symmetry]] is introduced.


Gauge symmetries are a mathematical device for describing a system that does not change when the underlying coordinate system is "shifted" by an infinitesimal amount. So although the perturbation metric <math>h_{\mu\nu}</math> is not consistently defined between different coordinate systems, the overall system which it describes ''is''.
In this approximation, the gauge symmetry is associated with [[diffeomorphism]]s with small "displacements" (diffeomorphisms with huge displacements obviously violate the weak field approximation), which has the exact form (for infinitesimal transformations)


To capture this formally, the non-uniqueness of the perturbation <math>h_{\mu\nu}</math> is represented as being a consequence of the diverse collection of [[diffeomorphisms]] on spacetime that leave <math>h_{\mu\nu}</math> sufficiently small. Therefore, it is required that <math>h_{\mu\nu}</math> be defined in terms of a general set of diffeomorphisms, then select the subset of these that preserve the small scale that is required by the weak-field approximation. One may thus define <math>\phi</math> to denote an arbitrary diffeomorphism that maps the flat Minkowski spacetime to the more general spacetime represented by the metric <math>g_{\mu\nu}</math>. With this, the perturbation metric may be defined as the difference between the [[Pullback (differential geometry)|pullback]] of <math>g_{\mu\nu}</math> and the Minkowski metric:
:<math>\delta_{\vec{\xi}}h=\delta_{\vec{\xi}}g-\delta_{\vec{\xi}}\eta=\mathcal{L}_{\vec{\xi}}g=\mathcal{L}_{\vec{\xi}}\eta+\mathcal{L}_{\vec{\xi}}h=
\left[\xi_{\nu;\mu} + \xi_{\mu;\nu} + \xi^\alpha h_{\mu\nu;\alpha} + \xi^\alpha_{;\mu} h_{\alpha\nu} + \xi^\alpha_{;\nu} h_{\mu\alpha}\right]dx^\mu \otimes dx^\nu</math>
: <math>h_{\mu\nu} = (\phi^*g)_{\mu\nu} - \eta_{\mu\nu}.</math>
The diffeomorphisms <math>\phi</math> may thus be chosen such that <math>|h_{\mu\nu}| \ll 1</math>.


Given then a vector field <math>\xi^\mu</math> defined on the flat background spacetime, an additional family of diffeomorphisms <math>\psi_\epsilon</math> may be defined as those generated by <math>\xi^\mu</math> and parameterized by <math>\epsilon > 0</math>. These new diffeomorphisms will be used to represent the coordinate transformations for "infinitesimal shifts" as discussed above. Together with <math>\phi</math>, a family of perturbations is given by
where <math>\mathcal{L}</math> is the [[Lie derivative]] and we used the fact that &eta; doesn't transform (by definition). Note that we are raising and lowering the indices with respect to &eta; and not g and taking the [[covariant derivative]]s ([[Levi-Civita connection]]) with respect to &eta;. This is the standard practice in linearized gravity. The way of thinking in linearized gravity is this: the background metric &eta; IS the metric and h is a field propagating over the spacetime with this metric.
: <math>\begin{align}
h^{(\epsilon)}_{\mu\nu} &= [(\phi\circ\psi_\epsilon)^*g]_{\mu\nu} - \eta_{\mu\nu} \\
&= [\psi^*_\epsilon(\phi^*g)]_{\mu\nu} - \eta_{\mu\nu} \\
&= \psi^*_\epsilon(h + \eta)_{\mu\nu} - \eta_{\mu\nu} \\
&= (\psi^*_\epsilon h)_{\mu\nu} + \epsilon\left[\frac{(\psi^*_\epsilon\eta)_{\mu\nu} - \eta_{\mu\nu}}{\epsilon}\right].
\end{align}</math>
Therefore, in the limit <math>\epsilon\rightarrow 0</math>,
: <math>h^{(\epsilon)}_{\mu\nu} = h_{\mu\nu} + \epsilon\mathcal{L}_\xi\eta_{\mu\nu}</math>
where <math>\mathcal{L}_\xi</math> is the [[Lie derivative]] along the vector field <math>\xi_\mu</math>.


In the weak field limit, this gauge transformation simplifies to
The Lie derivative works out to yield the final ''gauge transformation'' of the perturbation metric <math>h_{\mu\nu}</math>:
: <math>h^{(\epsilon)}_{\mu\nu} = h_{\mu\nu} + \epsilon(\partial_\mu\xi_\nu + \partial_\nu\xi_\mu),</math>
which precisely define the set of perturbation metrics that describe the same physical system. In other words, it characterizes the gauge symmetry of the linearized field equations.


=== Choice of gauge ===
:<math>\delta_{\vec{\xi}}h_{\mu\nu}\approx \left(\mathcal{L}_{\vec{\xi}}\eta\right)_{\mu\nu}=\xi_{\nu;\mu} + \xi_{\mu;\nu}</math>
By exploiting gauge invariance, certain properties of the perturbation metric can be guaranteed by choosing a suitable vector field <math>\xi^\mu</math>.


==== Transverse gauge ====
{{phys-stub}}
To study how the perturbation <math>h_{\mu\nu}</math> distorts measurements of length, it is useful to define the following spatial tensor:
: <math>s_{ij} = h_{ij} - \frac{1}{3}\delta^{kl}h_{kl}\delta_{ij}</math>
(Note that the indices span only spatial components: <math>i,j\in\{1,2,3\}</math>). Thus, by using <math>s_{ij}</math>, the spatial components of the perturbation can be decomposed as
: <math>h_{ij} = s_{ij} - \Psi\delta_{ij}</math>
where <math>\Psi = \frac{1}{3}\delta^{kl}h_{kl}</math>.


The tensor <math>s_{ij}</math> is, by construction, [[trace (linear algebra)|trace]]less and is referred to as the ''strain'' since it represents the amount by which the perturbation [[Gravitational wave#Effects of passing|stretches and contracts measurements of space]]. In the context of studying [[gravitational waves|gravitational radiation]], the strain is particularly useful when utilized with the ''transverse gauge.'' This gauge is defined by choosing the spatial components of <math>\xi^\mu</math> to satisfy the relation
: <math>\nabla^2\xi^j + \frac{1}{3}\partial_j\partial_i\xi^i = -\partial_i s^{ij},</math>
then choosing the time component <math>\xi^0</math> to satisfy
: <math>\nabla^2\xi^0 = \partial_i h_{0i} + \partial_0\partial_i\xi^i.</math>

After performing the gauge transformation using the formula in the previous section, the strain becomes spatially transverse:
: <math>\partial_i s^{ij}_{(\epsilon)} = 0,</math>
with the additional property:
: <math>\partial_i h^{0i}_{(\epsilon)} = 0.</math>

==== Synchronous gauge ====
The ''synchronous gauge'' simplifies the perturbation metric by requiring that the metric not distort measurements of time. More precisely, the synchronous gauge is chosen such that the non-spatial components of <math>h^{(\epsilon)}_{\mu\nu}</math> are zero, namely
: <math>h^{(\epsilon)}_{0\nu} = 0.</math>
This can be achieved by requiring the time component of <math>\xi^\mu</math> to satisfy
: <math>\partial_0\xi^0 = -h_{00}</math>
and requiring the spatial components to satisfy
: <math>\partial_0\xi^i = \partial_i\xi^0 - h_{0i}.</math>

==== Harmonic gauge ====
The ''[[Harmonic coordinate condition|harmonic gauge]]'' (also referred to as the ''Lorenz gauge''<ref group="note">Not to be confused with Lorentz.</ref>) is selected whenever it is necessary to reduce the linearized field equations as much as possible. This can be done if the condition
: <math>\partial_\mu h^\mu_\nu = \frac{1}{2}\partial_\nu h</math>
is true. To achieve this, <math>\xi_\mu</math> is required to satisfy the relation
: <math>\square\xi_\mu = -\partial_\nu h^\nu_\mu + \frac{1}{2}\partial_\mu h.</math>

Consequently, by using the harmonic gauge, the [[Einstein tensor]] <math>G_{\mu\nu} = R_{\mu\nu} - \frac{1}{2}Rg_{\mu\nu}</math> reduces to
: <math>G_{\mu\nu} = -\frac{1}{2}\square\left(h^{(\epsilon)}_{\mu\nu} - \frac{1}{2}h^{(\epsilon)}\eta_{\mu\nu}\right).</math>
Therefore, by writing it in terms of a "trace-reversed" metric, <math>\bar{h}^{(\epsilon)}_{\mu\nu} = h^{(\epsilon)}_{\mu\nu} - \frac{1}{2}h^{(\epsilon)}\eta_{\mu\nu}</math>, the linearized field equations reduce to
: <math>\square \bar{h}^{(\epsilon)}_{\mu\nu} = -2\kappa T_{\mu\nu}.</math>
This can be solved exactly, to produce the [[wave equation|wave solutions]] that define [[gravitational wave|gravitational radiation]].

== See also ==
{{cols}}
* [[Correspondence principle]]
* [[Gravitoelectromagnetism]]
* [[Lanczos tensor]]
* [[Parameterized post-Newtonian formalism]]
* [[Post-Newtonian expansion]]
* [[Quasinormal mode]]
{{colend}}

== Notes ==
{{reflist|group="note"|2}}

== Further reading ==
* {{cite book | author = Sean M. Carroll | title = Spacetime and Geometry, an Introduction to General Relativity | year = 2003 | publisher = Pearson | isbn = 978-0805387322}}

== External links ==
* {{wikiquote-inline}}

{{Theories of gravitation}}
{{Relativity}}

[[Category:Mathematical methods in general relativity]]
[[Category:General relativity]]
[[Category:General relativity]]

Latest revision as of 12:52, 26 August 2024

In the theory of general relativity, linearized gravity is the application of perturbation theory to the metric tensor that describes the geometry of spacetime. As a consequence, linearized gravity is an effective method for modeling the effects of gravity when the gravitational field is weak. The usage of linearized gravity is integral to the study of gravitational waves and weak-field gravitational lensing.

Weak-field approximation

[edit]

The Einstein field equation (EFE) describing the geometry of spacetime is given as

where is the Ricci tensor, is the Ricci scalar, is the energy–momentum tensor, is the Einstein gravitational constant, and is the spacetime metric tensor that represents the solutions of the equation.

Although succinct when written out using Einstein notation, hidden within the Ricci tensor and Ricci scalar are exceptionally nonlinear dependencies on the metric tensor that render the prospect of finding exact solutions impractical in most systems. However, when describing systems for which the curvature of spacetime is small (meaning that terms in the EFE that are quadratic in do not significantly contribute to the equations of motion), one can model the solution of the field equations as being the Minkowski metric[note 1] plus a small perturbation term . In other words:

In this regime, substituting the general metric for this perturbative approximation results in a simplified expression for the Ricci tensor:

where is the trace of the perturbation, denotes the partial derivative with respect to the coordinate of spacetime, and is the d'Alembert operator.

Together with the Ricci scalar,

the left side of the field equation reduces to

and thus the EFE is reduced to a linear second order partial differential equation in terms of .

Gauge invariance

[edit]

The process of decomposing the general spacetime into the Minkowski metric plus a perturbation term is not unique. This is due to that different choices for coordinates may give different forms for . In order to capture this phenomenon, the application of gauge symmetry is introduced.

Gauge symmetries are a mathematical device for describing a system that does not change when the underlying coordinate system is "shifted" by an infinitesimal amount. So although the perturbation metric is not consistently defined between different coordinate systems, the overall system which it describes is.

To capture this formally, the non-uniqueness of the perturbation is represented as being a consequence of the diverse collection of diffeomorphisms on spacetime that leave sufficiently small. Therefore, it is required that be defined in terms of a general set of diffeomorphisms, then select the subset of these that preserve the small scale that is required by the weak-field approximation. One may thus define to denote an arbitrary diffeomorphism that maps the flat Minkowski spacetime to the more general spacetime represented by the metric . With this, the perturbation metric may be defined as the difference between the pullback of and the Minkowski metric:

The diffeomorphisms may thus be chosen such that .

Given then a vector field defined on the flat background spacetime, an additional family of diffeomorphisms may be defined as those generated by and parameterized by . These new diffeomorphisms will be used to represent the coordinate transformations for "infinitesimal shifts" as discussed above. Together with , a family of perturbations is given by

Therefore, in the limit ,

where is the Lie derivative along the vector field .

The Lie derivative works out to yield the final gauge transformation of the perturbation metric :

which precisely define the set of perturbation metrics that describe the same physical system. In other words, it characterizes the gauge symmetry of the linearized field equations.

Choice of gauge

[edit]

By exploiting gauge invariance, certain properties of the perturbation metric can be guaranteed by choosing a suitable vector field .

Transverse gauge

[edit]

To study how the perturbation distorts measurements of length, it is useful to define the following spatial tensor:

(Note that the indices span only spatial components: ). Thus, by using , the spatial components of the perturbation can be decomposed as

where .

The tensor is, by construction, traceless and is referred to as the strain since it represents the amount by which the perturbation stretches and contracts measurements of space. In the context of studying gravitational radiation, the strain is particularly useful when utilized with the transverse gauge. This gauge is defined by choosing the spatial components of to satisfy the relation

then choosing the time component to satisfy

After performing the gauge transformation using the formula in the previous section, the strain becomes spatially transverse:

with the additional property:

Synchronous gauge

[edit]

The synchronous gauge simplifies the perturbation metric by requiring that the metric not distort measurements of time. More precisely, the synchronous gauge is chosen such that the non-spatial components of are zero, namely

This can be achieved by requiring the time component of to satisfy

and requiring the spatial components to satisfy

Harmonic gauge

[edit]

The harmonic gauge (also referred to as the Lorenz gauge[note 2]) is selected whenever it is necessary to reduce the linearized field equations as much as possible. This can be done if the condition

is true. To achieve this, is required to satisfy the relation

Consequently, by using the harmonic gauge, the Einstein tensor reduces to

Therefore, by writing it in terms of a "trace-reversed" metric, , the linearized field equations reduce to

This can be solved exactly, to produce the wave solutions that define gravitational radiation.

See also

[edit]

Notes

[edit]
  1. ^ This assumes that the background spacetime is flat. Perturbation theory applied in a spacetime that is already curved can work just as well when this term is replaced with the metric representing the curved background.
  2. ^ Not to be confused with Lorentz.

Further reading

[edit]
  • Sean M. Carroll (2003). Spacetime and Geometry, an Introduction to General Relativity. Pearson. ISBN 978-0805387322.
[edit]