{{Short description|Coordinate transformation that preserves the form of Hamilton's equations}}
{{Short description|Coordinate transformation that preserves the form of Hamilton's equations}}
In [[Hamiltonian mechanics]], a '''canonical transformation''' is a change of [[canonical coordinates]] <math>\ \left(\ \boldsymbol{q}, \boldsymbol{p}, t\ \right) \longmapsto \left(\ \boldsymbol{Q}, \boldsymbol{P}, t\ \right)\ </math> that preserves the form of [[Hamilton's equations]]. This is sometimes known as '''form invariance'''. It need not preserve the form of the [[Hamiltonian mechanics|Hamiltonian]] itself. Canonical transformations are useful in their own right, and also form the basis for the [[Hamilton–Jacobi equation]]s (a useful method for calculating [[constant of motion|conserved quantities]]) and [[Liouville's theorem (Hamiltonian)|Liouville's theorem]] (itself the basis for classical [[statistical mechanics]]).
In [[Hamiltonian mechanics]], a '''canonical transformation''' is a change of [[canonical coordinates]] <math>\ \left(\ \boldsymbol{q}, \boldsymbol{p}, t\ \right) \longmapsto \left(\ \boldsymbol{Q}, \boldsymbol{P}, t\ \right)\ </math> that preserves the form of [[Hamilton's equations]]. This is sometimes known as ''form invariance''. It need not preserve the form of the [[Hamiltonian mechanics|Hamiltonian]] itself. Canonical transformations are useful in their own right, and also form the basis for the [[Hamilton–Jacobi equation]]s (a useful method for calculating [[constant of motion|conserved quantities]]) and [[Liouville's theorem (Hamiltonian)|Liouville's theorem]] (itself the basis for classical [[statistical mechanics]]).
Since [[Lagrangian mechanics]] is based on [[generalized coordinates]], transformations of the coordinates <math>\ \boldsymbol{q} \longmapsto \boldsymbol{Q}\ </math> do not affect the form of [[Lagrangian mechanics|Lagrange's equations]] and, hence, do not affect the form of [[Hamilton's equations]] if the momentum is simultaneously changed by a [[Legendre transformation]] into
Since [[Lagrangian mechanics]] is based on [[generalized coordinates]], transformations of the coordinates <math>\ \boldsymbol{q} \longmapsto \boldsymbol{Q}\ </math> do not affect the form of [[Lagrangian mechanics|Lagrange's equations]] and, hence, do not affect the form of [[Hamilton's equations]] if the momentum is simultaneously changed by a [[Legendre transformation]] into
Revision as of 01:15, 24 December 2023
Coordinate transformation that preserves the form of Hamilton's equations
Therefore, coordinate transformations (also called point transformations) are a type of canonical transformation. However, the class of canonical transformations is much broader, since the old generalized coordinates, momenta and even time may be combined to form the new generalized coordinates and momenta. Canonical transformations that do not include the time explicitly are called restricted canonical transformations (many textbooks consider only this type).
A dot over a variable or list signifies the time derivative, e.g.,
and the equalities are read to be satisfied for all coordinates, for example:
The dot product notation between two lists of the same number of coordinates is a shorthand for the sum of the products of corresponding components, e.g.,
The dot product (also known as an "inner product") maps the two coordinate lists into one variable representing a single numerical value. The coordinates after transformation are similarly labelled with Q for transformed generalized coordinates and P for transformed generalized momentum.
Conditions for restricted canonical transformation
Restricted canonical transformations are coordinate transformations where transformed coordinates Q and P do not have explicit time dependance, ie. and . The following conditions are equivalent conditions that can be generalized to canonical transformation with the exception of bilinear invariance condition which is only equivalent and applicable, under restricted canonical transformations.
Indirect conditions
The functional form of Hamilton's equations is
In general, a transformation does not preserve the form of Hamilton's equations but in the absence of time dependance in transformation, the transformed Hamiltonian (sometimes called the Kamiltonian[1]) can be assumed to differ by a function of time.
This choice of the Kamiltonian is supported by results of canonical transformation conditions, generalized through the use of generating functions. This essentially permits the use of the following relations in the derivation:These equations, combined with the form of Hamilton's equations are sufficient to derive the indirect conditions.
By definition, the transformed coordinates have analogous dynamics
where K(Q, P) is the new Hamiltonian that is considered.
Since restricted transformations have no explicit time dependence (by definition), the time derivative of a new generalized coordinate Qm is
where {⋅, ⋅} is the Poisson bracket.
We also have the identity for the conjugate momentum Pm
If the transformation is canonical, these two must be equal, resulting in the equations
The analogous argument for the generalized momenta Pm leads to two other sets of equations
These are the indirect conditions to check whether a given transformation is canonical.
Symplectic condition
Sometimes the Hamiltonian relations are represented as:
Where
and . Similarly, let .
From the relation of partial derivatives, we convert relation in terms of partial derivatives with new variables:
where .
Similarly we find:or since due to the form of Kamiltonian:
It can also be shown that this condition is equivalent to satisfying indirect conditions. The left hand side of the above is called the Lagrange matrix of , denoted as: . Similarly, a Poisson matrix of can be constructed as .[3] It can be shown that the symplectic condition is also equivalent to .[4]
The set of all matrices which satisfy symplectic conditions form a symplectic group.
Invariance of Poisson Bracket
The Poisson bracket which is defined as:can be represented in matrix form as:Hence using partial derivative relations and symplectic condition, we get:[5]
Since the equality is expected to hold for any functions, by choice of u and v, we can either recover indirect condition or recover the symplectic condition by showing . Thus these conditions are equivalent to symplectic conditions.
Sincethe calculated values from the formula for Poisson brackets yields, , and .
If a matrix were defined as , using values from the above:
Since and by the invariance of Poisson bracket, , it can be expressed in the matrix form as which is equivalent to the symplectic condition.[6]
If a matrix were defined as , then from the above relation,
The matrix elements of can be explicitly calculated to be: [3]
Since , it implies and hence for arbitrary functions we have: . Since the symplectic condition can be trivially recovered from this, the condition serves as an equivalent condition for canonical transformation.
Bilinear invariance conditions
These set of conditions only apply to restricted canonical transformations or canonical transformations that are independent of time variable.
Due to lack of time dependance in the transformation:
where similar equations follow for , , and .
Substituting partial derivatives from canonical transformation conditions, we can show using canonical transformation partial derivative relations that:If the above is obeyed for any arbitrary variation, it would be only possible if the indirect conditions are met.[7][8]
Liouville's theorem
The indirect conditions allow us to prove Liouville's theorem, which states that the volume in phase space is conserved under canonical transformations, i.e.,
By calculus, the latter integral must equal the former times the determinant of JacobianMWhere
Exploiting the "division" property of Jacobians yields
Eliminating the repeated variables gives
Application of the indirect conditions above yields .[9]
To guarantee a valid transformation between (q, p, H) and (Q, P, K), we may resort to a direct generating function approach. Both sets of variables must obey Hamilton's principle. That is the Action Integral over the Lagrangian and respectively, obtained by the Hamiltonian via ("inverse") Legendre transformation, both must be stationary (so that one can use the Euler–Lagrange equations to arrive at equations of the above-mentioned and designated form; as it is shown for example here):
Lagrangians are not unique: one can always multiply by a constant λ and add a total time derivative dG/dt and yield the same equations of motion (as discussed on Wikibooks). In general, the scaling factor λ is set equal to one; canonical transformations for which λ ≠ 1 are called extended canonical transformations. dG/dt is kept, otherwise the problem would be rendered trivial and there would be not much freedom for the new canonical variables to differ from the old ones.
Here G is a generating function of one old canonical coordinate (q or p), one new canonical coordinate (Q or P) and (possibly) the time t. Thus, there are four basic types of generating functions (although mixtures of these four types can exist), depending on the choice of variables. As will be shown below, the generating function will define a transformation from old to new canonical coordinates, and any such transformation is guaranteed to be canonical.
The various generating functions and its properties tabulated below is discussed in detail:
Properties of four basic Canonical Transformations[10]
Generating Function
Generating Function Derivatives
Transformed Hamiltonian
Trivial Cases
Type 1 generating function
The type 1 generating function G1 depends only on the old and new generalized coordinates
To derive the implicit transformation, we expand the defining equation above
Since the new and old coordinates are each independent, the following 2N + 1 equations must hold
These equations define the transformation as follows: The first set of N equations
define relations between the new generalized coordinatesQ and the old canonical coordinates(q, p). Ideally, one can invert these relations to obtain formulae for each Qk as a function of the old canonical coordinates. Substitution of these formulae for the Q coordinates into the second set of N equations
yields analogous formulae for the new generalized momenta P in terms of the old canonical coordinates(q, p). We then invert both sets of formulae to obtain the oldcanonical coordinates(q, p) as functions of the newcanonical coordinates(Q, P). Substitution of the inverted formulae into the final equation
yields a formula for K as a function of the new canonical coordinates(Q, P).
In practice, this procedure is easier than it sounds, because the generating function is usually simple. For example, let
This results in swapping the generalized coordinates for the momenta and vice versa
and K = H. This example illustrates how independent the coordinates and momenta are in the Hamiltonian formulation; they are equivalent variables.
Type 2 generating function
The type 2 generating function G2 depends only on the old generalized coordinates and the new generalized momenta
where the terms represent a Legendre transformation to change the right-hand side of the equation below. To derive the implicit transformation, we expand the defining equation above
Since the old coordinates and new momenta are each independent, the following 2N + 1 equations must hold
These equations define the transformation as follows: The first set of N equations
define relations between the new generalized momenta P and the old canonical coordinates(q, p). Ideally, one can invert these relations to obtain formulae for each Pk as a function of the old canonical coordinates. Substitution of these formulae for the P coordinates into the second set of N equations
yields analogous formulae for the new generalized coordinates Q in terms of the old canonical coordinates(q, p). We then invert both sets of formulae to obtain the oldcanonical coordinates(q, p) as functions of the newcanonical coordinates(Q, P). Substitution of the inverted formulae into the final equation
yields a formula for K as a function of the new canonical coordinates(Q, P).
In practice, this procedure is easier than it sounds, because the generating function is usually simple. For example, let
where g is a set of N functions. This results in a point transformation of the generalized coordinates
Type 3 generating function
The type 3 generating function G3 depends only on the old generalized momenta and the new generalized coordinates
where the terms represent a Legendre transformation to change the left-hand side of the equation below. To derive the implicit transformation, we expand the defining equation above
Since the new and old coordinates are each independent, the following 2N + 1 equations must hold
These equations define the transformation as follows: The first set of N equations
define relations between the new generalized coordinatesQ and the old canonical coordinates(q, p). Ideally, one can invert these relations to obtain formulae for each Qk as a function of the old canonical coordinates. Substitution of these formulae for the Q coordinates into the second set of N equations
yields analogous formulae for the new generalized momenta P in terms of the old canonical coordinates(q, p). We then invert both sets of formulae to obtain the oldcanonical coordinates(q, p) as functions of the newcanonical coordinates(Q, P). Substitution of the inverted formulae into the final equation yields a formula for K as a function of the new canonical coordinates(Q, P).
In practice, this procedure is easier than it sounds, because the generating function is usually simple.
Type 4 generating function
The type 4 generating function depends only on the old and new generalized momenta
where the terms represent a Legendre transformation to change both sides of the equation below. To derive the implicit transformation, we expand the defining equation above
Since the new and old coordinates are each independent, the following 2N + 1 equations must hold
These equations define the transformation as follows: The first set of N equations
define relations between the new generalized momenta P and the old canonical coordinates(q, p). Ideally, one can invert these relations to obtain formulae for each Pk as a function of the old canonical coordinates. Substitution of these formulae for the P coordinates into the second set of N equations
yields analogous formulae for the new generalized coordinates Q in terms of the old canonical coordinates(q, p). We then invert both sets of formulae to obtain the oldcanonical coordinates(q, p) as functions of the newcanonical coordinates(Q, P). Substitution of the inverted formulae into the final equation
yields a formula for K as a function of the new canonical coordinates(Q, P).
Restrictions on generating functions
For example, using generating function of second kind: and , the first set of equations consisting of variables , and has to be inverted to get . The solution exists when the matrix defined by is non-singular.[11]
Similarly the restriction placed on generating functions as the matrices: and , being non-singular.[12]
Limitations of generating functions
Since is non-singular, it implies that is also non-singular. Since the matrix is inverse of , the transformations of type 2 generating functions always have a non-singular matrix.
Similarly, it can be stated that type 1 generating functions always have a non-singular matrix and type 2 generating functions always have a non-singular matrix. Hence, the canonical transformations resulting from these generating functions are not completely general.[13]
Canonical transformation conditions
Canonical transformation relations
From: , calculate :
Hence: , if canonical transformation rules are applied.
Similarly:
Hence: , if canonical transformation rules are applied.
The above two relations can be combined in matrix form as: (which will also retain same form for extended canonical transformation) where we have used the result .
The canonical transformation relations can now be restated to include time dependance:
Symplectic Condition
From:
Similarly we find:or:Where the last terms of each equation cancel due to condition from canonical transformations. Hence leaving the symplectic relation: . It follows from the above two equations that the symplectic condition implies the equation: , from which the indirect conditions can be recovered. Thus, symplectic conditions and indirect conditions can be said to be equivalent.
Invariance of Poisson and Lagrange Bracket
The invariance of Lagrange Bracket directly follows from symplectic condition since the Lagrange matrix is given as , the matrix components imply that for any arbitrary functions, . The invariance of Poisson brackets also follows from symplectic condition and can be shown to be equivalent to it. The proof is reproduced in exactly the same manner as:
However, bilinear condition will remain in the valid for restricted canonical transformations only.
We can also observe that since and if Q and P do not explicitly depend on time, can be taken. The analysis of restricted canonical transformations is hence consistent with this generalization.
Extended Canonical Transformation
Canonical transformation relations
By solving for:with various forms of generating function, we instead get the relation between K and H as which also applies for case.
All results presented below can also be obtained by replacing , and from known solutions, since these transformations retain the form of Hamilton's equations. The extended canonical transformations are hence said to be result of a canonical transformation () and a trivial canonical transformation () which has (for the given example, which satisfies the condition).[14]
Using same steps previously used in previous generalization, with in the general case, and retaining the equation , we get extended canonical transformation partial differential relations:
Symplectic condition
From: Similarly we find:or using :The second part of each equation cancel. Hence the condition for extended canonical transformation instead becomes: .[15]
Poisson and Lagrange Brackets
The Poisson brackets are changed as follows:whereas, the Lagrange brackets are changed as:
Hence, the Poisson bracket scales by the inverse of whereas the Lagrange bracket scales by a factor of .[16]
Infinitesimal canonical transformation
Consider the canonical transformation that depends on a continuous parameter , as follows:
For infinitesimal values of , the corresponding transformations are called as infinitesimal canonical transformations which are also known as differential canonical transformations.
Consider the following generating function:
Since for , has the resulting canonical transformation, and , this type of generating function can be used for infinitesimal canonical transformation by restricting to an infinitesimal value. From the conditions of generators of second type:Since , changing the variables of the function to and neglecting terms of higher order of , we get:[17]Infinitesimal canonical transformations can also be derived using the matrix form of the symplectic condition.[18]
In the passive view of transformations, the coordinate system is changed without the physical system changing, whereas in the active view of transformation, the coordinate system is retained and the physical system is said to undergo transformations. Thus, using the relations from infinitesimal canonical transformations, the change in the system states under active view of the canonical transformation is said to be:
or as in matrix form.
For any function , it changes under active view of the transformation according to:
Considering the change of Hamiltonians in the active view, ie. for a fixed point,where are mapped to the point, by the infinitesimal canonical transformation, and similar change of variables for to is considered up-to first order of . Hence, if the Hamiltonian is invariant for infinitesimal canonical transformations, its generator is a constant of motion.
Examples of ICT
Time evolution
Taking and , then . Thus the continuous application of such a transformation maps the coordinates to .
Translation
Taking , and . Hence, the canonical momentum generates a shift in the corresponding generalized coordinate and if the Hamiltonian is invariant of translation, the momentum is a constant of motion.
Rotation
Consider an orthogonal system for an N-particle system:
Choosing the generator to be: and the infinitesimal value of , then the change in the coordinates is given for x by:
and similarly for y:
whereas the z component of all particles is unchanged: .
These transformations correspond to rotation about z axis by angle in its first order approximation. Hence, repeated application of the infinitesimal canonical transformation generates a rotation of system of particles about the z axis. If the Hamiltonian is invariant under rotation by the z axis, the generator, the component of angular momentum along the axis of rotation, is an invariant of motion.[18]
Motion as canonical transformation
Motion itself (or, equivalently, a shift in the time origin) is a canonical transformation. If and , then Hamilton's principle is automatically satisfiedsince a valid trajectory should always satisfy Hamilton's principle, regardless of the endpoints.
Examples
The translation where are two constant vectors is a canonical transformation. Indeed, the Jacobian matrix is the identity, which is symplectic: .
Set and , the transformation where is a rotation matrix of order 2 is canonical. Keeping in mind that special orthogonal matrices obey it's easy to see that the Jacobian is symplectic. However, this example only works in dimension 2: is the only special orthogonal group in which every matrix is symplectic. Note that the rotation here acts on and not on and independently, so these are not the same as a physical rotation of an orthogonal coordinate system.
The transformation , where is an arbitrary function of , is canonical. Jacobian matrix is indeed given by which is symplectic.
Modern mathematical description
In mathematical terms, canonical coordinates are any coordinates on the phase space (cotangent bundle) of the system that allow the canonical one-form to be written as
up to a total differential (exact form). The change of variable between one set of canonical coordinates and another is a canonical transformation. The index of the generalized coordinatesq is written here as a superscript (), not as a subscript as done above (). The superscript conveys the contravariant transformation properties of the generalized coordinates, and does not mean that the coordinate is being raised to a power. Further details may be found at the symplectomorphism article.
History
The first major application of the canonical transformation was in 1846, by Charles Delaunay, in the study of the Earth-Moon-Sun system. This work resulted in the publication of a pair of large volumes as Mémoires by the French Academy of Sciences, in 1860 and 1867.