Jump to content

Calculus of variations: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Donludwig (talk | contribs)
added Fermat's principle. ~~~~
Line 54: Line 54:


then <math>f_0</math> has two continuous derivatives, and it satisfies the Euler-Lagrange equation.
then <math>f_0</math> has two continuous derivatives, and it satisfies the Euler-Lagrange equation.

==Fermat's principle==

[[Fermat's principle]] states that light takes a path that minimizes the optical length between its endpoints. If the ''x''-coordinate is chosen as the parameter along the path, and <math>y=f(x)</math> along the path, then the optical length is given by

:<math> A[f] = \int_{x=x_0}^{x_1} n(x,f(x)) \sqrt{1 + f'(x)^2} dx, \,</math>

where the refractive index <math>n(x,y)</math> depends upon the material.
If we try <math> f(x) = f_0 (x) + \epsilon f_1 (x)</math>
then the first variation of ''A'' (the derivative of ''A'' with respect to ε) is

:<math> \delta A[f_0,f_1] = \int_{x=x_0}^{x_1} \left[ \frac{ n(x,f_0) f_0'(x) f_1'(x)}{\sqrt{1 + f_0'(x)^2}} + n_y (x,f_0) f_1 \right] dx \,</math>

After integration by parts of the first term within brackets, we obtain the Euler-Lagrange equation

:<math> -\frac{d}{dx} \left[\frac{ n(x,f_0) f_0'}{\sqrt{1 + f_0'^2}} \right] + n_y (x,f_0) =0. \,</math>

The light rays may be determined by integrating this equation.

===Snell's law===

There is a discontinuity of the refractive index when light enters or leaves a lens. Let

:<math> n(x,y) = n_- \quad \hbox{if} \quad x<0, \,</math>
:<math> n(x,y) = n_+ \quad \hbox{if} \quad x>0,\,</math>

where <math>n_-</math> and <math>n_+</math> are constants. Then the Euler-Lagrange equation holds as before in the region where ''x''<0 or ''x''>0, and in fact the path is a straight line there, since the refractive index is constant. At the ''x''=0, ''f'' must be continuous, but ''f' '' may be discontinuous. After integration by parts in the separate regions and using the Euler-Lagrange equations, the first variation takes the form

:<math> \delta A[f_0,f_1] = f_1(0)\left[ n_-\frac{f_0'(0_-)}{\sqrt{1 + f_0'(0_-)^2}} -n_+\frac{f_0'(0_+)}{\sqrt{1 + f_0'(0_+)^2}} \right].\,</math>

The factor multiplying <math>n_-</math> is the sine of angle of the incident ray with the ''x'' axis, and the factor multiplying <math>n_+</math> is the sine of angle of the refracted ray with the ''x'' axis. This is [[Snell's law]] for refraction. As this calculation demonstrates, Snell's law is equivalent to vanishing of the first variation of the optical path length.

===Fermat's principle in three dimensions===

It is expedient to use vector notation: let <math>X=(x_1,x_2,x_3),</math> let ''t'' be a parameter, let <math>X(t)</math> be the parametric representation of a curve ''C'', and let <math>\dot X(t)</math> be its tangent vector. The optical length of the curve is given by

:<math> A[C] = \int_{t=t_0}^{t_1} n(X) \sqrt{ \dot X \cdot \dot X} dt. \,</math>

Note that this integral is invariant with respect to changes in the parametric representation of ''C''. The Euler-Lagrange equations for a minimizing curve have the symmetric form

:<math> \frac{d}{dt} P = \sqrt{ \dot X \cdot \dot X} \nabla n, \,</math>

where
:<math> P = \frac{n(x) \dot X}{\sqrt{\dot X \cdot \dot X} }.\,</math>

It follows from the definition that ''P'' satisfies


:<math> P \cdot P = n(X)^2. \,</math>

Therefore the integral may also be written as

:<math> A[C] = \int_{t=t_0}^{t_1} P \cdot \dot X \, dt.\,</math>

====Connection with the wave equation====

The [[wave equation]] for an inhomogeneous medium is

:<math> u_{tt} = c^2 \nabla \cdot \nabla u, \,</math>


where ''c'' is the propagation velocity, which generally depends upon ''X''. Wave fronts are characteristic surfaces for this partial differential equation: they satisfy
:<math> \varphi_t^2 = c(X)^2 \nabla \varphi \cdot \nabla \varphi. \,</math>

We may look for solutions in the form
:<math> \varphi(t,X) = t - \psi(X). \,</math>

In that case, ψ satisfies
:<math> \nabla \psi \cdot \nabla \psi = n^2, \,</math>

where <math>n=1/c.</math> According to the theory of [[first order partial differential equation]]s, if <math>P = \nabla \psi, </math> then ''P'' satisfies
:<math> \frac{dP}{ds} = 2 n \nabla n, \,</math>

along a system of curves ('''the rays''') that are given by
:<math> \frac{dX}{ds} = P. \,</math>

These equations are identical to the Euler-Lagrange equations if we make the identification
:<math> \frac{ds}{dt} = \frac{\sqrt{ \dot X \cdot \dot X} }{n}. \,</math>

We conclude that the function ψ is the value of the minimizing integral ''A'' as a function of the upper end point. That is, when a family of minimizing curves is constructed, the values of the optical length satisfy the characteristic equation corresponding the the wave equation. Hence, solving the associated partial differential equation of first order is equivalent to finding families of solutions of the variational problem. This is the essential content of the [[Hamilton-Jacobi theory]], which applies to general variational problems.


==See also==
==See also==

Revision as of 17:49, 11 May 2006

Calculus of variations is a field of mathematics that deals with functions of functions, as opposed to ordinary calculus which deals with functions of numbers. Such functionals can for example be formed as integrals involving an unknown function and its derivatives. The interest is in extremal functions: those making the functional attain a maximum or minimum value. Perhaps the simplest example of such a problem is to find the curve of shortest length connecting two points. If there are no constraints, the solution is obviously a straight line, but if the curve is constrained to lie on a surface in space, then it is less obvious. The solutions of the latter problem are called geodesics. A related problem is posed by Fermat's principle: light follows the path of shortest optical length connecting two points, where the optical length depends upon the material of the medium. A corresponding idea in mechanics is the principle of stationary action. The theory of optimal control is a generalization of the calculus of variations.

The preceding examples have all involved unknown functions of a single variable, which my be identified with a time variable. Other important problems involve functions of several variables. Solutions of boundary value problems for the Laplace equation satisfy the Dirichlet principle: they minimize the potential energy of a membrane. Plateau's problem requires finding a surface of minimal area that spans a given contour in space: the solution or solutions may be found by dipping a wire frame in a solution of soap suds. Although such experiments are relatively easy to perform, their mathematical interpretation is far from simple: there may be more than one locally minimizing surface, and they may have non-trivial topology.

The Euler-Lagrange equation

The maxima and minima of a given function may be located by finding the points where its derivative vanishes. In analogy, solutions of variational problems may be obtained by solving the associated Euler-Lagrange equation. In order to illustrate this process, consider the problem of finding the shortest curve in the plane that connects two points and . The arc length is given by

where , and The function f should have at least one derivative in order to be admitted to the competition. If is a minimizing function and is any function with at least one derivative that vanishes at the endpoints and , then we must have

for any number ε. Therefore, the derivative of with respect to ε (the first variation of A) must vanish at ε=0. Thus

for any choice of the function . We may interpret this condition as the vanishing of all directional derivatives of in the space of differentiable functions. If we assume that has two continuous derivatives, then it follows from integration by parts that

for any choice of with two derivatives that vanishes at the endpoints of the interval. This is a special case of the following situation:

for any with one derivative that vanishes at the endpoints on the interval. If at a point then there is an interval surrounding where H is positive. We may choose to vanish outside that interval, and to be non-negative inside. With this choice, which is a contradiction. There is a similar argument if at a point . We conclude that

It follows from this equation that

and hence the extremals are straight lines.


A similar calculation holds in the general case where

with

and f is required to have two continuous derivatives. In that case, an extremal will satisfy the Euler-Lagrange equation

The Euler-Lagrange equation is a necessary condition for an extremal. but its satisfaction does not guarantee that the solution is in fact an extremal. Sufficient conditions for an extremal are discussed in the references.

du Bois Raymond's theorem

The discussion thus far has assumed that extremal functions possess two continuous derivatives, although the existence of the integral A requires only first derivatives of trial functions. The condition that the first variation vanish at a extremal may be regarded as a weak form of the Euler-Lagrange equation. The theorem of du Bois Raymond asserts that this weak form implies the strong form. If L has continuous first and second derivatives with respect to all of its arguments, and if

then has two continuous derivatives, and it satisfies the Euler-Lagrange equation.

Fermat's principle

Fermat's principle states that light takes a path that minimizes the optical length between its endpoints. If the x-coordinate is chosen as the parameter along the path, and along the path, then the optical length is given by

where the refractive index depends upon the material. If we try then the first variation of A (the derivative of A with respect to ε) is

After integration by parts of the first term within brackets, we obtain the Euler-Lagrange equation

The light rays may be determined by integrating this equation.

Snell's law

There is a discontinuity of the refractive index when light enters or leaves a lens. Let

where and are constants. Then the Euler-Lagrange equation holds as before in the region where x<0 or x>0, and in fact the path is a straight line there, since the refractive index is constant. At the x=0, f must be continuous, but f' may be discontinuous. After integration by parts in the separate regions and using the Euler-Lagrange equations, the first variation takes the form

The factor multiplying is the sine of angle of the incident ray with the x axis, and the factor multiplying is the sine of angle of the refracted ray with the x axis. This is Snell's law for refraction. As this calculation demonstrates, Snell's law is equivalent to vanishing of the first variation of the optical path length.

Fermat's principle in three dimensions

It is expedient to use vector notation: let let t be a parameter, let be the parametric representation of a curve C, and let be its tangent vector. The optical length of the curve is given by

Note that this integral is invariant with respect to changes in the parametric representation of C. The Euler-Lagrange equations for a minimizing curve have the symmetric form

where

It follows from the definition that P satisfies


Therefore the integral may also be written as

Connection with the wave equation

The wave equation for an inhomogeneous medium is


where c is the propagation velocity, which generally depends upon X. Wave fronts are characteristic surfaces for this partial differential equation: they satisfy

We may look for solutions in the form

In that case, ψ satisfies

where According to the theory of first order partial differential equations, if then P satisfies

along a system of curves (the rays) that are given by

These equations are identical to the Euler-Lagrange equations if we make the identification

We conclude that the function ψ is the value of the minimizing integral A as a function of the upper end point. That is, when a family of minimizing curves is constructed, the values of the optical length satisfy the characteristic equation corresponding the the wave equation. Hence, solving the associated partial differential equation of first order is equivalent to finding families of solutions of the variational problem. This is the essential content of the Hamilton-Jacobi theory, which applies to general variational problems.

See also

Reference books

  • Fomin, S.V. and Gelfand, I.M.: Calculus of Variations, Dover Publ., 2000
  • Lebedev, L.P. and Cloud, M.J.: The Calculus of Variations and Functional Analysis with Optimal Control and Applications in Mechanics, World Scientific, 2003, pages 1-98
  • Charles Fox: An Introduction to the Calculus of Variations, Dover Publ., 1987
  • Forsyth, A.R.: Calculus of Variations, Dover, 1960
  • Sagan, Hans: Introduction to the Calculus of Variations, Dover, 1992
  • Weinstock, Robert: Calculus of Variations with Applications to Physics and Engineering, Dover, 1974
  • Clegg, J.C.: Calculus of Variations, Interscience Publishers Inc., 1968
  • Elsgolc, L.E.: Calculus of Variations, Pergamon Press Ltd., 1962