Jump to content

Change of variables (PDE): Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Adding short description: "Technique in partial differential evaluation"
 
(48 intermediate revisions by 36 users not shown)
Line 1: Line 1:
{{Short description|Technique in partial differential evaluation}}
{{seealso|Integration by substitution|Chain rule}}
{{for|change of variables for integration|integration by substitution}}


Often a [[partial differential equation]] can be reduced to a simpler form with a known solution by a suitable '''change of variables'''.
Often a [[partial differential equation]] can be reduced to a simpler form with a known solution by a suitable '''[[change of variables]]'''.


The article discusses change of variable for PDEs below in two ways:
==Technique explained by example==
#by example;
#by giving the theory of the method.


==Explanation by example==
For example the following simplified form of the [[Black–Scholes#Solution of the Black–Scholes PDE|Black–Scholes]] PDE

For example, the following simplified form of the [[Black–Scholes#Solution of the Black–Scholes PDE|Black–Scholes]] PDE


:<math> \frac{\partial V}{\partial t} + \frac{1}{2} S^2\frac{\partial^2 V}{\partial S^2} + S\frac{\partial V}{\partial S} - V = 0. </math>
:<math> \frac{\partial V}{\partial t} + \frac{1}{2} S^2\frac{\partial^2 V}{\partial S^2} + S\frac{\partial V}{\partial S} - V = 0. </math>


is reducible to the [[Heat equation]]
is reducible to the [[heat equation]]


:<math> \frac{\partial u}{\partial \tau} = \frac{\partial^2 u}{\partial x^2}</math>
:<math> \frac{\partial u}{\partial \tau} = \frac{\partial^2 u}{\partial x^2}</math>


by the change of variables:
by the change of variables<ref>[http://www.math.unl.edu/~sdunbar1/Teaching/MathematicalFinance/Lessons/BlackScholes/Solution/solution.shtml Solution of the Black Scholes Equation]</ref>:


:<math> V(S,t) = v(x(S),\tau(t)) </math>
:<math> V(S,t) = v(x(S),\tau(t)) </math>
:<math> x(S) = \ln(S) </math>
:<math> x(S) = \ln(S) </math>
:<math> \tau(t) = \frac{1}{2} \sigma^2 (T - t) </math>
:<math> \tau(t) = \frac{1}{2} (T - t) </math>
:<math> v(x,\tau)=\exp(-(1/2)x-(9/4)\tau) u(x,\tau) </math>
:<math> v(x,\tau)=\exp(-(1/2)x-(9/4)\tau) u(x,\tau) </math>


Line 24: Line 29:
* Replace <math>V(S,t)</math> by <math> v(x(S),\tau(t)) </math> and apply the [[chain rule]] to get
* Replace <math>V(S,t)</math> by <math> v(x(S),\tau(t)) </math> and apply the [[chain rule]] to get


:<math>\frac{1}{2}(-2v(s,\tau)+2 \frac{\partial\tau}{\partial t} \frac{\partial v}{\partial \tau} +S\left(\left(2 \frac{\partial x}{\partial S} + S\frac{\partial^2 x}{\partial S^2}\right)
::<math>\frac{1}{2}\left(-2v(x(S),\tau)+2 \frac{\partial\tau}{\partial t} \frac{\partial v}{\partial \tau} +S\left(\left(2 \frac{\partial x}{\partial S} + S\frac{\partial^2 x}{\partial S^2}\right)
\frac{\partial v}{\partial x} +
\frac{\partial v}{\partial x} +
S \left(\frac{\partial x}{\partial S}\right)^2 \frac{\partial^2 v}{\partial x^2}\right)=0
S \left(\frac{\partial x}{\partial S}\right)^2 \frac{\partial^2 v}{\partial x^2}\right)\right)=0.
</math>
</math>


* Replace <math>x(S)</math> and <math>\tau(t)</math> by <math>\ln(S) </math> and <math>\frac{1}{2}(T-t)</math> to get
* Replace <math>x(S)</math> and <math>\tau(t)</math> by <math>\ln(S) </math> and <math>\frac{1}{2}(T-t)</math> to get


:<math>\frac{1}{2}(
::<math>\frac{1}{2}\left(
-2v(\ln(S),\frac{1}{2}(T-t))
-2v(\ln(S),\frac{1}{2}(T-t))
-\frac{\partial v(\ln(S),\frac{1}{2}(T-t))}{\partial\tau}
-\frac{\partial v(\ln(S),\frac{1}{2}(T-t))}{\partial\tau}
+\frac{\partial v(\ln(S),\frac{1}{2}(T-t))}{\partial x}
+\frac{\partial v(\ln(S),\frac{1}{2}(T-t))}{\partial x}
+\frac{\partial^2 v(\ln(S),\frac{1}{2}(T-t))}{\partial x}
+\frac{\partial^2 v(\ln(S),\frac{1}{2}(T-t))}{\partial x^2}\right)=0.
</math>
</math>


* Replace <math>\ln(S) </math> and <math>\frac{1}{2}(T-t)</math> by <math>x(S)</math> and <math>\tau(t)</math> and divide both sides by <math>\frac{1}{2}</math> to get
* Replace <math>\ln(S) </math> and <math>\frac{1}{2}(T-t)</math> by <math>x(S)</math> and <math>\tau(t)</math> and divide both sides by <math>\frac{1}{2}</math> to get


:<math>-2 v-\frac{\partial v}{\partial\tau}+\frac{\partial v}{\partial x}+ \frac{\partial^2 v}{\partial x^2}=0</math>
::<math>-2 v-\frac{\partial v}{\partial\tau}+\frac{\partial v}{\partial x}+ \frac{\partial^2 v}{\partial x^2}=0.</math>


* Replace <math>v(x,\tau)</math> by <math>\exp(-(1/2)x-(9/4)\tau) u(x,\tau) </math> and divide through by <math>-\exp(-(1/2)x-(9/4)\tau) u(x,\tau) </math> to yield the heat equation.
* Replace <math>v(x,\tau)</math> by <math>\exp(-(1/2)x-(9/4)\tau) u(x,\tau) </math> and divide through by <math>-\exp(-(1/2)x-(9/4)\tau)</math> to yield the heat equation.


Advice on the application of change of variable to PDEs is given by mathematician [[J. Michael Steele]]<ref>[[J. Michael Steele]], ''Stochastic Calculus and Financial Applications'', Springer, New York, 2001</ref>:
Advice on the application of change of variable to PDEs is given by mathematician [[J. Michael Steele]]:<ref>[[J. Michael Steele]], ''Stochastic Calculus and Financial Applications'', Springer, New York, 2001</ref>
{{quotation|"There is nothing particularly difficult about changing variables and transforming one equation to another, but there is an element of tedium and complexity that slows us down. There is no universal remedy for this molasses effect, but the calculations do seem to go more quickly if one follows a well-defined plan. If we know that <math>V(S,t)</math> satisfies an equation (like the Black-Scholes equation) we are guaranteed that we can make good use of the equation in the derivation of the equation for a new function <math>v(x,t)</math> defined in terms of the old if we write the old ''V'' as a function of the new ''v'' and write the new ''t'' and ''x'' as functions of the old ''t'' and ''S''. This order of things puts everything in the direct line of fire of the chain rule; the partial derivatives <math>\frac{\partial V}{\partial t}</math>, <math>\frac{\partial V}{\partial S}</math> and <math>\frac{\partial^2 V}{\partial S^2}</math>are easy to compute and at the end, the original equation stands ready for immediate use."}}
{{quotation|"There is nothing particularly difficult about changing variables and transforming one equation to another, but there is an element of tedium and complexity that slows us down. There is no universal remedy for this molasses effect, but the calculations do seem to go more quickly if one follows a well-defined plan. If we know that <math>V(S,t)</math> satisfies an equation (like the Black&ndash;Scholes equation) we are guaranteed that we can make good use of the equation in the derivation of the equation for a new function <math>v(x,t)</math> defined in terms of the old if we write the old ''V'' as a function of the new ''v'' and write the new <math>\tau</math> and ''x'' as functions of the old ''t'' and ''S''. This order of things puts everything in the direct line of fire of the chain rule; the partial derivatives <math>\frac{\partial V}{\partial t}</math>, <math>\frac{\partial V}{\partial S}</math> and <math>\frac{\partial^2 V}{\partial S^2}</math>are easy to compute and at the end, the original equation stands ready for immediate use."}}


==Technique in general==
==Technique in general==
Line 69: Line 74:
:<math>t=f(a(x,t),b(x,t))</math>
:<math>t=f(a(x,t),b(x,t))</math>


In other words, it is helpful for there to be a [[bijection]] between the old set of variables and the new one, or else one has to
Suppose <math>\mathcal{L}</math> is a [[differential operator]] such that
* Restrict the domain of applicability of the correspondence to a subject of the real plane which is sufficient for a solution of the practical problem at hand (where again it needs to be a bijection), and
* Enumerate the (zero or more finite list) of exceptions (poles) where the otherwise-bijection fails (and say why these exceptions don't restrict the applicability of the solution of the reduced equation to the original equation)

If a bijection does not exist then the solution to the reduced-form equation will not in general be a solution of the original equation.

We are discussing change of variable for PDEs. A PDE can be expressed as a [[differential operator]] applied to a function. Suppose <math>\mathcal{L}</math> is a differential operator such that


:<math>\mathcal{L}u(x,t)=0</math>
:<math>\mathcal{L}u(x,t)=0</math>


Then it is also the case that
Then it is also the case that


:<math>\mathcal{L}v(x_1,x_2)=0</math>
:<math>\mathcal{L}v(x_1,x_2)=0</math>
Line 81: Line 92:
:<math>v(x_1,x_2)=u(e(x_1,x_2),f(x_1,x_2))</math>
:<math>v(x_1,x_2)=u(e(x_1,x_2),f(x_1,x_2))</math>


and we operate as follows to go from <math>\mathcal{L}u(x,t)=0</math> to <math>\mathcal{L}v(x_1,x_2)=0</math>:
and we operate as follows to go from <math>\mathcal{L}u(x,t)=0</math> to <math>\mathcal{L}v(x_1,x_2)=0:</math>
* Apply the [[chain rule]] to <math>\mathcal{L} v(x_1(x,t),x_2(x,t))=0</math> and expand out giving equation <math>e_1</math>.
* Apply the [[chain rule]] to <math>\mathcal{L} v(x_1(x,t),x_2(x,t))=0</math> and expand out giving equation <math>e_1</math>.
* Substitute <math>a(x,t)</math> for <math>x_1(x,t)</math> and <math>b(x,t)</math> for <math>x_2(x,t)</math> in <math>e_1</math> and expand out giving equation <math>e_2</math>.
* Substitute <math>a(x,t)</math> for <math>x_1(x,t)</math> and <math>b(x,t)</math> for <math>x_2(x,t)</math> in <math>e_1</math> and expand out giving equation <math>e_2</math>.
* Replace occurrences of <math>x</math> by <math>e(x_1,x_2)</math> and <math>t</math> by <math>f(x_1,x_2)</math> to yield <math>\mathcal{L}v(x_1,x_2)=0</math>, which will be free of <math>x</math> and <math>t</math>.
* Replace occurrences of <math>x</math> by <math>e(x_1,x_2)</math> and <math>t</math> by <math>f(x_1,x_2)</math> to yield <math>\mathcal{L}v(x_1,x_2)=0</math>, which will be free of <math>x</math> and <math>t</math>.

In the context of PDEs, Weizhang Huang and Robert D. Russell define and explain the different possible time-dependent transformations in details.<ref>{{cite book |last1=Huang |first1=Weizhang |last2=Russell |first2=Russell |title=[[Adaptive moving mesh methods]] |publisher=Springer New York |publication-date=2011 |page=141}}</ref>

==Action-angle coordinates==

Often, theory can establish the existence of a change of variables, although the formula itself cannot be explicitly stated. For an integrable Hamiltonian system of dimension <math> n </math>, with <math> \dot{x}_i = \partial H/\partial p_j </math> and <math> \dot{p}_j = - \partial H/\partial x_j </math>, there exist <math> n </math> integrals <math> I_i
</math>. There exists a change of variables from the coordinates <math> \{ x_1, \dots, x_n, p_1, \dots, p_n \} </math> to a set of variables <math> \{ I_1, \dots I_n, \varphi_1, \dots, \varphi_n \} </math>, in which the equations of motion become <math> \dot{I}_i = 0 </math>, <math> \dot{\varphi}_i = \omega_i(I_1, \dots, I_n) </math>, where the functions <math> \omega_1, \dots, \omega_n </math> are unknown, but depend only on <math> I_1, \dots, I_n </math>. The variables <math> I_1, \dots, I_n </math> are the action coordinates, the variables <math> \varphi_1, \dots, \varphi_n </math> are the angle coordinates. The motion of the system can thus be visualized as rotation on torii. As a particular example, consider the simple harmonic oscillator, with <math> \dot{x} = 2p </math> and <math> \dot{p} = - 2x </math>, with Hamiltonian <math> H(x,p) = x^2 + p^2 </math>. This system can be rewritten as <math> \dot{I} = 0 </math>, <math> \dot{\varphi} = 1 </math>, where <math> I </math> and <math> \varphi </math> are the canonical polar coordinates: <math> I = p^2 + q^2 </math> and <math> \tan(\varphi) = p/x </math>. See [[Vladimir Arnold|V. I. Arnold]], `Mathematical Methods of Classical Mechanics', for more details.<ref>[[Vladimir Arnold|V. I. Arnold]], ''Mathematical Methods of Classical Mechanics'', Graduate Texts in Mathematics, v. 60, Springer-Verlag, New York, 1989</ref>


==References==
==References==

Latest revision as of 10:43, 25 August 2023

Often a partial differential equation can be reduced to a simpler form with a known solution by a suitable change of variables.

The article discusses change of variable for PDEs below in two ways:

  1. by example;
  2. by giving the theory of the method.

Explanation by example

[edit]

For example, the following simplified form of the Black–Scholes PDE

is reducible to the heat equation

by the change of variables:

in these steps:

  • Replace by and apply the chain rule to get
  • Replace and by and to get
  • Replace and by and and divide both sides by to get
  • Replace by and divide through by to yield the heat equation.

Advice on the application of change of variable to PDEs is given by mathematician J. Michael Steele:[1]

"There is nothing particularly difficult about changing variables and transforming one equation to another, but there is an element of tedium and complexity that slows us down. There is no universal remedy for this molasses effect, but the calculations do seem to go more quickly if one follows a well-defined plan. If we know that satisfies an equation (like the Black–Scholes equation) we are guaranteed that we can make good use of the equation in the derivation of the equation for a new function defined in terms of the old if we write the old V as a function of the new v and write the new and x as functions of the old t and S. This order of things puts everything in the direct line of fire of the chain rule; the partial derivatives , and are easy to compute and at the end, the original equation stands ready for immediate use."

Technique in general

[edit]

Suppose that we have a function and a change of variables such that there exist functions such that

and functions such that

and furthermore such that

and

In other words, it is helpful for there to be a bijection between the old set of variables and the new one, or else one has to

  • Restrict the domain of applicability of the correspondence to a subject of the real plane which is sufficient for a solution of the practical problem at hand (where again it needs to be a bijection), and
  • Enumerate the (zero or more finite list) of exceptions (poles) where the otherwise-bijection fails (and say why these exceptions don't restrict the applicability of the solution of the reduced equation to the original equation)

If a bijection does not exist then the solution to the reduced-form equation will not in general be a solution of the original equation.

We are discussing change of variable for PDEs. A PDE can be expressed as a differential operator applied to a function. Suppose is a differential operator such that

Then it is also the case that

where

and we operate as follows to go from to

  • Apply the chain rule to and expand out giving equation .
  • Substitute for and for in and expand out giving equation .
  • Replace occurrences of by and by to yield , which will be free of and .

In the context of PDEs, Weizhang Huang and Robert D. Russell define and explain the different possible time-dependent transformations in details.[2]

Action-angle coordinates

[edit]

Often, theory can establish the existence of a change of variables, although the formula itself cannot be explicitly stated. For an integrable Hamiltonian system of dimension , with and , there exist integrals . There exists a change of variables from the coordinates to a set of variables , in which the equations of motion become , , where the functions are unknown, but depend only on . The variables are the action coordinates, the variables are the angle coordinates. The motion of the system can thus be visualized as rotation on torii. As a particular example, consider the simple harmonic oscillator, with and , with Hamiltonian . This system can be rewritten as , , where and are the canonical polar coordinates: and . See V. I. Arnold, `Mathematical Methods of Classical Mechanics', for more details.[3]

References

[edit]
  1. ^ J. Michael Steele, Stochastic Calculus and Financial Applications, Springer, New York, 2001
  2. ^ Huang, Weizhang; Russell, Russell (2011). Adaptive moving mesh methods. Springer New York. p. 141.
  3. ^ V. I. Arnold, Mathematical Methods of Classical Mechanics, Graduate Texts in Mathematics, v. 60, Springer-Verlag, New York, 1989