Change of variables (PDE): Difference between revisions

Content deleted Content added

Inline

Latest revision as of 10:43, 25 August 2023

Often a partial differential equation can be reduced to a simpler form with a known solution by a suitable change of variables.

The article discusses change of variable for PDEs below in two ways:

by example;
by giving the theory of the method.

Explanation by example

For example, the following simplified form of the Black–Scholes PDE

{\frac {\partial V}{\partial t}}+{\frac {1}{2}}S^{2}{\frac {\partial ^{2}V}{\partial S^{2}}}+S{\frac {\partial V}{\partial S}}-V=0.

is reducible to the heat equation

{\frac {\partial u}{\partial \tau }}={\frac {\partial ^{2}u}{\partial x^{2}}}

by the change of variables:

V(S,t)=v(x(S),\tau (t))

x(S)=\ln(S)

\tau (t)={\frac {1}{2}}(T-t)

v(x,\tau )=\exp(-(1/2)x-(9/4)\tau )u(x,\tau )

in these steps:

Replace $V(S,t)$ by $v(x(S),\tau (t))$ and apply the chain rule to get

{\frac {1}{2}}\left(-2v(x(S),\tau )+2{\frac {\partial \tau }{\partial t}}{\frac {\partial v}{\partial \tau }}+S\left(\left(2{\frac {\partial x}{\partial S}}+S{\frac {\partial ^{2}x}{\partial S^{2}}}\right){\frac {\partial v}{\partial x}}+S\left({\frac {\partial x}{\partial S}}\right)^{2}{\frac {\partial ^{2}v}{\partial x^{2}}}\right)\right)=0.

Replace $x(S)$ and $\tau (t)$ by $\ln(S)$ and ${\frac {1}{2}}(T-t)$ to get

{\frac {1}{2}}\left(-2v(\ln(S),{\frac {1}{2}}(T-t))-{\frac {\partial v(\ln(S),{\frac {1}{2}}(T-t))}{\partial \tau }}+{\frac {\partial v(\ln(S),{\frac {1}{2}}(T-t))}{\partial x}}+{\frac {\partial ^{2}v(\ln(S),{\frac {1}{2}}(T-t))}{\partial x^{2}}}\right)=0.

Replace $\ln(S)$ and ${\frac {1}{2}}(T-t)$ by $x(S)$ and $\tau (t)$ and divide both sides by ${\frac {1}{2}}$ to get

-2v-{\frac {\partial v}{\partial \tau }}+{\frac {\partial v}{\partial x}}+{\frac {\partial ^{2}v}{\partial x^{2}}}=0.

Replace $v(x,\tau )$ by $\exp(-(1/2)x-(9/4)\tau )u(x,\tau )$ and divide through by $-\exp(-(1/2)x-(9/4)\tau )$ to yield the heat equation.

Advice on the application of change of variable to PDEs is given by mathematician J. Michael Steele:^[1]

"There is nothing particularly difficult about changing variables and transforming one equation to another, but there is an element of tedium and complexity that slows us down. There is no universal remedy for this molasses effect, but the calculations do seem to go more quickly if one follows a well-defined plan. If we know that $V(S,t)$ satisfies an equation (like the Black–Scholes equation) we are guaranteed that we can make good use of the equation in the derivation of the equation for a new function $v(x,t)$ defined in terms of the old if we write the old V as a function of the new v and write the new $\tau$ and x as functions of the old t and S. This order of things puts everything in the direct line of fire of the chain rule; the partial derivatives ${\frac {\partial V}{\partial t}}$ , ${\frac {\partial V}{\partial S}}$ and ${\frac {\partial ^{2}V}{\partial S^{2}}}$ are easy to compute and at the end, the original equation stands ready for immediate use."

Technique in general

Suppose that we have a function $u(x,t)$ and a change of variables $x_{1},x_{2}$ such that there exist functions $a(x,t),b(x,t)$ such that

x_{1}=a(x,t)

x_{2}=b(x,t)

and functions $e(x_{1},x_{2}),f(x_{1},x_{2})$ such that

x=e(x_{1},x_{2})

t=f(x_{1},x_{2})

and furthermore such that

x_{1}=a(e(x_{1},x_{2}),f(x_{1},x_{2}))

x_{2}=b(e(x_{1},x_{2}),f(x_{1},x_{2}))

and

x=e(a(x,t),b(x,t))

t=f(a(x,t),b(x,t))

In other words, it is helpful for there to be a bijection between the old set of variables and the new one, or else one has to

Restrict the domain of applicability of the correspondence to a subject of the real plane which is sufficient for a solution of the practical problem at hand (where again it needs to be a bijection), and
Enumerate the (zero or more finite list) of exceptions (poles) where the otherwise-bijection fails (and say why these exceptions don't restrict the applicability of the solution of the reduced equation to the original equation)

If a bijection does not exist then the solution to the reduced-form equation will not in general be a solution of the original equation.

We are discussing change of variable for PDEs. A PDE can be expressed as a differential operator applied to a function. Suppose ${\mathcal {L}}$ is a differential operator such that

{\mathcal {L}}u(x,t)=0

Then it is also the case that

{\mathcal {L}}v(x_{1},x_{2})=0

where

v(x_{1},x_{2})=u(e(x_{1},x_{2}),f(x_{1},x_{2}))

and we operate as follows to go from ${\mathcal {L}}u(x,t)=0$ to ${\mathcal {L}}v(x_{1},x_{2})=0:$

Apply the chain rule to ${\mathcal {L}}v(x_{1}(x,t),x_{2}(x,t))=0$ and expand out giving equation $e_{1}$ .
Substitute $a(x,t)$ for $x_{1}(x,t)$ and $b(x,t)$ for $x_{2}(x,t)$ in $e_{1}$ and expand out giving equation $e_{2}$ .
Replace occurrences of $x$ by $e(x_{1},x_{2})$ and $t$ by $f(x_{1},x_{2})$ to yield ${\mathcal {L}}v(x_{1},x_{2})=0$ , which will be free of $x$ and $t$ .

In the context of PDEs, Weizhang Huang and Robert D. Russell define and explain the different possible time-dependent transformations in details.^[2]

Action-angle coordinates

Often, theory can establish the existence of a change of variables, although the formula itself cannot be explicitly stated. For an integrable Hamiltonian system of dimension $n$ , with ${\dot {x}}_{i}=\partial H/\partial p_{j}$ and ${\dot {p}}_{j}=-\partial H/\partial x_{j}$ , there exist $n$ integrals $I_{i}$ . There exists a change of variables from the coordinates $\{x_{1},\dots ,x_{n},p_{1},\dots ,p_{n}\}$ to a set of variables $\{I_{1},\dots I_{n},\varphi _{1},\dots ,\varphi _{n}\}$ , in which the equations of motion become ${\dot {I}}_{i}=0$ , ${\dot {\varphi }}_{i}=\omega _{i}(I_{1},\dots ,I_{n})$ , where the functions $\omega _{1},\dots ,\omega _{n}$ are unknown, but depend only on $I_{1},\dots ,I_{n}$ . The variables $I_{1},\dots ,I_{n}$ are the action coordinates, the variables $\varphi _{1},\dots ,\varphi _{n}$ are the angle coordinates. The motion of the system can thus be visualized as rotation on torii. As a particular example, consider the simple harmonic oscillator, with ${\dot {x}}=2p$ and ${\dot {p}}=-2x$ , with Hamiltonian $H(x,p)=x^{2}+p^{2}$ . This system can be rewritten as ${\dot {I}}=0$ , ${\dot {\varphi }}=1$ , where $I$ and $\varphi$ are the canonical polar coordinates: $I=p^{2}+q^{2}$ and $\tan(\varphi )=p/x$ . See V. I. Arnold, `Mathematical Methods of Classical Mechanics', for more details.^[3]

References

^ J. Michael Steele, Stochastic Calculus and Financial Applications, Springer, New York, 2001
^ Huang, Weizhang; Russell, Russell (2011). Adaptive moving mesh methods. Springer New York. p. 141.
^ V. I. Arnold, Mathematical Methods of Classical Mechanics, Graduate Texts in Mathematics, v. 60, Springer-Verlag, New York, 1989

[1] J. Michael Steele, Stochastic Calculus and Financial Applications, Springer, New York, 2001

[2] Huang, Weizhang; Russell, Russell (2011). Adaptive moving mesh methods. Springer New York. p. 141.

[3] V. I. Arnold, Mathematical Methods of Classical Mechanics, Graduate Texts in Mathematics, v. 60, Springer-Verlag, New York, 1989

[1]

[2]

[3]

@@ Line 1: / Line 1: @@
+{{Short description|Technique in partial differential evaluation}}
-{{seealso|Integration by substitution|Chain rule}}
+{{for|change of variables for integration|integration by substitution}}
-Often a [[partial differential equation]] can be reduced to a simpler form with a known solution by a suitable '''change of variables'''.
+Often a [[partial differential equation]] can be reduced to a simpler form with a known solution by a suitable '''[[change of variables]]'''.
+The article discusses change of variable for PDEs below in two ways:
-==Technique explained by example==
+#by example;
+#by giving the theory of the method.
+==Explanation by example==
-For example the following simplified form of the [[Black–Scholes#Solution of the Black–Scholes PDE|Black–Scholes]] PDE
+For example, the following simplified form of the [[Black–Scholes#Solution of the Black–Scholes PDE|Black–Scholes]] PDE
 :<math> \frac{\partial V}{\partial t} + \frac{1}{2} S^2\frac{\partial^2 V}{\partial S^2} + S\frac{\partial V}{\partial S} - V = 0. </math>
-is reducible to the [[Heat equation]]
+is reducible to the [[heat equation]]
 :<math> \frac{\partial u}{\partial \tau} = \frac{\partial^2 u}{\partial x^2}</math>
+by the change of variables:
-by the change of variables<ref>[http://www.math.unl.edu/~sdunbar1/Teaching/MathematicalFinance/Lessons/BlackScholes/Solution/solution.shtml Solution of the Black Scholes Equation]</ref>:
 :<math> V(S,t) = v(x(S),\tau(t)) </math>
 :<math> x(S) = \ln(S) </math>
-:<math> \tau(t) = \frac{1}{2} \sigma^2 (T - t)  </math>
+:<math> \tau(t) = \frac{1}{2} (T - t)  </math>
 :<math> v(x,\tau)=\exp(-(1/2)x-(9/4)\tau) u(x,\tau) </math>
@@ Line 24: / Line 29: @@
 * Replace <math>V(S,t)</math> by <math> v(x(S),\tau(t)) </math> and apply the [[chain rule]] to get
-:<math>\frac{1}{2}(-2v(s,\tau)+2 \frac{\partial\tau}{\partial t} \frac{\partial v}{\partial \tau} +S\left(\left(2 \frac{\partial x}{\partial S} + S\frac{\partial^2 x}{\partial S^2}\right)
+::<math>\frac{1}{2}\left(-2v(x(S),\tau)+2 \frac{\partial\tau}{\partial t} \frac{\partial v}{\partial \tau} +S\left(\left(2 \frac{\partial x}{\partial S} + S\frac{\partial^2 x}{\partial S^2}\right)
 \frac{\partial v}{\partial x} +
-S \left(\frac{\partial x}{\partial S}\right)^2 \frac{\partial^2 v}{\partial x^2}\right)=0
+S \left(\frac{\partial x}{\partial S}\right)^2 \frac{\partial^2 v}{\partial x^2}\right)\right)=0.
 </math>
 * Replace <math>x(S)</math> and <math>\tau(t)</math> by <math>\ln(S) </math> and <math>\frac{1}{2}(T-t)</math> to get
-:<math>\frac{1}{2}(
+::<math>\frac{1}{2}\left(
   -2v(\ln(S),\frac{1}{2}(T-t))
   -\frac{\partial v(\ln(S),\frac{1}{2}(T-t))}{\partial\tau}
   +\frac{\partial v(\ln(S),\frac{1}{2}(T-t))}{\partial x}
-  +\frac{\partial^2 v(\ln(S),\frac{1}{2}(T-t))}{\partial x}
+  +\frac{\partial^2 v(\ln(S),\frac{1}{2}(T-t))}{\partial x^2}\right)=0.
 </math>
 * Replace <math>\ln(S) </math> and <math>\frac{1}{2}(T-t)</math> by <math>x(S)</math> and <math>\tau(t)</math> and divide both sides by <math>\frac{1}{2}</math> to get
-:<math>-2 v-\frac{\partial v}{\partial\tau}+\frac{\partial v}{\partial x}+ \frac{\partial^2 v}{\partial x^2}=0</math>
+::<math>-2 v-\frac{\partial v}{\partial\tau}+\frac{\partial v}{\partial x}+ \frac{\partial^2 v}{\partial x^2}=0.</math>
-* Replace <math>v(x,\tau)</math> by <math>\exp(-(1/2)x-(9/4)\tau) u(x,\tau) </math> and divide through by <math>-\exp(-(1/2)x-(9/4)\tau) u(x,\tau) </math> to yield the heat equation.
+* Replace <math>v(x,\tau)</math> by <math>\exp(-(1/2)x-(9/4)\tau) u(x,\tau) </math> and divide through by <math>-\exp(-(1/2)x-(9/4)\tau)</math> to yield the heat equation.
-Advice on the application of change of variable to PDEs is given by mathematician [[J. Michael Steele]]<ref>[[J. Michael Steele]], ''Stochastic Calculus and Financial Applications'', Springer, New York, 2001</ref>:
+Advice on the application of change of variable to PDEs is given by mathematician [[J. Michael Steele]]:<ref>[[J. Michael Steele]], ''Stochastic Calculus and Financial Applications'', Springer, New York, 2001</ref>
-{{quotation|"There is nothing particularly difficult about changing variables and transforming one equation to another, but there is an element of tedium and complexity that slows us down. There is no universal remedy for this molasses effect, but the calculations do seem to go more quickly if one follows a well-defined plan. If we know that <math>V(S,t)</math> satisfies an equation (like the Black-Scholes equation) we are guaranteed that we can make good use of the equation in the derivation of the equation for a new function <math>v(x,t)</math> defined in terms of the old if we write the old ''V'' as a function of the new ''v'' and write the new ''t'' and ''x'' as functions of the old ''t'' and ''S''. This order of things puts everything in the direct line of fire of the chain rule; the partial derivatives <math>\frac{\partial V}{\partial t}</math>, <math>\frac{\partial V}{\partial S}</math> and <math>\frac{\partial^2 V}{\partial S^2}</math>are easy to compute and at the end, the original equation stands ready for immediate use."}}
+{{quotation|"There is nothing particularly difficult about changing variables and transforming one equation to another, but there is an element of tedium and complexity that slows us down. There is no universal remedy for this molasses effect, but the calculations do seem to go more quickly if one follows a well-defined plan. If we know that <math>V(S,t)</math> satisfies an equation (like the Black&ndash;Scholes equation) we are guaranteed that we can make good use of the equation in the derivation of the equation for a new function <math>v(x,t)</math> defined in terms of the old if we write the old ''V'' as a function of the new ''v'' and write the new <math>\tau</math> and ''x'' as functions of the old ''t'' and ''S''. This order of things puts everything in the direct line of fire of the chain rule; the partial derivatives <math>\frac{\partial V}{\partial t}</math>, <math>\frac{\partial V}{\partial S}</math> and <math>\frac{\partial^2 V}{\partial S^2}</math>are easy to compute and at the end, the original equation stands ready for immediate use."}}
 ==Technique in general==
@@ Line 69: / Line 74: @@
 :<math>t=f(a(x,t),b(x,t))</math>
+In other words, it is helpful for there to be a [[bijection]] between the old set of variables and the new one, or else one has to
-Suppose <math>\mathcal{L}</math> is a [[differential operator]] such that
+* Restrict the domain of applicability of the correspondence to a subject of the real plane which is sufficient for a solution of the practical problem at hand (where again it needs to be a bijection), and
+* Enumerate the (zero or more finite list) of exceptions (poles) where the otherwise-bijection fails (and say why these exceptions don't restrict the applicability of the solution of the reduced equation to the original equation)
+If a bijection does not exist then the solution to the reduced-form equation will not in general be a solution of the original equation.
+We are discussing change of variable for PDEs.  A PDE can be expressed as a [[differential operator]] applied to a function.  Suppose <math>\mathcal{L}</math> is a differential operator such that
 :<math>\mathcal{L}u(x,t)=0</math>
 Then it is also the case that
 :<math>\mathcal{L}v(x_1,x_2)=0</math>
@@ Line 81: / Line 92: @@
 :<math>v(x_1,x_2)=u(e(x_1,x_2),f(x_1,x_2))</math>
-and we operate as follows to go from <math>\mathcal{L}u(x,t)=0</math> to <math>\mathcal{L}v(x_1,x_2)=0</math>:
+and we operate as follows to go from <math>\mathcal{L}u(x,t)=0</math> to <math>\mathcal{L}v(x_1,x_2)=0:</math>
 * Apply the [[chain rule]] to <math>\mathcal{L} v(x_1(x,t),x_2(x,t))=0</math> and expand out giving equation <math>e_1</math>.
 * Substitute <math>a(x,t)</math> for <math>x_1(x,t)</math> and <math>b(x,t)</math> for <math>x_2(x,t)</math> in <math>e_1</math> and expand out giving equation <math>e_2</math>.
 * Replace occurrences of <math>x</math> by <math>e(x_1,x_2)</math> and <math>t</math> by <math>f(x_1,x_2)</math> to yield <math>\mathcal{L}v(x_1,x_2)=0</math>, which will be free of  <math>x</math> and <math>t</math>.
+In the context of PDEs, Weizhang Huang and Robert D. Russell define and explain the different possible time-dependent transformations in details.<ref>{{cite book |last1=Huang |first1=Weizhang |last2=Russell |first2=Russell |title=[[Adaptive moving mesh methods]] |publisher=Springer New York |publication-date=2011 |page=141}}</ref>
+==Action-angle coordinates==
+Often, theory can establish the existence of a change of variables, although the formula itself cannot be explicitly stated.  For an integrable Hamiltonian system of dimension <math> n </math>, with <math> \dot{x}_i = \partial H/\partial p_j </math> and <math> \dot{p}_j = - \partial H/\partial x_j </math>, there exist <math> n </math> integrals <math> I_i
+</math>.  There exists a change of variables from the coordinates <math> \{ x_1, \dots, x_n, p_1, \dots, p_n \} </math> to a set of variables <math> \{ I_1, \dots I_n, \varphi_1, \dots, \varphi_n \} </math>, in which the equations of motion become <math> \dot{I}_i = 0 </math>, <math> \dot{\varphi}_i = \omega_i(I_1, \dots, I_n) </math>, where the functions <math> \omega_1, \dots, \omega_n </math> are unknown, but depend only on <math> I_1, \dots, I_n </math>.  The variables <math> I_1, \dots, I_n </math> are the action coordinates, the variables <math> \varphi_1, \dots, \varphi_n </math> are the angle coordinates.  The motion of the system can thus be visualized as rotation on torii.  As a particular example, consider the simple harmonic oscillator, with <math> \dot{x} = 2p </math> and <math> \dot{p} = - 2x </math>, with Hamiltonian <math> H(x,p) = x^2 + p^2 </math>.  This system can be rewritten as <math> \dot{I} = 0 </math>, <math> \dot{\varphi} = 1 </math>, where <math> I </math> and <math> \varphi </math> are the canonical polar coordinates: <math> I = p^2 + q^2 </math> and <math> \tan(\varphi) = p/x </math>.  See [[Vladimir Arnold|V. I. Arnold]], `Mathematical Methods of Classical Mechanics', for more details.<ref>[[Vladimir Arnold|V. I. Arnold]], ''Mathematical Methods of Classical Mechanics'', Graduate Texts in Mathematics, v. 60, Springer-Verlag, New York, 1989</ref>
 ==References==