Jump to content

Taylor's theorem: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
 
(744 intermediate revisions by more than 100 users not shown)
Line 1: Line 1:
{{short description|Approximation of a function by a truncated power series}}
[[Image:Taylorspolynomialexbig.svg|thumb|right|300px|The exponential function <math>y=e^x</math> (continuous red line) and the corresponding Taylor polynomial of degree four (dashed green line) around the origin.]]
[[File:Taylorspolynomialexbig.svg|thumb|right|300px|The exponential function <math display="inline">y=e^x</math> (red) and the corresponding Taylor polynomial of degree four (dashed green) around the origin.]]
{{Calculus |Differential}}


In [[calculus]], '''Taylor's theorem''' gives an approximation of a <math display="inline">k</math>-times [[differentiable function]] around a given point by a [[polynomial]] of degree <math display="inline">k</math>, called the <math display="inline">k</math>-th-order '''Taylor polynomial'''. For a [[smooth function]], the Taylor polynomial is the truncation at the order ''<math display="inline">k</math>'' of the [[Taylor series]] of the function. The first-order Taylor polynomial is the [[linear approximation]] of the function, and the second-order Taylor polynomial is often referred to as the '''quadratic approximation'''.<ref>(2013). [http://www.math.ubc.ca/~sujatha/2013/103/week10-12/Linearapp.pdf"Linear and quadratic approximation"] Retrieved December 6, 2018</ref> There are several versions of Taylor's theorem, some giving explicit estimates of the approximation error of the function by its Taylor polynomial.
{{Calculus|cTopic=Differentiation}}


Taylor's theorem is named after the mathematician [[Brook Taylor]], who stated a version of it in 1715,<ref>{{cite book|language=la|last=Taylor |first=Brook |title=Methodus Incrementorum Directa et Inversa |url=https://archive.org/details/UFIE003454_TO0324_PNI-2529_000000|trans-title=Direct and Reverse Methods of Incrementation |location=London |date=1715 |at=p. 21–23 (Prop. VII, Thm. 3, Cor. 2)}} Translated into English in {{cite book|first=D. J. |last=Struik|title=A Source Book in Mathematics 1200–1800 |location=Cambridge, Massachusetts |publisher=Harvard University Press |date=1969 |pages= 329–332}}</ref> although an earlier version of the result was already mentioned in [[1671 in science|1671]] by [[James Gregory (astronomer and mathematician)|James Gregory]].<ref>{{harvnb|Kline|1972|pp=442, 464}}.</ref>
In [[calculus]], '''Taylor's theorem''' gives a sequence of approximations of a [[differentiable]] [[function (mathematics)|function]] around a given point by [[polynomial]]s (the '''Taylor polynomials''' of that function) whose [[coefficients]] depend only on the derivatives of the function at that point. The theorem also gives precise estimates on the size of the error in the approximation. The [[theorem]] is named after the mathematician [[Brook Taylor]], who stated it in 1712, though the result was first discovered 41 years earlier in 1671 by [[James Gregory (astronomer and mathematician)|James Gregory]].


Taylor's theorem is taught in introductory-level calculus courses and is one of the central elementary tools in [[mathematical analysis]]. It gives simple arithmetic formulas to accurately compute values of many [[transcendental function]]s such as the [[exponential function]] and [[trigonometric function]]s.
==Taylor's theorem in one variable==
It is the starting point of the study of [[analytic function]]s, and is fundamental in various areas of mathematics, as well as in [[numerical analysis]] and [[mathematical physics]]. Taylor's theorem also generalizes to [[multivariate function|multivariate]] and [[vector valued function|vector valued]] functions. It provided the mathematical basis for some landmark early computing machines: [[Charles Babbage]]'s [[Difference Engine]] calculated sines, cosines, logarithms, and other transcendental functions by numerically integrating the first 7 terms of their Taylor series.
===Motivation===
Taylor's theorem asserts that any sufficiently [[smooth function]] can locally be approximated by polynomials. A simple example of application of Taylor's theorem is the approximation of the [[exponential function]] ''e''<sup>''x''</sup> near ''x'' = 0:


== Motivation ==
:<math> \textrm{e}^x \approx 1 + x + \frac{x^2}{2!} + \frac{x^3}{3!} + \cdots + \frac{x^n}{n!}.</math>
[[File:E^x with linear approximation.png|thumb|right|Graph of <math display="inline">f(x)=e^x</math> (blue) with its [[linear approximation]] <math display="inline">P_1(x)=1+x</math> (red) at <math display="inline">a=0</math>.]]
If a real-valued [[function (mathematics)|function]] <math display="inline">f(x)</math> is [[Derivative|differentiable]] at the point <math display="inline">x=a</math>, then it has a [[linear approximation]] near this point. This means that there exists a function ''h''<sub>1</sub>(''x'') such that


<math display="block"> f(x) = f(a) + f'(a)(x - a) + h_1(x)(x - a), \quad \lim_{x \to a} h_1(x) = 0.</math>
The approximation is called the ''n''-th order Taylor approximation to ''e''<sup>''x''</sup> because it approximates the value of the exponential function by a polynomial of degree ''n''. This approximation only holds for ''x'' close to zero, and as ''x'' moves further away from zero, the approximation becomes worse. The quality of the approximation is controlled by the '''remainder term''':


Here
<math>R_n(x) = \textrm{e}^x - \left(1 + x + \frac{x^2}{2!} + \frac{x^3}{3!} + \cdots + \frac{x^n}{n!}\right).</math>


<math display="block">P_1(x) = f(a) + f'(a)(x - a)</math>
More generally, Taylor's theorem applies to any sufficiently differentiable function &fnof;, giving an approximation, for ''x'' near a point ''a'', of the form


is the linear approximation of <math display="inline">f(x)</math> for ''x'' near the point ''a'', whose graph <math display="inline">y=P_1(x)</math> is the [[tangent line]] to the graph <math display="inline">y=f(x)</math> at {{nowrap|1=''x'' = ''a''}}. The error in the approximation is:
:<math>f(x)\approx f(a) + f'(a)(x-a) +\frac{f''(a)}{2!}(x-a)^2 + \cdots + \frac{f^{(n)}(a)}{n!}(x-a)^n.</math>
<math display="block">R_1(x) = f(x) - P_1(x) = h_1(x)(x - a).</math>


As ''x'' tends to&nbsp;''a,'' this error goes to zero much faster than <math>f'(a)(x{-}a)</math>, making <math>f(x)\approx P_1(x)</math> a useful approximation.
The remainder term is just the difference of the function and its approximating polynomial


[[File:E^x with quadratic approximation corrected.png|thumb|right|Graph of <math display="inline">f(x)=e^x</math> (blue) with its quadratic approximation <math>P_2(x) = 1 +x + \dfrac{x^2}{2}</math> (red) at <math display="inline">a=0</math>. Note the improvement in the approximation.]]
:<math>R_n(x) = f(x) - \left(f(a) + f'(a)(x-a) +\frac{f''(a)}{2!}(x-a)^2 + \cdots + \frac{f^{(n)}(a)}{n!}(x-a)^n\right).</math>
For a better approximation to <math display="inline">f(x)</math>, we can fit a [[quadratic polynomial]] instead of a linear function:


<math display="block">P_2(x) = f(a) + f'(a)(x - a) + \frac{f''(a)}{2}(x - a)^2.</math>
Although an explicit formula for the remainder term is seldom of any use, Taylor's theorem also provides several ways in which to estimate the value of the remainder. In other words, for ''x'' near enough to ''a'', the remainder ought to be "small"; Taylor's theorem gives information on precisely how small it actually is.


Instead of just matching one derivative of <math display="inline">f(x)</math> at <math display="inline">x=a</math>, this polynomial has the same first and second derivatives, as is evident upon differentiation.
===Statement===
The precise statement of the theorem is as follows: If ''n'' &ge; 0 is an [[integer]] and ƒ is a function which is ''n'' times continuously differentiable on the [[interval (mathematics)|closed interval]] [''a'', ''x''] and {{nowrap|''n'' + 1}} times differentiable on the [[interval (mathematics)|open interval]] (''a'', ''x''), then


Taylor's theorem ensures that the ''quadratic approximation'' is, in a sufficiently small neighborhood of <math display="inline">x=a</math>, more accurate than the linear approximation. Specifically,
:<math> f(x) = f(a) + \frac{f'(a)}{1!}(x - a) + \frac{f^{(2)}(a)}{2!}(x - a)^2 + \cdots + \frac{f^{(n)}(a)}{n!}(x - a)^n + R_n(x). </math>


<math display="block">f(x) = P_2(x) + h_2(x)(x - a)^2, \quad \lim_{x \to a} h_2(x) = 0.</math>
Here, ''n''! denotes the [[factorial]] of ''n'', and ''R<sub>n</sub>''(''x'') is a remainder term, denoting the difference between the Taylor polynomial of degree ''n'' and the original function. The remainder term ''R<sub>n</sub>''(''x'') depends on ''x'' and is small if ''x'' is close enough to ''a''. Several expressions are available for it.


Here the error in the approximation is
The '''[[Joseph Louis Lagrange|Lagrange]] form'''<ref>Klein (1998) 20.3; Apostol (1967) 7.7.</ref> of the remainder term states that there exists a number ξ between ''a'' and ''x'' such that


<math display="block">R_2(x) = f(x) - P_2(x) = h_2(x)(x - a)^2,</math>
:<math>
R_n(x) = \frac{f^{(n+1)}(\xi)}{(n+1)!} (x-a)^{n+1}.
</math>


which, given the limiting behavior of <math>h_2</math>, goes to zero faster than <math>(x - a)^2</math> as ''x'' tends to&nbsp;''a''.
This exposes Taylor's theorem as a generalization of the [[mean value theorem]]. In fact, the mean value theorem is used to prove Taylor's theorem with the Lagrange remainder term.


[[File:Tayloranimation.gif|thumb|360px|right|Approximation of <math display="inline">f(x)= \dfrac{1}{1+x^2}</math> (blue) by its Taylor polynomials <math display="inline">P_k</math> of order <math display="inline">k=1,\ldots,16</math> centered at <math display="inline">x=0</math> (red) and <math display="inline">x=1</math> (green). The approximations do not improve at all outside <math>(-1,1)</math> and <math display="inline">(1-\sqrt{2}, 1+\sqrt{2})</math>, respectively.]] Similarly, we might get still better approximations to ''f'' if we use [[polynomial]]s of higher degree, since then we can match even more derivatives with ''f'' at the selected base point.
The '''[[Augustin Louis Cauchy|Cauchy]] form'''<ref>Apostol (1967) 7.7.</ref> of the remainder term states that there exists a number ξ between ''a'' and ''x'' such that


In general, the error in approximating a function by a polynomial of degree ''k'' will go to zero much faster than <math>(x-a)^k</math> as ''x'' tends to&nbsp;''a''. However, there are functions, even infinitely differentiable ones, for which increasing the degree of the approximating polynomial does not increase the accuracy of approximation: we say such a function fails to be [[Analytic function|analytic]] at ''x = a'': it is not (locally) determined by its derivatives at this point.
:<math>
R_n(x) = \frac{f^{(n+1)}(\xi)}{n!}(x-\xi)^n(x-a).
</math>


Taylor's theorem is of asymptotic nature: it only tells us that the error <math display="inline">R_k</math> in an [[approximation]] by a <math display="inline">k</math>-th order Taylor polynomial ''P<sub>k</sub>'' tends to zero faster than any nonzero <math display="inline">k</math>-th degree [[polynomial]] as <math display="inline">x \to a</math>. It does not tell us how large the error is in any concrete [[neighborhood (mathematics)|neighborhood]] of the center of expansion, but for this purpose there are explicit formulas for the remainder term (given below) which are valid under some additional regularity assumptions on ''f''. These enhanced versions of Taylor's theorem typically lead to [[uniform convergence|uniform estimates]] for the approximation error in a small neighborhood of the center of expansion, but the estimates do not necessarily hold for neighborhoods which are too large, even if the function ''f'' is [[analytic function|analytic]]. In that situation one may have to select several Taylor polynomials with different centers of expansion to have reliable Taylor-approximations of the original function (see animation on the right.)
More generally, if ''G''(''t'') is a continuous function on [''a'',''x''] which is differentiable with non-vanishing derivative on (''a'',''x''), then there exists a number &xi; between ''a'' and ''x'' such that


There are several ways we might use the remainder term:
:<math>
R_n(x) = \frac{f^{(n+1)}(\xi)}{n!}(x-\xi)^n\cdot\frac{G(x)-G(a)}{G'(\xi)}.
</math>


# Estimate the error for a polynomial ''P<sub>k</sub>''(''x'') of degree ''k'' estimating <math display="inline">f(x)</math> on a given interval (''a'' – ''r'', ''a'' + ''r''). (Given the interval and degree, we find the error.)
This exposes Taylor's theorem as a generalization of the [[mean value theorem#Cauchy's mean value theorem|Cauchy mean value theorem]].
# Find the smallest degree ''k'' for which the polynomial ''P<sub>k</sub>''(''x'') approximates <math display="inline">f(x)</math> to within a given error tolerance on a given interval (''a'' − ''r'', ''a'' + ''r'') . (Given the interval and error tolerance, we find the degree.)
# Find the largest interval (''a'' − ''r'', ''a'' + ''r'') on which ''P<sub>k</sub>''(''x'') approximates <math display="inline">f(x)</math> to within a given error tolerance. (Given the degree and error tolerance, we find the interval.)


{{clear}}
The above forms are restricted to the case of functions taking real values. However, the '''[[integral]] form'''<ref>Apostol (1967) 7.5.</ref> of the remainder term applies as well when the function takes
complex values. It is:


== Taylor's theorem in one real variable ==
:<math>
R_n(x) = \int_a^x \frac{f^{(n+1)} (t)}{n!} (x - t)^n \, dt,
</math>


=== Statement of the theorem ===
provided, as is often the case, ƒ<sup>(n)</sub> is [[absolutely continuous]] on {{nowrap|[''a'', ''x'']}}. This shows the theorem to be a generalization of the [[fundamental theorem of calculus]].


The precise statement of the most basic version of Taylor's theorem is as follows:
In general, a function does not need to be equal to its [[Taylor series]], since it is possible that the Taylor series does not [[convergent series|converge]], or that it converges to a ''different'' function. However, for many functions ƒ(''x''), one can show that the remainder term ''R<sub>n</sub>'' approaches zero as ''n'' approaches &infin;. Those functions can be expressed as a Taylor series in a [[neighbourhood (mathematics)|neighbourhood]] of the point ''a'' and are called [[analytic function|analytic]].


{{math theorem
Taylor's theorem (with the integral formulation of the remainder term) is also valid if the function ƒ has [[complex number|complex]] values or [[vector space|vector]] values. Furthermore, there is a version of Taylor's theorem for functions in several variables. For complex functions analytic in a region containing a circle ''C'' surrounding ''a'' and its interior, there is a [[contour integral]] expression for the remainder
| name = Taylor's theorem<ref>{{ citation|first1=Angelo|last1=Genocchi|first2= Giuseppe|last2=Peano|title=Calcolo differenziale e principii di calcolo integrale|location=(N. 67, pp.&nbsp;XVII–XIX)|publisher=[[Fratelli Bocca |Fratelli Bocca ed.]]|year=1884}}</ref><ref>{{Citation | last1=Spivak | first1=Michael | author1-link=Michael Spivak | title=Calculus | publisher=Publish or Perish | location=Houston, TX | edition=3rd | isbn=978-0-914098-89-8 | year=1994| page=383}}</ref><ref>{{springer|title=Taylor formula|id=p/t092300}}</ref>
:<math> R_n(x) = \frac{(x-a)^{n+1}}{2 \pi i}\int_C \frac{f(z)}{(z-a)^{n+1}(z-x)}dz</math>
| math_statement = Let ''k''&nbsp;≥&nbsp;1 be an [[integer]] and let the [[Function (mathematics)|function]] {{nowrap|''f'' : '''R''' → '''R'''}} be ''k'' times [[Differentiable function|differentiable]] at the point {{nowrap|''a'' ∈ '''R'''}}. Then there exists a function {{nowrap|''h<sub>k</sub>'' : '''R''' → '''R'''}} such that
valid inside of ''C''.


<math display="block"> f(x) = \sum_{i=0}^k \frac{f^{(i)}(a)}{i!}(x-a)^i + h_k(x)(x-a)^k,</math>
Example:
and
Approximate the number <math> e </math> within <math> 10^{-5}</math> of accuracy.
<math display="block">\lim_{x\to a} h_k(x) = 0.</math>
This is called the '''[[Peano]] form of the remainder'''.
}}


The polynomial appearing in Taylor's theorem is the '''<math display="inline">\boldsymbol{k}</math>-th order Taylor polynomial'''
We consider <math> f(x)=e^x </math> and let <math> x \in \left[ 0,1\right] </math>. We have <math> f^\prime (x)=f^{(n)}=e^x \forall n\in \mathbb{N}</math> and therefore <math> f^{ (n)} (0)=1</math>.
The Taylor polynomial for <math>f(x)=e^x </math> is given by:
<math> P_n(x)=1+x+\frac{x^2}{2!}+...+\frac{x^n}{n!}</math>. The reminder is <math> R_n(x)=\frac{1}{(n+1)!}e^c</math> for some <math> 0<c<1</math>.
Now we have <math> e^c<3</math> and we seek a value of <math>n</math> such that <math>\frac{3}{(n+1)!}<10^{-5}</math>.
By some calculation, the value of <math>n</math> to get the desired accuracy is <math>n=8 </math>. So, <math> e\approx P_{8}(1)=1+1+\frac{1}{2!}+...+\frac{1}{8!}=2.71828</math>.


<math display="block">P_k(x) = f(a) + f'(a)(x-a) + \frac{f''(a)}{2!}(x-a)^2 + \cdots + \frac{f^{(k)}(a)}{k!}(x-a)^k </math>
Reference:
Bartle,R.G. and D.R. Sherbert, Introduction to Real Analysis, Third Edition, John Wiley & Sons, New York,2000.


of the function ''f'' at the point ''a''. The Taylor polynomial is the unique "asymptotic best fit" polynomial in the sense that if there exists a function {{nowrap|''h<sub>k</sub>'' : '''R''' → '''R'''}} and a <math display="inline">k</math>-th order polynomial ''p'' such that
=== Estimates of the remainder ===
Another common version of Taylor's theorem holds on an interval {{nowrap|(''a'' − ''r'', ''a'' + ''r'')}} where the variable ''x'' is assumed to take its values. This formulation of the theorem has the advantage that it is often possible to explicitly control the size of the remainder terms, and thus arrive at an approximation of a function valid in a whole interval with precise bounds on the quality of the approximation.


<math display="block"> f(x) = p(x) + h_k(x)(x-a)^k, \quad \lim_{x\to a} h_k(x) = 0 ,</math>
A precise version of Taylor's theorem in this form is as follows. Suppose &fnof; is a function which is ''n'' times continuously differentiable on the closed interval {{nowrap|[''a'' − ''r'', ''a'' + ''r'']}} and {{nowrap|''n'' + 1}} times differentiable on the open interval {{nowrap|(''a'' − ''r'', ''a'' + ''r'').}} If there exists a positive real constant ''M<sub>n</sub>'' such that |&fnof;<sup>(''n''+1)</sup>(''x'')| &le; ''M<sub>n</sub>'' for all ''x'' &isin; {{nowrap|(''a'' − ''r'', ''a'' + ''r''),}} then


then ''p''&nbsp;=&nbsp;''P<sub>k</sub>''. Taylor's theorem describes the asymptotic behavior of the '''remainder term'''
:<math> f(x) = f(a) + \frac{f'(a)}{1!}(x - a) + \frac{f^{(2)}(a)}{2!}(x - a)^2 + \cdots + \frac{f^{(n)}(a)}{n!}(x - a)^n + R_n(x),</math>


<math display="block"> R_k(x) = f(x) - P_k(x),</math>
where the remainder function ''R<sub>n</sub>'' satisfies the inequality (known as Cauchy's estimate):


which is the [[approximation error]] when approximating ''f'' with its Taylor polynomial. Using the [[little-o notation]], the statement in Taylor's theorem reads as
:<math> |R_n(x)| \le M_n \frac{r^{n+1}}{(n+1)!}</math>


<math display="block">R_k(x) = o(|x-a|^{k}), \quad x\to a.</math>
for all ''x'' &isin; {{nowrap|(''a'' − ''r'', ''a'' + ''r'').}} This is called a [[uniform convergence|uniform estimate]] of the error in the Taylor polynomial centered at ''a'', because it holds uniformly for all ''x'' in the interval.


=== Explicit formulas for the remainder ===
If &fnof; is [[smooth function|infinitely differentiable]] on {{nowrap|[''a'' − ''r'', ''a'' + ''r''],}} then positive constants ''M<sub>n</sub>'' exist for each ''n'' = 1, 2, 3, … such that | &fnof;<sup>(''n''+1)</sup>(''x'')| &le; ''M<sub>n</sub>'' for all ''x'' &isin; {{nowrap|(''a'' − ''r'', ''a'' + ''r'').}} If, in addition, it is possible to select these constants so that


Under stronger regularity assumptions on ''f'' there are several precise formulas for the remainder term ''R<sub>k</sub>'' of the Taylor polynomial, the most common ones being the following.
:<math> M_n\frac{r^{n+1}}{(n+1)!} \rightarrow 0</math> as <math>n \rightarrow \infin ,\!</math>


{{math theorem
then &fnof; is an [[analytic function]] on {{nowrap|(''a'' − ''r'', ''a'' + ''r'').}} In particular, the remainder term in the Taylor approximation ''R<sub>n</sub>''(''x'') tends to zero [[uniform convergence|uniformly]] as ''n''&rarr;∞. In other words, an analytic function is the uniform limit of its Taylor polynomials on an interval.
| name = Mean-value forms of the remainder
| math_statement = Let {{nowrap|''f'' : '''R''' → '''R'''}} be ''k''&nbsp;+&nbsp;1 times [[Differentiable function|differentiable]] on the [[open interval]] with ''f''{{i sup|(''k'')}} [[continuous function|continuous]] on the [[closed interval]] between <math display=inline>a</math> and <math display=inline>x</math>.<ref>The hypothesis of ''f''{{i sup|(''k'')}} being [[continuous function|continuous]] on the [[closed interval|''closed'' interval]] between <math display=inline>a</math> and <math display=inline>x</math> is ''not'' redundant. Although ''f'' being ''k''&nbsp;+&nbsp;1 times [[Differentiable function|differentiable]] on the [[open interval]] between <math display=inline>a</math> and <math display=inline>x</math> does imply that ''f''{{i sup|(''k'')}} is [[continuous function|continuous]] on the [[open interval|''open'' interval]] between <math display=inline>a</math> and <math display=inline>x</math>, it does ''not'' imply that ''f''{{i sup|(''k'')}} is [[continuous function|continuous]] on the [[closed interval|''closed'' interval]] between <math display=inline>a</math> and <math display=inline>x</math>, i.e. it does not imply that ''f''{{i sup|(''k'')}} is [[continuous function|continuous]] at the ''endpoints'' of that interval. Consider, for example, the [[Function (mathematics)|function]] {{nowrap|''f'' : [0,1] → '''R'''}} defined to equal <math> \sin(1/x)</math> on <math>(0,1]</math> and with <math>f(0)=0</math>. This is not [[continuous function|continuous]] at ''0'', but is [[continuous function|continuous]] on <math>(0,1)</math>. Moreover, one can show that this [[Function (mathematics)|function]] has an [[antiderivative]]. Therefore that [[antiderivative]] is [[Differentiable function|differentiable]] on <math>(0,1)</math>, its [[derivative]] (the function ''f'') is [[continuous function|continuous]] on the [[open interval|''open'' interval]] <math>(0,1)</math>, but its [[derivative]] ''f'' is ''not'' [[continuous function|continuous]] on the [[closed interval|''closed'' interval]] <math>[0,1]</math>. So the theorem would not apply in this case.</ref> Then


<math display="block"> R_k(x) = \frac{f^{(k+1)}(\xi_L)}{(k+1)!} (x-a)^{k+1} </math>
==Taylor's theorem for several variables==
Taylor's theorem can be generalized to several variables as follows. Let ''B'' be a [[ball (mathematics)|ball]] in '''R'''<sup>''N''</sup> centered at a point ''a'', and ƒ be a real-valued function defined on the [[topological closure|closure]] <math>\bar{B}</math> having ''n''+1 continuous partial derivatives at every point. Taylor's theorem asserts that for any <math>x\in B</math>,


for some real number <math display="inline">\xi_L</math> between <math display=inline>a</math> and <math display=inline>x</math>. This is the '''[[Joseph Louis Lagrange|Lagrange]] form'''<ref>{{harvnb|Kline|1998|loc=§20.3}}; {{harvnb|Apostol|1967|loc=§7.7}}.</ref> of the remainder.
:<math>f(x)=\sum_{|\alpha|=0}^n\frac{1}{\alpha!}\frac{\partial^\alpha f(a)}{\partial x^\alpha}(x-a)^\alpha+\sum_{|\alpha|=n+1}R_{\alpha}(x)(x-a)^\alpha</math>


Similarly,
where the summation extends over multi-indices &alpha; (this formula uses the [[multi-index notation]]).


<math display="block"> R_k(x) = \frac{f^{(k+1)}(\xi_C)}{k!}(x-\xi_C)^k(x-a) </math>
The remainder terms satisfy the inequality
:<math>|R_{\alpha}(x)|\le\sup_{y\in\bar{B} }\left|\frac{1}{\alpha!}\frac{\partial^\alpha f(y)}{\partial x^\alpha}\right|</math>


for some real number <math display="inline">\xi_C</math> between <math display=inline>a</math> and <math display=inline>x</math>. This is the '''[[Augustin Louis Cauchy|Cauchy]] form'''<ref>{{harvnb|Apostol|1967|loc=§7.7}}.</ref> of the remainder.
for all &alpha; with |&alpha;|&nbsp;=&nbsp;{{nowrap|''n'' + 1}}. As was the case with one variable, the remainder terms can be described explicitly. See the proof for details.


Both can be thought of as specific cases of the following result: Consider <math>p>0</math>
==Proof: Taylor's theorem in one variable==


<math display="block"> R_k(x) = \frac{f^{(k+1)}(\xi_S)}{k!}(x-\xi_S)^{k+1-p}\frac{(x-a)^p}{p} </math>
=== Integral version ===
for some real number <math display="inline">\xi_S</math> between <math display=inline>a</math> and <math display=inline>x</math>. This is the '''[[Oskar Schlömilch|Schlömilch]] form''' of the remainder (sometimes called the '''Schlömilch-[[Édouard Roche|Roche]]'''). The choice <math display="inline">p=k+1</math> is the Lagrange form, whilst the choice <math display="inline">p=1</math> is the Cauchy form.
We first prove Taylor's theorem with the integral remainder term.<ref>Note that this proof requires ƒ<sup>(''n'')</sup> to be [[absolutely continuous]] on {{nowrap|[''a'', ''x'']}} so that the [[fundamental theorem of calculus]] holds. Except at the end when the mean value theorem is invoked, differentiability of ƒ<sup>(''n'')</sup> need not be assumed since absolute continuity implies differentiability [[almost everywhere]] as well as the validity of the fundamental theorem of calculus, provided the integrals involved are understood as [[Lebesgue integral]]s. Consequently, the integral form of the remainder holds with this particular weakening of the assumptions on ƒ.</ref>
}}


These refinements of Taylor's theorem are usually proved using the [[mean value theorem]], whence the name. Additionally, notice that this is precisely the [[mean value theorem]] when <math display="inline">k=0</math>. Also other similar expressions can be found. For example, if ''G''(''t'') is continuous on the closed interval and differentiable with a non-vanishing derivative on the open interval between <math display="inline">a</math> and <math display="inline">x</math>, then
The [[fundamental theorem of calculus]] states that


:<math>\int_a^x \, f'(t) \, dt=f(x)-f(a),</math>
<math display="block"> R_k(x) = \frac{f^{(k+1)}(\xi)}{k!}(x-\xi)^k \frac{G(x)-G(a)}{G'(\xi)} </math>


for some number <math display="inline">\xi</math> between <math display="inline">a</math> and <math display="inline">x</math>. This version covers the Lagrange and Cauchy forms of the remainder as special cases, and is proved below using [[mean value theorem#Cauchy's mean value theorem|Cauchy's mean value theorem]]. The Lagrange form is obtained by taking <math>G(t)=(x-t)^{k+1}</math> and the Cauchy form is obtained by taking <math>G(t)=t-a</math>.
which can be rearranged to:


{{anchor|Integral form of the remainder}}The statement for the integral form of the remainder is more advanced than the previous ones, and requires understanding of [[Lebesgue integral|Lebesgue integration theory]] for the full generality. However, it holds also in the sense of [[Riemann integral]] provided the (''k''&nbsp;+&nbsp;1)th derivative of ''f'' is continuous on the closed interval [''a'',''x''].
:<math>f(x)=f(a)+ \int_a^x \, f'(t) \, dt.</math>


{{math theorem|name=Integral form of the remainder<ref>{{harvnb|Apostol|1967|loc=§7.5}}.</ref> |math_statement=Let <math display=inline>f^{(k)}</math> be [[absolutely continuous]] on the [[closed interval]] between <math display=inline>a</math> and <math display=inline>x</math>. Then
Now we can see that an application of [[Integration by parts]] yields:


<math display="block"> R_k(x) = \int_a^x \frac{f^{(k+1)} (t)}{k!} (x - t)^k \, dt. </math>
:<math> \begin{align}
f(x) &= f(a)+xf'(x)-af'(a)-\int_a^x \, tf''(t) \, dt \\
&= f(a)+\int_a^x \, xf''(t) \,dt+xf'(a)-af'(a)-\int_a^x \, tf''(t) \, dt \\
&= f(a)+(x-a)f'(a)+\int_a^x \, (x-t)f''(t) \, dt.
\end{align} </math>


}}
The first equation is arrived at by letting <math>u=f'(t)\,</math> and {{nowrap|1=''dv'' = ''dt'';}} the second equation by noting that


Due to the [[absolutely continuous|absolute continuity]] of ''f''{{i sup|(''k'')}} on the [[closed interval]] between <math display=inline>a</math> and <math display=inline>x</math>, its derivative ''f''{{i sup|(''k''+1)}} exists as an ''L''{{i sup|1}}-function, and the result can be [[#Derivation for the integral form of the remainder|proven]] by a formal calculation using the [[fundamental theorem of calculus]] and [[integration by parts]].
:<math>\int_a^x \, xf''(t) \,dt = xf'(x)-xf'(a);</math>


=== Estimates for the remainder ===
the third just factors out some common terms.


It is often useful in practice to be able to estimate the remainder term appearing in the Taylor approximation, rather than having an exact formula for it. Suppose that ''f'' is {{nowrap|(''k'' + 1)}}-times continuously differentiable in an interval ''I'' containing ''a''. Suppose that there are real constants ''q'' and ''Q'' such that
Another application yields:


<math display="block">q\le f^{(k+1)}(x)\le Q</math>
:<math>f(x)=f(a)+(x-a)f'(a)+ \frac 1 2 (x-a)^2f''(a) + \frac 1 2 \int_a^x \, (x-t)^2f'''(t) \, dt. </math>


throughout ''I''. Then the remainder term satisfies the inequality<ref>{{harvnb|Apostol|1967|loc=§7.6}}</ref>
By repeating this process, we may derive Taylor's theorem for higher values {{nowrap|of ''n''.}}


<math display="block">q\frac{(x-a)^{k+1}}{(k+1)!}\le R_k(x)\le Q\frac{(x-a)^{k+1}}{(k+1)!},</math>
This can be formalized by applying the technique of [[mathematical induction|induction]]. So, suppose that Taylor's theorem holds for a {{nowrap|particular ''n'',}} that is, suppose that


if {{nowrap|''x'' > ''a''}}, and a similar estimate if {{nowrap|''x'' < ''a''}}. This is a simple consequence of the Lagrange form of the remainder. In particular, if
:<math>
f(x) = f(a)
+ \frac{f'(a)}{1!}(x - a)
+ \cdots
+ \frac{f^{(n)}(a)}{n!}(x - a)^n
+ \int_a^x \frac{f^{(n+1)} (t)}{n!} (x - t)^n \, dt. \qquad(*)
</math>


<math display="block">|f^{(k+1)}(x)|\le M</math>
We can rewrite the integral using [[integration by parts]]. An antiderivative of {{nowrap|(''x'' − ''t'')<sup>''n''</sup>}} as a function {{nowrap|of ''t''}} is given by {{nowrap|−(''x''−''t'')<sup>''n''+1</sup> / (''n'' + 1),}} so


on an interval {{nowrap|1=''I'' = (''a'' − ''r'',''a'' + ''r'')}} with some <math>r > 0</math> , then
:<math> \int_a^x \frac{f^{(n+1)} (t)}{n!} (x - t)^n \, dt </math>


::<math> {} = - \left[ \frac{f^{(n+1)} (t)}{(n+1)n!} (x - t)^{n+1} \right]_a^x + \int_a^x \frac{f^{(n+2)} (t)}{(n+1)n!} (x - t)^{n+1} \, dt </math>
<math display="block">|R_k(x)|\le M\frac{|x-a|^{k+1}}{(k+1)!}\le M\frac{r^{k+1}}{(k+1)!}</math>


for all {{nowrap|''x''∈(''a'' − ''r'',''a'' + ''r'').}} The second inequality is called a [[uniform convergence|uniform estimate]], because it holds uniformly for all ''x'' on the interval {{nowrap|(''a'' − ''r'',''a'' + ''r'').}}
::<math> {} = \frac{f^{(n+1)} (a)}{(n+1)!} (x - a)^{n+1} + \int_a^x \frac{f^{(n+2)} (t)}{(n+1)!} (x - t)^{n+1} \, dt. </math>
Substituting this {{nowrap|in (*)}} proves Taylor's theorem for {{nowrap|''n'' + 1,}} and hence for all nonnegative {{nowrap|integers ''n''.}}


=== Example ===
The remainder term in the Lagrange form can be derived by the [[mean value theorem]] for integrals in the following way:


[[File:Expanimation.gif|thumb|400px|right|Approximation of <math display="inline">e^x</math> (blue) by its Taylor polynomials <math>P_k</math> of order <math display="inline">k=1,\ldots,7</math> centered at <math display="inline">x=0</math> (red).]] Suppose that we wish to find the approximate value of the function <math display="inline">f(x)=e^x</math> on the interval <math display="inline">[-1,1]</math> while ensuring that the error in the approximation is no more than 10<sup>−5</sup>. In this example we pretend that we only know the following properties of the exponential function:
:<math>
R_n = \int_a^x \frac{f^{(n+1)} (t)}{n!} (x - t)^n \, dt =f^{(n+1)}(\xi) \int_a^x \frac{(x - t)^n }{n!} \, dt,
</math>


{{NumBlk|:|<math>e^0=1, \qquad \frac{d}{dx} e^x = e^x, \qquad e^x>0, \qquad x\in\R.</math>|{{EquationRef|★}}}}
where <math>\xi</math> is some number from the interval [''a'', ''x'']. The last integral can be evaluated immediately, which leads to


From these properties it follows that <math display="inline">f^{(k)}(x)=e^x</math> for all <math display="inline">k</math>, and in particular, <math display="inline">f^{(k)}(0)=1</math>. Hence the ''<math display="inline">k</math>''-th order Taylor polynomial of <math display="inline">f</math> at <math display="inline">0</math> and its remainder term in the Lagrange form are given by
:<math>
R_n = \frac{f^{(n+1)}(\xi)}{(n+1)!} (x-a)^{n+1}.
</math>


<math display="block"> P_k(x) = 1+x+\frac{x^2}{2!}+\cdots+\frac{x^k}{k!}, \qquad R_k(x)=\frac{e^\xi}{(k+1)!}x^{k+1},</math>
More generally, for any function ''G''(''t''), the mean value theorem asserts the existence of <math>\xi</math> in the interval [''a'', ''x''] satisfying


where <math display="inline">\xi</math> is some number between 0 and ''x''. Since ''e''<sup>''x''</sup> is increasing by ({{EquationNote|★}}), we can simply use <math display="inline">e^x \leq 1</math> for <math display="inline">x \in [-1,0]</math> to estimate the remainder on the subinterval <math>[-1,0]</math>. To obtain an upper bound for the remainder on <math>[0,1]</math>, we use the property <math display="inline">e^\xi <e^x</math> for <math display="inline">0<\xi<x</math> to estimate
:<math>
R_n = \int_a^x \frac{f^{(n+1)} (t)}{n!} (x - t)^n \frac{G'(t)}{G'(t)}\, dt =\frac{f^{(n+1)}(\xi)}{n!} (x-\xi)^n \frac{1}{G'(\xi)} \int_a^x G'(t) \, dt
</math>


::<math> = \frac{f^{(n+1)}(\xi)}{n!} (x-\xi)^n \cdot \frac{G(x)-G(a)}{G'(\xi)}. </math>
<math display="block"> e^x = 1 + x + \frac{e^\xi}{2}x^2 < 1 + x + \frac{e^x}{2}x^2, \qquad 0 < x\leq 1 </math>


using the second order Taylor expansion. Then we solve for ''e<sup>x</sup>'' to deduce that
=== Mean value theorem ===
An alternative proof, which holds under milder technical assumptions on the function ƒ, can be supplied using the [[mean value theorem#Cauchy's mean value theorem|Cauchy mean value theorem]]. Let ''G'' be a real-valued function continuous on {{nowrap|[''a'', ''x'']}} and differentiable with non-vanishing derivative on {{nowrap|(''a'', ''x'').}} Let


<math display="block"> e^x \leq \frac{1+x}{1-\frac{x^2}{2}} = 2\frac{1+x}{2-x^2} \leq 4, \qquad 0 \leq x\leq 1 </math>
:<math>

F(t) = f(t) + \frac{f'(t)}{1!}(x-t) + \cdots + \frac{f^{(n)}(t)}{n!}(x-t)^n.
simply by maximizing the [[numerator]] and minimizing the [[denominator]]. Combining these estimates for ''e<sup>x</sup>'' we see that

<math display="block"> |R_k(x)| \leq \frac{4|x|^{k+1}}{(k+1)!} \leq \frac{4}{(k+1)!}, \qquad -1\leq x \leq 1, </math>

so the required precision is certainly reached, when

<math display="block"> \frac{4}{(k+1)!} < 10^{-5} \quad \Longleftrightarrow \quad 4\cdot 10^5 < (k+1)! \quad \Longleftrightarrow \quad k \geq 9.
</math>
</math>


(See [[factorial]] or compute by hand the values <math display="inline">9! =362880</math> and <math display="inline">10! =3628800</math>.) As a conclusion, Taylor's theorem leads to the approximation
By Cauchy's mean value theorem,


<math display="block"> e^x = 1+x+\frac{x^2}{2!} + \cdots + \frac{x^9}{9!} + R_9(x), \qquad |R_9(x)| < 10^{-5}, \qquad -1\leq x \leq 1. </math>
:<math>
\frac{F'(\xi)}{G'(\xi)} = \frac{F(x) - F(a)}{G(x) - G(a)}
</math> {{nowrap| }} {{nowrap| }} {{nowrap| }} (1)


For instance, this approximation provides a [[decimal representation|decimal expression]] <math>e \approx 2.71828</math>, correct up to five decimal places.
for some {{nowrap|&xi; &isin; (''a'', ''x'')}}. Note that the numerator {{nowrap|1=''F''(''x'') − ''F''(''a'') = ''R<sub>n</sub>''}} is the remainder of the Taylor polynomial for ƒ(''x''). On the other hand, computing ''F''&prime;(''t''),


== Relationship to analyticity ==
:<math>

F'(t) = f'(t) - f'(t) + \frac{f''(t)}{1!}(x-t) - \frac{f''(t)}{1!}(x-t) + \cdots + \frac{f^{(n+1)}(t)}{n!}(x-t)^n = \frac{f^{(n+1)}(t)}{n!}(x-t)^n.
=== Taylor expansions of real analytic functions ===

Let ''I'' ⊂ '''R''' be an [[open interval]]. By definition, a function ''f'' : ''I'' → '''R''' is [[analytic function|real analytic]] if it is locally defined by a convergent [[power series]]. This means that for every ''a''&nbsp;∈&nbsp;''I'' there exists some ''r''&nbsp;>&nbsp;0 and a sequence of coefficients ''c<sub>k</sub>''&nbsp;∈&nbsp;'''R''' such that {{nowrap|(''a'' − ''r'', ''a'' + ''r'') ⊂ ''I''}} and

<math display="block"> f(x) = \sum_{k=0}^\infty c_k(x-a)^k = c_0 + c_1(x-a) + c_2(x-a)^2 + \cdots, \qquad |x-a|<r. </math>

In general, the [[power series#Radius of convergence|radius of convergence]] of a power series can be computed from the [[Cauchy–Hadamard theorem|Cauchy–Hadamard formula]]

<math display="block"> \frac{1}{R} = \limsup_{k\to\infty}|c_k|^\frac{1}{k}. </math>

This result is based on comparison with a [[geometric series]], and the same method shows that if the power series based on ''a'' converges for some ''b'' ∈ '''R''', it must converge [[uniform convergence|uniformly]] on the [[closed interval]] <math display="inline">[a-r_b,a+r_b]</math>, where <math display="inline">r_b=\left\vert b-a \right\vert</math>. Here only the convergence of the power series is considered, and it might well be that {{nowrap|(''a'' − ''R'',''a'' + ''R'')}} extends beyond the domain ''I'' of ''f''.

The Taylor polynomials of the real analytic function ''f'' at ''a'' are simply the finite truncations

<math display="block"> P_k(x) = \sum_{j=0}^k c_j(x-a)^j, \qquad c_j = \frac{f^{(j)}(a)}{j!}</math>

of its locally defining power series, and the corresponding remainder terms are locally given by the analytic functions

<math display="block"> R_k(x) = \sum_{j=k+1}^\infty c_j(x-a)^j = (x-a)^k h_k(x), \qquad |x-a|<r. </math>

Here the functions

<math display="block">\begin{align}
& h_k:(a-r,a+r)\to \R \\[1ex]
& h_k(x) = (x-a)\sum_{j=0}^\infty c_{k+1+j} \left(x - a\right)^j
\end{align}</math>

are also analytic, since their defining power series have the same radius of convergence as the original series. Assuming that {{nowrap|[''a'' − ''r'', ''a'' + ''r'']}} ⊂ ''I'' and ''r''&nbsp;<&nbsp;''R'', all these series converge uniformly on {{nowrap|(''a'' − ''r'', ''a'' + ''r'')}}. Naturally, in the case of analytic functions one can estimate the remainder term <math display="inline">R_k(x)</math> by the tail of the sequence of the derivatives ''f′''(''a'') at the center of the expansion, but using [[complex analysis]] also another possibility arises, which is described [[Taylor's theorem#Relationship to analyticity##Taylor's theorem in complex analysis|below]].

=== Taylor's theorem and convergence of Taylor series ===

The Taylor series of ''f'' will converge in some interval in which all its derivatives are bounded and do not grow too fast as ''k'' goes to infinity. (However, even if the Taylor series converges, it might not converge to ''f'', as explained below; ''f'' is then said to be non-[[analytic function|analytic]].)

One might think of the Taylor series

<math display="block"> f(x) \approx \sum_{k=0}^\infty c_k(x-a)^k = c_0 + c_1(x-a) + c_2(x-a)^2 + \cdots </math>

of an infinitely many times differentiable function ''f'' : '''R''' → '''R''' as its "infinite order Taylor polynomial" at ''a''. Now the [[Taylor's theorem#Estimates for the remainder|estimates for the remainder]] imply that if, for any ''r'', the derivatives of ''f'' are known to be bounded over (''a''&nbsp;−&nbsp;''r'', ''a''&nbsp;+&nbsp;''r''), then for any order ''k'' and for any ''r''&nbsp;>&nbsp;0 there exists a constant {{nowrap|''M<sub>k,r</sub>'' > 0}} such that

{{NumBlk|:|<math> |R_k(x)| \leq M_{k,r} \frac{|x-a|^{k+1}}{(k+1)!} </math>|{{EquationRef|★★}}}}

for every ''x''&nbsp;∈&nbsp;(''a''&nbsp;−&nbsp;''r'',''a''&nbsp;+&nbsp;''r''). Sometimes the constants {{nowrap|''M<sub>k,r</sub>''}} can be chosen in such way that {{nowrap|''M<sub>k,r</sub>''}} is bounded above, for fixed ''r'' and all ''k''. Then the Taylor series of ''f'' [[uniform convergence|converges uniformly]] to some analytic function

<math display="block">\begin{align}
& T_f:(a-r,a+r)\to\R \\
& T_f(x) = \sum_{k=0}^\infty \frac{f^{(k)}(a)}{k!} \left(x-a\right)^k
\end{align}</math>

(One also gets convergence even if {{nowrap|''M<sub>k,r</sub>''}} is not bounded above as long as it grows slowly enough.)

The limit function {{nowrap|''T<sub>f</sub>''}} is by definition always analytic, but it is not necessarily equal to the original function ''f'', even if ''f'' is infinitely differentiable. In this case, we say ''f'' is a [[non-analytic smooth function]], for example a [[flat function]]:

<math display="block">\begin{align}
& f:\R \to \R \\
& f(x) = \begin{cases}
e^{-\frac{1}{x^2}} & x>0 \\
0 & x \leq 0 .
\end{cases}
\end{align}</math>

Using the [[chain rule]] repeatedly by [[mathematical induction]], one shows that for any order&nbsp;''k'',

<math display="block"> f^{(k)}(x) = \begin{cases}
\frac{p_k(x)}{x^{3k}}\cdot e^{-\frac{1}{x^2}} & x>0 \\
0 & x \leq 0
\end{cases}</math>

for some polynomial ''p<sub>k</sub>'' of degree 2(''k'' − 1). The function <math>e^{-\frac{1}{x^2}}</math> tends to zero faster than any polynomial as <math display="inline">x \to 0</math>, so ''f'' is infinitely many times differentiable and {{nowrap|1=''f''{{i sup|(''k'')}}(0) = 0}} for every positive integer ''k''. The above results all hold in this case:

* The Taylor series of ''f'' converges uniformly to the zero function ''T<sub>f</sub>''(''x'')&nbsp;=&nbsp;0, which is analytic with all coefficients equal to zero.
* The function ''f'' is unequal to this Taylor series, and hence non-analytic.
* For any order ''k''&nbsp;∈&nbsp;'''N''' and radius ''r''&nbsp;>&nbsp;0 there exists ''M<sub>k,r</sub>''&nbsp;>&nbsp;0 satisfying the remainder bound&nbsp;({{EquationNote|★★}}) above.
However, as ''k'' increases for fixed ''r'', the value of ''M<sub>k,r</sub>'' grows more quickly than ''r<sup>k</sup>'', and the error does not go to zero''.''

=== Taylor's theorem in complex analysis ===

Taylor's theorem generalizes to functions ''f'' : '''C''' → '''C''' which are [[complex differentiable]] in an open subset ''U''&nbsp;⊂&nbsp;'''C''' of the [[complex plane]]. However, its usefulness is dwarfed by other general theorems in [[complex analysis]]. Namely, stronger versions of related results can be deduced for [[complex differentiable]] functions ''f''&nbsp;:&nbsp;''U''&nbsp;→&nbsp;'''C''' using [[Cauchy's integral formula]] as follows.

Let ''r''&nbsp;>&nbsp;0 such that the [[closed disk]] ''B''(''z'',&nbsp;''r'')&nbsp;∪&nbsp;''S''(''z'',&nbsp;''r'') is contained in ''U''. Then Cauchy's integral formula with a positive parametrization {{nowrap|1=''γ''(''t'') = ''z'' + ''re<sup>it</sup>''}} of the circle ''S''(''z'', ''r'') with <math>t \in [0,2 \pi]</math> gives

<math display="block">f(z) = \frac{1}{2\pi i}\int_\gamma \frac{f(w)}{w-z}\,dw, \quad f'(z) = \frac{1}{2\pi i}\int_\gamma \frac{f(w)}{(w-z)^2} \, dw, \quad \ldots, \quad f^{(k)}(z) = \frac{k!}{2\pi i}\int_\gamma \frac{f(w)}{(w-z)^{k+1}} \, dw.</math>

Here all the integrands are continuous on the [[circle]] ''S''(''z'',&nbsp;''r''), which justifies differentiation under the integral sign. In particular, if ''f'' is once [[complex differentiable]] on the open set ''U'', then it is actually infinitely many times [[complex differentiable]] on ''U''. One also obtains [[Cauchy's estimate]]<ref>{{harvnb|Rudin|1987|loc=§10.26}}</ref>

<math display="block"> |f^{(k)}(z)| \leq \frac{k!}{2\pi}\int_\gamma \frac{M_r}{|w-z|^{k+1}} \, dw = \frac{k!M_r}{r^k}, \quad M_r = \max_{|w-c|=r}|f(w)| </math>

for any ''z''&nbsp;∈&nbsp;''U'' and ''r''&nbsp;>&nbsp;0 such that ''B''(''z'',&nbsp;''r'')&nbsp;∪&nbsp;''S''(''c'',&nbsp;''r'')&nbsp;⊂&nbsp;''U''. The estimate implies that the [[complex number|complex]] [[Taylor series]]

<math display="block"> T_f(z) = \sum_{k=0}^\infty \frac{f^{(k)}(c)}{k!}(z-c)^k </math>

of ''f'' converges uniformly on any [[open disk]] <math display="inline">B(c,r) \subset U</math> with <math display="inline">S(c,r) \subset U</math> into some function ''T<sub>f</sub>''. Furthermore, using the [[contour integral]] formulas for the derivatives ''f''{{i sup|(''k'')}}(''c''),

<math display="block">\begin{align}
T_f(z) &= \sum_{k=0}^\infty \frac{(z-c)^k}{2\pi i}\int_\gamma \frac{f(w)}{(w-c)^{k+1}} \, dw \\
&= \frac{1}{2\pi i} \int_\gamma \frac{f(w)}{w-c} \sum_{k=0}^\infty \left(\frac{z-c}{w-c}\right)^k \, dw \\
&= \frac{1}{2\pi i} \int_\gamma \frac{f(w)}{w-c}\left( \frac{1}{1-\frac{z-c}{w-c}} \right) \, dw \\
&= \frac{1}{2\pi i} \int_\gamma \frac{f(w)}{w-z} \, dw \\
&= f(z),
\end{align}</math>

so any [[complex derivative|complex differentiable]] function ''f'' in an open set ''U''&nbsp;⊂&nbsp;'''C''' is in fact [[complex analytic]]. All that is said for real analytic functions [[Taylor's theorem#Relationship to analyticity##Taylor expansions of analytic functions|here]] holds also for complex analytic functions with the open interval ''I'' replaced by an open subset ''U''&nbsp;∈&nbsp;'''C''' and ''a''-centered intervals (''a''&nbsp;−&nbsp;''r'',&nbsp;''a''&nbsp;+&nbsp;''r'') replaced by ''c''-centered disks ''B''(''c'',&nbsp;''r''). In particular, the Taylor expansion holds in the form

<math display="block"> f(z) = P_k(z) + R_k(z), \quad P_k(z) = \sum_{j=0}^k \frac{f^{(j)}(c)}{j!}(z-c)^j, </math>

where the remainder term ''R<sub>k</sub>'' is complex analytic. Methods of complex analysis provide some powerful results regarding Taylor expansions. For example, using Cauchy's integral formula for any positively oriented [[Jordan curve]] <math display="inline">\gamma</math> which parametrizes the boundary <math display="inline">\partial W \subset U</math> of a region <math display="inline">W \subset U</math>, one obtains expressions for the derivatives {{nowrap|''f''{{i sup|(''j'')}}(''c'')}} as above, and modifying slightly the computation for {{nowrap|1=''T<sub>f</sub>''(''z'') = ''f''(''z'')}}, one arrives at the exact formula

<math display="block"> R_k(z) = \sum_{j=k+1}^\infty \frac{(z-c)^j}{2\pi i} \int_\gamma \frac{f(w)}{(w-c)^{j+1}} \, dw = \frac{(z-c)^{k+1}}{2\pi i} \int_\gamma \frac{f(w) \, dw}{(w-c)^{k+1}(w-z)} , \qquad z\in W. </math>

The important feature here is that the quality of the approximation by a Taylor polynomial on the region <math display="inline">W \subset U</math> is dominated by the values of the function ''f'' itself on the boundary <math display="inline">\partial W \subset U</math>. Similarly, applying Cauchy's estimates to the series expression for the remainder, one obtains the uniform estimates

<math display="block"> |R_k(z)|
\leq \sum_{j=k+1}^\infty \frac{M_r |z-c|^j}{r^j}
= \frac{M_r}{r^{k+1}} \frac{|z-c|^{k+1}}{1-\frac{|z-c|}{r}}
\leq \frac{M_r \beta^{k+1}}{1-\beta}, \qquad \frac{|z-c|}{r} \leq \beta < 1. </math>

=== Example ===

[[File:Function with two poles.png|thumb|right|Complex plot of <math display="inline">f(z)=\frac{1}{1+z^2}</math>. Modulus is shown by elevation and argument by coloring: cyan&nbsp;=&nbsp;<math display="inline">0</math>, blue&nbsp;=&nbsp;<math display="inline">\frac{\pi}{3}</math>, violet&nbsp;=&nbsp;<math display="inline">\frac{2\pi}{3}</math>, red&nbsp;=&nbsp;<math>\pi</math>, yellow&nbsp;=&nbsp;<math display="inline">\frac{4\pi}{3}</math>, green&nbsp;=&nbsp;<math display="inline">\frac{5\pi}{3}</math>.]]
The function

<math display="block">\begin{align}
& f : \R \to \R \\
& f(x) = \frac{1}{1+x^2}
\end{align}</math>

is [[analytic function|real analytic]], that is, locally determined by its Taylor series. This function was plotted [[Taylor's theorem#Motivation|above]] to illustrate the fact that some elementary functions cannot be approximated by Taylor polynomials in neighborhoods of the center of expansion which are too large. This kind of behavior is easily understood in the framework of complex analysis. Namely, the function ''f'' extends into a [[meromorphic function]]

<math display="block">\begin{align}
& f:\Complex \cup \{\infty\} \to \Complex \cup \{\infty\} \\
& f(z) = \frac{1}{1+z^2}
\end{align}</math>

on the compactified complex plane. It has simple poles at <math display="inline">z=i</math> and <math display="inline">z=-i</math>, and it is analytic elsewhere. Now its Taylor series centered at ''z''<sub>0</sub> converges on any disc ''B''(''z''<sub>0</sub>, ''r'') with ''r'' < |''z''&nbsp;−&nbsp;''z''<sub>0</sub>|, where the same Taylor series converges at ''z''&nbsp;∈&nbsp;'''C'''. Therefore, Taylor series of ''f'' centered at 0 converges on ''B''(0, 1) and it does not converge for any ''z'' ∈ '''C''' with |''z''|&nbsp;>&nbsp;1 due to the poles at ''i'' and −''i''. For the same reason the Taylor series of ''f'' centered at 1 converges on <math display="inline">B(1, \sqrt{2})</math> and does not converge for any ''z''&nbsp;∈&nbsp;'''C''' with <math display="inline">\left\vert z-1 \right\vert>\sqrt{2}</math>.

== Generalizations of Taylor's theorem ==

=== Higher-order differentiability ===

A function {{math|''f'': '''R'''<sup>''n''</sup> → '''R'''}} is [[derivative|differentiable]] at {{math|'''''a''''' ∈ '''R'''<sup>''n''</sup>}} [[if and only if]] there exists a [[linear functional]] {{math|''L'' : '''R'''<sup>''n''</sup> → '''R'''}} and a function {{math|''h'' : '''R'''<sup>''n''</sup> → '''R'''}} such that

<math display="block"> f(\boldsymbol{x}) = f(\boldsymbol{a}) + L(\boldsymbol{x}-\boldsymbol{a}) + h(\boldsymbol{x})\lVert\boldsymbol{x}-\boldsymbol{a}\rVert,
\qquad \lim_{\boldsymbol{x}\to\boldsymbol{a}} h(\boldsymbol{x})=0. </math>

If this is the case, then <math display="inline">L = df(\boldsymbol{a})</math> is the (uniquely defined) [[differential of a function|differential]] of {{math|''f''}} at the point {{math|'''''a'''''}}. Furthermore, then the [[partial derivatives]] of {{math|''f''}} exist at {{math|'''''a'''''}} and the differential of {{math|''f''}} at {{math|'''''a'''''}} is given by

<math display="block"> df( \boldsymbol{a} )( \boldsymbol{v} ) = \frac{\partial f}{\partial x_1}(\boldsymbol{a}) v_1 + \cdots + \frac{\partial f}{\partial x_n}(\boldsymbol{a}) v_n. </math>

Introduce the [[multi-index notation]]

<math display="block"> |\alpha| = \alpha_1+\cdots+\alpha_n, \quad \alpha!=\alpha_1!\cdots\alpha_n!, \quad \boldsymbol{x}^\alpha=x_1^{\alpha_1}\cdots x_n^{\alpha_n} </math>

for {{math|''α'' ∈ '''N'''<sup>''n''</sup>}} and {{math|'''''x''''' ∈ '''R'''<sup>''n''</sup>}}. If all the <math display="inline">k</math>-th order [[partial derivatives]] of {{math|''f'' : '''R'''<sup>''n''</sup> → '''R'''}} are continuous at {{math|'''''a''''' ∈ '''R'''<sup>''n''</sup>}}, then by [[symmetry of second derivatives|Clairaut's theorem]], one can change the order of mixed derivatives at {{math|'''''a'''''}}, so the short-hand notation

<math display="block"> D^\alpha f = \frac{\partial^{|\alpha|}f}{\partial\boldsymbol x^\alpha} = \frac{\partial^{\alpha_1 + \ldots + \alpha_n}f}{\partial x_1^{\alpha_1}\cdots \partial x_n^{\alpha_n}}</math>

for the higher order [[partial derivatives]] is justified in this situation. The same is true if all the ({{math|''k'' − 1}})-th order partial derivatives of {{math|''f''}} exist in some neighborhood of {{math|'''''a'''''}} and are differentiable at {{math|'''''a'''''}}.<ref>This follows from iterated application of the theorem that if the partial derivatives of a function {{math|''f''}} exist in a neighborhood of {{math|'''''a'''''}} and are continuous at {{math|'''''a'''''}}, then the function is differentiable at {{math|'''''a'''''}}. See, for instance, {{harvnb|Apostol|1974|loc=Theorem 12.11}}.</ref> Then we say that {{math|''f''}} is {{math|''k''}} '''times differentiable at the point&nbsp;{{math|''a''}}'''.

=== Taylor's theorem for multivariate functions ===
Using notations of the preceding section, one has the following theorem.
{{math theorem|name=Multivariate version of Taylor's theorem<ref>Königsberger Analysis 2, p. 64 ff.</ref>|math_statement= Let {{math|''f'' : '''R'''<sup>''n''</sup> → '''R'''}} be a {{math|''k''}}-times [[continuously differentiable]] function at the point {{math|'''''a''''' ∈ '''R'''<sup>''n''</sup>}}. Then there exist functions {{math|''h''<sub>''α''</sub> : '''R'''<sup>''n''</sup> → '''R'''}}, where <math>|\alpha|=k,</math> such that

<math display="block">\begin{align}
& f(\boldsymbol{x}) = \sum_{|\alpha|\leq k} \frac{D^\alpha f(\boldsymbol{a})}{\alpha!} (\boldsymbol{x}-\boldsymbol{a})^\alpha + \sum_{|\alpha|=k} h_\alpha(\boldsymbol{x})(\boldsymbol{x}-\boldsymbol{a})^\alpha, \\
& \mbox{and}\quad \lim_{\boldsymbol{x}\to \boldsymbol{a}}h_\alpha(\boldsymbol{x})=0.
\end{align}</math>}}

If the function {{math|''f'' : '''R'''<sup>''n''</sup> → '''R'''}} is {{math|''k'' + 1}} times [[continuously differentiable]] in a [[closed ball]] <math>B = \{ \mathbf{y} \in \R^n : \left\|\mathbf{a}-\mathbf{y}\right\| \leq r\}</math> for some <math>r > 0</math>, then one can derive an exact formula for the remainder in terms of {{nowrap|({{math|''k''+1}})-th}} order [[partial derivatives]] of ''f'' in this neighborhood.<ref>{{cite web | title = Higher-Order Derivatives and Taylor's Formula in Several Variables | last = Folland | first = G. B. | url = https://sites.math.washington.edu/~folland/Math425/taylor2.pdf | website = Department of Mathematics {{!}} University of Washington | access-date = 2024-02-21 }}</ref> Namely,

<math display="block"> \begin{align}
& f( \boldsymbol{x} ) = \sum_{|\alpha|\leq k} \frac{D^\alpha f(\boldsymbol{a})}{\alpha!} (\boldsymbol{x}-\boldsymbol{a})^\alpha + \sum_{|\beta|=k+1} R_\beta(\boldsymbol{x})(\boldsymbol{x}-\boldsymbol{a})^\beta, \\
& R_\beta( \boldsymbol{x} ) = \frac{|\beta|}{\beta!} \int_0^1 (1-t)^{|\beta|-1}D^\beta f \big(\boldsymbol{a}+t( \boldsymbol{x}-\boldsymbol{a} )\big) \, dt.
\end{align}
</math>
</math>


In this case, due to the [[continuous function|continuity]] of ({{math|''k''+1}})-th order [[partial derivative]]s in the [[compact set]] {{math|''B''}}, one immediately obtains the uniform estimates
Putting these two facts together and rearranging the terms of (1) yields

<math display="block"> \left|R_\beta(\boldsymbol{x})\right| \leq \frac{1}{\beta!} \max_{|\alpha|=|\beta|} \max_{\boldsymbol{y}\in B} |D^\alpha f(\boldsymbol{y})|, \qquad \boldsymbol{x}\in B. </math>

=== Example in two dimensions ===

For example, the third-order Taylor polynomial of a smooth function <math>f:\mathbb R^2\to\mathbb R</math> is, denoting <math>\boldsymbol{x}-\boldsymbol{a}=\boldsymbol{v}</math>,

<math display="block"> \begin{align}
P_3(\boldsymbol{x}) = f ( \boldsymbol{a} ) + {} &\frac{\partial f}{\partial x_1}( \boldsymbol{a} ) v_1 + \frac{\partial f}{\partial x_2}( \boldsymbol{a} ) v_2 + \frac{\partial^2 f}{\partial x_1^2}( \boldsymbol{a} ) \frac {v_1^2}{2!} + \frac{\partial^2 f}{\partial x_1 \partial x_2}( \boldsymbol{a} ) v_1 v_2 + \frac{\partial^2 f}{\partial x_2^2}( \boldsymbol{a} ) \frac{v_2^2}{2!} \\
& + \frac{\partial^3 f}{\partial x_1^3}( \boldsymbol{a} ) \frac{v_1^3}{3!} + \frac{\partial^3 f}{\partial x_1^2 \partial x_2}( \boldsymbol{a} ) \frac{v_1^2 v_2}{2!} + \frac{\partial^3 f}{\partial x_1 \partial x_2^2}( \boldsymbol{a} ) \frac{v_1 v_2^2}{2!} + \frac{\partial^3 f}{\partial x_2^3}( \boldsymbol{a} ) \frac{v_2^3}{3!}
\end{align}</math>

== Proofs ==

=== Proof for Taylor's theorem in one real variable ===

Let<ref>{{harvnb|Stromberg|1981}}</ref>


<math display="block"> h_k(x) = \begin{cases}
:<math>
R_n = \frac{f^{(n+1)}(\xi)}{n!}(x-\xi)^n\cdot\frac{G(x)-G(a)}{G'(\xi)},
\frac{f(x) - P(x)}{(x-a)^k} & x\not=a\\
0&x=a
\end{cases}
</math>
</math>


where, as in the statement of Taylor's theorem,
which was to be shown.


<math display="block"> P(x) = f(a) + f'(a)(x-a) + \frac{f''(a)}{2!}(x-a)^2 + \cdots + \frac{f^{(k)}(a)}{k!}(x-a)^k.</math>
Note that the Lagrange form of the remainder comes from taking ''G''(''t'')&nbsp;= {{nowrap|(''x'' − ''t'')<sup>''n''+1</sup>,}} the given Cauchy form of the remainder comes from taking ''G''(''t'')&nbsp;= {{nowrap|(''t'' − ''a''),}} and the integral form of the remainder comes from taking


It is sufficient to show that
:<math>

G(t) = \int_a^t \frac{f^{(n+1)}(s)}{n!} (x-s)^n ds.
<math display="block"> \lim_{x\to a} h_k(x) =0. </math>

The proof here is based on repeated application of [[L'Hôpital's rule]]. Note that, for each <math display="inline">j=0,1,...,k-1</math>, <math>f^{(j)}(a)=P^{(j)}(a)</math>. Hence each of the first <math display="inline">k-1</math> derivatives of the numerator in <math>h_k(x)</math> vanishes at <math>x=a</math>, and the same is true of the denominator. Also, since the condition that the function <math display="inline">f</math> be <math display="inline">k</math> times differentiable at a point requires differentiability up to order <math display="inline">k-1</math> in a neighborhood of said point (this is true, because differentiability requires a function to be defined in a whole neighborhood of a point), the numerator and its <math display="inline">k-2</math> derivatives are differentiable in a neighborhood of <math display="inline">a</math>. Clearly, the denominator also satisfies said condition, and additionally, doesn't vanish unless <math display="inline">x=a</math>, therefore all conditions necessary for L'Hôpital's rule are fulfilled, and its use is justified. So

<math display="block">\begin{align}
\lim_{x\to a} \frac{f(x) - P(x)}{(x-a)^k}
&= \lim_{x\to a} \frac{\frac{d}{dx}(f(x) - P(x))}{\frac{d}{dx}(x-a)^k} \\[1ex]
&= \cdots \\[1ex]
&= \lim_{x\to a} \frac{\frac{d^{k-1}}{dx^{k-1}}(f(x) - P(x))}{\frac{d^{k-1}}{dx^{k-1}}(x-a)^k}\\[1ex]
&= \frac{1}{k!}\lim_{x\to a} \frac{f^{(k-1)}(x) - P^{(k-1)}(x)}{x-a}\\[1ex]
&=\frac{1}{k!}(f^{(k)}(a) - P^{(k)}(a))
= 0
\end{align}</math>

where the second-to-last equality follows by the definition of the derivative at <math display="inline"> x=a</math>.

=== Alternate proof for Taylor's theorem in one real variable ===

Let <math>f(x)</math> be any real-valued continuous function to be approximated by the Taylor polynomial.

Step 1: Let <math display="inline">F</math> and <math display="inline">G</math> be functions. Set <math display="inline">F</math> and <math display="inline">G</math> to be

<math display="block">\begin{align}
F(x) = f(x) - \sum^{n-1}_{k=0} \frac{f^{(k)}(a)}{k!}(x-a)^{k}
\end{align}</math>

<math display="block">\begin{align}
G(x) = (x-a)^{n}
\end{align}</math>

Step 2: Properties of <math display="inline">F</math> and <math display="inline">G</math>:

<math display="block">\begin{align}
F(a) & = f(a) - f(a) - f'(a)(a - a) - ... - \frac{f^{(n-1)}(a)}{(n-1)!}(a-a)^{n-1} = 0 \\
G(a) & = (a-a)^n = 0
\end{align}</math>

Similarly,

<math display="block">\begin{align}
F'(a) = f'(a) - f'(a) - \frac{f''(a)}{(2-1)!}(a-a)^{(2-1)} - ... - \frac{f^{(n-1)}(a)}{(n-2)!}(a-a)^{n-2} = 0
\end{align}</math>

<math display="block">\begin{align}
G'(a) &= n(a-a)^{n-1} = 0\\
&\qquad \vdots\\
G^{(n-1)}(a) &= F^{(n-1)}(a) = 0
\end{align}</math>

Step 3: Use Cauchy Mean Value Theorem

Let <math>f_{1}</math> and <math>g_{1}</math> be continuous functions on <math>[a, b]</math>. Since <math>a < x < b</math> so we can work with the interval <math>[a, x]</math>. Let <math>f_{1}</math> and <math>g_{1}</math> be differentiable on <math>(a, x)</math>. Assume <math>g_{1}'(x) \neq 0</math> for all <math>x \in (a, b)</math>.
Then there exists <math>c_{1} \in (a, x)</math> such that

<math display="block">\begin{align}
\frac{f_{1}(x) - f_{1}(a)}{g_{1}(x) - g_{1}(a)} = \frac{f_{1}'(c_{1})}{g_{1}'(c_{1})}
\end{align}</math>

Note: <math>G'(x) \neq 0</math> in <math>(a, b)</math> and <math>F(a), G(a) = 0</math> so

<math display="block">\begin{align}
\frac{F(x)}{G(x)} = \frac{F(x) - F(a)}{G(x) - G(a)} = \frac{F'(c_{1})}{G'(c_{1})}
\end{align}</math>

for some <math>c_{1} \in (a, x)</math>.

This can also be performed for <math>(a, c_{1})</math>:

<math display="block">\begin{align}
\frac{F'(c_{1})}{G'(c_{1})} = \frac{F'(c_{1}) - F'(a)}{G'(c_{1}) - G'(a)} = \frac{F''(c_{2})}{G''(c_{2})}
\end{align}</math>

for some <math>c_{2} \in (a, c_{1})</math>.
This can be continued to <math>c_{n}</math>.

This gives a partition in <math>(a, b)</math>:

<math display="block">a < c_{n} < c_{n-1} < \dots < c_{1} < x</math>

with

<math display="block">\frac{F(x)}{G(x)} = \frac{F'(c_{1})}{G'(c_{1})} = \dots = \frac{F^{(n)}(c_{n})}{G^{(n)}(c_{n})} .</math>

Set <math>c = c_{n}</math>:

<math display="block">\frac{F(x)}{G(x)} = \frac{F^{(n)}(c)}{G^{(n)}(c)}</math>

Step 4: Substitute back

<math display="block">\begin{align}
\frac{F(x)}{G(x)} = \frac{f(x) - \sum^{n-1}_{k=0} \frac{f^{(k)}(a)}{k!}(x-a)^{k}}{(x-a)^{n}} = \frac{F^{(n)}(c)}{G^{(n)}(c)}
\end{align}</math>

By the Power Rule, repeated derivatives of <math>(x - a)^{n}</math>, <math>G^{(n)}(c) = n(n-1)...1</math>, so:

<math display="block">\frac{F^{(n)}(c)}{G^{(n)}(c)} = \frac{f^{(n)}(c)}{n(n-1)\cdots1} = \frac{f^{(n)}(c)}{n!}.</math>

This leads to:

<math display="block">\begin{align}
f(x) - \sum^{n-1}_{k=0} \frac{f^{(k)}(a)}{k!}(x-a)^{k} = \frac{f^{(n)}(c)}{n!}(x-a)^{n}
\end{align}.</math>

By rearranging, we get:

<math display="block">\begin{align}
f(x) = \sum^{n-1}_{k=0} \frac{f^{(k)}(a)}{k!}(x-a)^{k} + \frac{f^{(n)}(c)}{n!}(x-a)^{n}
\end{align},</math>

or because <math>c_{n} = a</math> eventually:

<math display="block">f(x) = \sum^{n}_{k=0} \frac{f^{(k)}(a)}{k!}(x-a)^{k}.</math>

=== Derivation for the mean value forms of the remainder ===

Let ''G'' be any real-valued function, continuous on the closed interval between <math display=inline>a</math> and <math display=inline>x</math> and differentiable with a non-vanishing derivative on the open interval between <math display=inline>a</math> and <math display=inline>x</math>, and define

<math display="block"> F(t) = f(t) + f'(t)(x-t) + \frac{f''(t)}{2!}(x-t)^2 + \cdots + \frac{f^{(k)}(t)}{k!}(x-t)^k.
</math>
</math>


For <math> t \in [a,x] </math>. Then, by [[mean value theorem#Cauchy's mean value theorem|Cauchy's mean value theorem]],
==Proof: several variables==
Let ''x'' = (''x''<sub>1</sub>,...,''x<sub>N</sub>'') lie in the ball ''B'' with center ''a''. Parametrize the line segment between ''a'' and ''x'' by ''u''(''t'') = {{nowrap|''a'' + ''t''(''x'' − ''a'').}} We apply the one-variable version of Taylor's theorem to the function ƒ(''u''(''t'')):


:<math>f(x)=f(u(1))=f(a)+\sum_{k=1}^n\left.\frac{1}{k!}\frac{d^k}{dt^k}\right|_{t=0}f(u(t))\ +\ \int_0^1 \frac{(1-t)^n }{n!} \frac{d^{n+1}}{dt^{n+1}} f(u(t))\, dt.</math>
{{NumBlk|:|<math> \frac{F'(\xi)}{G'(\xi)} = \frac{F(x) - F(a)}{G(x) - G(a)}</math>|{{EquationRef|★★★}}}}


for some <math display="inline">\xi</math> on the open interval between <math display=inline>a</math> and <math display=inline>x</math>. Note that here the numerator <math display="inline">F(x)-F(a)=R_k(x)</math> is exactly the remainder of the Taylor polynomial for <math display="inline">y=f(x)</math>. Compute
By the [[chain rule]] for several variables,


<math display="block">\begin{align}
:<math>\frac{d^k}{dt^k}f(u(t)) = \frac{d^k}{dt^k} f(a+t(x-a)) = \sum_{|\alpha|=k} \left(\begin{matrix} k \\ \alpha\end{matrix} \right) (D^\alpha f) (a+t(x-a))\cdot (x-a)^\alpha</math>
F'(t) = {} & f'(t) + \big(f''(t)(x-t) - f'(t)\big) + \left(\frac{f^{(3)}(t)}{2!}(x-t)^2 - \frac{f^{(2)}(t)}{1!}(x-t)\right) + \cdots \\
& \cdots + \left( \frac{f^{(k+1)}(t)}{k!}(x-t)^k - \frac{f^{(k)}(t)}{(k-1)!}(x-t)^{k-1}\right)
= \frac{f^{(k+1)}(t)}{k!}(x-t)^k,
\end{align}</math>


plug it into ({{EquationNote|★★★}}) and rearrange terms to find that
where <math>\left(\begin{matrix}k \\ \alpha\end{matrix}\right)</math> is the [[multinomial coefficient]] for the [[multi-index]] &alpha;. Since <math>\frac{1}{k!}\left(\begin{matrix}k\\ \alpha\end{matrix}\right)=\frac{1}{\alpha!}</math>, we get


<math display="block"> R_k(x) = \frac{f^{(k+1)}(\xi)}{k!}(x-\xi)^k \frac{G(x)-G(a)}{G'(\xi)}.</math>
:<math>f(x)= f(a)+\sum_{|\alpha|=1}^n\frac{1}{\alpha!} (D^\alpha f) (a)(x-a)^\alpha+\sum_{|\alpha|=n+1}\frac{n+1}{\alpha!}
(x-a)^\alpha \int_0^1 (1-t)^n (D^\alpha f)(a+t(x-a))\,dt.</math>


This is the form of the remainder term mentioned after the actual statement of Taylor's theorem with remainder in the mean value form.
The remainder term is given by
The Lagrange form of the remainder is found by choosing <math> G(t) = (x-t)^{k+1} </math> and the Cauchy form by choosing <math> G(t) = t-a</math>.


'''Remark.''' Using this method one can also recover the integral form of the remainder by choosing
:<math>\sum_{|\alpha|=n+1}\frac{n+1}{\alpha!} (x-a)^\alpha \int_0^1 (1-t)^n (D^\alpha f)(a+t(x-a))\,dt.</math>


<math display="block"> G(t) = \int_a^t \frac{f^{(k+1)}(s)}{k!} (x-s)^k \, ds,</math>
The terms of this summation are explicit forms for the ''R''<sub>&alpha;</sub> in the statement of the theorem. These are easily seen to satisfy the required estimate.


but the requirements for ''f'' needed for the use of mean value theorem are too strong, if one aims to prove the claim in the case that ''f''{{i sup|(''k'')}} is only [[absolutely continuous]]. However, if one uses [[Riemann integral]] instead of [[Lebesgue integral]], the assumptions cannot be weakened.
==See also==
* [[Taylor series]]
* [[Linear approximation]]
* [[Power series]]
* [[Laurent series]] – an extension of Taylor series for functions with singularities.
* [[Padé approximant]] – approximation of a function by a rational function
* [[Newton series]] - a discrete version of Taylor series
* '''G-Taylor''' - A generalized Taylor's theorem, which approximates a function in powers of another function. Several well-known theorems in Calculus are its special cases such as the Taylor’s formula, the Mean Value Theorem, Cauchy’s Mean Value Theorem, and Newton-Raphson’s method. <ref>Hamed Shah-Hosseini, Generalizing Taylor’s Theorem to Approximate a Function in Powers of Another Function, International Review of Automatic Control (Theory and Applications),July 2010.[http://www.praiseworthyprize.com/ireaco_latest.html#Generalizing_Taylor%E2%80%99s_Theorem_to_Approximate_a_Function_in_Powers_of_Another_Function] </ref>


=== Derivation for the integral form of the remainder ===
== Notes ==

Due to the [[absolutely continuous|absolute continuity]] of <math>f^{(k)}</math> on the [[closed interval]] between <math display=inline>a</math> and <math display=inline>x</math>, its derivative <math>f^{(k+1)}</math> exists as an <math>L^1</math>-function, and we can use the [[fundamental theorem of calculus]] and [[integration by parts]]. This same proof applies for the [[Riemann integral]] assuming that <math>f^{(k)}</math> is [[continuous function|continuous]] on the closed interval and [[Differentiable function|differentiable]] on the [[open interval]] between <math display=inline>a</math> and <math display=inline>x</math>, and this leads to the same result than using the mean value theorem.

The [[fundamental theorem of calculus]] states that

<math display="block"> f(x)=f(a)+ \int_a^x \, f'(t) \, dt.</math>

Now we can [[Integration by parts|integrate by parts]] and use the fundamental theorem of calculus again to see that

<math display="block"> \begin{align}
f(x) &= f(a)+\Big(xf'(x)-af'(a)\Big)-\int_a^x tf''(t) \, dt \\
&= f(a) + x\left(f'(a) + \int_a^x f''(t) \,dt \right) -af'(a)-\int_a^x tf''(t) \, dt \\
&= f(a)+(x-a)f'(a)+\int_a^x \, (x-t)f''(t) \, dt,
\end{align} </math>

which is exactly Taylor's theorem with remainder in the integral form in the case <math>k=1</math>. The general statement is proved using [[mathematical induction|induction]]. Suppose that
{{NumBlk|:|<math> f(x) = f(a) + \frac{f'(a)}{1!}(x - a) + \cdots + \frac{f^{(k)}(a)}{k!}(x - a)^k + \int_a^x \frac{f^{(k+1)} (t)}{k!} (x - t)^k \, dt. </math>|{{EquationRef|eq1}}}}

Integrating the remainder term by parts we arrive at

<math display="block">\begin{align}
\int_a^x \frac{f^{(k+1)} (t)}{k!} (x - t)^k \, dt = & - \left[ \frac{f^{(k+1)} (t)}{(k+1)k!} (x - t)^{k+1} \right]_a^x + \int_a^x \frac{f^{(k+2)} (t)}{(k+1)k!} (x - t)^{k+1} \, dt \\
= & \ \frac{f^{(k+1)} (a)}{(k+1)!} (x - a)^{k+1} + \int_a^x \frac{f^{(k+2)} (t)}{(k+1)!} (x - t)^{k+1} \, dt.
\end{align}</math>

Substituting this into the formula {{nowrap|in ({{EquationNote|eq1}})}} shows that if it holds for the value <math>k</math>, it must also hold for the value <math>k+1</math>. Therefore, since it holds for <math>k=1</math>, it must hold for every positive integer <math>k</math>.

=== Derivation for the remainder of multivariate Taylor polynomials ===

We prove the special case, where <math>f:\mathbb R^n\to\mathbb R</math> has continuous partial derivatives up to the order <math>k+1</math> in some closed ball <math>B</math> with center <math>\boldsymbol{a}</math>. The strategy of the proof is to apply the one-variable case of Taylor's theorem to the restriction of <math>f</math> to the line segment adjoining <math>\boldsymbol{x}</math> and <math>\boldsymbol{a}</math>.<ref>{{harvnb|Hörmander|1976|pp=12–13}}</ref> Parametrize the line segment between <math>\boldsymbol{a}</math> and <math>\boldsymbol{x}</math> by <math>\boldsymbol{u}(t)=\boldsymbol{a}+t(\boldsymbol{x}-\boldsymbol{a})</math> We apply the one-variable version of Taylor's theorem to the function <math>g(t) = f(\boldsymbol{u}(t))</math>:

<math display="block"> f(\boldsymbol{x})=g(1)=g(0)+\sum_{j=1}^k\frac{1}{j!}g^{(j)}(0)\ +\ \int_0^1 \frac{(1-t)^k }{k!} g^{(k+1)}(t)\, dt.</math>

Applying the [[chain rule]] for several variables gives

<math display="block">\begin{align}
g^{(j)}(t)&=\frac{d^j}{dt^j}f(\boldsymbol{u}(t))\\
&= \frac{d^j}{dt^j} f(\boldsymbol{a}+t(\boldsymbol{x}-\boldsymbol{a}))\\
&= \sum_{|\alpha| =j} \left(\begin{matrix} j\\ \alpha\end{matrix} \right) (D^\alpha f) (\boldsymbol{a}+t(\boldsymbol{x}-\boldsymbol{a})) (\boldsymbol{x}-\boldsymbol{a})^\alpha
\end{align}</math>

where <math>\tbinom j \alpha</math> is the [[multinomial coefficient]]. Since <math>\tfrac{1}{j!}\tbinom j \alpha=\tfrac{1}{\alpha!}</math>, we get:

<math display="block"> f(\boldsymbol{x})= f(\boldsymbol{a}) + \sum_{1 \leq |\alpha| \leq k}\frac{1}{\alpha!} (D^\alpha f) (\boldsymbol{a})(\boldsymbol{x}-\boldsymbol{a})^\alpha+\sum_{|\alpha|=k+1}\frac{k+1}{\alpha!}
(\boldsymbol{x}-\boldsymbol{a})^\alpha \int_0^1 (1-t)^k (D^\alpha f)(\boldsymbol{a}+t(\boldsymbol{x}-\boldsymbol{a}))\,dt.</math>

== See also ==
{{Portal|Mathematics}}
* {{annotated link|Hadamard's lemma}}
* {{annotated link|Laurent series}}
* {{annotated link|Padé approximant}}
* {{annotated link|Newton series}}
* {{annotated link|Approximation theory}}
* {{annotated link|Function approximation}}

== Footnotes ==
{{Reflist}}
{{Reflist}}


== References ==
== References ==
*{{citation
*{{cite book|title = Calculus|authorlink=Tom Apostol|first = Tom|last = Apostol|publisher = Jon Wiley & Sons, Inc.|year = 1967|isbn = 0-471-00005-1}}
| title=Calculus
*{{cite book|title = Calculus: An Intuitive and Physical Approach|first = Morris | last = Klein| publisher = Dover | year = 1998 | isbn = 0-486-40453-6}}
| author-link=Tom Apostol

| first=Tom
==External links==
| last=Apostol
| publisher=Wiley
| year=1967
| isbn=0-471-00005-1
| url-access=registration
| url=https://archive.org/details/calculus01apos
}}.
*{{citation
| title=Mathematical analysis
| first=Tom
| last=Apostol
| publisher=Addison–Wesley
| year=1974
}}.
*{{citation
| first1=Robert G.
| last1=Bartle
| first2=Donald R.
| last2=Sherbert
| title=Introduction to Real Analysis
| edition=4th
| publisher=Wiley
| year=2011
| isbn=978-0-471-43331-6
}}.
*{{citation
| first=L.
| last=Hörmander
| author-link=Lars Hörmander
| title=Linear Partial Differential Operators, Volume 1
| publisher=Springer
| year=1976
| isbn=978-3-540-00662-6
}}.
*{{citation
| title = Mathematical thought from ancient to modern times, Volume 2
| author-link=Morris Kline
| first=Morris
| last=Kline
| publisher=Oxford University Press
| year=1972
}}.
*{{citation
| title=Calculus: An Intuitive and Physical Approach
| first=Morris
| last=Kline
| publisher=Dover
| year=1998
| isbn=0-486-40453-6
}}.
*{{citation
| last=Pedrick
| first=George
| year=1994
| title=A First Course in Analysis
| publisher=Springer
| isbn=0-387-94108-8
| url-access=registration
| url=https://archive.org/details/firstcourseinana0000pedr
}}.
*{{citation
| last=Stromberg
| first=Karl
| title=Introduction to classical real analysis
| publisher=Wadsworth
| year=1981
| isbn=978-0-534-98012-2
}}.
*{{citation
| last=Rudin
| first=Walter
| title=Real and complex analysis
| edition=3rd
| publisher=McGraw-Hill
| year=1987
| isbn=0-07-054234-1
}}.
*{{citation
| last=Tao
| first=Terence
| title=Analysis, Volume I
| edition=3rd
| publisher=Hindustan Book Agency
| year=2014
| isbn=978-93-80250-64-9
}}.


== External links ==
* [http://www.cut-the-knot.org/Curriculum/Calculus/TaylorSeries.shtml Taylor Series Approximation to Cosine] at [[cut-the-knot]]
* [http://www.cut-the-knot.org/Curriculum/Calculus/TaylorSeries.shtml Taylor Series Approximation to Cosine] at [[cut-the-knot]]
* [http://cinderella.de/files/HTMLDemos/2C02_Taylor.html Trigonometric Taylor Expansion] interactive demonstrative applet
* [http://cinderella.de/files/HTMLDemos/2C02_Taylor.html Trigonometric Taylor Expansion] interactive demonstrative applet
* [http://numericalmethods.eng.usf.edu/topics/taylor_series.html Taylor Series Revisited] at [http://numericalmethods.eng.usf.edu Holistic Numerical Methods Institute]
* [http://numericalmethods.eng.usf.edu/topics/taylor_series.html Taylor Series Revisited] at [http://numericalmethods.eng.usf.edu Holistic Numerical Methods Institute]


{{Calculus topics}}
[[Category:Articles containing proofs]]
{{authority control}}
[[Category:Calculus]]
[[Category:Mathematical theorems]]
[[Category:Mathematical series]]


[[Category:Articles containing proofs]]
[[ar:مبرهنة تايلور]]
[[ca:Teorema de Taylor]]
[[Category:Theorems in calculus]]
[[Category:Theorems in real analysis]]
[[cs:Taylorův polynom]]
[[Category:Approximations]]
[[de:Taylor-Formel]]
[[es:Teorema de Taylor]]
[[fr:Théorème de Taylor]]
[[id:Teorema Taylor]]
[[it:Teorema di Taylor]]
[[nl:Stelling van Taylor]]
[[pl:Wzór Taylora]]
[[pt:Teorema de Taylor]]
[[ja:テイラーの定理]]
[[sr:Тејлорова формула]]
[[zh:泰勒公式]]

Latest revision as of 21:51, 1 December 2024

The exponential function (red) and the corresponding Taylor polynomial of degree four (dashed green) around the origin.

In calculus, Taylor's theorem gives an approximation of a -times differentiable function around a given point by a polynomial of degree , called the -th-order Taylor polynomial. For a smooth function, the Taylor polynomial is the truncation at the order of the Taylor series of the function. The first-order Taylor polynomial is the linear approximation of the function, and the second-order Taylor polynomial is often referred to as the quadratic approximation.[1] There are several versions of Taylor's theorem, some giving explicit estimates of the approximation error of the function by its Taylor polynomial.

Taylor's theorem is named after the mathematician Brook Taylor, who stated a version of it in 1715,[2] although an earlier version of the result was already mentioned in 1671 by James Gregory.[3]

Taylor's theorem is taught in introductory-level calculus courses and is one of the central elementary tools in mathematical analysis. It gives simple arithmetic formulas to accurately compute values of many transcendental functions such as the exponential function and trigonometric functions. It is the starting point of the study of analytic functions, and is fundamental in various areas of mathematics, as well as in numerical analysis and mathematical physics. Taylor's theorem also generalizes to multivariate and vector valued functions. It provided the mathematical basis for some landmark early computing machines: Charles Babbage's Difference Engine calculated sines, cosines, logarithms, and other transcendental functions by numerically integrating the first 7 terms of their Taylor series.

Motivation

[edit]
Graph of (blue) with its linear approximation (red) at .

If a real-valued function is differentiable at the point , then it has a linear approximation near this point. This means that there exists a function h1(x) such that

Here

is the linear approximation of for x near the point a, whose graph is the tangent line to the graph at x = a. The error in the approximation is:

As x tends to a, this error goes to zero much faster than , making a useful approximation.

Graph of (blue) with its quadratic approximation (red) at . Note the improvement in the approximation.

For a better approximation to , we can fit a quadratic polynomial instead of a linear function:

Instead of just matching one derivative of at , this polynomial has the same first and second derivatives, as is evident upon differentiation.

Taylor's theorem ensures that the quadratic approximation is, in a sufficiently small neighborhood of , more accurate than the linear approximation. Specifically,

Here the error in the approximation is

which, given the limiting behavior of , goes to zero faster than as x tends to a.

Approximation of (blue) by its Taylor polynomials of order centered at (red) and (green). The approximations do not improve at all outside and , respectively.

Similarly, we might get still better approximations to f if we use polynomials of higher degree, since then we can match even more derivatives with f at the selected base point.

In general, the error in approximating a function by a polynomial of degree k will go to zero much faster than as x tends to a. However, there are functions, even infinitely differentiable ones, for which increasing the degree of the approximating polynomial does not increase the accuracy of approximation: we say such a function fails to be analytic at x = a: it is not (locally) determined by its derivatives at this point.

Taylor's theorem is of asymptotic nature: it only tells us that the error in an approximation by a -th order Taylor polynomial Pk tends to zero faster than any nonzero -th degree polynomial as . It does not tell us how large the error is in any concrete neighborhood of the center of expansion, but for this purpose there are explicit formulas for the remainder term (given below) which are valid under some additional regularity assumptions on f. These enhanced versions of Taylor's theorem typically lead to uniform estimates for the approximation error in a small neighborhood of the center of expansion, but the estimates do not necessarily hold for neighborhoods which are too large, even if the function f is analytic. In that situation one may have to select several Taylor polynomials with different centers of expansion to have reliable Taylor-approximations of the original function (see animation on the right.)

There are several ways we might use the remainder term:

  1. Estimate the error for a polynomial Pk(x) of degree k estimating on a given interval (ar, a + r). (Given the interval and degree, we find the error.)
  2. Find the smallest degree k for which the polynomial Pk(x) approximates to within a given error tolerance on a given interval (ar, a + r) . (Given the interval and error tolerance, we find the degree.)
  3. Find the largest interval (ar, a + r) on which Pk(x) approximates to within a given error tolerance. (Given the degree and error tolerance, we find the interval.)

Taylor's theorem in one real variable

[edit]

Statement of the theorem

[edit]

The precise statement of the most basic version of Taylor's theorem is as follows:

Taylor's theorem[4][5][6] — Let k ≥ 1 be an integer and let the function f : RR be k times differentiable at the point aR. Then there exists a function hk : RR such that

and This is called the Peano form of the remainder.

The polynomial appearing in Taylor's theorem is the -th order Taylor polynomial

of the function f at the point a. The Taylor polynomial is the unique "asymptotic best fit" polynomial in the sense that if there exists a function hk : RR and a -th order polynomial p such that

then p = Pk. Taylor's theorem describes the asymptotic behavior of the remainder term

which is the approximation error when approximating f with its Taylor polynomial. Using the little-o notation, the statement in Taylor's theorem reads as

Explicit formulas for the remainder

[edit]

Under stronger regularity assumptions on f there are several precise formulas for the remainder term Rk of the Taylor polynomial, the most common ones being the following.

Mean-value forms of the remainder — Let f : RR be k + 1 times differentiable on the open interval with f(k) continuous on the closed interval between and .[7] Then

for some real number between and . This is the Lagrange form[8] of the remainder.

Similarly,

for some real number between and . This is the Cauchy form[9] of the remainder.

Both can be thought of as specific cases of the following result: Consider

for some real number between and . This is the Schlömilch form of the remainder (sometimes called the Schlömilch-Roche). The choice is the Lagrange form, whilst the choice is the Cauchy form.

These refinements of Taylor's theorem are usually proved using the mean value theorem, whence the name. Additionally, notice that this is precisely the mean value theorem when . Also other similar expressions can be found. For example, if G(t) is continuous on the closed interval and differentiable with a non-vanishing derivative on the open interval between and , then

for some number between and . This version covers the Lagrange and Cauchy forms of the remainder as special cases, and is proved below using Cauchy's mean value theorem. The Lagrange form is obtained by taking and the Cauchy form is obtained by taking .

The statement for the integral form of the remainder is more advanced than the previous ones, and requires understanding of Lebesgue integration theory for the full generality. However, it holds also in the sense of Riemann integral provided the (k + 1)th derivative of f is continuous on the closed interval [a,x].

Integral form of the remainder[10] — Let be absolutely continuous on the closed interval between and . Then

Due to the absolute continuity of f(k) on the closed interval between and , its derivative f(k+1) exists as an L1-function, and the result can be proven by a formal calculation using the fundamental theorem of calculus and integration by parts.

Estimates for the remainder

[edit]

It is often useful in practice to be able to estimate the remainder term appearing in the Taylor approximation, rather than having an exact formula for it. Suppose that f is (k + 1)-times continuously differentiable in an interval I containing a. Suppose that there are real constants q and Q such that

throughout I. Then the remainder term satisfies the inequality[11]

if x > a, and a similar estimate if x < a. This is a simple consequence of the Lagrange form of the remainder. In particular, if

on an interval I = (ar,a + r) with some , then

for all x∈(ar,a + r). The second inequality is called a uniform estimate, because it holds uniformly for all x on the interval (ar,a + r).

Example

[edit]
Approximation of (blue) by its Taylor polynomials of order centered at (red).

Suppose that we wish to find the approximate value of the function on the interval while ensuring that the error in the approximation is no more than 10−5. In this example we pretend that we only know the following properties of the exponential function:

From these properties it follows that for all , and in particular, . Hence the -th order Taylor polynomial of at and its remainder term in the Lagrange form are given by

where is some number between 0 and x. Since ex is increasing by (), we can simply use for to estimate the remainder on the subinterval . To obtain an upper bound for the remainder on , we use the property for to estimate

using the second order Taylor expansion. Then we solve for ex to deduce that

simply by maximizing the numerator and minimizing the denominator. Combining these estimates for ex we see that

so the required precision is certainly reached, when

(See factorial or compute by hand the values and .) As a conclusion, Taylor's theorem leads to the approximation

For instance, this approximation provides a decimal expression , correct up to five decimal places.

Relationship to analyticity

[edit]

Taylor expansions of real analytic functions

[edit]

Let IR be an open interval. By definition, a function f : IR is real analytic if it is locally defined by a convergent power series. This means that for every a ∈ I there exists some r > 0 and a sequence of coefficients ck ∈ R such that (ar, a + r) ⊂ I and

In general, the radius of convergence of a power series can be computed from the Cauchy–Hadamard formula

This result is based on comparison with a geometric series, and the same method shows that if the power series based on a converges for some bR, it must converge uniformly on the closed interval , where . Here only the convergence of the power series is considered, and it might well be that (aR,a + R) extends beyond the domain I of f.

The Taylor polynomials of the real analytic function f at a are simply the finite truncations

of its locally defining power series, and the corresponding remainder terms are locally given by the analytic functions

Here the functions

are also analytic, since their defining power series have the same radius of convergence as the original series. Assuming that [ar, a + r]I and r < R, all these series converge uniformly on (ar, a + r). Naturally, in the case of analytic functions one can estimate the remainder term by the tail of the sequence of the derivatives f′(a) at the center of the expansion, but using complex analysis also another possibility arises, which is described below.

Taylor's theorem and convergence of Taylor series

[edit]

The Taylor series of f will converge in some interval in which all its derivatives are bounded and do not grow too fast as k goes to infinity. (However, even if the Taylor series converges, it might not converge to f, as explained below; f is then said to be non-analytic.)

One might think of the Taylor series

of an infinitely many times differentiable function f : RR as its "infinite order Taylor polynomial" at a. Now the estimates for the remainder imply that if, for any r, the derivatives of f are known to be bounded over (a − r, a + r), then for any order k and for any r > 0 there exists a constant Mk,r > 0 such that

for every x ∈ (a − r,a + r). Sometimes the constants Mk,r can be chosen in such way that Mk,r is bounded above, for fixed r and all k. Then the Taylor series of f converges uniformly to some analytic function

(One also gets convergence even if Mk,r is not bounded above as long as it grows slowly enough.)

The limit function Tf is by definition always analytic, but it is not necessarily equal to the original function f, even if f is infinitely differentiable. In this case, we say f is a non-analytic smooth function, for example a flat function:

Using the chain rule repeatedly by mathematical induction, one shows that for any order k,

for some polynomial pk of degree 2(k − 1). The function tends to zero faster than any polynomial as , so f is infinitely many times differentiable and f(k)(0) = 0 for every positive integer k. The above results all hold in this case:

  • The Taylor series of f converges uniformly to the zero function Tf(x) = 0, which is analytic with all coefficients equal to zero.
  • The function f is unequal to this Taylor series, and hence non-analytic.
  • For any order k ∈ N and radius r > 0 there exists Mk,r > 0 satisfying the remainder bound (★★) above.

However, as k increases for fixed r, the value of Mk,r grows more quickly than rk, and the error does not go to zero.

Taylor's theorem in complex analysis

[edit]

Taylor's theorem generalizes to functions f : CC which are complex differentiable in an open subset U ⊂ C of the complex plane. However, its usefulness is dwarfed by other general theorems in complex analysis. Namely, stronger versions of related results can be deduced for complex differentiable functions f : U → C using Cauchy's integral formula as follows.

Let r > 0 such that the closed disk B(zr) ∪ S(zr) is contained in U. Then Cauchy's integral formula with a positive parametrization γ(t) = z + reit of the circle S(z, r) with gives

Here all the integrands are continuous on the circle S(zr), which justifies differentiation under the integral sign. In particular, if f is once complex differentiable on the open set U, then it is actually infinitely many times complex differentiable on U. One also obtains Cauchy's estimate[12]

for any z ∈ U and r > 0 such that B(zr) ∪ S(cr) ⊂ U. The estimate implies that the complex Taylor series

of f converges uniformly on any open disk with into some function Tf. Furthermore, using the contour integral formulas for the derivatives f(k)(c),

so any complex differentiable function f in an open set U ⊂ C is in fact complex analytic. All that is said for real analytic functions here holds also for complex analytic functions with the open interval I replaced by an open subset U ∈ C and a-centered intervals (a − ra + r) replaced by c-centered disks B(cr). In particular, the Taylor expansion holds in the form

where the remainder term Rk is complex analytic. Methods of complex analysis provide some powerful results regarding Taylor expansions. For example, using Cauchy's integral formula for any positively oriented Jordan curve which parametrizes the boundary of a region , one obtains expressions for the derivatives f(j)(c) as above, and modifying slightly the computation for Tf(z) = f(z), one arrives at the exact formula

The important feature here is that the quality of the approximation by a Taylor polynomial on the region is dominated by the values of the function f itself on the boundary . Similarly, applying Cauchy's estimates to the series expression for the remainder, one obtains the uniform estimates

Example

[edit]
Complex plot of . Modulus is shown by elevation and argument by coloring: cyan = , blue = , violet = , red = , yellow = , green = .

The function

is real analytic, that is, locally determined by its Taylor series. This function was plotted above to illustrate the fact that some elementary functions cannot be approximated by Taylor polynomials in neighborhoods of the center of expansion which are too large. This kind of behavior is easily understood in the framework of complex analysis. Namely, the function f extends into a meromorphic function

on the compactified complex plane. It has simple poles at and , and it is analytic elsewhere. Now its Taylor series centered at z0 converges on any disc B(z0, r) with r < |z − z0|, where the same Taylor series converges at z ∈ C. Therefore, Taylor series of f centered at 0 converges on B(0, 1) and it does not converge for any zC with |z| > 1 due to the poles at i and −i. For the same reason the Taylor series of f centered at 1 converges on and does not converge for any z ∈ C with .

Generalizations of Taylor's theorem

[edit]

Higher-order differentiability

[edit]

A function f: RnR is differentiable at aRn if and only if there exists a linear functional L : RnR and a function h : RnR such that

If this is the case, then is the (uniquely defined) differential of f at the point a. Furthermore, then the partial derivatives of f exist at a and the differential of f at a is given by

Introduce the multi-index notation

for αNn and xRn. If all the -th order partial derivatives of f : RnR are continuous at aRn, then by Clairaut's theorem, one can change the order of mixed derivatives at a, so the short-hand notation

for the higher order partial derivatives is justified in this situation. The same is true if all the (k − 1)-th order partial derivatives of f exist in some neighborhood of a and are differentiable at a.[13] Then we say that f is k times differentiable at the point a.

Taylor's theorem for multivariate functions

[edit]

Using notations of the preceding section, one has the following theorem.

Multivariate version of Taylor's theorem[14] — Let f : RnR be a k-times continuously differentiable function at the point aRn. Then there exist functions hα : RnR, where such that

If the function f : RnR is k + 1 times continuously differentiable in a closed ball for some , then one can derive an exact formula for the remainder in terms of (k+1)-th order partial derivatives of f in this neighborhood.[15] Namely,

In this case, due to the continuity of (k+1)-th order partial derivatives in the compact set B, one immediately obtains the uniform estimates

Example in two dimensions

[edit]

For example, the third-order Taylor polynomial of a smooth function is, denoting ,

Proofs

[edit]

Proof for Taylor's theorem in one real variable

[edit]

Let[16]

where, as in the statement of Taylor's theorem,

It is sufficient to show that

The proof here is based on repeated application of L'Hôpital's rule. Note that, for each , . Hence each of the first derivatives of the numerator in vanishes at , and the same is true of the denominator. Also, since the condition that the function be times differentiable at a point requires differentiability up to order in a neighborhood of said point (this is true, because differentiability requires a function to be defined in a whole neighborhood of a point), the numerator and its derivatives are differentiable in a neighborhood of . Clearly, the denominator also satisfies said condition, and additionally, doesn't vanish unless , therefore all conditions necessary for L'Hôpital's rule are fulfilled, and its use is justified. So

where the second-to-last equality follows by the definition of the derivative at .

Alternate proof for Taylor's theorem in one real variable

[edit]

Let be any real-valued continuous function to be approximated by the Taylor polynomial.

Step 1: Let and be functions. Set and to be

Step 2: Properties of and :

Similarly,

Step 3: Use Cauchy Mean Value Theorem

Let and be continuous functions on . Since so we can work with the interval . Let and be differentiable on . Assume for all . Then there exists such that

Note: in and so

for some .

This can also be performed for :

for some . This can be continued to .

This gives a partition in :

with

Set :

Step 4: Substitute back

By the Power Rule, repeated derivatives of , , so:

This leads to:

By rearranging, we get:

or because eventually:

Derivation for the mean value forms of the remainder

[edit]

Let G be any real-valued function, continuous on the closed interval between and and differentiable with a non-vanishing derivative on the open interval between and , and define

For . Then, by Cauchy's mean value theorem,

for some on the open interval between and . Note that here the numerator is exactly the remainder of the Taylor polynomial for . Compute

plug it into (★★★) and rearrange terms to find that

This is the form of the remainder term mentioned after the actual statement of Taylor's theorem with remainder in the mean value form. The Lagrange form of the remainder is found by choosing and the Cauchy form by choosing .

Remark. Using this method one can also recover the integral form of the remainder by choosing

but the requirements for f needed for the use of mean value theorem are too strong, if one aims to prove the claim in the case that f(k) is only absolutely continuous. However, if one uses Riemann integral instead of Lebesgue integral, the assumptions cannot be weakened.

Derivation for the integral form of the remainder

[edit]

Due to the absolute continuity of on the closed interval between and , its derivative exists as an -function, and we can use the fundamental theorem of calculus and integration by parts. This same proof applies for the Riemann integral assuming that is continuous on the closed interval and differentiable on the open interval between and , and this leads to the same result than using the mean value theorem.

The fundamental theorem of calculus states that

Now we can integrate by parts and use the fundamental theorem of calculus again to see that

which is exactly Taylor's theorem with remainder in the integral form in the case . The general statement is proved using induction. Suppose that

Integrating the remainder term by parts we arrive at

Substituting this into the formula in (eq1) shows that if it holds for the value , it must also hold for the value . Therefore, since it holds for , it must hold for every positive integer .

Derivation for the remainder of multivariate Taylor polynomials

[edit]

We prove the special case, where has continuous partial derivatives up to the order in some closed ball with center . The strategy of the proof is to apply the one-variable case of Taylor's theorem to the restriction of to the line segment adjoining and .[17] Parametrize the line segment between and by We apply the one-variable version of Taylor's theorem to the function :

Applying the chain rule for several variables gives

where is the multinomial coefficient. Since , we get:

See also

[edit]

Footnotes

[edit]
  1. ^ (2013). "Linear and quadratic approximation" Retrieved December 6, 2018
  2. ^ Taylor, Brook (1715). Methodus Incrementorum Directa et Inversa [Direct and Reverse Methods of Incrementation] (in Latin). London. p. 21–23 (Prop. VII, Thm. 3, Cor. 2). Translated into English in Struik, D. J. (1969). A Source Book in Mathematics 1200–1800. Cambridge, Massachusetts: Harvard University Press. pp. 329–332.
  3. ^ Kline 1972, pp. 442, 464.
  4. ^ Genocchi, Angelo; Peano, Giuseppe (1884), Calcolo differenziale e principii di calcolo integrale, (N. 67, pp. XVII–XIX): Fratelli Bocca ed.{{citation}}: CS1 maint: location (link)
  5. ^ Spivak, Michael (1994), Calculus (3rd ed.), Houston, TX: Publish or Perish, p. 383, ISBN 978-0-914098-89-8
  6. ^ "Taylor formula", Encyclopedia of Mathematics, EMS Press, 2001 [1994]
  7. ^ The hypothesis of f(k) being continuous on the closed interval between and is not redundant. Although f being k + 1 times differentiable on the open interval between and does imply that f(k) is continuous on the open interval between and , it does not imply that f(k) is continuous on the closed interval between and , i.e. it does not imply that f(k) is continuous at the endpoints of that interval. Consider, for example, the function f : [0,1] → R defined to equal on and with . This is not continuous at 0, but is continuous on . Moreover, one can show that this function has an antiderivative. Therefore that antiderivative is differentiable on , its derivative (the function f) is continuous on the open interval , but its derivative f is not continuous on the closed interval . So the theorem would not apply in this case.
  8. ^ Kline 1998, §20.3; Apostol 1967, §7.7.
  9. ^ Apostol 1967, §7.7.
  10. ^ Apostol 1967, §7.5.
  11. ^ Apostol 1967, §7.6
  12. ^ Rudin 1987, §10.26
  13. ^ This follows from iterated application of the theorem that if the partial derivatives of a function f exist in a neighborhood of a and are continuous at a, then the function is differentiable at a. See, for instance, Apostol 1974, Theorem 12.11.
  14. ^ Königsberger Analysis 2, p. 64 ff.
  15. ^ Folland, G. B. "Higher-Order Derivatives and Taylor's Formula in Several Variables" (PDF). Department of Mathematics | University of Washington. Retrieved 2024-02-21.
  16. ^ Stromberg 1981
  17. ^ Hörmander 1976, pp. 12–13

References

[edit]
[edit]