Jump to content

Lp space: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Das-g (talk | contribs)
When p = 0: Donoho's "norm" counts non-zeros, not zeros
 
(545 intermediate revisions by more than 100 users not shown)
Line 1: Line 1:
{{DISPLAYTITLE:L<sup>''p''</sup> space}}
{{DISPLAYTITLE:''L''<sup>''p''</sup> space}}
{{Short description|Function spaces generalizing finite-dimensional p norm spaces}}
In [[mathematics]], the '''L<sup>''p''</sup> spaces''' are [[function space]]s defined using natural generalizations of ''p''-[[norm (mathematics)|norms]] for finite-dimensional [[vector space]]s. They are sometimes called '''Lebesgue spaces''', named after [[Henri Lebesgue]] {{harv|Dunford|Schwartz|1958|loc=III.3}}, although according to {{harvtxt|Bourbaki|1987}} they were first introduced by {{harvtxt|Riesz|1910}}. They form an important class of examples of [[Banach space]]s in [[functional analysis]], and of [[topological vector space]]s. Lebesgue spaces have applications in physics, statistics, finance, engineering, and other disciplines.
In [[mathematics]], the '''{{math|''L''<sup>''p''</sup>}} spaces''' are [[function space]]s defined using a natural generalization of the [[Norm (mathematics)#p-norm|{{math|''p''}}-norm]] for finite-dimensional [[vector space]]s. They are sometimes called '''Lebesgue spaces''', named after [[Henri Lebesgue]] {{harv|Dunford|Schwartz|1958|loc=III.3}}, although according to the [[Nicolas Bourbaki|Bourbaki]] group {{harv|Bourbaki|1987}} they were first introduced by [[Frigyes Riesz]] {{harv|Riesz|1910}}.


{{math|''L''<sup>''p''</sup>}} spaces form an important class of [[Banach space]]s in [[functional analysis]], and of [[topological vector space]]s. Because of their key role in the mathematical analysis of measure and probability spaces, Lebesgue spaces are used also in the theoretical discussion of problems in physics, statistics, economics, finance, engineering, and other disciplines.
==Motivation==
==Preliminaries==
[[Image:Vector norms.svg|frame|right|Illustrations of [[unit circle]]s in different ''p''-norms (every vector from the origin to the unit circle has a length of one, the length being calculated with length-formula of the corresponding p).]]
===The {{math|''p''}}-norm in finite dimensions===
[[Image:Superellipse rounded diamond.svg|thumb|right|Unit circle ([[superellipse]]) in ''p''&nbsp;=&nbsp;3/2 norm]]
[[Image:Vector-p-Norms qtl1.svg|thumb|right|Illustrations of [[unit circle]]s (see also [[superellipse]]) in <math>\Reals^2</math> based on different <math>p</math>-norms (every vector from the origin to the unit circle has a length of one, the length being calculated with length-formula of the corresponding <math>p</math>).]]


Consider the [[real number|real]] [[vector space]] '''R'''<sup>''n''</sup>. The sum of two [[vector space|vectors]] in '''R'''<sup>''n''</sup> is given by
The Euclidean length of a vector <math>x = (x_1, x_2, \dots, x_n)</math> in the <math>n</math>-dimensional [[real number|real]] [[vector space]] <math>\Reals^n</math> is given by the [[Euclidean norm]]:
:<math>\ (x_1, x_2, \dots, x_n) + (y_1, y_2, \dots, y_n) = (x_1+y_1, x_2+y_2, \dots, x_n+y_n),</math>
<math display="block">\|x\|_2 = \left({x_1}^2 + {x_2}^2 + \dotsb + {x_n}^2\right)^{1/2}.</math>
and the scalar action is given by
:<math>\ \lambda(x_1, x_2, \dots, x_n)=(\lambda x_1, \lambda x_2, \dots, \lambda x_n).</math>


The Euclidean distance between two points <math>x</math> and <math>y</math> is the length <math>\|x - y\|_2</math> of the straight line between the two points. In many situations, the Euclidean distance is appropriate for capturing the actual distances in a given space. In contrast, consider taxi drivers in a grid street plan who should measure distance not in terms of the length of the straight line to their destination, but in terms of the [[taxicab geometry|rectilinear distance]], which takes into account that streets are either orthogonal or parallel to each other. The class of <math>p</math>-norms generalizes these two examples and has an abundance of applications in many parts of [[mathematics]], [[physics]], and [[computer science]].
The length of a vector ''x''&nbsp;= (''x''<sub>1</sub>, ''x''<sub>2</sub>, …, ''x''<sub>''n''</sub>) is usually given by the [[Euclidean norm]]
:<math>\ \|x\|_2=\left(x_1^2+x_2^2+\cdots+x_n^2\right)^{1/2}</math>
but this is by no means the only way of defining length. If ''p'' is a [[real number]], ''p'' ≥ 1, define the ''L''<sup>''p''</sup> norm of ''x'' by
:<math>\ \|x\|_p=\left(|x_1|^p+|x_2|^p+\cdots+|x_n|^p\right)^{1/p}</math>
(so the ''L''<sup>2</sup> norm is the familiar Euclidean norm, while the distance in the ''L''<sup>1</sup> norm is known as the ''[[Manhattan distance]]'').


For a [[real number]] <math>p \geq 1,</math> the '''<math>p</math>-norm''' or '''<math>L^p</math>-norm''' of <math>x</math> is defined by
One also extends this to ''p''&nbsp;= ∞ via
:<math>\ \|x\|_\infty=\max \left\{|x_1|, |x_2|, \ldots, |x_n|\right\}</math>
<math display="block">\|x\|_p = \left(|x_1|^p + |x_2|^p + \dotsb + |x_n|^p\right)^{1/p}.</math>
The absolute value bars can be dropped when <math>p</math> is a rational number with an even numerator in its reduced form, and <math>x</math> is drawn from the set of real numbers, or one of its subsets.
which is in fact the limit of the ''p'' norms for finite ''p''.
The ''L''<sup>∞</sup> norm is also known as the ''[[Chebyshev distance|uniform norm]]''.


The Euclidean norm from above falls into this class and is the <math>2</math>-norm, and the <math>1</math>-norm is the norm that corresponds to the [[taxicab geometry|rectilinear distance]].
It turns out that for all ''p''&nbsp;≥ 1 this definition indeed satisfies the properties of a "length function" (or [[norm (mathematics)|norm]]), which are that:
* only the zero vector has zero length,
* the length of the vector is positive homogeneous with respect to multiplication by a scalar, and
* the length of the sum of two vectors is no larger than the sum of lengths of the vectors ([[triangle inequality]]).


The '''<math>L^\infty</math>-norm''' or [[Chebyshev distance|maximum norm]] (or uniform norm) is the limit of the <math>L^p</math>-norms for <math>p \to \infty</math>, given by:
For any ''p'' ≥ 1, '''R'''<sup>''n''</sup> together with the ''L''<sup>''p''</sup> norm (or simply ''p''-norm) becomes a [[Banach space]].
<math display="block">\|x\|_\infty = \max \left\{|x_1|, |x_2|, \dotsc, |x_n|\right\}</math>


For all <math>p \geq 1,</math> the <math>p</math>-norms and maximum norm satisfy the properties of a "length function" (or [[norm (mathematics)|norm]]), that is:
In '''R''', the L<sup>''p''</sup> norm for ''p'' ≥ 1 reduces to the [[absolute difference]].
*only the zero vector has zero length,
*the length of the vector is positive homogeneous with respect to multiplication by a scalar ([[Euler's homogeneous function theorem|positive homogeneity]]), and
*the length of the sum of two vectors is no larger than the sum of lengths of the vectors ([[triangle inequality]]).
Abstractly speaking, this means that <math>\Reals^n</math> together with the <math>p</math>-norm is a [[normed vector space]]. Moreover, it turns out that this space is [[Complete_metric_space|complete]], thus making it a [[Banach space]].


=== When 0 < ''p'' < 1 ===
====Relations between {{math|''p''}}-norms====
In '''R'''<sup>''n''</sup> for ''n''&nbsp;> 1, the formula
:<math>\ \|x\|_p=\left(|x_1|^p+|x_2|^p+\cdots+|x_n|^p\right)^{1/p}</math>
defines an absolutely [[homogeneous function]] for 0&nbsp;< ''p''&nbsp;< 1; however, the resulting function does not define an F-norm, because it is not subadditive.
In '''R'''<sup>''n''</sup> for ''n''&nbsp;> 1, the formula for 0&nbsp;< ''p''&nbsp;< 1
:<math>\ \|x\|_p=\left(|x_1|^p+|x_2|^p+\cdots+|x_n|^p\right)</math>
defines a subadditive function, which does define an F-norm. This F-norm is not homogeneous.


The grid distance or rectilinear distance (sometimes called the "[[Manhattan distance]]") between two points is never shorter than the length of the line segment between them (the Euclidean or "as the crow flies" distance). Formally, this means that the Euclidean norm of any vector is bounded by its 1-norm:
However, the function
<math display="block">\|x\|_2 \leq \|x\|_1 .</math>


This fact generalizes to <math>p</math>-norms in that the <math>p</math>-norm <math>\|x\|_p</math> of any given vector <math>x</math> does not grow with <math>p</math>:
:<math>d_p(x,y) = \sum_{i=1}^n |x_i-y_i|^p</math>
{{block indent | em = 1.5 | text = <math>\|x\|_{p+a} \leq \|x\|_p</math> for any vector <math>x</math> and real numbers <math>p \geq 1</math> and <math>a \geq 0.</math> (In fact this remains true for <math>0 < p < 1</math> and <math>a \geq 0</math> .)}}


For the opposite direction, the following relation between the <math>1</math>-norm and the <math>2</math>-norm is known:
defines a [[metric space|metric]]. The metric space ('''R'''<sup>''n''</sup>, ''d''<sub>''p''</sub>) is denoted by ℓ<sub>''n''</sub><sup>''p''</sup>.
<math display="block">\|x\|_1 \leq \sqrt{n} \|x\|_2 ~.</math>


This inequality depends on the dimension <math>n</math> of the underlying vector space and follows directly from the [[Cauchy–Schwarz inequality]].
Although the ''p''-unit ball ''B''<sub>''n''</sub><sup>''p''</sup> around the origin in this metric is "concave", the topology defined on '''R'''<sup>''n''</sup> by the metric ''d''<sub>''p''</sup> is the usual vector space topology of '''R'''<sup>''n''</sup>, hence ℓ<sub>''n''</sub><sup>''p''</sup> is a [[locally convex]] topological vector space. Beyond this qualitative statement, a quantitative way to measure the lack of convexity of ℓ<sub>''n''</sub><sup>''p''</sup> is to denote by ''C''<sub>''p''</sub>(''n'') the smallest constant ''C'' such that the multiple ''C''&nbsp;''B''<sub>''n''</sub><sup>''p''</sup> of the ''p''-unit ball contains the convex hull of ''B''<sub>''n''</sub><sup>''p''</sup>, equal to ''B''<sub>''n''</sub><sup>1</sup>. The fact that ''C''<sub>''p''</sub>(''n'')&nbsp;= ''n''<sup>1/''p''&nbsp;&ndash;&nbsp;1</sup> tends to infinity with ''n'' (for fixed ''p''&nbsp;< 1) reflects the fact that the infinite-dimensional sequence space ℓ<sup>''p''</sup> defined below, is no longer locally convex.


In general, for vectors in <math>\Complex^n</math> where <math>0 < r < p:</math>
===When ''p'' = 0===
<math display="block">\|x\|_p \leq \|x\|_r \leq n^{\frac{1}{r} - \frac{1}{p}} \|x\|_p ~.</math>
There is one l0 norm and another function called the l0 "norm" (with scare quotation marks).


This is a consequence of [[Hölder's inequality]].
The mathematical definition of the l0 norm was established by [[Banach]]'s ''[[Theory of Linear Operations]]''. The [[F-space|space]] of sequences has a complete metric topology provided by the [[F-space|F–norm]] <math>(x_n) \mapsto \sum_n{2^{-n} |x_n|/(1+ |x_n| )}</math>, which is discussed by Stefan Rolewicz in ''Metric Linear Spaces''.<ref name="RolewiczControl"> {{Citation | title=Functional analysis and control theory: Linear systems|last=Rolewicz |first=Stefan|year=1987| isbn=9027721866| publisher=D. Reidel Publishing Co.; PWN—Polish Scientific Publishers|oclc=13064804|edition=Translated from the Polish by Ewa Bednarczuk|series=Mathematics and its Applications (East European Series)|location=Dordrecht; Warsaw|volume=29|pages=xvi+524|id=| | mr=920371}} </ref> The l0-normed space is studied in functional analysis, probability theory, and harmonic analysis.


====When {{math|0 < ''p'' < 1}}====
Another function was called the l0 "norm" by David Donoho, whose quotation marks warn that this function is not a proper norm. Some later authors [[abuse of terminology|abuse terminology]] by omitting the quotation marks, alas. Donoho suggested the terminology ''p''-'''"'''norm'''"''' ''locally'', by taking the limit of the lp norm, ''on bounded sets'', as ''p'' approaches zero
:<math>\ (x_n) \mapsto \lim_{p\downarrow 0}\left(|x_1|^p+|x_2|^p+\cdots+|x_n|^p\right) </math>
which is the number of non-zero entries of the vector ''x''. Defining 0<sup>0</sup>=0, Donoho's zero "norm" of ''x'' is equal to <math>|x_1|^0+|x_2|^0+\cdots+|x_n|^0</math>. This is not a [[norm (mathematics)|norm]], because it is not continuous with respect to scalar-vector multiplication (as the scalar approaches zero); it is not a proper norm (B-norm, with "B" for [[Banach]]) because it is not homogeneous. Despite these defects as a mathematical norm, Donoho's non-zero counting "norm" (with quotation marks) has uses in [[scientific computing]], [[information theory]], and [[statistics]]---notably in [[compressed sensing]] in [[signal processing]] and computational [[harmonic analysis]].
[[Image:Astroid.svg|thumb|right|[[Astroid]], unit circle in ''p''&nbsp;=&nbsp;2/3 metric]]


[[Image:Astroid.svg|thumb|right|[[Astroid]], unit circle in <math>p = \tfrac{2}{3}</math> metric]]
==<!-- hidden span to make ToC display properly --><span style="display:none;">l<sup>p</sup></span><math>\ell^p</math> spaces==
In <math>\Reals^n</math> for <math>n > 1,</math> the formula
:{{Details|Sequence space}}
<math display="block">\|x\|_p = \left(|x_1|^p + |x_2| ^p + \cdots + |x_n|^p\right)^{1/p}</math>
The above ''p''-norm can be extended to vectors having an infinite number of components, yielding the space <math>\ell^p</math>. This contains as special cases:
defines an absolutely [[homogeneous function]] for <math>0 < p < 1;</math> however, the resulting function does not define a norm, because it is not [[subadditivity|subadditive]]. On the other hand, the formula
* <math>\ell^1</math>, the space of sequences whose series is [[Absolute convergence|absolutely convergent]],
<math display="block">|x_1|^p + |x_2|^p + \dotsb + |x_n|^p</math>
* <math>\ell^2</math>, the space of '''square-summable''' sequences, which is a [[Hilbert space]], and
defines a subadditive function at the cost of losing absolute homogeneity. It does define an [[F-space|F-norm]], though, which is homogeneous of degree <math>p.</math>
* <math>\ell^\infty</math>, the space of [[bounded sequence]]s.


Hence, the function
The space of sequences has a natural vector space structure by applying addition and scalar multiplication coordinate by coordinate.
<math display="block">d_p(x, y) = \sum_{i=1}^n |x_i - y_i|^p</math>
Explicitly, for <math>\ x=(x_1, x_2, \dots, x_n, x_{n+1},\dots)</math> an infinite [[sequence]] of real (or [[complex number|complex]]) numbers, define the vector sum to be
defines a [[metric space|metric]]. The [[metric space]] <math>(\Reals^n, d_p)</math> is denoted by <math>\ell_n^p.</math>
:<math>\ (x_1, x_2, \dots, x_n, x_{n+1},\dots)+(y_1, y_2, \dots, y_n, y_{n+1},\dots)=(x_1+y_1, x_2+y_2, \dots, x_n+y_n, x_{n+1}+y_{n+1},\dots),</math>
while the scalar action is given by
:<math>\ \lambda(x_1, x_2, \dots, x_n, x_{n+1},\dots) = (\lambda x_1, \lambda x_2, \dots, \lambda x_n, \lambda x_{n+1},\dots).</math>


Although the <math>p</math>-unit ball <math>B_n^p</math> around the origin in this metric is "concave", the topology defined on <math>\Reals^n</math> by the metric <math>B_p</math> is the usual vector space topology of <math>\Reals^n,</math> hence <math>\ell_n^p</math> is a [[locally convex]] topological vector space. Beyond this qualitative statement, a quantitative way to measure the lack of convexity of <math>\ell_n^p</math> is to denote by <math>C_p(n)</math> the smallest constant <math>C</math> such that the scalar multiple <math>C \, B_n^p</math> of the <math>p</math>-unit ball contains the convex hull of <math>B_n^p,</math> which is equal to <math>B_n^1.</math> The fact that for fixed <math>p < 1</math> we have
Define the ''p''-norm
<math display="block">C_p(n) = n^{\tfrac{1}{p} - 1} \to \infty, \quad \text{as } n \to \infty</math>
shows that the infinite-dimensional sequence space <math>\ell^p</math> defined below, is no longer locally convex.{{citation needed|date=November 2015}}


====When {{math|1=''p'' = 0}}====
:<math>\ \|x\|_p=\left(|x_1|^p+|x_2|^p+\cdots+|x_n|^p+|x_{n+1}|^p+\cdots\right)^{1/p}.</math>


There is one <math>\ell_0</math> norm and another function called the <math>\ell_0</math> "norm" (with quotation marks).
Here, a complication arises, namely that the [[series (mathematics)|series]] on the right is not always convergent, so for example, the sequence made up of only ones, (1,&nbsp;1,&nbsp;1,&nbsp;...), will have an infinite ''p''-norm (length) for every finite ''p''&nbsp;≥&nbsp;1. The space ℓ<sup>''p''</sup> is then defined as the set of all infinite sequences of real (or complex) numbers such that the ''p''-norm is finite.


The mathematical definition of the <math>\ell_0</math> norm was established by [[Stefan Banach|Banach]]'s ''[[Theory of Linear Operations]]''. The [[F-space|space]] of sequences has a complete metric topology provided by the [[F-space|F-norm]] on the [[Metric_space#Product_metric_spaces|product metric]]:{{Citation needed|date=December 2024}}
One can check that as ''p'' increases, the set ℓ<sup>''p''</sup> grows larger. For example, the sequence
<math display="block">(x_n) \mapsto \|x\|:=d(0,x)=\sum_n 2^{-n} \frac{|x_n|}{1 +|x_n|}.</math>
The <math>\ell_0</math>-normed space is studied in functional analysis, probability theory, and harmonic analysis.


Another function was called the <math>\ell_0</math> "norm" by [[David Donoho]]—whose quotation marks warn that this function is not a proper norm—is the number of non-zero entries of the vector <math>x.</math>{{Citation needed|date=September 2022}} Many authors [[abuse of terminology|abuse terminology]] by omitting the quotation marks. Defining [[zero to the power of zero|<math>0^0 = 0,</math>]] the zero "norm" of <math>x</math> is equal to
:<math>\ \left(1, \frac{1}{2}, \dots, \frac{1}{n}, \frac{1}{n+1},\dots\right)</math>
<math display="block">|x_1|^0 + |x_2|^0 + \cdots + |x_n|^0 .</math>


[[File:Lp space animation.gif|alt=An animated gif of p-norms 0.1 through 2 with a step of 0.05.|thumb|An animated gif of p-norms 0.1 through 2 with a step of 0.05.]]
is not in ℓ<sup>1</sup>, but it is in ℓ<sup>''p''</sup> for ''p''&nbsp;> 1, as the series
This is not a [[norm (mathematics)|norm]] because it is not [[Homogeneous function|homogeneous]]. For example, scaling the vector <math>x</math> by a positive constant does not change the "norm". Despite these defects as a mathematical norm, the non-zero counting "norm" has uses in [[scientific computing]], [[information theory]], and [[statistics]]–notably in [[compressed sensing]] in [[signal processing]] and computational [[harmonic analysis]]. Despite not being a norm, the associated metric, known as [[Hamming distance]], is a valid distance, since homogeneity is not required for distances.
:<math>\ 1^p+\frac{1}{2^p} + \cdots + \frac{1}{n^p} + \frac{1}{(n+1)^p}+\cdots</math>
diverges for ''p''&nbsp;= 1 (the [[harmonic series (mathematics)|harmonic series]]), but is convergent for ''p''&nbsp;> 1.


=={{math|''ℓ''{{i sup|''p''}}}} spaces and sequence spaces==
One also defines the ∞-norm as
{{Details|Sequence space}}
:<math>\ \|x\|_\infty=\sup(|x_1|, |x_2|, \dots, |x_n|,|x_{n+1}|, \dots)</math>
The <math>p</math>-norm can be extended to vectors that have an infinite number of components ([[sequence]]s), which yields the space <math>\ell^p.</math> This contains as special cases:
and the corresponding space ℓ<sup>∞</sup> of all bounded sequences. It turns out that {{Citation needed|date=January 2011}}
* <math>\ell^1,</math> the space of sequences whose series are [[absolute convergence|absolutely convergent]],
:<math>\ \|x\|_\infty=\lim_{p\to\infty}\|x\|_p</math>
* <math>\ell^2,</math> the space of '''square-summable''' sequences, which is a [[Hilbert space]], and
if the right-hand side is finite, or the left-hand side is infinite.
* <math>\ell^\infty,</math> the space of [[bounded sequence]]s.
Thus, we will consider ℓ<sup>''p''</sup> spaces for 1&nbsp;≤ ''p''&nbsp;≤ ∞.


The space of sequences has a natural vector space structure by applying scalar addition and multiplication. Explicitly, the vector sum and the scalar action for infinite [[sequence]]s of real (or [[complex number|complex]]) numbers are given by:
The ''p''-norm thus defined on ℓ<sup>''p''</sup> is indeed a norm, and ℓ<sup>''p''</sup> together with this norm is a [[Banach space]]. The fully general ''L''<sup>''p''</sup> space is obtained &mdash; as seen below &mdash; by considering vectors, not only with finitely or countably-infinitely many components, but with "''arbitrarily many components''"; in other words, [[function (mathematics)|functions]]. An [[integral]] instead of a sum is used to define the ''p''-norm.
<math display="block">\begin{align}
& (x_1, x_2, \ldots, x_n, x_{n+1},\ldots)+(y_1, y_2, \ldots, y_n, y_{n+1},\ldots) \\
= {} & (x_1+y_1, x_2+y_2, \ldots, x_n+y_n, x_{n+1}+y_{n+1},\ldots), \\[6pt]
& \lambda \cdot \left (x_1, x_2, \ldots, x_n, x_{n+1},\ldots \right) \\
= {} & (\lambda x_1, \lambda x_2, \ldots, \lambda x_n, \lambda x_{n+1},\ldots).
\end{align}</math>


Define the <math>p</math>-norm:
== ''L<sup>p</sup>'' spaces ==
<math display="block">\|x\|_p = \left(|x_1|^p + |x_2|^p + \cdots +|x_n|^p + |x_{n+1}|^p + \cdots\right)^{1/p}</math>
Let 1&nbsp;≤ ''p''&nbsp;< ∞ and (''S'', ''Σ'', ''μ'') be a [[measure space]]. Consider the set of all [[measurable function]]s from ''S'' to '''C''' (or '''R''') whose [[absolute value]] raised to the ''p''-th power has finite integral, or equivalently, that


Here, a complication arises, namely that the [[series (mathematics)|series]] on the right is not always convergent, so for example, the sequence made up of only ones, <math>(1, 1, 1, \ldots),</math> will have an infinite <math>p</math>-norm for <math>1 \leq p < \infty.</math> The space <math>\ell^p</math> is then defined as the set of all infinite sequences of real (or complex) numbers such that the <math>p</math>-norm is finite.
:<math>\|f\|_p := \left({\int_S |f|^p\;\mathrm{d}\mu}\right)^{1/p}<\infty. </math>


One can check that as <math>p</math> increases, the set <math>\ell^p</math> grows larger. For example, the sequence
The set of such functions forms a vector space, with the following natural operations:
<math display="block">\left(1, \frac{1}{2}, \ldots, \frac{1}{n}, \frac{1}{n+1}, \ldots\right)</math>
is not in <math>\ell^1,</math> but it is in <math>\ell^p</math> for <math>p > 1,</math> as the series
<math display="block">1^p + \frac{1}{2^p} + \cdots + \frac{1}{n^p} + \frac{1}{(n+1)^p} + \cdots,</math>
diverges for <math>p = 1</math> (the [[harmonic series (mathematics)|harmonic series]]), but is convergent for <math>p > 1.</math>


One also defines the <math>\infty</math>-norm using the [[supremum]]:
:<math>(f+g)(x)=f(x)+g(x), \ \ \ \text{and} \ \ \ (\lambda f)(x) = \lambda f(x) \,</math>
<math display="block">\|x\|_\infty = \sup(|x_1|, |x_2|, \dotsc, |x_n|,|x_{n+1}|, \ldots)</math>
for every scalar ''λ''.
and the corresponding space <math>\ell^\infty</math> of all bounded sequences. It turns out that<ref>{{Citation| last1=Maddox | first1=I. J. | title=Elements of Functional Analysis | publisher=CUP | location=Cambridge | edition=2nd | year=1988}}, page 16</ref>
<math display="block">\|x\|_\infty = \lim_{p \to \infty} \|x\|_p</math>
if the right-hand side is finite, or the left-hand side is infinite. Thus, we will consider <math>\ell^p</math> spaces for <math>1 \leq p \leq \infty.</math>


The <math>p</math>-norm thus defined on <math>\ell^p</math> is indeed a norm, and <math>\ell^p</math> together with this norm is a [[Banach space]].
:<math></math>


===General ℓ<sup>''p''</sup>-space===
That the sum of two ''p''<sup>th</sup> power integrable functions is again ''p''<sup>th</sup> power integrable follows from the inequality |''f''&nbsp;+ ''g''|<sup>''p''</sup>&nbsp;≤ 2<sup>''p''</sup> (|''f''|<sup>''p''</sup>&nbsp;+ |''g''|<sup>''p''</sup>). In fact, more is true. [[Minkowski inequality|Minkowski's inequality]] says the [[triangle inequality]] holds for ||&nbsp;.&nbsp;||<sub>''p''</sub>. Thus the set of ''p''<sup>th</sup> power integrable functions, together with the function ||&thinsp;.&thinsp;||<sub>''p''</sub>, is a [[seminorm]]ed vector space, which is denoted by <math>\mathcal{L}^p(S, \mu)</math>.


In complete analogy to the preceding definition one can define the space <math>\ell^p(I)</math> over a general [[index set]] <math>I</math> (and <math>1 \leq p < \infty</math>) as
This can be made into a normed vector space in a standard way; one simply takes the [[Quotient Space|quotient space]] with respect to the [[Kernel (mathematics)|kernel]] of ||&nbsp;·&nbsp;||<sub>''p''</sub>. Since for any measurable function ''f'', we have that ||''f''||<sub>''p''</sub>&nbsp;= 0 if and only if ''f''&nbsp;= 0 [[almost everywhere]], the kernel of ||&nbsp;.&nbsp;||<sub>''p''</sub> does not depend upon ''p'',
:<math>N := \mathrm{ker}(\|\cdot\|_p) = \{f : f = 0 \ \mu\text{-almost everywhere} \}.</math>
<math display="block">\ell^p(I) = \left\{(x_i)_{i\in I} \in \mathbb{K}^I : \sum_{i \in I} |x_i|^p < +\infty\right\},</math>
where convergence on the right means that only countably many summands are nonzero (see also [[Unconditional convergence]]).
In the quotient space, two functions ''f'' and ''g'' are identified if ''f''&nbsp;= ''g'' almost everywhere. The resulting normed vector space is, by definition,
With the norm
<math display="block">\|x\|_p = \left(\sum_{i\in I} |x_i|^p\right)^{1/p}</math>
the space <math>\ell^p(I)</math> becomes a Banach space.
In the case where <math>I</math> is finite with <math> n</math> elements, this construction yields <math>\Reals^n</math> with the <math>p</math>-norm defined above.
If <math>I</math> is countably infinite, this is exactly the sequence space <math>\ell^p</math> defined above.
For uncountable sets <math>I</math> this is a non-[[Separable space|separable]] Banach space which can be seen as the [[Locally convex topological vector space|locally convex]] [[direct limit]] of <math>\ell^p</math>-sequence spaces.<ref>Rafael Dahmen, Gábor Lukács: ''Long colimits of topological groups I: Continuous maps and homeomorphisms.'' in: ''Topology and its Applications'' Nr. 270, 2020. Example 2.14 </ref>


For <math>p = 2,</math> the <math>\|\,\cdot\,\|_2</math>-norm is even induced by a canonical [[inner product]] <math>\langle \,\cdot,\,\cdot\rangle,</math> called the ''{{visible anchor|Euclidean inner product}}'', which means that <math>\|\mathbf{x}\|_2 = \sqrt{\langle\mathbf{x}, \mathbf{x}\rangle}</math> holds for all vectors <math>\mathbf{x}.</math> This inner product can expressed in terms of the norm by using the [[polarization identity]].
:<math>L^p(S, \mu) := \mathcal{L}^p(S, \mu) / N .</math>
On <math>\ell^2,</math> it can be defined by
<math display="block">\langle \left(x_i\right)_{i}, \left(y_n\right)_{i} \rangle_{\ell^2} ~=~ \sum_i x_i \overline{y_i}.</math>
Now consider the case <math>p = \infty.</math> Define{{refn|group=note|The condition <math>\sup\operatorname{range} |x| < + \infty.</math> is not equivalent to <math>\sup\operatorname{range} |x|</math> being finite, unless <math>X \neq \varnothing.</math>}}
<math display="block">\ell^\infty(I)=\{x\in \mathbb K^I : \sup\operatorname{range}|x|<+\infty\},</math>
where for all <math>x</math><ref>{{cite book|last1=Garling|first1=D. J. H.|title=Inequalities: A Journey into Linear Analysis|date=2007|publisher=Cambridge University Press|isbn=978-0-521-87624-7|page=54}}</ref>{{refn|group=note|If <math>X = \varnothing</math> then <math>\sup\operatorname{range} |x| = - \infty.</math>}}
<math display="block">\|x\|_\infty\equiv\inf\{C \in \Reals_{\geq 0}:|x_i| \leq C\text{ for all } i \in I\} = \begin{cases}\sup\operatorname{range}|x|&\text{if } X\neq\varnothing,\\0&\text{if } X=\varnothing.\end{cases}</math>


The index set <math>I</math> can be turned into a [[measure space]] by giving it the [[Σ-algebra#Simple set-based examples|discrete σ-algebra]] and the [[counting measure]]. Then the space <math>\ell^p(I)</math> is just a special case of the more general <math>L^p</math>-space (defined below).
For ''p''&nbsp;= ∞, the space ''L''<sup>∞</sup>(''S'', ''μ'') is defined as follows. We start with the set of all measurable functions from ''S'' to '''C''' (or '''R''') which are '''essentially bounded''', i.e. bounded up to a set of measure zero. Again two such functions are identified if they are equal almost everywhere. Denote this set by ''L''<sup>∞</sup>(''S'', ''μ''). For ''f'' in ''L''<sup>∞</sup>(''S'', ''μ''), its [[essential supremum]] serves as an appropriate norm:


==''L<sup>p</sup>'' spaces and Lebesgue integrals==
:<math>\|f\|_\infty := \inf \{ C\ge 0 : |f(x)| \le C \mbox{ for almost every } x\}.</math>


An <math>L^p</math> space may be defined as a space of measurable functions for which the <math>p</math>-th power of the [[absolute value]] is [[Lebesgue integrable]], where functions which agree almost everywhere are identified. More generally, let <math>(S, \Sigma, \mu)</math> be a [[measure space]] and <math>1 \leq p \leq \infty.</math><ref group=note>The definitions of <math>\|\cdot\|_p,</math> <math>\mathcal{L}^p(S,\, \mu),</math> and <math>L^p(S,\, \mu)</math> can be extended to all <math>0 < p \leq \infty</math> (rather than just <math>1 \leq p \leq \infty</math>), but it is only when <math>1 \leq p \leq \infty</math> that <math>\|\cdot\|_p</math> is guaranteed to be a norm (although <math>\|\cdot\|_p</math> is a [[quasi-seminorm]] for all <math>0 < p \leq \infty,</math>).</ref>
As before, we have
When <math>p \neq \infty</math>, consider the set <math>\mathcal{L}^p(S,\, \mu)</math> of all [[measurable function]]s <math>f</math> from <math>S</math> to <math>\Complex</math> or <math>\Reals</math> whose [[absolute value]] raised to the <math>p</math>-th power has a finite integral, or in symbols:{{sfn|Rudin|1987|p=65}}
<math display="block">\|f\|_p ~\stackrel{\scriptscriptstyle\text{def}}{=}~ \left(\int_S |f|^p\;\mathrm{d}\mu\right)^{1/p} < \infty.</math>


To define the set for <math>p = \infty,</math> recall that two functions <math>f</math> and <math>g</math> defined on <math>S</math> are said to be {{em|equal [[almost everywhere]]}}, written {{em|<math>f = g</math> a.e.}}, if the set <math>\{s \in S : f(s) \neq g(s)\}</math> is measurable and has measure zero.
:<math>\|f\|_\infty=\lim_{p\to\infty}\|f\|_p</math>
Similarly, a measurable function <math>f</math> (and its [[absolute value]]) is {{em|bounded}} (or {{em|dominated}}) {{em|almost everywhere}} by a real number <math>C,</math> written {{em|<math>|f| \leq C</math> a.e.}}, if the (necessarily) measurable set <math>\{s \in S : |f(s)| > C\}</math> has measure zero.
The space <math>\mathcal{L}^\infty(S,\mu)</math> is the set of all measurable functions <math>f</math> that are bounded almost everywhere (by some real <math>C</math>) and <math>\|f\|_\infty</math> is defined as the [[infimum]] of these bounds:
<math display="block">\|f\|_\infty ~\stackrel{\scriptscriptstyle\text{def}}{=}~ \inf \{C \in \Reals_{\geq 0} : |f(s)| \leq C \text{ for almost every } s\}.</math>
When <math>\mu(S) \neq 0</math> then this is the same as the [[essential supremum]] of the absolute value of <math>f</math>:{{refn|group=note|If <math>\mu(S) = 0</math> then <math>\operatorname{ess}\sup|f| = -\infty.</math>}}
<math display="block">\|f\|_\infty ~=~ \begin{cases}\operatorname{ess}\sup|f| & \text{if } \mu(S) > 0,\\ 0 & \text{if } \mu(S) = 0.\end{cases}</math>


For example, if <math>f</math> is a measurable function that is equal to <math>0</math> almost everywhere<ref group=note name=Non0Value0Example>For example, if a non-empty measurable set <math>N \neq \varnothing</math> of measure <math>\mu(N) = 0</math> exists then its [[indicator function]] <math>\mathbf{1}_N</math> satisfies <math>\|\mathbf{1}_N\|_p = 0</math> although <math>\mathbf{1}_N \neq 0.</math></ref> then <math>\|f\|_p = 0</math> for every <math>p</math> and thus <math>f \in \mathcal{L}^p(S,\, \mu)</math> for all <math>p.</math>
if ''f'' ∈ ''L''<sup>∞</sup>(''S'', ''μ'') ∩ ''L''<sup>''q''</sup>(''S'', ''μ'') for some ''q''&nbsp;< ∞.


For every positive <math>p,</math> the value under <math>\|\,\cdot\,\|_p</math> of a measurable function <math>f</math> and its absolute value <math>|f| : S \to [0, \infty]</math> are always the same (that is, <math>\|f\|_p = \||f|\|_p</math> for all <math>p</math>) and so a measurable function belongs to <math>\mathcal{L}^p(S,\, \mu)</math> if and only if its absolute value does. Because of this, many formulas involving <math>p</math>-norms are stated only for non-negative real-valued functions. Consider for example the identity <math>\|f\|_p^r = \|f^r\|_{p/r},</math> which holds whenever <math>f \geq 0</math> is measurable, <math>r > 0</math> is real, and <math>0 < p \leq \infty</math> (here <math>\infty / r \;\stackrel{\scriptscriptstyle\text{def}}{=}\; \infty</math> when <math>p = \infty</math>). The non-negativity requirement <math>f \geq 0</math> can be removed by substituting <math>|f|</math> in for <math>f,</math> which gives <math>\|\,|f|\,\|_p^r = \|\,|f|^r\,\|_{p/r}.</math>
For 1&nbsp;≤ ''p''&nbsp;≤ ∞, ''L''<sup>''p''</sup>(''S'', ''μ'') is a [[Banach space]]. The fact that ''L''<sup>''p''</sup> is ''complete'' is often referred to as ''[[Riesz-Fischer theorem]]''. Completeness can be checked using the convergence theorems for Lebesgue integrals.
Note in particular that when <math>p = r</math> is finite then the formula <math>\|f\|_p^p = \||f|^p\|_1</math> relates the <math>p</math>-norm to the <math>1</math>-norm.


'''Seminormed space of <math>p</math>-th power integrable functions'''
When the underlying measure space ''S'' is understood, ''L''<sup>''p''</sup>(''S'', ''μ'') is often abbreviated ''L''<sup>''p''</sup>(''μ''), or just ''L''<sup>''p''</sup>. The above definitions generalize to [[Bochner space]]s.


Each set of functions <math>\mathcal{L}^p(S,\, \mu)</math> forms a [[vector space]] when addition and scalar multiplication are defined pointwise.<ref group=note>Explicitly, the vector space operations are defined by:
=== Special cases ===
<math display="block">\begin{align}
(f+g)(x) &= f(x)+g(x), \\
(s f)(x) &= s f(x)
\end{align}</math>
for all <math>f, g \in \mathcal{L}^p(S,\, \mu)</math> and all scalars <math>s.</math> These operations make <math>\mathcal{L}^p(S,\, \mu)</math> into a vector space because if <math>s</math> is any scalar and <math>f, g \in \mathcal{L}^p(S,\, \mu)</math> then both <math>s f</math> and <math>f + g</math> also belong to <math>\mathcal{L}^p(S,\, \mu).</math></ref>
That the sum of two <math>p</math>-th power integrable functions <math>f</math> and <math>g</math> is again <math>p</math>-th power integrable follows from <math display=inline>\|f + g\|_p^p \leq 2^{p-1} \left(\|f\|_p^p + \|g\|_p^p\right),</math><ref group=proof name=UpperBoundForNormOfSum>When <math>1 \leq p < \infty,</math> the inequality <math>\|f + g\|_p^p \leq 2^{p-1} \left(\|f\|_p^p + \|g\|_p^p\right)</math> can be deduced from the fact that the function <math>F : [0, \infty) \to \Reals</math> defined by <math>F(t) = t^p</math> is [[Convex function|convex]], which by definition means that <math>F(t x + (1 - t) y) \leq t F(x) + (1 - t) F(y)</math> for all <math>0 \leq t \leq 1</math> and all <math>x, y</math> in the domain of <math>F.</math> Substituting <math>|f|, |g|,</math> and <math>\tfrac{1}{2}</math> in for <math>x, y,</math> and <math>t</math> gives <math>\left(\tfrac{1}{2}|f| + \tfrac{1}{2}|g|\right)^p \leq \tfrac{1}{2} |f|^p + \tfrac{1}{2} |g|^p,</math> which proves that <math>(|f| + |g|)^p \leq 2^{p-1} (|f|^p + |g|^p).</math> The triangle inequality <math>|f + g| \leq |f| + |g|</math> now implies <math>|f + g|^p \leq 2^{p-1} (|f|^p + |g|^p).</math> The desired inequality follows by integrating both sides. <math>\blacksquare</math></ref>
although it is also a consequence of ''[[Minkowski inequality|Minkowski's inequality]]''
<math display="block">\|f + g\|_p \leq \|f\|_p + \|g\|_p</math>
which establishes that <math>\|\cdot\|_p</math> satisfies the [[triangle inequality]] for <math>1 \leq p \leq \infty</math> (the triangle inequality does not hold for <math>0 < p < 1</math>).
That <math>\mathcal{L}^p(S,\, \mu)</math> is closed under scalar multiplication is due to <math>\|\cdot\|_p</math> being [[Absolute homogeneity|absolutely homogeneous]], which means that <math>\|s f\|_p = |s| \|f\|_p</math> for every scalar <math>s</math> and every function <math>f.</math>


[[Absolute homogeneity]], the [[triangle inequality]], and non-negativity are the defining properties of a [[seminorm]].
When ''p'' = 2; like the ℓ<sup>2</sup> space, the space ''L''<sup>2</sup> is the only [[Hilbert space]] of this class. In the complex case, the inner product on ''L''<sup>2</sup> is defined by
Thus <math>\|\cdot\|_p</math> is a seminorm and the set <math>\mathcal{L}^p(S,\, \mu)</math> of <math>p</math>-th power integrable functions together with the function <math>\|\cdot\|_p</math> defines a [[seminormed vector space]]. In general, the [[seminorm]] <math>\|\cdot\|_p</math> is not a [[Norm (mathematics)|norm]] because there might exist measurable functions <math>f</math> that satisfy <math>\|f\|_p = 0</math> but are not {{em|identically}} equal to <math>0</math><ref group=note name=Non0Value0Example /> (<math>\|\cdot\|_p</math> is a norm if and only if no such <math>f</math> exists).
:<math> \langle f, g \rangle = \int_S f(x) \overline{g(x)} \, \mathrm{d}\mu(x).</math>
The additional inner product structure allows for a richer theory, with applications to, for instance, [[Fourier series]] and [[quantum mechanics]]. Functions in ''L''<sup>2</sup> are sometimes called '''[[quadratically integrable function]]s''', '''square-integrable functions''' or '''square-summable functions''', but sometimes these terms are reserved for functions that are square-integrable in some other sense, such as in the sense of a [[Riemann integral]] {{harv|Titchmarsh|1976}}.


'''Zero sets of <math>p</math>-seminorms'''
If we use complex-valued functions, the space ''L''<sup>∞</sup> is a [[commutative]] [[C*-algebra]] with pointwise multiplication and conjugation. For many measure spaces, including all sigma-finite ones, it is in fact a commutative [[von Neumann algebra]]. An element of ''L''<sup>∞</sup> defines a [[bounded operator]] on any ''L''<sup>''p''</sup> space by [[multiplication operator|multiplication]].


{{anchor|kernel}}
The ℓ<sup>''p''</sup> spaces (1&nbsp;≤ ''p''&nbsp;≤ ∞) are a special case of ''L<sup>p</sup>'' spaces, when ''S'' is the set '''N''' of positive [[integer]]s, and the measure ''μ'' is the [[counting measure]] on '''N'''. More generally, if one considers any set ''S'' with the counting measure, the resulting ''L<sup>&nbsp;p</sup>'' space is denoted ℓ<sup>''p''</sup>(''S''). For example, the space ℓ<sup>''p''</sup>('''Z''') is the space of all sequences indexed by the integers, and when defining the ''p''-norm on such a space, one sums over all the integers. The space ℓ<sup>''p''</sup>(''n''), where ''n'' is the set with ''n'' elements, is '''R'''<sup>''n''</sup> with its ''p''-norm as defined above. As any Hilbert space, every space ''L''<sup>2</sup> is linearly isometric to a suitable ℓ<sup>2</sup>(''I''), where the cardinality of the set ''I'' is the cardinality of an arbitrary Hilbertian basis for this particular ''L''<sup>2</sup>.
If <math>f</math> is measurable and equals <math>0</math> a.e. then <math>\|f\|_p = 0</math> for all positive <math>p \leq \infty.</math>
On the other hand, if <math>f</math> is a measurable function for which there exists some <math>0 < p \leq \infty</math> such that <math>\|f\|_p = 0</math> then <math>f = 0</math> almost everywhere. When <math>p</math> is finite then this follows from the <math>p = 1</math> case and the formula <math>\|f\|_p^p = \||f|^p\|_1</math> mentioned above. <!--(this formula itself follows from <math>\|f\|_p^r = \|f^r\|_{p/r},</math> which holds whenever <math>f \geq 0</math> is measurable, <math>r > 0</math> is real, and <math>0 < p \leq \infty</math> (where <math>\infty / r \;\stackrel{\scriptscriptstyle\text{def}}{=}\; \infty</math> when <math>p = \infty</math>)). -->


Thus if <math>p \leq \infty</math> is positive and <math>f</math> is any measurable function, then <math>\|f\|_p = 0</math> if and only if <math>f = 0</math> [[almost everywhere]]. Since the right hand side (<math>f = 0</math> a.e.) does not mention <math>p,</math> it follows that all <math>\|\cdot\|_p</math> have the same [[zero set]] (it does not depend on <math>p</math>). So denote this common set by
==Properties of ''L''<sup>''p''</sup> spaces==
<math display="block">\mathcal{N} \;\stackrel{\scriptscriptstyle\text{def}}{=}\; \{f : f = 0 \ \mu\text{-almost everywhere} \} = \{f \in \mathcal{L}^p(S,\, \mu) : \|f\|_p = 0\} \qquad \forall \ p.</math>
===Dual spaces===
This set is a vector subspace of <math>\mathcal{L}^p(S,\, \mu)</math> for every positive <math>p \leq \infty.</math>
The [[dual space]] (the space of all continuous linear functionals) of ''L''<sup>''p''</sup>(''μ'') for 1&nbsp;< ''p''&nbsp;< ∞ has a natural isomorphism with ''L''<sup>''q''</sup>(''μ''), where ''q''&thinsp; is such that 1/''p''&nbsp;+ 1/''q''&nbsp;=&nbsp;1, which associates ''g''&nbsp;∈ ''L''<sup>''q''</sup>(''μ'') with the functional ''κ''<sub>''p''</sub>(''g'')&nbsp;∈ ''L''<sup>''p''</sup>(''μ'')<sup>∗</sup> defined by


'''Quotient vector space'''
:<math>\kappa_p(g) : f \in L^p(\mu) \mapsto \int f g \, \mathrm{d}\mu. \, </math>


Like every [[seminorm]], the seminorm <math>\|\cdot\|_p</math> induces a [[Norm (mathematics)|norm]] (defined shortly) on the canonical [[Quotient space (linear algebra)|quotient vector space]] of <math>\mathcal{L}^p(S,\, \mu)</math> by its vector subspace
The fact that ''κ''<sub>''p''</sub>(''g'') is well defined and continuous follows from [[Hölder's inequality]]. The mapping ''κ''<sub>''p''</sub> is a linear mapping from ''L''<sup>''q''</sup>(''μ'') into ''L''<sup>''p''</sup>(''μ'')<sup>∗</sup>, which is an [[isometry]] by the [[Hölder's inequality#Extremal equality|extremal case]] of Hölder's inequality. It is also possible to show (for example with the [[Radon–Nikodym theorem]], see<ref>{{Citation | last1=Rudin | first1=Walter | author1-link=Walter Rudin | title=Real and Complex Analysis | publisher=Tata McGraw-Hill | location=New Delhi | edition=2rd | year=1980}}, Theorem 6.16</ref>) that any ''G''&nbsp;∈ ''L''<sup>''p''</sup>(''μ'')<sup>∗</sup> can be expressed this way: i.e., that ''κ''<sub>''p''</sub> is ''onto''. Since ''κ''<sub>''p''</sub> is onto and isometric, it is an [[isomorphism]] of [[Banach space]]s. With this (isometric) isomorphism in mind, it is usual to say simply that ''L''<sup>''q''</sup>&nbsp; "''is''"&nbsp; the dual of&nbsp;''L''<sup>''p''</sup>.
<math display="inline">\mathcal{N} = \{f \in \mathcal{L}^p(S,\, \mu) : \|f\|_p = 0\}.</math>
This normed quotient space is called {{em|Lebesgue space}} and it is the subject of this article. We begin by defining the quotient vector space.


Given any <math>f \in \mathcal{L}^p(S,\, \mu),</math> the [[coset]] <math>f + \mathcal{N} \;\stackrel{\scriptscriptstyle\text{def}}{=}\; \{f + h : h \in \mathcal{N}\}</math> consists of all measurable functions <math>g</math> that are equal to <math>f</math> [[almost everywhere]].
When 1&nbsp;< ''p''&nbsp;< ∞, the space ''L''<sup>''p''</sup>(''μ'') is [[reflexive space|reflexive]]. Let ''κ''<sub>''p''</sub> be the above map and let ''κ''<sub>''q''</sub> be the corresponding linear isometry from ''L''<sup>''p''</sup>(''μ'') onto ''L''<sup>''q''</sup>(''μ'')<sup>∗</sup>. The map
The set of all cosets, typically denoted by
<math display="block">\mathcal{L}^p(S, \mu) / \mathcal{N} ~~\stackrel{\scriptscriptstyle\text{def}}{=}~~ \{f + \mathcal{N} : f \in \mathcal{L}^p(S, \mu)\},</math>
forms a vector space with origin <math>0 + \mathcal{N} = \mathcal{N}</math> when vector addition and scalar multiplication are defined by <math>(f + \mathcal{N}) + (g + \mathcal{N}) \;\stackrel{\scriptscriptstyle\text{def}}{=}\; (f + g) + \mathcal{N}</math> and <math>s (f + \mathcal{N}) \;\stackrel{\scriptscriptstyle\text{def}}{=}\; (s f) + \mathcal{N}.</math>
This particular quotient vector space will be denoted by <math display="block">L^p(S,\, \mu) ~\stackrel{\scriptscriptstyle\text{def}}{=}~ \mathcal{L}^p(S, \mu) / \mathcal{N}.</math>
Two cosets are equal <math>f + \mathcal{N} = g + \mathcal{N}</math> if and only if <math>g \in f + \mathcal{N}</math> (or equivalently, <math>f - g \in \mathcal{N}</math>), which happens if and only if <math>f = g</math> almost everywhere; if this is the case then <math>f</math> and <math>g</math> are identified in the quotient space. Hence, strictly speaking <math>L^p(S,\, \mu) </math> consists of [[equivalence class]]es of functions.{{sfn|Stein|Shakarchi|2012|p=2}}<ref>{{MathWorld |title=L^2-Space |id=L2-Space }}</ref>


Given any <math>f \in \mathcal{L}^p(S,\, \mu),</math> the value of the seminorm <math>\|\cdot\|_p</math> on the [[coset]] <math>f + \mathcal{N} = \{f + h : h \in \mathcal{N}\}</math> is constant and equal to <math>\|f\|_p</math>, that is:
:<math>j_p : L^p(\mu) \overset{\kappa_q}{\to} L^q(\mu)^* \overset{\,\,(\kappa_p^{-1})^*}{\longrightarrow} L^p(\mu)^{**}</math>
<math display=block>\|f + \mathcal{N}\|_p \;\stackrel{\scriptscriptstyle\text{def}}{=}\; \|f\|_p.</math>
The map <math>f + \mathcal{N} \mapsto \|f + \mathcal{N}\|_p</math> is a [[Norm (mathematics)|norm]] on <math>L^p(S, \mu)</math> called the {{em|{{visible anchor|p-norm|text=<math>p</math>-norm}}}}.
The value <math>\|f + \mathcal{N}\|_p</math> of a coset <math>f + \mathcal{N}</math> is independent of the particular function <math>f</math> that was chosen to represent the coset, meaning that if <math>\mathcal{C} \in L^p(S, \mu)</math> is any coset then <math>\|\mathcal{C}\|_p = \|f\|_p</math> for every <math>f \in \mathcal{C}</math> (since <math>\mathcal{C} = f + \mathcal{N}</math> for every <math>f \in \mathcal{C}</math>).


'''The Lebesgue <math>L^p</math> space'''
from ''L''<sup>''p''</sup>(''μ'') to ''L''<sup>''p''</sup>(''μ'')<sup>∗∗</sup>, obtained by composing ''κ''<sub>''q''</sub> with the [[Dual space#Transpose of a continuous linear map|transpose]] (or adjoint) of the inverse of ''κ''<sub>''p''</sub>, coincides with the [[Reflexive space#Definitions|canonical embedding]] ''J''&thinsp; of ''L''<sup>''p''</sup>(''μ'') into its bidual. Moreover, the map ''j''<sub>''p''</sub> is onto, as composition of two onto isometries, and this proves reflexivity.


The [[normed vector space]] <math>\left(L^p(S, \mu), \|\cdot\|_p\right)</math> is called {{em|<math>L^p</math> space}} or the {{em|Lebesgue space}} of <math>p</math>-th power integrable functions and it is a [[Banach space]] for every <math>1 \leq p \leq \infty</math> (meaning that it is a [[complete metric space]], a result that is sometimes called the [[Riesz–Fischer theorem#Completeness of Lp, 0 < p ≤ ∞|Riesz–Fischer theorem]]).
If the measure ''μ'' on ''S'' is [[sigma-finite]], then the dual of ''L''<sup>1</sup>(''μ'') is isometrically isomorphic to ''L''<sup>∞</sup>(''μ'') (more precisely, the map ''κ''<sub>1</sub> corresponding to ''p''&nbsp;= 1 is an isometry from ''L''<sup>∞</sup>(''μ'') onto ''L''<sup>1</sup>(''μ'')<sup>∗</sup>).
When the underlying measure space <math>S</math> is understood then <math>L^p(S, \mu)</math> is often abbreviated <math>L^p(\mu),</math> or even just <math>L^p.</math>
Depending on the author, the subscript notation <math>L_p</math> might denote either <math>L^p(S, \mu)</math> or <math>L^{1/p}(S, \mu).</math>


If the seminorm <math>\|\cdot\|_p</math> on <math>\mathcal{L}^p(S,\, \mu)</math> happens to be a norm (which happens if and only if <math>\mathcal{N} = \{0\}</math>) then the normed space <math>\left(\mathcal{L}^p(S,\, \mu), \|\cdot\|_p\right)</math> will be [[Linear map|linearly]] [[isometrically isomorphic]] to the normed quotient space <math>\left(L^p(S, \mu), \|\cdot\|_p\right)</math> via the canonical map <math>g \in \mathcal{L}^p(S,\, \mu) \mapsto \{g\}</math> (since <math>g + \mathcal{N} = \{g\}</math>); in other words, they will be, [[up to]] a [[linear isometry]], the same normed space and so they may both be called "<math>L^p</math> space".
The dual of ''L''<sup>∞</sup> is subtler. Elements of (''L''<sup>∞</sup>(''μ''))<sup>∗</sup> can be identified with bounded signed ''finitely'' additive measures on ''S'' that are [[absolutely continuous]] with respect to ''μ''. See [[ba space]] for more details. If we assume the axiom of choice, this space is much bigger than ''L''<sup>1</sup>(''μ'') except in some trivial cases. However, there are relatively consistent extensions of Zermelo-Fraenkel set theory in which the dual of ''ℓ''<sup>∞</sup> is ''ℓ''<sup>1</sup>. This is a result of Shelah, discussed in Eric Schechter's book Handbook of Analysis and its Foundations.


The above definitions generalize to [[Bochner space]]s.
===Embeddings===


In general, this process cannot be reversed: there is no consistent way to define a "canonical" representative of each coset of <math>\mathcal{N}</math> in <math>L^p.</math> For <math>L^\infty,</math> however, there is a [[Lifting theory|theory of lifts]] enabling such recovery.
Colloquially, if 1&nbsp;≤ ''p''&nbsp;< ''q''&nbsp;≤ ∞, ''L<sup>p</sup>''(''S'',&nbsp;''μ'') contains functions that are more locally singular, while elements of ''L<sup>q</sup>''(''S'',&nbsp;''μ'') can be more spread out. Consider the Lebesgue measure on the half line (0, ∞). A continuous function in ''L''<sup>1</sup> might blow up near 0 but must decay sufficiently fast toward infinity. On the other hand, continuous functions in ''L''<sup>∞</sup> need not decay at all but no blow-up is allowed. The precise technical result is the following:


===Special cases===
#Let 1&nbsp;≤ ''p''&nbsp;< ''q''&nbsp;≤ ∞. ''L<sup>q</sup>''(''S'',&nbsp;''μ'') is contained in ''L<sup>p</sup>''(''S'',&nbsp;μ) iff ''S'' does not contain sets of arbitrarily large measure, and
For <math>1 \leq p \leq \infty</math> the <math>\ell^p</math> spaces are a special case of <math>L^p</math> spaces; when <math>S</math> are the [[natural number]]s <math>\mathbb{N}</math> and <math>\mu</math> is the [[counting measure]]. More generally, if one considers any set <math>S</math> with the counting measure, the resulting <math>L^p</math> space is denoted <math>\ell^p(S).</math> For example, <math>\ell^p(\mathbb{Z})</math> is the space of all sequences indexed by the integers, and when defining the <math>p</math>-norm on such a space, one sums over all the integers. The space <math>\ell^p(n),</math> where <math>n</math> is the set with <math>n</math> elements, is <math>\Reals^n</math> with its <math>p</math>-norm as defined above.
#Let 1&nbsp;≤ ''p''&nbsp;< ''q''&nbsp;≤ ∞. ''L<sup>p</sup>''(''S'',&nbsp;''μ'') is contained in ''L<sup>q</sup>''(''S'',&nbsp;''μ'') iff ''S'' does not contain sets of arbitrarily small non-zero measure.


Similar to <math>\ell^2</math> spaces, <math>L^2</math> is the only [[Hilbert space]] among <math>L^p</math> spaces. In the complex case, the inner product on <math>L^2</math> is defined by
In particular, if the domain ''S'' has finite measure, the bound (a consequence of [[Jensen's inequality]])
<math display="block">\langle f, g \rangle = \int_S f(x) \overline{g(x)} \, \mathrm{d}\mu(x).</math>
Functions in <math>L^2</math> are sometimes called '''[[square-integrable function]]s''', '''quadratically integrable functions''' or '''square-summable functions''', but sometimes these terms are reserved for functions that are square-integrable in some other sense, such as in the sense of a [[Riemann integral]] {{harv|Titchmarsh|1976}}.


As any Hilbert space, every space <math>L^2</math> is linearly isometric to a suitable <math>\ell^2(I),</math> where the cardinality of the set <math>I</math> is the cardinality of an arbitrary basis for this particular <math>L^2.</math>
:<math>\ \|f\|_p \le \mu(S)^{(1/p)-(1/q)} \|f\|_q </math>


If we use complex-valued functions, the space <math>L^\infty</math> is a [[commutative]] [[C*-algebra]] with pointwise multiplication and conjugation. For many measure spaces, including all sigma-finite ones, it is in fact a commutative [[von Neumann algebra]]. An element of <math>L^\infty</math> defines a [[bounded operator]] on any <math>L^p</math> space by [[multiplication operator|multiplication]].
means the space ''L''<sup>''q''</sup> is continuously embedded in ''L''<sup>''p''</sup>. That is to say, the identity operator is a bounded linear map from ''L''<sup>''q''</sup> to ''L''<sup>''p''</sup>. The constant appearing in the above inequality is optimal, in the sense that the [[operator norm]] of the identity ''I''&nbsp;:&nbsp;''L<sup>q</sup>''(''S'',&nbsp;''μ'')&nbsp;&rarr;&nbsp;''L<sup>p</sup>''(''S'',&nbsp;''μ'') is precisely
:<math>\|I\|_{q,p} = \mu(S)^{(1/p)-(1/q)},</math>
the case of equality being achieved exactly when ''f''&nbsp;=&nbsp;1 a.e.[μ].


=== Dense subspaces ===
===When {{math|(0 < ''p'' < 1)}}===


If <math>0 < p < 1,</math> then <math>L^p(\mu)</math> can be defined as above, that is:
It is assumed that 1&nbsp;≤ ''p''&nbsp;< ∞ throughout this section.<br />
<math display="block">N_p(f) = \int_S |f|^p\, d\mu < \infty.</math>
Let (''S'',&nbsp;''Σ'',&nbsp;''μ'') be a measure space. An ''integrable simple function'' ''f''&thinsp; on ''S''&thinsp; is one of the form
In this case, however, the <math>p</math>-norm <math>\|f\|_p = N_p(f)^{1/p}</math> does not satisfy the triangle inequality and defines only a [[quasi-norm]]. The inequality <math>(a + b)^p \leq a^p + b^p,</math> valid for <math>a, b \geq 0,</math> implies that
:<math>f = \sum_{j=1}^n a_j \mathbf{1}_{A_j},</math>
<math display="block">N_p(f + g) \leq N_p(f) + N_p(g)</math>
where ''a<sub>j</sub>'' is scalar and ''A<sub>j</sub>''&nbsp;∈ ''Σ''&thinsp; has finite measure, for ''j''&nbsp;= 1,...,''n''. By construction of the [[Lebesgue integration|integral]], the vector space of integrable simple functions is dense in ''L''<sup>''p''</sup>(''S'',&nbsp;''Σ'',&nbsp;''μ'').
and so the function
<math display="block">d_p(f ,g) = N_p(f - g) = \|f - g\|_p^p</math>
is a metric on <math>L^p(\mu).</math> The resulting metric space is [[Complete metric space|complete]].{{sfn|Rudin|1991|p=37}}


In this setting <math>L^p</math> satisfies a ''reverse Minkowski inequality'', that is for <math>u, v \in L^p</math>
More can be said when ''S''&thinsp; is a [[Metrization theorem|metrizable]] [[topological space]] and ''Σ''&thinsp; its [[Borel algebra|Borel ''σ''&ndash;algebra]], ''i.e.'', the smallest ''σ''&ndash;algebra of subsets of ''S''&thinsp; containing the [[open set]]s.
<math display="block">\Big\||u| + |v|\Big\|_p \geq \|u\|_p + \|v\|_p</math>


This result may be used to prove [[Clarkson's inequalities]], which are in turn used to establish the [[uniformly convex space|uniform convexity]] of the spaces <math>L^p</math> for <math>1 < p < \infty</math> {{harv|Adams|Fournier|2003}}.
Suppose that ''V''&nbsp;⊂ ''S''&thinsp; is an open set with ''μ''(''V'')&nbsp;< ∞. It can be proved that for every Borel set ''A''&nbsp;∈ ''Σ''&thinsp; contained in ''V'', and for every ''ε''&nbsp;> 0, there exist a closed set ''F''&thinsp; and an open set ''U''&thinsp; such that
:<math>F \subset A \subset U \subset V \ \ \text{and} \ \ \mu(U) - \mu(F) = \mu(U \setminus F) < \varepsilon.</math>
It follows that there exists ''φ'' continuous on ''S''&thinsp; such that
:<math>0 \le \varphi \le \mathbf{1}_V \ \text{and} \ \int_S |\mathbf{1}_A - \varphi| \, \mathrm{d}\mu < \varepsilon.</math>


The space <math>L^p</math> for <math>0 < p < 1</math> is an [[F-space]]: it admits a complete translation-invariant metric with respect to which the vector space operations are continuous. It is the prototypical example of an [[F-space]] that, for most reasonable measure spaces, is not [[Locally convex topological vector space|locally convex]]: in <math>\ell^p</math> or <math>L^p([0, 1]),</math> every open convex set containing the <math>0</math> function is unbounded for the <math>p</math>-quasi-norm; therefore, the <math>0</math> vector does not possess a fundamental system of convex neighborhoods. Specifically, this is true if the measure space <math>S</math> contains an infinite family of disjoint measurable sets of finite positive measure.
If ''S''&thinsp; can be covered by an increasing sequence (''V<sub>n</sub>'') of open sets that have finite measure, then the space of ''p''&ndash;integrable continuous functions is dense in ''L''<sup>''p''</sup>(''S'',&nbsp;''Σ'',&nbsp;''μ''). More precisely, one can use bounded continuous functions that vanish outside one of the open sets ''V<sub>n</sub>''.


The only nonempty convex open set in <math>L^p([0, 1])</math> is the entire space. Consequently, there are no nonzero continuous linear functionals on <math>L^p([0, 1]);</math> the [[continuous dual space]] is the zero space. In the case of the [[counting measure]] on the natural numbers (i.e. <math>L^p(\mu) = \ell^p</math>), the bounded linear functionals on <math>\ell^p</math> are exactly those that are bounded on <math>\ell^1</math>, i.e., those given by sequences in <math>\ell^\infty.</math> Although <math>\ell^p</math> does contain non-trivial convex open sets, it fails to have enough of them to give a base for the topology.
This applies in particular when ''S''&nbsp;= '''R'''<sup>''d''</sup> and when ''μ'' is the Lebesgue measure. The space of continuous and compactly supported functions is dense in ''L''<sup>''p''</sup>('''R'''<sup>''d''</sup>). Similarly, the space of integrable ''step functions''&thinsp; is dense in ''L''<sup>''p''</sup>('''R'''<sup>''d''</sup>); this space is the linear span of indicator functions of bounded intervals when ''d''&nbsp;= 1, of bounded rectangles when ''d''&nbsp;= 2 and more generally of products of bounded intervals.<br />
Several properties of general functions in ''L''<sup>''p''</sup>('''R'''<sup>''d''</sup>) are first proved for continuous and compactly supported functions (sometimes for step functions), then extended by density to all functions. For example, it is proved this way that translations are continuous on ''L''<sup>''p''</sup>('''R'''<sup>''d''</sup>), in the following sense: for every ''f''&nbsp;∈ ''L''<sup>''p''</sup>('''R'''<sup>''d''</sup>),
:<math>\|\tau_t f - f \|_p \rightarrow 0</math>
when ''t''&nbsp;∈ '''R'''<sup>''d''</sup> tends to 0, where <math>\tau_t f</math> is the translated function defined by <math>(\tau_t f)(x) = f(x - t).</math>


Having no linear functionals is highly undesirable for the purposes of doing analysis. In case of the Lebesgue measure on <math>\Reals^n,</math> rather than work with <math>L^p</math> for <math>0 < p < 1,</math> it is common to work with the [[Hardy space]] {{math|''H{{i sup|p}}''}} whenever possible, as this has quite a few linear functionals: enough to distinguish points from one another. However, the [[Hahn–Banach theorem]] still fails in {{math|''H{{i sup|p}}''}} for <math>p < 1</math> {{harv|Duren|1970|loc=§7.5}}.
==Applications==
''L<sup>p</sup>'' spaces are widely used in mathematics and applications.


==Properties==
===Hausdorff–Young inequality===


===Hölder's inequality===
The [[Fourier transform]] for the real line (resp. for periodic functions, cf. [[Fourier series]]) maps ''L<sup>p</sup>''('''R''') to ''L<sup>q</sup>''('''R''') (resp. ''L<sup>p</sup>''('''T''') to ℓ<sup>''q''</sup>), where 1&nbsp;≤ ''p''&nbsp;≤ 2 and 1/''p''&nbsp;+ 1/''q''&nbsp;= 1. This is a consequence of the [[Riesz-Thorin theorem|Riesz-Thorin interpolation theorem]], and is made precise with the [[Hausdorff–Young inequality]].


Suppose <math>p, q, r \in [1, \infty]</math> satisfy <math>\tfrac{1}{p} + \tfrac{1}{q} = \tfrac{1}{r}</math>. If <math>f \in L^p(S, \mu)</math> and <math>g \in L^q(S, \mu)</math> then <math>f g \in L^r(S, \mu)</math> and{{sfn|Bahouri|Chemin|Danchin|2011|pp=1–4}}
By contrast, if ''p''&nbsp;> 2, the Fourier transform does not map into ''L<sup>q</sup>''.
<math display=block>\|f g\|_r ~\leq~ \|f\|_p \, \|g\|_q.</math>


This inequality, called [[Hölder's inequality]], is in some sense optimal since if <math>r = 1</math> and <math>f</math> is a measurable function such that
===Hilbert spaces===
<math display=block>\sup_{\|g\|_q \leq 1} \, \int_S |f g| \, \mathrm{d} \mu ~<~ \infty</math>
where the [[supremum]] is taken over the closed unit ball of <math>L^q(S, \mu),</math> then <math>f \in L^p(S, \mu)</math> and
<math display=block>\|f\|_p ~=~ \sup_{\|g\|_q \leq 1} \, \int_S f g \, \mathrm{d} \mu.</math>


===Atomic decomposition===
[[Hilbert space]]s are central to many applications, from [[quantum mechanics]] to [[stochastic calculus]]. The spaces ''L''<sup>2</sup> and ℓ<sup>2</sup> are both Hilbert spaces. In fact, by choosing a Hilbert basis, one sees that all Hilbert spaces are isometric to ℓ<sup>2</sup>(''E''), where ''E'' is a set with an appropriate cardinality.

If <math>1 \leq p < \infty</math> then every non-negative <math>f \in L^p(\mu)</math> has an {{em|atomic decomposition}},{{sfn|Bahouri|Chemin|Danchin|2011|pp=7–8}} meaning that there exist a sequence <math>(r_n)_{n \in \Z}</math> of non-negative real numbers and a sequence of non-negative functions <math>(f_n)_{n \in \Z},</math> called {{em|the atoms}}, whose supports <math>\left(\operatorname{supp} f_n\right)_{n \in \Z}</math> are [[Disjoint sets|pairwise disjoint sets]] of measure <math>\mu\left(\operatorname{supp} f_n\right) \leq 2^{n+1},</math> such that
<math display=block>f ~=~ \sum_{n \in \Z} r_n \, f_n \, ,</math>
and for every integer <math>n \in \Z,</math>
<math display=block>\|f_n\|_\infty ~\leq~ 2^{-\tfrac{n}{p}} \, ,</math>
and
<math display=block>\tfrac{1}{2} \|f\|_p^p ~\leq~ \sum_{n \in \Z} r_n^p ~\leq~ 2 \|f\|^p_p \, ,</math>
and where moreover, the sequence of functions <math>(r_n f_n)_{n \in \Z}</math> depends only on <math>f</math> (it is independent of <math>p</math>).
These inequalities guarantee that <math>\|f_n\|_p^p \leq 2</math> for all integers <math>n</math> while the supports of <math>(f_n)_{n \in \Z}</math> being pairwise disjoint implies
<math display=block>\|f\|_p^p ~=~ \sum_{n \in \Z} r_n^p \, \|f_n\|^p_p \, .</math>

===Dual spaces===

The [[Continuous dual|dual space]] of <math>L^p(\mu)</math> for <math>1 < p < \infty</math> has a natural isomorphism with <math>L^q(\mu),</math> where <math>q</math> is such that <math>\tfrac{1}{p} + \tfrac{1}{q} = 1</math>. This isomorphism associates <math>g \in L^q(\mu)</math> with the functional <math>\kappa_p(g) \in L^p(\mu)^*</math> defined by
<math display="block">f \mapsto \kappa_p(g)(f) = \int f g \, \mathrm{d}\mu</math>
for every <math>f \in L^p(\mu).</math>

<math>\kappa_p : L^q(\mu) \to L^p(\mu)^*</math> is a well defined continuous linear mapping which is an [[isometry]] by the [[Hölder's inequality#Extremal equality|extremal case]] of Hölder's inequality. If <math>(S,\Sigma,\mu)</math> is a [[Measure_space#Important_classes_of_measure_spaces|<math>\sigma</math>-finite measure space]] one can use the [[Radon–Nikodym theorem]] to show that any <math>G \in L^p(\mu)^*</math> can be expressed this way, i.e., <math>\kappa_p</math> is an [[Isometry#Definition|isometric isomorphism]] of [[Banach space]]s.{{sfn|Rudin|1987|loc=Theorem 6.16}} Hence, it is usual to say simply that <math>L^q(\mu)</math> is the [[continuous dual space]] of <math>L^p(\mu).</math>

For <math>1 < p < \infty,</math> the space <math>L^p(\mu)</math> is [[reflexive space|reflexive]]. Let <math>\kappa_p</math> be as above and let <math>\kappa_q : L^p(\mu) \to L^q(\mu)^*</math> be the corresponding linear isometry. Consider the map from <math>L^p(\mu)</math> to <math>L^p(\mu)^{**},</math> obtained by composing <math>\kappa_q</math> with the [[dual space#Transpose of a continuous linear map|transpose]] (or adjoint) of the inverse of <math>\kappa_p:</math>

<math display="block">j_p : L^p(\mu) \mathrel{\overset{\kappa_q}{\longrightarrow}} L^q(\mu)^* \mathrel{\overset{\left(\kappa_p^{-1}\right)^*}{\longrightarrow}} L^p(\mu)^{**}</math>

This map coincides with the [[Reflexive space#Definitions|canonical embedding]] <math>J</math> of <math>L^p(\mu)</math> into its bidual. Moreover, the map <math>j_p</math> is onto, as composition of two onto isometries, and this proves reflexivity.

If the measure <math>\mu</math> on <math>S</math> is [[sigma-finite]], then the dual of <math>L^1(\mu)</math> is isometrically isomorphic to <math>L^\infty(\mu)</math> (more precisely, the map <math>\kappa_1</math> corresponding to <math>p = 1</math> is an isometry from <math>L^\infty(\mu)</math> onto <math>L^1(\mu)^*.</math>

The dual of <math>L^\infty(\mu)</math> is subtler. Elements of <math>L^\infty(\mu)^*</math> can be identified with bounded signed ''finitely'' additive measures on <math>S</math> that are [[absolutely continuous]] with respect to <math>\mu.</math> See [[ba space]] for more details. If we assume the axiom of choice, this space is much bigger than <math>L^1(\mu)</math> except in some trivial cases. However, [[Saharon Shelah]] proved that there are relatively consistent extensions of [[Zermelo–Fraenkel set theory]] (ZF + [[Axiom of dependent choice|DC]] + "Every subset of the real numbers has the [[Baire property]]") in which the dual of <math>\ell^\infty</math> is <math>\ell^1.</math><ref>{{Citation|title=Handbook of Analysis and its Foundations|last=Schechter |first=Eric|year=1997| publisher=Academic Press Inc.|location=London}} See Sections 14.77 and 27.44–47</ref>

===Embeddings===

Colloquially, if <math>1 \leq p < q \leq \infty,</math> then <math>L^p(S, \mu)</math> contains functions that are more locally singular, while elements of <math>L^q(S, \mu)</math> can be more spread out. Consider the [[Lebesgue measure]] on the half line <math>(0, \infty).</math> A continuous function in <math>L^1</math> might blow up near <math>0</math> but must decay sufficiently fast toward infinity. On the other hand, continuous functions in <math>L^\infty</math> need not decay at all but no blow-up is allowed. More formally, suppose that <math>0 < p < q \leq \infty </math>, then:<ref name="VillaniEmbeddings">{{Citation|title=Another note on the inclusion {{math|''L<sup>p</sup>''(''μ'') ⊂ ''L<sup>q</sup>''(''μ'')}}|last=Villani|first=Alfonso|year=1985|journal=Amer. Math. Monthly|volume=92|number=7|pages=485–487|doi=10.2307/2322503|mr=801221|jstor=2322503}}</ref>

#<math>L^q(S, \mu) \subseteq L^p(S, \mu)</math> if and only if <math>S</math> does not contain sets of finite but arbitrarily large measure (e.g. any [[finite measure]]).
#<math>L^p(S, \mu) \subseteq L^q(S, \mu)</math> if and only if <math>S</math> does not contain sets of non-zero but arbitrarily small measure (e.g. the [[counting measure]]).

Neither condition holds for the Lebesgue measure on the real line while both conditions holds for the [[counting measure]] on any finite set. As a consequence of the [[closed graph theorem]], the embedding is continuous, i.e., the [[identity operator]] is a bounded linear map from <math>L^q</math> to <math>L^p</math> in the first case and <math>L^p</math> to <math>L^q</math> in the second. Indeed, if the domain <math>S</math> has finite measure, one can make the following explicit calculation using [[Hölder's inequality]]
<math display="block">\ \|\mathbf{1}f^p\|_1 \leq \|\mathbf{1}\|_{q/(q-p)} \|f^p\|_{q/p}</math>
leading to
<math display="block">\ \|f\|_p \leq \mu(S)^{1/p - 1/q} \|f\|_q .</math>

The constant appearing in the above inequality is optimal, in the sense that the [[operator norm]] of the identity <math>I : L^q(S, \mu) \to L^p(S, \mu)</math> is precisely
<math display="block">\|I\|_{q,p} = \mu(S)^{1/p - 1/q}</math>
the case of equality being achieved exactly when <math>f = 1</math> <math>\mu</math>-almost-everywhere.

===Dense subspaces===

Let <math>1 \leq p < \infty</math> and <math>(S, \Sigma, \mu)</math> be a measure space and consider an integrable [[simple function]] <math>f</math> on <math>S</math> given by
<math display="block">f = \sum_{j=1}^n a_j \mathbf{1}_{A_j},</math>
where <math>a_j</math> are scalars, <math>A_j \in \Sigma</math> has finite measure and <math>{\mathbf 1}_{A_j}</math> is the [[indicator function]] of the set <math>A_j,</math> for <math>j = 1, \dots, n.</math> By construction of the [[Lebesgue integration|integral]], the vector space of integrable simple functions is [[dense_set|dense]] in <math>L^p(S, \Sigma, \mu).</math>

More can be said when <math>S</math> is a [[Normal space|normal]] [[topological space]] and <math>\Sigma</math> its [[Borel algebra|Borel {{sigma}}&ndash;algebra]].

Suppose <math>V \subseteq S</math> is an open set with <math>\mu(V) < \infty.</math> Then for every Borel set <math>A \in \Sigma</math> contained in <math>V</math> there exist a closed set <math>F</math> and an open set <math>U</math> such that
<math display="block">F \subseteq A \subseteq U \subseteq V \quad \text{and} \quad \mu(U \setminus F)= \mu(U) - \mu(F) < \varepsilon,</math>
for every <math>\varepsilon > 0</math>. Subsequently, there exists a [[Urysohn function]] <math>0 \leq \varphi \leq 1</math> on <math>S</math> that is <math>1</math> on <math>F</math> and <math>0</math> on <math>S \setminus U,</math> with
<math display="block">\int_S |\mathbf{1}_A - \varphi| \, \mathrm{d}\mu < \varepsilon \, .</math>

If <math>S</math> can be covered by an increasing sequence <math>(V_n)</math> of open sets that have finite measure, then the space of <math>p</math>&ndash;integrable continuous functions is dense in <math>L^p(S, \Sigma, \mu).</math> More precisely, one can use bounded continuous functions that vanish outside one of the open sets <math>V_n.</math>

This applies in particular when <math>S = \Reals^d</math> and when <math>\mu</math> is the Lebesgue measure. For example, the space of continuous and compactly supported functions as well as the space of integrable [[step function]]s are dense in <math>L^p(\Reals^d)</math>.

===Closed subspaces===

Suppose <math>0 < p < \infty</math>. If <math>(S,\Sigma,\mu)</math> is a [[probability space]] and <math>V \subset L^\infty(\mu)</math> is a closed subspace of <math>L^p(\mu)</math> then <math>V</math> is finite-dimensional.{{sfn|Rudin|1991|pp=117–119}}
It is crucial that the vector space <math>V</math> be a subset of <math>L^\infty</math> since it is possible to construct an infinite-dimensional closed vector subspace of <math>L^1\left(\mathbb{T}, \tfrac{1}{2\pi}\lambda\right)</math> which lies in <math>L^4</math>; taking the [[Lebesgue measure]] <math>\lambda</math> on the [[circle group]] <math>\mathbb{T}</math> divided by <math>2\pi</math> as the probability measure.

==Applications==


===Statistics===
===Statistics===


In [[statistics]], measures of [[central tendency]] and [[statistical dispersion]], such as the [[mean]], [[median]], and [[standard deviation]], are defined in terms of ''L''<sup>''p''</sup> metrics, and measures of central tendency can be characterized as [[Average#Solutions to variational problems|solutions to variational problems]].
In statistics, measures of [[central tendency]] and [[statistical dispersion]], such as the [[mean]], [[median]], and [[standard deviation]], can be defined in terms of <math>L^p</math> metrics, and measures of central tendency can be characterized as [[Central tendency#Solutions to variational problems|solutions to variational problems]].


In [[penalized regression]], "L1 penalty" and "L2 penalty" refer to penalizing either the [[Taxicab geometry|<math>L^1</math> norm]] of a solution's vector of parameter values (i.e. the sum of its absolute values), or its squared <math>L^2</math> norm (its [[Euclidean norm|Euclidean length]]). Techniques which use an L1 penalty, like [[LASSO]], encourage sparse solutions (where the many parameters are zero).<ref>{{cite book |last=Hastie |first=T. J. |authorlink=Trevor Hastie |last2=Tibshirani |first2=R. |author2link=Robert Tibshirani |last3=Wainwright |first3=M. J. |year=2015 |title=Statistical Learning with Sparsity: The Lasso and Generalizations |location= |publisher=CRC Press |isbn=978-1-4987-1216-3 }}</ref> [[Elastic net regularization]] uses a penalty term that is a combination of the <math>L^1</math> norm and the squared <math>L^2</math> norm of the parameter vector.
== ''L''<sup>''p''</sup> for 0 < ''p'' < 1 ==
Let (''S'', ''Σ'', ''μ'') be a measure space. If 0&nbsp;< ''p''&nbsp;< 1, then ''L<sup>p</sup>''(''μ'') can be defined as above: it is the vector space of those measurable functions ''f'' such that
:<math>N_p(f) = \int_S |f|^p\, d\mu < \infty.</math>
As before, we may introduce the ''p''-norm ||&nbsp;''f''&nbsp;||<sub>''p''</sub>&nbsp;= ''N''<sub>''p''</sup>(''f'')<sup>1/''p''</sup>,
but ||&nbsp;·&nbsp;||<sub>''p''</sub> does not satisfy the triangle inequality in this case, and defines only a [[quasi-norm]].
The inequality (''a''&nbsp;+ ''b'')<sup>''p''</sup>&nbsp;≤ ''a''<sup>''p''</sup>&nbsp;+ ''b''<sup>''p''</sup>, valid for ''a''&nbsp;≥ 0 and ''b''&nbsp;≥ 0 implies that {{harv|Rudin|1991|loc=§1.47}}
:<math>N_p(f+g)\le N_p(f) + N_p(g),</math>
and so the function
:<math>d_p(f,g) = N_p(f-g) = \|f - g\|_p^p</math>
is a metric on ''L''<sup>''p''</sup>(''μ''). The resulting metric space is [[complete space|complete]]; the verification is similar to the familiar case when ''p''&nbsp;≥&nbsp;1.


===Hausdorff–Young inequality===
In this setting ''L''<sup>''p''</sup> satisfies a ''reverse Minkowski inequality'', that is for ''u'' and ''v'' in ''L<sup>p</sup>''
:<math>\|\,|u|+|v|\,\|_p\geq \|u\|_p+\|v\|_p</math>.


The [[Fourier transform]] for the real line (or, for [[periodic functions]], see [[Fourier series]]), maps <math>L^p(\Reals)</math> to <math>L^q(\Reals)</math> (or <math>L^p(\mathbf{T})</math> to <math>\ell^q</math>) respectively, where <math>1 \leq p \leq 2</math> and <math>\tfrac{1}{p} + \tfrac{1}{q} = 1.</math> This is a consequence of the [[Riesz–Thorin_theorem#Hausdorff–Young_inequality|Riesz–Thorin interpolation theorem]], and is made precise with the [[Hausdorff–Young inequality]].
This result may be used to prove Clarkson's inequalities, which are in turn used to establish the [[Uniformly convex space|uniform convexity]] of the spaces ''L''<sup>''p''</sup>
for 1&nbsp;&lt; ''p''&nbsp;&lt; ∞ {{harv|Adams|Fournier|2003}}.


By contrast, if <math>p > 2,</math> the Fourier transform does not map into <math>L^q.</math>
The space ''L''<sup>''p''</sup> for 0&nbsp;< ''p''&nbsp;< 1 is an [[F-space]]: it admits a complete translation-invariant metric with respect to which the vector space operations are continuous. It is also [[locally bounded]], much like the case ''p''&nbsp;≥ 1. It is the prototypical example of an [[F-space]] that, for most reasonable measure spaces, is not [[locally convex]]: in ℓ<sup>''p''</sup> or
''L''<sup>''p''</sup>([0,&nbsp;1]), every open convex set containing the 0 function is unbounded for the ''p''-quasi-norm; therefore, the 0 vector does not possess a fundamental system of convex neighborhoods. Specifically, this is true if the measure space ''S'' contains an infinite family of disjoint measurable sets of finite positive measure.


===Hilbert spaces===
The only nonempty convex open set in ''L''<sup>''p''</sup>([0,&nbsp;1]) is the entire space {{harv|Rudin|1991|loc=§1.47}}. As a particular consequence, there are no nonzero linear functionals on ''L''<sup>''p''</sup>([0,&nbsp;1]): the dual space is the zero space. In the case of the [[counting measure]] on the natural numbers (producing the sequence space ''L''<sup>''p''</sup>(''μ'')&nbsp;= ℓ<sup>''p''</sup>), the bounded linear functionals on ℓ<sup>''p''</sup> are exactly those that are bounded on ℓ<sup>1</sup>, namely those given by sequences in ℓ<sup>∞</sup>. Although ℓ<sup>''p''</sup> does contain non-trivial convex open sets, it fails to have enough of them to give a base for the topology.
[[Hilbert space]]s are central to many applications, from [[quantum mechanics]] to [[stochastic calculus]]. The spaces <math>L^2</math> and <math>\ell^2</math> are both Hilbert spaces. In fact, by choosing a Hilbert basis <math>E,</math> i.e., a maximal orthonormal subset of <math>L^2</math> or any Hilbert space, one sees that every Hilbert space is isometrically isomorphic to <math>\ell^2(E)</math> (same <math>E</math> as above), i.e., a Hilbert space of type <math>\ell^2.</math>


==Generalizations and extensions==
The situation of having no linear functionals is highly undesirable for the purposes of doing analysis. In the case of the Lebesgue measure on '''R'''<sup>''n''</sup>, rather than work with ''L''<sup>''p''</sup> for 0&nbsp;< ''p''&nbsp;< 1, it is common to work with the [[Hardy space]] ''H''<sup>''p''</sup> whenever possible, as this has quite a few linear functionals: enough to distinguish points from one another. However, the [[Hahn–Banach theorem]] still fails in ''H''<sup>''p''</sup> for ''p''&nbsp;<&nbsp;1 {{harv|Duren|1970|loc=§7.5}}.


=== ''L''<sup>0</sup>, the space of measurable functions ===
===Weak {{math|''L<sup>p</sup>''}}===


Let <math>(S, \Sigma, \mu)</math> be a measure space, and <math>f</math> a [[measurable function]] with real or complex values on <math>S.</math> The [[cumulative distribution function|distribution function]] of <math>f</math> is defined for <math>t \geq 0</math> by
The vector space of (equivalence classes of) measurable functions on (''S'', ''Σ'', ''μ'') is denoted ''L''<sup>0</sup>(''S'', ''Σ'', ''μ'') {{harv|Kalton|Peck|Roberts|1984}}. By definition, it contains all the ''L''<sup>''p''</sup>, and is equipped with the topology of [[Convergence in measure|''convergence in measure'']]. When ''μ'' is a probability measure (i.e., ''μ''(''S'')&nbsp;= 1), this mode of convergence is named [[Convergence of random variables#Convergence in probability|''convergence in probability'']].
<math display="block">\lambda_f(t) = \mu\{x \in S : |f(x)| > t\}.</math>
The description is easier when ''μ'' is finite.


If <math>f</math> is in <math>L^p(S, \mu)</math> for some <math>p</math> with <math>1 \leq p < \infty,</math> then by [[Markov's inequality]],
If ''μ'' is a finite measure on (''S'',&nbsp;''Σ''), the 0 function admits for the convergence in measure the following fundamental system of neighborhoods
<math display="block">\lambda_f(t) \leq \frac{\|f\|_p^p}{t^p}</math>
:<math>V_\varepsilon = \Bigl\{ f : \mu \bigl(\{x : |f(x)| > \varepsilon \} \bigr) < \varepsilon \Bigr\}, \ \ \varepsilon > 0.</math>
The topology can be defined by any metric ''d''&thinsp; of the form
:<math>d(f, g) = \int_S \varphi \bigl( |f(x) - g(x)| \bigr) \, \mathrm{d}\mu(x)</math>
where ''φ''&thinsp; is bounded continuous concave and non-decreasing on [0, ∞), with ''φ''(0)&nbsp;= 0 and ''φ''(''t'')&nbsp;> 0 when ''t''&nbsp;> 0 (for example, ''φ''(''t'')&nbsp;= min(''t'', 1)). Such a metric is called [[Paul Pierre Lévy|''Lévy'']]-''metric for'' ''L''<sup>0</sup>. Under this metric the space ''L''<sup>0</sup> is complete (it is again an F-space). The space ''L''<sup>0</sup> is in general not locally bounded, and not locally convex.


A function <math>f</math> is said to be in the space '''weak <math>L^p(S, \mu)</math>''', or <math>L^{p,w}(S, \mu),</math> if there is a constant <math>C > 0</math> such that, for all <math>t > 0,</math>
For the infinite Lebesgue measure ''λ'' on '''R'''<sup>''n''</sup>, the definition of the fundamental system of neighborhoods could be modified as follows
<math display="block">\lambda_f(t) \leq \frac{C^p}{t^p}</math>
:<math>W_\varepsilon = \Bigl\{ f : \lambda \bigl(\{ x : |f(x)| > \varepsilon \ \text{and} \ |x| < 1 / \varepsilon\} \bigr) < \varepsilon \Bigr\}.</math>
The resulting space ''L''<sup>0</sup>('''R'''<sup>''n''</sup>, ''λ'') coincides as topological vector space with ''L''<sup>0</sup>('''R'''<sup>''n''</sup>, ''g''(''x'')&thinsp;d''λ''(x)), for any positive ''λ''&ndash;integrable density ''g''.


The best constant <math>C</math> for this inequality is the <math>L^{p,w}</math>-norm of <math>f,</math> and is denoted by
==Weak ''L<sup>p</sup>''==
<math display="block">\|f\|_{p,w} = \sup_{t > 0} ~ t \lambda_f^{1/p}(t).</math>
Let (''S'', ''Σ'', ''μ'') be a measure space, and ''f'' a [[measurable function]] with real or complex values on ''S''. The [[cumulative distribution function|distribution function]] of ''f'' is defined for ''t''&nbsp;> 0 by


The weak <math>L^p</math> coincide with the [[Lorentz space]]s <math>L^{p,\infty},</math> so this notation is also used to denote them.
:<math>\lambda_f(t) = \mu\left\{x\in S\mid |f(x)| > t\right\}.</math>


The <math>L^{p,w}</math>-norm is not a true norm, since the [[triangle inequality]] fails to hold. Nevertheless, for <math>f</math> in <math>L^p(S, \mu),</math>
If ''f'' is in ''L''<sup>''p''</sup>(''S'', ''μ'') for some ''p'' with 1&nbsp;≤ ''p''&nbsp;< ∞, then by [[Markov's inequality]],
<math display="block">\|f\|_{p,w} \leq \|f\|_p</math>
and in particular <math>L^p(S, \mu) \subset L^{p,w}(S, \mu).</math>


In fact, one has
:<math>\lambda_f(t)\le \frac{\|f\|_p^p}{t^p}.</math>
<math display="block">\|f\|^p_{L^p} = \int |f(x)|^p d\mu(x) \geq \int_{\{|f(x)| > t \}} t^p + \int_{\{|f(x)| \leq t \}} |f|^p \geq t^p \mu(\{|f| > t \}),</math>
and raising to power <math>1/p</math> and taking the supremum in <math>t</math> one has
<math display="block">\|f\|_{L^p} \geq \sup_{t > 0} t \; \mu(\{|f| > t \})^{1/p} = \|f\|_{L^{p,w}}.</math>


Under the convention that two functions are equal if they are equal <math>\mu</math> almost everywhere, then the spaces <math>L^{p,w}</math> are complete {{harv|Grafakos|2004}}.
A function ''f'' is said to be in the space '''weak ''L<sup>p</sup>''(''S'', ''μ'')''', or ''L<sup>p,w</sup>''(''S'', ''μ''), if there is a constant ''C''&nbsp;> 0 such that, for all ''t''&nbsp;> 0,


:<math>\lambda_f(t) \le \frac{C^p}{t^p}.</math>
For any <math>0 < r < p</math> the expression
<math display="block">\|| f |\|_{L^{p,\infty}} = \sup_{0<\mu(E)<\infty} \mu(E)^{-1/r + 1/p} \left(\int_E |f|^r\, d\mu\right)^{1/r}</math>
is comparable to the <math>L^{p,w}</math>-norm. Further in the case <math>p > 1,</math> this expression defines a norm if <math>r = 1.</math> Hence for <math>p > 1</math> the weak <math>L^p</math> spaces are [[Banach space]]s {{harv|Grafakos|2004}}.


A major result that uses the <math>L^{p,w}</math>-spaces is the [[Marcinkiewicz interpolation|Marcinkiewicz interpolation theorem]], which has broad applications to [[harmonic analysis]] and the study of [[singular integrals]].
The best constant ''C'' for this inequality is the ''L<sup>p,w</sup>''-norm of ''f'', and is denoted by
:<math>\|f\|_{p,w} = \inf\left\{C \mid \lambda_f(t) \le \frac{C^p}{t^p}\quad\forall t>0 \right\}.</math>


===Weighted {{math|''L<sup>p</sup>''}} spaces===
The weak ''L''<sup>''p''</sup> coincide with the [[Lorentz space]]s ''L''<sup>''p'',∞</sup>, so this notation is also used to denote them.


As before, consider a [[measure space]] <math>(S, \Sigma, \mu).</math> Let <math>w : S \to [a, \infty), a > 0</math> be a measurable function. The <math>w</math>-'''weighted <math>L^p</math> space''' is defined as <math>L^p(S, w \, \mathrm{d} \mu),</math> where <math>w \, \mathrm{d} \mu</math> means the measure <math>\nu</math> defined by
The ''L<sup>p,w</sup>''-norm is not a true norm, since the [[triangle inequality]] fails to hold. Nevertheless, for ''f'' in ''L''<sup>p</sup>(''S'', ''μ''),
:<math>\|f\|_{p,w}\le \|f\|_p,</math>
<math display="block">\nu(A) \equiv \int_A w(x) \, \mathrm{d} \mu (x), \qquad A \in \Sigma,</math>
and in particular ''L<sup>p</sup>''(''S'', ''μ'')&nbsp;⊂ ''L<sup>p,w</sup>''(''S'', ''μ''). Under the convention that two functions are equal if they are equal ''μ'' almost everywhere, then the spaces ''L''<sup>p,w</sup> are complete {{harv|Grafakos|2004}}.


or, in terms of the [[Radon–Nikodym theorem|Radon–Nikodym derivative]], <math>w = \tfrac{\mathrm{d} \nu}{\mathrm{d} \mu}</math> the [[Norm (mathematics)|norm]] for <math>L^p(S, w \, \mathrm{d} \mu)</math> is explicitly
For any 0&nbsp;&lt; ''r''&nbsp;&lt; ''p'' the expression
:<math>||| f |||_{L^{p,\infty}}=\sup_{0<\mu(E)<\infty} \mu(E)^{-\frac{1}{r}+\frac{1}{p}}\left(\int_E |f|^r\,d\mu\right)^{\frac{1}{r}}</math>
<math display="block">\|u\|_{L^p(S, w \, \mathrm{d} \mu)} \equiv \left(\int_S w(x) |u(x)|^p \, \mathrm{d} \mu(x)\right)^{1/p}</math>
is comparable to the ''L<sup>p,w</sup>''-norm. Further in the case ''p''&nbsp;> 1, this expression defines a norm if ''r''&nbsp;= 1. Hence for ''p''&nbsp;&gt; 1 the weak ''L''<sup>''p''</sup> spaces are [[Banach space]]s {{harv|Grafakos|2004}}.


As <math>L^p</math>-spaces, the weighted spaces have nothing special, since <math>L^p(S, w \, \mathrm{d} \mu)</math> is equal to <math>L^p(S, \mathrm{d} \nu).</math> But they are the natural framework for several results in harmonic analysis {{harv|Grafakos|2004}}<!--Please check this reference. Appears in Grafakos "Modern Fourier analysis", Chapter 9.-->; they appear for example in the [[Muckenhoupt weights|Muckenhoupt theorem]]: for <math>1 < p < \infty,</math> the classical [[Hilbert transform]] is defined on <math>L^p(\mathbf{T}, \lambda)</math> where <math>\mathbf{T}</math> denotes the [[unit circle]] and <math>\lambda</math> the Lebesgue measure; the (nonlinear) [[Hardy–Littlewood maximal operator]] is bounded on <math>L^p(\Reals^n, \lambda).</math> Muckenhoupt's theorem describes weights <math>w</math> such that the Hilbert transform remains bounded on <math>L^p(\mathbf{T}, w \, \mathrm{d} \lambda)</math> and the maximal operator on <math>L^p(\Reals^n, w \, \mathrm{d} \lambda).</math>
A major result that uses the ''L<sup>p,w</sup>''-spaces is the [[Marcinkiewicz interpolation|Marcinkiewicz interpolation theorem]], which has broad applications to [[harmonic analysis]] and the study of [[singular integrals]].


==Weighted ''L<sup>p</sup>'' spaces==
==={{math|''L<sup>p</sup>''}} spaces on manifolds===


One may also define spaces <math>L^p(M)</math> on a manifold, called the '''intrinsic <math>L^p</math> spaces''' of the manifold, using [[Density on a manifold|densities]].
As before, consider a [[measure space]] (''S'', ''Σ'', ''μ''). Let <math>w : S \to [0, + \infty)</math> be a measurable function. The ''w''-'''weighted ''L<sup>p</sup>'' space''' is defined as ''L<sup>p</sup>''(''S'', ''w''&thinsp;d''μ''), where ''w''&thinsp;d''μ'' means the measure ''ν'' defined by


===Vector-valued {{math|''L<sup>p</sup>''}} spaces===
:<math>\ \nu (A) := \int_{A} w(x) \, \mathrm{d} \mu (x), \ \ \ A \in \Sigma, </math>


Given a measure space <math>(\Omega, \Sigma, \mu)</math> and a [[Locally convex topological vector space|locally convex space]] <math>E</math> (here assumed to be [[Complete topological vector space|complete]]), it is possible to define spaces of <math>p</math>-integrable <math>E</math>-valued functions on <math>\Omega</math> in a number of ways. One way is to define the spaces of [[Bochner integral|Bochner integrable]] and [[Pettis integral|Pettis integrable]] functions, and then endow them with [[Locally convex topological vector space|locally convex]] [[Vector topology|TVS-topologies]] that are (each in their own way) a natural generalization of the usual <math>L^p</math> topology. Another way involves [[topological tensor product]]s of <math>L^p(\Omega, \Sigma, \mu)</math> with <math>E.</math> Element of the vector space <math>L^p(\Omega, \Sigma, \mu) \otimes E</math> are finite sums of simple tensors <math>f_1 \otimes e_1 + \cdots + f_n \otimes e_n,</math> where each simple tensor <math>f \times e</math> may be identified with the function <math>\Omega \to E</math> that sends <math>x \mapsto e f(x).</math> This [[tensor product]] <math>L^p(\Omega, \Sigma, \mu) \otimes E</math> is then endowed with a locally convex topology that turns it into a [[topological tensor product]], the most common of which are the [[projective tensor product]], denoted by <math>L^p(\Omega, \Sigma, \mu) \otimes_\pi E,</math> and the [[injective tensor product]], denoted by <math>L^p(\Omega, \Sigma, \mu) \otimes_\varepsilon E.</math> In general, neither of these space are complete so their [[Complete topological vector space|completions]] are constructed, which are respectively denoted by <math>L^p(\Omega, \Sigma, \mu) \widehat{\otimes}_\pi E</math> and <math>L^p(\Omega, \Sigma, \mu) \widehat{\otimes}_\varepsilon E</math> (this is analogous to how the space of scalar-valued [[simple function]]s on <math>\Omega,</math> when seminormed by any <math>\|\cdot\|_p,</math> is not complete so a completion is constructed which, after being quotiented by <math>\ker \|\cdot\|_p,</math> is isometrically isomorphic to the Banach space <math>L^p(\Omega, \mu)</math>). [[Alexander Grothendieck]] showed that when <math>E</math> is a [[nuclear space]] (a concept he introduced), then these two constructions are, respectively, canonically TVS-isomorphic with the spaces of Bochner and Pettis integral functions mentioned earlier; in short, they are indistinguishable.
or, in terms of the [[Radon–Nikodym theorem|Radon–Nikodym derivative]],


==={{math|''L''<sup>0</sup>}} space of measurable functions===
:<math>\ w = \frac{\mathrm{d} \nu}{\mathrm{d} \mu}.</math>


The vector space of ([[equivalence class]]es of) measurable functions on <math>(S, \Sigma, \mu)</math> is denoted <math>L^0(S, \Sigma, \mu)</math> {{harv|Kalton|Peck|Roberts|1984}}. By definition, it contains all the <math>L^p,</math> and is equipped with the topology of ''[[convergence in measure]]''. When <math>\mu</math> is a probability measure (i.e., <math>\mu(S) = 1</math>), this mode of convergence is named ''[[convergence in probability]]''. The space <math>L^0</math> is always a [[topological abelian group]] but is only a [[topological vector space]] if <math>\mu(S)<\infty.</math> This is because scalar multiplication is continuous if and only if <math>\mu(S)<\infty.</math> If <math>(S,\Sigma,\mu)</math> is <math>\sigma</math>-finite then the [[weaker topology]] of [[local convergence in measure]] is an [[F-space]], i.e. a [[Complete topological vector space|completely]] [[metrizable topological vector space]]. Moreover, this topology is isometric to global convergence in measure <math>(S,\Sigma,\nu)</math> for a suitable choice of [[probability measure]] <math>\nu.</math>
The [[norm (mathematics)|norm]] for ''L<sup>p</sup>''(''S'', ''w''&thinsp;d''μ'') is explicitly


The description is easier when <math>\mu</math> is finite. If <math>\mu</math> is a [[finite measure]] on <math>(S, \Sigma),</math> the <math>0</math> function admits for the convergence in measure the following [[fundamental system of neighborhoods]]
:<math>\ \| u \|_{L^{p} (S, w \, \mathrm{d} \mu)} := \left( \int_{S} w(x) | u(x) |^{p} \, \mathrm{d} \mu (x) \right)^{1/p}.</math>
<math display="block">V_\varepsilon = \Bigl\{f : \mu \bigl(\{x : |f(x)| > \varepsilon\} \bigr) < \varepsilon \Bigr\}, \qquad \varepsilon > 0.</math>


The topology can be defined by any metric <math>d</math> of the form
As ''L''<sup>''p''</sub>-spaces, the weighted spaces have nothing special, since ''L<sup>p</sup>''(''S'', ''w''&thinsp;d''μ'') is equal to ''L''<sup>''p''</sup>(''S'',&nbsp;d''ν''). But they are the natural framework for several results in harmonic analysis {{harv|Grafakos|2004}}<!--Please check this reference. Appears in Grafakos "Modern Fourier analysis", Chapter 9.-->; they appear for example in the [[Muckenhoupt weights|Muckenhoupt theorem]]: for 1&nbsp;< ''p''&nbsp;< ∞, the classical [[Hilbert transform]] is defined on ''L''<sup>''p''</sub>('''T''',&nbsp;''λ'') where '''T''' denotes the unit circle and ''λ'' the Lebesgue measure; the (nonlinear) [[Hardy–Littlewood maximal operator]] is bounded on ''L''<sup>''p''</sub>('''R'''<sup>''n''</sup>,&nbsp;''λ''). Muckenhoupt's theorem describes weights ''w'' such that the Hilbert transform remains bounded on ''L<sup>p</sup>''('''T''', ''w''&thinsp;d''λ'') and the maximal operator on ''L<sup>p</sup>''('''R'''<sup>''n''</sup>, ''w''&thinsp;d''λ'').
<math display="block">d(f, g) = \int_S \varphi \bigl(|f(x) - g(x)|\bigr)\, \mathrm{d}\mu(x)</math>
where <math>\varphi</math> is bounded continuous concave and non-decreasing on <math>[0, \infty),</math> with <math>\varphi(0) = 0</math> and <math>\varphi(t) > 0</math> when <math>t > 0</math> (for example, <math>\varphi(t) = \min(t, 1).</math> Such a metric is called [[Paul Lévy (mathematician)|Lévy]]-metric for <math>L^0.</math> Under this metric the space <math>L^0</math> is complete. However, as mentioned above, scalar multiplication is continuous with respect to this metric only if <math>\mu(S)<\infty</math>. To see this, consider the Lebesgue measurable function <math>f:\mathbb R\rightarrow \mathbb R</math> defined by <math>f(x)=x</math>. Then clearly <math>\lim_{c\rightarrow 0}d(cf,0)=\infty</math>. The space <math>L^0</math> is in general not locally bounded, and not locally convex.


For the infinite Lebesgue measure <math>\lambda</math> on <math>\Reals^n,</math> the definition of the fundamental system of neighborhoods could be modified as follows
==''L<sup>p</sup>'' spaces on manifolds==
<math display="block">W_\varepsilon = \left\{f : \lambda \left(\left\{x : |f(x)| > \varepsilon \text{ and } |x| < \tfrac{1}{\varepsilon}\right\}\right) < \varepsilon\right\}</math>
One may also define spaces <math>L^p(M)</math> on a manifold, called the '''intrinsic ''L<sup>p</sup>'' spaces''' of the manifold, using [[Density on a manifold|densities]].

The resulting space <math>L^0(\Reals^n, \lambda)</math>, with the topology of local convergence in measure, is isomorphic to the space <math>L^0(\Reals^n, g \, \lambda),</math> for any positive <math>\lambda</math>&ndash;integrable density <math>g.</math>


==See also==
==See also==
{{Div col|colwidth=30em}}
* [[Birnbaum–Orlicz space]]
* {{annotated link|Absolutely integrable function}}
* [[Hardy space]]
* {{annotated link|Bochner space}}
* [[Hölder mean]]
* [[Hölder space]]
* {{annotated link|Orlicz space}}
* {{annotated link|Hardy space}}
* [[Root mean square]]
* {{annotated link|Riesz–Thorin theorem}}
* [[Locally integrable function]] (<math>\scriptstyle L^1_{\text{loc}}</math>)
* {{annotated link|Hölder mean}}
* [[Pontryagin_duality#Haar_measure|<math>L^p(G)</math> spaces over a locally compact group <math>G</math>]]
* {{annotated link|Hölder space}}
* {{annotated link|Root mean square}}
* {{annotated link|Least absolute deviations}}
* {{annotated link|Locally integrable function}} <math>\left( L^1_{\text{loc}}\right)</math>
* {{annotated link|Pontryagin duality#Haar measure|<math> L^p(G)</math> spaces over a locally compact group <math>G</math>}}
* {{annotated link|Least-squares spectral analysis}}
* {{annotated link|List of Banach spaces}}
* {{annotated link|Minkowski distance}}
* {{annotated link|L-infinity}}
* {{annotated link|Lp sum|''L<sup>p</sup>'' sum}}
* {{annotated link|Cm sum|''C<sup>m</sup>'' space}}

{{div col end}}


==Notes==
==Notes==

<references/>
{{reflist}}
{{reflist|group=note}}

{{reflist|group=proof}}


==References==
==References==
{{sfn whitelist|CITEREFBahouriCheminDanchin2011}}
* {{citation|last1=Adams|first1=Robert A.|last2=Fournier|first2=John F.|title=Sobolev Spaces|edition=Second|publisher=Academic Press|year=2003|isbn=978-0120441433}}.
* {{citation|first=Nicolas|last=Bourbaki|authorlink=Nicolas Bourbaki|title=Topological vector spaces|series=Elements of mathematics|publisher= Springer-Verlag|publication-place=Berlin|year=1987|isbn=978-3540136279}}.
* {{citation | last1=Adams | first1=Robert A. | last2=Fournier | first2=John F. | title=Sobolev Spaces | edition=Second | publisher=Academic Press | year=2003 | isbn=978-0-12-044143-3}}.
* {{Bahouri Chemin Danchin Fourier Analysis and Nonlinear Partial Differential Equations 2011}} <!--{{sfn|Bahouri|Chemin|Danchin|2011|p=}}-->
* {{citation | last=DiBenedetto|first=Emmanuele|title=Real analysis|publisher=Birkhäuser|year=2002|isbn=3-7643-4231-5}}.
* {{citation|last1=Dunford|first1=Nelson|last2=Schwartz|first2=Jacob T.|title=Linear operators, volume I|publisher=Wiley-Interscience|year=1958}}.
* {{citation | last=Bourbaki | first=Nicolas | author-link=Nicolas Bourbaki | title=Topological vector spaces|series=Elements of mathematics|publisher= Springer-Verlag | location=Berlin | year=1987 | isbn=978-3-540-13627-9}}.
* {{citation | last=DiBenedetto | first=Emmanuele | title=Real analysis | publisher=Birkhäuser | year=2002 | isbn=3-7643-4231-5}}.
*{{citation
* {{citation | last1=Dunford | first1=Nelson | last2=Schwartz | first2=Jacob T. | title=Linear operators, volume I | publisher=Wiley-Interscience | year=1958}}.
|last= Duren|first=P.|title=Theory of H<sup>p</sup>-Spaces|year=1970|publisher= Academic Press|publication-place= New York}}
* {{citation|title=Classical and Modern Fourier Analysis | last=Grafakos | first=Loukas | publisher=Pearson Education, Inc. | pages=253&ndash;257 | year=2004 | isbn=0-13-035399-X}}.
* {{citation | last= Duren | first=P. | title=Theory of H<sup>p</sup>-Spaces | year=1970 | publisher= Academic Press | location= New York}}
* {{citation|last1=Hewitt|first1=Edwin|last2=Stromberg|first2=Karl|title=Real and abstract analysis|publisher=Springer-Verlag|year=1965}}.
* {{citation | last=Grafakos | first=Loukas | authorlink=Loukas Grafakos | title=Classical and Modern Fourier Analysis | publisher=Pearson Education, Inc. | pages=253&ndash;257 | year=2004 | isbn=0-13-035399-X}}.
* {{citation | last1=Hewitt | first1=Edwin | last2=Stromberg | first2=Karl | title=Real and abstract analysis | publisher=Springer-Verlag | year=1965}}.
* {{citation
* {{citation | last1=Kalton | first1=Nigel J. | author-link=Nigel Kalton | last2=Peck | first2=N. Tenney | last3=Roberts | first3=James W. | title = An F-space sampler | series = London Mathematical Society Lecture Note Series | volume=89 | publisher = Cambridge University Press| location = Cambridge | year = 1984 | isbn = 0-521-27585-7 | mr=808777 | doi=10.1017/CBO9780511662447}}
|last1=Kalton|first1=Nigel J.
* {{citation | last=Riesz | first=Frigyes | author-link=Frigyes Riesz | title=Untersuchungen über Systeme integrierbarer Funktionen | journal=Mathematische Annalen | volume=69 | year=1910 | pages=449–497 | doi=10.1007/BF01457637 | issue=4| s2cid=120242933 | url=https://zenodo.org/record/2456593 }}
|last2=Peck|first2=N. Tenney
* {{Rudin Walter Functional Analysis|edition=2}} <!--{{sfn|Rudin|1991|p=}}-->
|last3=Roberts|first3=James W.
* {{Citation | last1=Rudin | first1=Walter | author1-link=Walter Rudin | title=Real and complex analysis | publisher=[[McGraw-Hill]] | location=New York | edition=3rd | isbn=978-0-07-054234-1 | mr=924157 | year=1987}}
| title = An F-space sampler
* {{cite book | last=Stein | first=Elias M. | last2=Shakarchi | first2=Rami | title=Functional Analysis: Introduction to Further Topics in Analysis | publisher=Princeton University Press | date=2012 | isbn=978-1-4008-4055-7 | doi=10.1515/9781400840557}}
| series = London Mathematical Society Lecture Note Series, 89
* {{citation | last=Titchmarsh | first=EC | author-link=Edward Charles Titchmarsh | title=The theory of functions | publisher=Oxford University Press | year=1976 | isbn=978-0-19-853349-8}}
| publisher = Cambridge University Press| publication-place = Cambridge
| year = 1984| pages = xii+240| isbn = 0-521-27585-7}} {{MathSciNet|id=0808777}}
* {{citation
|last=Riesz|first=Frigyes|authorlink=Frigyes Riesz
|title=Untersuchungen über Systeme integrierbarer Funktionen|journal=Mathematische Annalen|volume=69|year=1910|pages=449–497
|doi=10.1007/BF01457637
|issue=4}}
* {{Citation | last1=Rudin | first1=Walter | author1-link=Walter Rudin | title=Functional Analysis | publisher=McGraw-Hill Science/Engineering/Math | isbn=978-0-07-054236-5 | year=1991}}
* {{Citation | last1=Rudin | first1=Walter | author1-link=Walter Rudin | title=Real and complex analysis | publisher=[[McGraw-Hill]] | location=New York | edition=3rd | isbn=978-0-07-054234-1 | id={{MathSciNet | id = 924157}} | year=1987}}
* {{citation|first=EC|last=Titchmarsh|authorlink=Edward Charles Titchmarsh|title=The theory of functions|publisher=Oxford University Press|year=1976|isbn=9780198533498}}


==External links==
==External links==

* {{planetmath reference|id=6270|title=Proof that ''L''<sup>''p''</sup> spaces are complete }}
* {{springer|title=Lebesgue space|id=p/l057910}}
* [http://planetmath.org/ProofThatLpSpacesAreComplete Proof that ''L''<sup>''p''</sup> spaces are complete]

{{Lp spaces}}
{{Measure theory}}
{{Banach spaces}}
{{Functional analysis}}


{{DEFAULTSORT:Lp Space}}
{{DEFAULTSORT:Lp Space}}

[[Category:Normed spaces]]
[[Category:Banach spaces]]
[[Category:Banach spaces]]
[[Category:Function spaces]]
[[Category:Mathematical series]]
[[Category:Mathematical series]]
[[Category:Measure theory]]

[[ca:Espai Lp]]
[[Category:Normed spaces]]
[[cs:Lp prostor]]
[[Category:Lp spaces]]
[[da:Lp (matematik)]]
[[de:Lp-Raum]]
[[es:Espacios Lp]]
[[fr:Espace Lp]]
[[it:Spazio Lp]]
[[he:מרחב Lp]]
[[lt:Lebego erdvė]]
[[nl:Lp-ruimte]]
[[pms:Spassi Lp]]
[[pl:Przestrzeń Lp]]
[[pt:Espaço Lp]]
[[ru:Lp (пространство)]]
[[fi:Lp-avaruus]]
[[sv:Lp-rum]]
[[uk:Простір Lp]]
[[zh:Lp空间]]

Latest revision as of 05:49, 12 December 2024

In mathematics, the Lp spaces are function spaces defined using a natural generalization of the p-norm for finite-dimensional vector spaces. They are sometimes called Lebesgue spaces, named after Henri Lebesgue (Dunford & Schwartz 1958, III.3), although according to the Bourbaki group (Bourbaki 1987) they were first introduced by Frigyes Riesz (Riesz 1910).

Lp spaces form an important class of Banach spaces in functional analysis, and of topological vector spaces. Because of their key role in the mathematical analysis of measure and probability spaces, Lebesgue spaces are used also in the theoretical discussion of problems in physics, statistics, economics, finance, engineering, and other disciplines.

Preliminaries

[edit]

The p-norm in finite dimensions

[edit]
Illustrations of unit circles (see also superellipse) in based on different -norms (every vector from the origin to the unit circle has a length of one, the length being calculated with length-formula of the corresponding ).

The Euclidean length of a vector in the -dimensional real vector space is given by the Euclidean norm:

The Euclidean distance between two points and is the length of the straight line between the two points. In many situations, the Euclidean distance is appropriate for capturing the actual distances in a given space. In contrast, consider taxi drivers in a grid street plan who should measure distance not in terms of the length of the straight line to their destination, but in terms of the rectilinear distance, which takes into account that streets are either orthogonal or parallel to each other. The class of -norms generalizes these two examples and has an abundance of applications in many parts of mathematics, physics, and computer science.

For a real number the -norm or -norm of is defined by The absolute value bars can be dropped when is a rational number with an even numerator in its reduced form, and is drawn from the set of real numbers, or one of its subsets.

The Euclidean norm from above falls into this class and is the -norm, and the -norm is the norm that corresponds to the rectilinear distance.

The -norm or maximum norm (or uniform norm) is the limit of the -norms for , given by:

For all the -norms and maximum norm satisfy the properties of a "length function" (or norm), that is:

  • only the zero vector has zero length,
  • the length of the vector is positive homogeneous with respect to multiplication by a scalar (positive homogeneity), and
  • the length of the sum of two vectors is no larger than the sum of lengths of the vectors (triangle inequality).

Abstractly speaking, this means that together with the -norm is a normed vector space. Moreover, it turns out that this space is complete, thus making it a Banach space.

Relations between p-norms

[edit]

The grid distance or rectilinear distance (sometimes called the "Manhattan distance") between two points is never shorter than the length of the line segment between them (the Euclidean or "as the crow flies" distance). Formally, this means that the Euclidean norm of any vector is bounded by its 1-norm:

This fact generalizes to -norms in that the -norm of any given vector does not grow with :

for any vector and real numbers and (In fact this remains true for and .)

For the opposite direction, the following relation between the -norm and the -norm is known:

This inequality depends on the dimension of the underlying vector space and follows directly from the Cauchy–Schwarz inequality.

In general, for vectors in where

This is a consequence of Hölder's inequality.

When 0 < p < 1

[edit]
Astroid, unit circle in metric

In for the formula defines an absolutely homogeneous function for however, the resulting function does not define a norm, because it is not subadditive. On the other hand, the formula defines a subadditive function at the cost of losing absolute homogeneity. It does define an F-norm, though, which is homogeneous of degree

Hence, the function defines a metric. The metric space is denoted by

Although the -unit ball around the origin in this metric is "concave", the topology defined on by the metric is the usual vector space topology of hence is a locally convex topological vector space. Beyond this qualitative statement, a quantitative way to measure the lack of convexity of is to denote by the smallest constant such that the scalar multiple of the -unit ball contains the convex hull of which is equal to The fact that for fixed we have shows that the infinite-dimensional sequence space defined below, is no longer locally convex.[citation needed]

When p = 0

[edit]

There is one norm and another function called the "norm" (with quotation marks).

The mathematical definition of the norm was established by Banach's Theory of Linear Operations. The space of sequences has a complete metric topology provided by the F-norm on the product metric:[citation needed] The -normed space is studied in functional analysis, probability theory, and harmonic analysis.

Another function was called the "norm" by David Donoho—whose quotation marks warn that this function is not a proper norm—is the number of non-zero entries of the vector [citation needed] Many authors abuse terminology by omitting the quotation marks. Defining the zero "norm" of is equal to

An animated gif of p-norms 0.1 through 2 with a step of 0.05.
An animated gif of p-norms 0.1 through 2 with a step of 0.05.

This is not a norm because it is not homogeneous. For example, scaling the vector by a positive constant does not change the "norm". Despite these defects as a mathematical norm, the non-zero counting "norm" has uses in scientific computing, information theory, and statistics–notably in compressed sensing in signal processing and computational harmonic analysis. Despite not being a norm, the associated metric, known as Hamming distance, is a valid distance, since homogeneity is not required for distances.

p spaces and sequence spaces

[edit]

The -norm can be extended to vectors that have an infinite number of components (sequences), which yields the space This contains as special cases:

  • the space of sequences whose series are absolutely convergent,
  • the space of square-summable sequences, which is a Hilbert space, and
  • the space of bounded sequences.

The space of sequences has a natural vector space structure by applying scalar addition and multiplication. Explicitly, the vector sum and the scalar action for infinite sequences of real (or complex) numbers are given by:

Define the -norm:

Here, a complication arises, namely that the series on the right is not always convergent, so for example, the sequence made up of only ones, will have an infinite -norm for The space is then defined as the set of all infinite sequences of real (or complex) numbers such that the -norm is finite.

One can check that as increases, the set grows larger. For example, the sequence is not in but it is in for as the series diverges for (the harmonic series), but is convergent for

One also defines the -norm using the supremum: and the corresponding space of all bounded sequences. It turns out that[1] if the right-hand side is finite, or the left-hand side is infinite. Thus, we will consider spaces for

The -norm thus defined on is indeed a norm, and together with this norm is a Banach space.

General ℓp-space

[edit]

In complete analogy to the preceding definition one can define the space over a general index set (and ) as where convergence on the right means that only countably many summands are nonzero (see also Unconditional convergence). With the norm the space becomes a Banach space. In the case where is finite with elements, this construction yields with the -norm defined above. If is countably infinite, this is exactly the sequence space defined above. For uncountable sets this is a non-separable Banach space which can be seen as the locally convex direct limit of -sequence spaces.[2]

For the -norm is even induced by a canonical inner product called the Euclidean inner product, which means that holds for all vectors This inner product can expressed in terms of the norm by using the polarization identity. On it can be defined by Now consider the case Define[note 1] where for all [3][note 2]

The index set can be turned into a measure space by giving it the discrete σ-algebra and the counting measure. Then the space is just a special case of the more general -space (defined below).

Lp spaces and Lebesgue integrals

[edit]

An space may be defined as a space of measurable functions for which the -th power of the absolute value is Lebesgue integrable, where functions which agree almost everywhere are identified. More generally, let be a measure space and [note 3] When , consider the set of all measurable functions from to or whose absolute value raised to the -th power has a finite integral, or in symbols:[4]

To define the set for recall that two functions and defined on are said to be equal almost everywhere, written a.e., if the set is measurable and has measure zero. Similarly, a measurable function (and its absolute value) is bounded (or dominated) almost everywhere by a real number written a.e., if the (necessarily) measurable set has measure zero. The space is the set of all measurable functions that are bounded almost everywhere (by some real ) and is defined as the infimum of these bounds: When then this is the same as the essential supremum of the absolute value of :[note 4]

For example, if is a measurable function that is equal to almost everywhere[note 5] then for every and thus for all

For every positive the value under of a measurable function and its absolute value are always the same (that is, for all ) and so a measurable function belongs to if and only if its absolute value does. Because of this, many formulas involving -norms are stated only for non-negative real-valued functions. Consider for example the identity which holds whenever is measurable, is real, and (here when ). The non-negativity requirement can be removed by substituting in for which gives Note in particular that when is finite then the formula relates the -norm to the -norm.

Seminormed space of -th power integrable functions

Each set of functions forms a vector space when addition and scalar multiplication are defined pointwise.[note 6] That the sum of two -th power integrable functions and is again -th power integrable follows from [proof 1] although it is also a consequence of Minkowski's inequality which establishes that satisfies the triangle inequality for (the triangle inequality does not hold for ). That is closed under scalar multiplication is due to being absolutely homogeneous, which means that for every scalar and every function

Absolute homogeneity, the triangle inequality, and non-negativity are the defining properties of a seminorm. Thus is a seminorm and the set of -th power integrable functions together with the function defines a seminormed vector space. In general, the seminorm is not a norm because there might exist measurable functions that satisfy but are not identically equal to [note 5] ( is a norm if and only if no such exists).

Zero sets of -seminorms

If is measurable and equals a.e. then for all positive On the other hand, if is a measurable function for which there exists some such that then almost everywhere. When is finite then this follows from the case and the formula mentioned above.

Thus if is positive and is any measurable function, then if and only if almost everywhere. Since the right hand side ( a.e.) does not mention it follows that all have the same zero set (it does not depend on ). So denote this common set by This set is a vector subspace of for every positive

Quotient vector space

Like every seminorm, the seminorm induces a norm (defined shortly) on the canonical quotient vector space of by its vector subspace This normed quotient space is called Lebesgue space and it is the subject of this article. We begin by defining the quotient vector space.

Given any the coset consists of all measurable functions that are equal to almost everywhere. The set of all cosets, typically denoted by forms a vector space with origin when vector addition and scalar multiplication are defined by and This particular quotient vector space will be denoted by Two cosets are equal if and only if (or equivalently, ), which happens if and only if almost everywhere; if this is the case then and are identified in the quotient space. Hence, strictly speaking consists of equivalence classes of functions.[5][6]

Given any the value of the seminorm on the coset is constant and equal to , that is: The map is a norm on called the -norm. The value of a coset is independent of the particular function that was chosen to represent the coset, meaning that if is any coset then for every (since for every ).

The Lebesgue space

The normed vector space is called space or the Lebesgue space of -th power integrable functions and it is a Banach space for every (meaning that it is a complete metric space, a result that is sometimes called the Riesz–Fischer theorem). When the underlying measure space is understood then is often abbreviated or even just Depending on the author, the subscript notation might denote either or

If the seminorm on happens to be a norm (which happens if and only if ) then the normed space will be linearly isometrically isomorphic to the normed quotient space via the canonical map (since ); in other words, they will be, up to a linear isometry, the same normed space and so they may both be called " space".

The above definitions generalize to Bochner spaces.

In general, this process cannot be reversed: there is no consistent way to define a "canonical" representative of each coset of in For however, there is a theory of lifts enabling such recovery.

Special cases

[edit]

For the spaces are a special case of spaces; when are the natural numbers and is the counting measure. More generally, if one considers any set with the counting measure, the resulting space is denoted For example, is the space of all sequences indexed by the integers, and when defining the -norm on such a space, one sums over all the integers. The space where is the set with elements, is with its -norm as defined above.

Similar to spaces, is the only Hilbert space among spaces. In the complex case, the inner product on is defined by Functions in are sometimes called square-integrable functions, quadratically integrable functions or square-summable functions, but sometimes these terms are reserved for functions that are square-integrable in some other sense, such as in the sense of a Riemann integral (Titchmarsh 1976).

As any Hilbert space, every space is linearly isometric to a suitable where the cardinality of the set is the cardinality of an arbitrary basis for this particular

If we use complex-valued functions, the space is a commutative C*-algebra with pointwise multiplication and conjugation. For many measure spaces, including all sigma-finite ones, it is in fact a commutative von Neumann algebra. An element of defines a bounded operator on any space by multiplication.

When (0 < p < 1)

[edit]

If then can be defined as above, that is: In this case, however, the -norm does not satisfy the triangle inequality and defines only a quasi-norm. The inequality valid for implies that and so the function is a metric on The resulting metric space is complete.[7]

In this setting satisfies a reverse Minkowski inequality, that is for

This result may be used to prove Clarkson's inequalities, which are in turn used to establish the uniform convexity of the spaces for (Adams & Fournier 2003).

The space for is an F-space: it admits a complete translation-invariant metric with respect to which the vector space operations are continuous. It is the prototypical example of an F-space that, for most reasonable measure spaces, is not locally convex: in or every open convex set containing the function is unbounded for the -quasi-norm; therefore, the vector does not possess a fundamental system of convex neighborhoods. Specifically, this is true if the measure space contains an infinite family of disjoint measurable sets of finite positive measure.

The only nonempty convex open set in is the entire space. Consequently, there are no nonzero continuous linear functionals on the continuous dual space is the zero space. In the case of the counting measure on the natural numbers (i.e. ), the bounded linear functionals on are exactly those that are bounded on , i.e., those given by sequences in Although does contain non-trivial convex open sets, it fails to have enough of them to give a base for the topology.

Having no linear functionals is highly undesirable for the purposes of doing analysis. In case of the Lebesgue measure on rather than work with for it is common to work with the Hardy space Hp whenever possible, as this has quite a few linear functionals: enough to distinguish points from one another. However, the Hahn–Banach theorem still fails in Hp for (Duren 1970, §7.5).

Properties

[edit]

Hölder's inequality

[edit]

Suppose satisfy . If and then and[8]

This inequality, called Hölder's inequality, is in some sense optimal since if and is a measurable function such that where the supremum is taken over the closed unit ball of then and

Atomic decomposition

[edit]

If then every non-negative has an atomic decomposition,[9] meaning that there exist a sequence of non-negative real numbers and a sequence of non-negative functions called the atoms, whose supports are pairwise disjoint sets of measure such that and for every integer and and where moreover, the sequence of functions depends only on (it is independent of ). These inequalities guarantee that for all integers while the supports of being pairwise disjoint implies

Dual spaces

[edit]

The dual space of for has a natural isomorphism with where is such that . This isomorphism associates with the functional defined by for every

is a well defined continuous linear mapping which is an isometry by the extremal case of Hölder's inequality. If is a -finite measure space one can use the Radon–Nikodym theorem to show that any can be expressed this way, i.e., is an isometric isomorphism of Banach spaces.[10] Hence, it is usual to say simply that is the continuous dual space of

For the space is reflexive. Let be as above and let be the corresponding linear isometry. Consider the map from to obtained by composing with the transpose (or adjoint) of the inverse of

This map coincides with the canonical embedding of into its bidual. Moreover, the map is onto, as composition of two onto isometries, and this proves reflexivity.

If the measure on is sigma-finite, then the dual of is isometrically isomorphic to (more precisely, the map corresponding to is an isometry from onto

The dual of is subtler. Elements of can be identified with bounded signed finitely additive measures on that are absolutely continuous with respect to See ba space for more details. If we assume the axiom of choice, this space is much bigger than except in some trivial cases. However, Saharon Shelah proved that there are relatively consistent extensions of Zermelo–Fraenkel set theory (ZF + DC + "Every subset of the real numbers has the Baire property") in which the dual of is [11]

Embeddings

[edit]

Colloquially, if then contains functions that are more locally singular, while elements of can be more spread out. Consider the Lebesgue measure on the half line A continuous function in might blow up near but must decay sufficiently fast toward infinity. On the other hand, continuous functions in need not decay at all but no blow-up is allowed. More formally, suppose that , then:[12]

  1. if and only if does not contain sets of finite but arbitrarily large measure (e.g. any finite measure).
  2. if and only if does not contain sets of non-zero but arbitrarily small measure (e.g. the counting measure).

Neither condition holds for the Lebesgue measure on the real line while both conditions holds for the counting measure on any finite set. As a consequence of the closed graph theorem, the embedding is continuous, i.e., the identity operator is a bounded linear map from to in the first case and to in the second. Indeed, if the domain has finite measure, one can make the following explicit calculation using Hölder's inequality leading to

The constant appearing in the above inequality is optimal, in the sense that the operator norm of the identity is precisely the case of equality being achieved exactly when -almost-everywhere.

Dense subspaces

[edit]

Let and be a measure space and consider an integrable simple function on given by where are scalars, has finite measure and is the indicator function of the set for By construction of the integral, the vector space of integrable simple functions is dense in

More can be said when is a normal topological space and its Borel 𝜎–algebra.

Suppose is an open set with Then for every Borel set contained in there exist a closed set and an open set such that for every . Subsequently, there exists a Urysohn function on that is on and on with

If can be covered by an increasing sequence of open sets that have finite measure, then the space of –integrable continuous functions is dense in More precisely, one can use bounded continuous functions that vanish outside one of the open sets

This applies in particular when and when is the Lebesgue measure. For example, the space of continuous and compactly supported functions as well as the space of integrable step functions are dense in .

Closed subspaces

[edit]

Suppose . If is a probability space and is a closed subspace of then is finite-dimensional.[13] It is crucial that the vector space be a subset of since it is possible to construct an infinite-dimensional closed vector subspace of which lies in ; taking the Lebesgue measure on the circle group divided by as the probability measure.

Applications

[edit]

Statistics

[edit]

In statistics, measures of central tendency and statistical dispersion, such as the mean, median, and standard deviation, can be defined in terms of metrics, and measures of central tendency can be characterized as solutions to variational problems.

In penalized regression, "L1 penalty" and "L2 penalty" refer to penalizing either the norm of a solution's vector of parameter values (i.e. the sum of its absolute values), or its squared norm (its Euclidean length). Techniques which use an L1 penalty, like LASSO, encourage sparse solutions (where the many parameters are zero).[14] Elastic net regularization uses a penalty term that is a combination of the norm and the squared norm of the parameter vector.

Hausdorff–Young inequality

[edit]

The Fourier transform for the real line (or, for periodic functions, see Fourier series), maps to (or to ) respectively, where and This is a consequence of the Riesz–Thorin interpolation theorem, and is made precise with the Hausdorff–Young inequality.

By contrast, if the Fourier transform does not map into

Hilbert spaces

[edit]

Hilbert spaces are central to many applications, from quantum mechanics to stochastic calculus. The spaces and are both Hilbert spaces. In fact, by choosing a Hilbert basis i.e., a maximal orthonormal subset of or any Hilbert space, one sees that every Hilbert space is isometrically isomorphic to (same as above), i.e., a Hilbert space of type

Generalizations and extensions

[edit]

Weak Lp

[edit]

Let be a measure space, and a measurable function with real or complex values on The distribution function of is defined for by

If is in for some with then by Markov's inequality,

A function is said to be in the space weak , or if there is a constant such that, for all

The best constant for this inequality is the -norm of and is denoted by

The weak coincide with the Lorentz spaces so this notation is also used to denote them.

The -norm is not a true norm, since the triangle inequality fails to hold. Nevertheless, for in and in particular

In fact, one has and raising to power and taking the supremum in one has

Under the convention that two functions are equal if they are equal almost everywhere, then the spaces are complete (Grafakos 2004).

For any the expression is comparable to the -norm. Further in the case this expression defines a norm if Hence for the weak spaces are Banach spaces (Grafakos 2004).

A major result that uses the -spaces is the Marcinkiewicz interpolation theorem, which has broad applications to harmonic analysis and the study of singular integrals.

Weighted Lp spaces

[edit]

As before, consider a measure space Let be a measurable function. The -weighted space is defined as where means the measure defined by

or, in terms of the Radon–Nikodym derivative, the norm for is explicitly

As -spaces, the weighted spaces have nothing special, since is equal to But they are the natural framework for several results in harmonic analysis (Grafakos 2004); they appear for example in the Muckenhoupt theorem: for the classical Hilbert transform is defined on where denotes the unit circle and the Lebesgue measure; the (nonlinear) Hardy–Littlewood maximal operator is bounded on Muckenhoupt's theorem describes weights such that the Hilbert transform remains bounded on and the maximal operator on

Lp spaces on manifolds

[edit]

One may also define spaces on a manifold, called the intrinsic spaces of the manifold, using densities.

Vector-valued Lp spaces

[edit]

Given a measure space and a locally convex space (here assumed to be complete), it is possible to define spaces of -integrable -valued functions on in a number of ways. One way is to define the spaces of Bochner integrable and Pettis integrable functions, and then endow them with locally convex TVS-topologies that are (each in their own way) a natural generalization of the usual topology. Another way involves topological tensor products of with Element of the vector space are finite sums of simple tensors where each simple tensor may be identified with the function that sends This tensor product is then endowed with a locally convex topology that turns it into a topological tensor product, the most common of which are the projective tensor product, denoted by and the injective tensor product, denoted by In general, neither of these space are complete so their completions are constructed, which are respectively denoted by and (this is analogous to how the space of scalar-valued simple functions on when seminormed by any is not complete so a completion is constructed which, after being quotiented by is isometrically isomorphic to the Banach space ). Alexander Grothendieck showed that when is a nuclear space (a concept he introduced), then these two constructions are, respectively, canonically TVS-isomorphic with the spaces of Bochner and Pettis integral functions mentioned earlier; in short, they are indistinguishable.

L0 space of measurable functions

[edit]

The vector space of (equivalence classes of) measurable functions on is denoted (Kalton, Peck & Roberts 1984). By definition, it contains all the and is equipped with the topology of convergence in measure. When is a probability measure (i.e., ), this mode of convergence is named convergence in probability. The space is always a topological abelian group but is only a topological vector space if This is because scalar multiplication is continuous if and only if If is -finite then the weaker topology of local convergence in measure is an F-space, i.e. a completely metrizable topological vector space. Moreover, this topology is isometric to global convergence in measure for a suitable choice of probability measure

The description is easier when is finite. If is a finite measure on the function admits for the convergence in measure the following fundamental system of neighborhoods

The topology can be defined by any metric of the form where is bounded continuous concave and non-decreasing on with and when (for example, Such a metric is called Lévy-metric for Under this metric the space is complete. However, as mentioned above, scalar multiplication is continuous with respect to this metric only if . To see this, consider the Lebesgue measurable function defined by . Then clearly . The space is in general not locally bounded, and not locally convex.

For the infinite Lebesgue measure on the definition of the fundamental system of neighborhoods could be modified as follows

The resulting space , with the topology of local convergence in measure, is isomorphic to the space for any positive –integrable density

See also

[edit]

Notes

[edit]
  1. ^ Maddox, I. J. (1988), Elements of Functional Analysis (2nd ed.), Cambridge: CUP, page 16
  2. ^ Rafael Dahmen, Gábor Lukács: Long colimits of topological groups I: Continuous maps and homeomorphisms. in: Topology and its Applications Nr. 270, 2020. Example 2.14
  3. ^ Garling, D. J. H. (2007). Inequalities: A Journey into Linear Analysis. Cambridge University Press. p. 54. ISBN 978-0-521-87624-7.
  4. ^ Rudin 1987, p. 65.
  5. ^ Stein & Shakarchi 2012, p. 2.
  6. ^ Weisstein, Eric W. "L^2-Space". MathWorld.
  7. ^ Rudin 1991, p. 37.
  8. ^ Bahouri, Chemin & Danchin 2011, pp. 1–4.
  9. ^ Bahouri, Chemin & Danchin 2011, pp. 7–8.
  10. ^ Rudin 1987, Theorem 6.16.
  11. ^ Schechter, Eric (1997), Handbook of Analysis and its Foundations, London: Academic Press Inc. See Sections 14.77 and 27.44–47
  12. ^ Villani, Alfonso (1985), "Another note on the inclusion Lp(μ) ⊂ Lq(μ)", Amer. Math. Monthly, 92 (7): 485–487, doi:10.2307/2322503, JSTOR 2322503, MR 0801221
  13. ^ Rudin 1991, pp. 117–119.
  14. ^ Hastie, T. J.; Tibshirani, R.; Wainwright, M. J. (2015). Statistical Learning with Sparsity: The Lasso and Generalizations. CRC Press. ISBN 978-1-4987-1216-3.
  1. ^ The condition is not equivalent to being finite, unless
  2. ^ If then
  3. ^ The definitions of and can be extended to all (rather than just ), but it is only when that is guaranteed to be a norm (although is a quasi-seminorm for all ).
  4. ^ If then
  5. ^ a b For example, if a non-empty measurable set of measure exists then its indicator function satisfies although
  6. ^ Explicitly, the vector space operations are defined by: for all and all scalars These operations make into a vector space because if is any scalar and then both and also belong to
  1. ^ When the inequality can be deduced from the fact that the function defined by is convex, which by definition means that for all and all in the domain of Substituting and in for and gives which proves that The triangle inequality now implies The desired inequality follows by integrating both sides.

References

[edit]
[edit]