Jump to content

Compactness theorem: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Reverting edit(s) by 2409:4081:9B97:965:0:0:C509:9D0D (talk) to rev. 1113918188 by Discospinster: non-constructive (RW 16.1)
Changing short description from "Theorem" to "Theorem in mathematical logic"
 
(18 intermediate revisions by 13 users not shown)
Line 1: Line 1:
{{mvar|}}{{Short description|Theorem}}
{{mvar|}}{{Short description|Theorem in mathematical logic}}
In [[mathematical logic]], the '''compactness theorem''' states that a [[Set (mathematics)|set]] of [[First-order predicate calculus|first-order]] [[Sentence (mathematical logic)|sentences]] has a [[Model (model theory)|model]] if and only if every [[Finite set|finite]] [[subset]] of it has a model. This theorem is an important tool in [[model theory]], as it provides a useful (but generally not effective) method for constructing models of any set of sentences that is finitely [[Consistency|consistent]].
In [[mathematical logic]], the '''compactness theorem''' states that a [[Set (mathematics)|set]] of [[First-order predicate calculus|first-order]] [[Sentence (mathematical logic)|sentences]] has a [[Model (model theory)|model]] if and only if every [[Finite set|finite]] [[subset]] of it has a model. This theorem is an important tool in [[model theory]], as it provides a useful (but generally not [[effective method|effective]]) method for constructing models of any set of sentences that is finitely [[Consistency|consistent]].


The compactness theorem for the [[propositional calculus]] is a consequence of [[Tychonoff's theorem]] (which says that the [[Product topology|product]] of [[compact space]]s is compact) applied to compact [[Stone space]]s,<ref>See Truss (1997).</ref> hence the theorem's name. Likewise, it is analogous to the [[finite intersection property]] characterization of compactness in [[topological space]]s: a collection of [[closed set]]s in a compact space has a [[Empty set|non-empty]] [[Intersection (set theory)|intersection]] if every finite subcollection has a non-empty intersection.
The compactness theorem for the [[propositional calculus]] is a consequence of [[Tychonoff's theorem]] (which says that the [[Product topology|product]] of [[compact space]]s is compact) applied to compact [[Stone space]]s,<ref>See Truss (1997).</ref> hence the theorem's name. Likewise, it is analogous to the [[finite intersection property]] characterization of compactness in [[topological space]]s: a collection of [[closed set]]s in a compact space has a [[Empty set|non-empty]] [[Intersection (set theory)|intersection]] if every finite subcollection has a non-empty intersection.


The compactness theorem is one of the two key properties, along with the downward [[Löwenheim–Skolem theorem]], that is used in [[Lindström's theorem]] to characterize first-order logic. Although, there are some generalizations of the compactness theorem to non-first-order logics, the compactness theorem itself does not hold in them, except for a very limited number of examples.<ref>J. Barwise, S. Feferman, eds., Model-Theoretic Logics (New York: Springer-Verlag, 1985) [https://projecteuclid.org/euclid.pl/1235417263#toc], in particular, Makowsky, J. A. Chapter XVIII: Compactness, Embeddings and Definability. 645--716, see Theorems 4.5.9, 4.6.12 and Proposition 4.6.9. For compact logics for an extended notion of model see Ziegler, M. Chapter XV: Topological Model Theory. 557--577. For logics without the relativization property it is possible to have simultaneously compactness and interpolation, while the problem is still open for logics with relativization. See Xavier Caicedo, A Simple Solution to Friedman's Fourth Problem, J. Symbolic Logic, Volume 51, Issue 3 (1986), 778-784.{{doi|10.2307/2274031}} {{JSTOR|2274031}}</ref>
The compactness theorem is one of the two key properties, along with the downward [[Löwenheim–Skolem theorem]], that is used in [[Lindström's theorem]] to characterize first-order logic. Although there are some generalizations of the compactness theorem to non-first-order logics, the compactness theorem itself does not hold in them, except for a very limited number of examples.<ref>J. Barwise, S. Feferman, eds., Model-Theoretic Logics (New York: Springer-Verlag, 1985) [https://projecteuclid.org/euclid.pl/1235417263#toc], in particular, Makowsky, J. A. Chapter XVIII: Compactness, Embeddings and Definability. 645--716, see Theorems 4.5.9, 4.6.12 and Proposition 4.6.9. For compact logics for an extended notion of model see Ziegler, M. Chapter XV: Topological Model Theory. 557--577. For logics without the relativization property it is possible to have simultaneously compactness and interpolation, while the problem is still open for logics with relativization. See Xavier Caicedo, A Simple Solution to Friedman's Fourth Problem, J. Symbolic Logic, Volume 51, Issue 3 (1986), 778-784.{{doi|10.2307/2274031}} {{JSTOR|2274031}}</ref>


==History==
==History==
Line 15: Line 15:
===Robinson's principle===
===Robinson's principle===


The compactness theorem implies the following result, stated by [[Abraham Robinson]] in his [[1949]] dissertation.
The compactness theorem implies the following result, stated by [[Abraham Robinson]] in his 1949 dissertation.


[[Robinson's principle]]:{{sfn|Marker|2002|pp=40-43}}{{sfn|Gowers|Barrow-Green|Leader|2008|pp=639-643}} If a first-order sentence holds in every [[Field (mathematics)|field]] of [[Characteristic (algebra)|characteristic]] zero, then there exists a constant <math>p</math> such that the sentence holds for every field of characteristic larger than <math>p.</math> This can be seen as follows: suppose <math>\varphi</math> is a sentence that holds in every field of characteristic zero. Then its negation <math>\lnot \varphi,</math> together with the field axioms and the infinite sequence of sentences
[[Robinson's principle]]:{{sfn|Marker|2002|pp=40-43}}{{sfn|Gowers|Barrow-Green|Leader|2008|pp=639-643}} If a first-order sentence holds in every [[Field (mathematics)|field]] of [[Characteristic (algebra)|characteristic]] zero, then there exists a constant <math>p</math> such that the sentence holds for every field of characteristic larger than <math>p.</math> This can be seen as follows: suppose <math>\varphi</math> is a sentence that holds in every field of characteristic zero. Then its negation <math>\lnot \varphi,</math> together with the field axioms and the infinite sequence of sentences
Line 30: Line 30:
===Non-standard analysis===
===Non-standard analysis===


A third application of the compactness theorem is the construction of [[Non-standard analysis|nonstandard models]] of the real numbers, that is, consistent extensions of the theory of the real numbers that contain "infinitesimal" numbers. To see this, let <math>\Sigma</math> be a first-order axiomatization of the theory of the real numbers. Consider the theory obtained by adding a new constant symbol <math>\varepsilon</math> to the language and adjoining to <math>\Sigma</math> the axiom <math>\varepsilon > 0</math> and the axioms <math>\varepsilon < \tfrac{1}{n}</math> for all positive integers <math>n.</math> Clearly, the standard real numbers <math>\R</math> are a model for every finite subset of these axioms, because the real numbers satisfy everything in <math>\Sigma</math> and, by suitable choice of <math>\varepsilon,</math> can be made to satisfy any finite subset of the axioms about <math>\varepsilon.</math> By the compactness theorem, there is a model <math>{}^* \R</math> that satisfies <math>\Sigma</math> and also contains an infinitesimal element <math>\varepsilon.</math> A similar argument, adjoining axioms <math>\omega > 0, \; \omega > 1, \ldots,</math> etc., shows that the existence of infinitely large integers cannot be ruled out by any axiomatization <math>\Sigma</math> of the reals.<ref name="Goldblatt">{{cite book|title=Lectures on the Hyperreals|url=https://archive.org/details/lecturesonhyperr00gold_574|url-access=limited|last=Goldblatt|first=Robert|author-link=Robert Goldblatt|year=1998|publisher=Springer Verlag|location=New York|isbn=0-387-98464-X|pages=[https://archive.org/details/lecturesonhyperr00gold_574/page/n12 10]–11}}</ref>
A third application of the compactness theorem is the construction of [[Non-standard analysis|nonstandard models]] of the real numbers, that is, consistent extensions of the theory of the real numbers that contain "infinitesimal" numbers. To see this, let <math>\Sigma</math> be a first-order axiomatization of the theory of the real numbers. Consider the theory obtained by adding a new constant symbol <math>\varepsilon</math> to the language and adjoining to <math>\Sigma</math> the axiom <math>\varepsilon > 0</math> and the axioms <math>\varepsilon < \tfrac{1}{n}</math> for all positive integers <math>n.</math> Clearly, the standard real numbers <math>\R</math> are a model for every finite subset of these axioms, because the real numbers satisfy everything in <math>\Sigma</math> and, by suitable choice of <math>\varepsilon,</math> can be made to satisfy any finite subset of the axioms about <math>\varepsilon.</math> By the compactness theorem, there is a model <math>{}^* \R</math> that satisfies <math>\Sigma</math> and also contains an infinitesimal element <math>\varepsilon.</math>
A similar argument, this time adjoining the axioms <math>\omega > 0, \; \omega > 1, \ldots,</math> etc., shows that the existence of numbers with infinitely large magnitudes cannot be ruled out by any axiomatization <math>\Sigma</math> of the reals.{{sfn|Goldblatt|1998|pages=[https://archive.org/details/lecturesonhyperr00gold_574/page/n12 10]–11}}

It can be shown that the [[hyperreal number]]s <math>{}^* \R</math> satisfy the [[transfer principle]]:{{sfn|Goldblatt|1998|p=11}} a first-order sentence is true of <math>\R</math> if and only if it is true of <math>{}^* \R.</math>


==Proofs==
==Proofs==


One can prove the compactness theorem using [[Gödel's completeness theorem]], which establishes that a set of sentences is satisfiable if and only if no contradiction can be proven from it. Since proofs are always finite and therefore involve only finitely many of the given sentences, the compactness theorem follows. In fact, the compactness theorem is equivalent to Gödel's completeness theorem, and both are equivalent to the [[Boolean prime ideal theorem]], a weak form of the [[axiom of choice]].<ref>See Hodges (1993).</ref>
One can prove the compactness theorem using [[Gödel's completeness theorem]], which establishes that a set of sentences is satisfiable if and only if no contradiction can be proven from it. Since [[mathematical proof|proof]]s are always finite and therefore involve only finitely many of the given sentences, the compactness theorem follows. In fact, the compactness theorem is equivalent to Gödel's completeness theorem, and both are equivalent to the [[Boolean prime ideal theorem]], a weak form of the [[axiom of choice]].<ref>See Hodges (1993).</ref>


Gödel originally proved the compactness theorem in just this way, but later some "purely semantic" proofs of the compactness theorem were found; that is, proofs that refer to {{em|truth}} but not to {{em|provability}}. One of those proofs relies on [[ultraproduct]]s hinging on the axiom of choice as follows:
Gödel originally proved the compactness theorem in just this way, but later some "purely semantic" proofs of the compactness theorem were found; that is, proofs that refer to {{em|truth}} but not to {{em|provability}}. One of those proofs relies on [[ultraproduct]]s hinging on the axiom of choice as follows:


'''Proof''':
'''Proof''':
Fix a first-order language <math>L,</math> and let <math>\Sigma</math> be a collection of L-sentences such that every finite subcollection of <math>L</math>-sentences, <math>i \subseteq \Sigma</math> of it has a model <math>\mathcal{M}_i.</math> Also let <math display=inline>\prod_{i \subseteq \Sigma}\mathcal{M}_i</math> be the direct product of the structures and <math>I</math> be the collection of finite subsets of <math>\Sigma.</math> For each <math>i \in I,</math> let <math>A_i = \{j \in I : j \supseteq i\}.</math>
Fix a first-order language <math>L,</math> and let <math>\Sigma</math> be a collection of <math>L</math>-sentences such that every finite subcollection of <math>L</math>-sentences, <math>i \subseteq \Sigma</math> of it has a model <math>\mathcal{M}_i.</math> Also let <math display=inline>\prod_{i \subseteq \Sigma}\mathcal{M}_i</math> be the direct product of the structures and <math>I</math> be the collection of finite subsets of <math>\Sigma.</math> For each <math>i \in I,</math> let <math>A_i = \{j \in I : j \supseteq i\}.</math>
The family of all of these sets <math>A_i</math> generates a proper [[Filter (set theory)|filter]], so there is an [[Ultrafilter (set theory)|ultrafilter]] <math>U</math> containing all sets of the form <math>A_i.</math>
The family of all of these sets <math>A_i</math> generates a proper [[Filter (set theory)|filter]], so there is an [[Ultrafilter (set theory)|ultrafilter]] <math>U</math> containing all sets of the form <math>A_i.</math>


Now for any formula <math>\varphi</math> in <math>\Sigma:</math>
Now for any sentence <math>\varphi</math> in <math>\Sigma:</math>
* the set <math>A_{\{\varphi\}}</math> is in <math>U</math>
* the set <math>A_{\{\varphi\}}</math> is in <math>U</math>
* whenever <math>j \in A_{\{\varphi\}},</math> then <math>\varphi \in j,</math> hence <math>\varphi</math> holds in <math>\mathcal M_j</math>
* whenever <math>j \in A_{\{\varphi\}},</math> then <math>\varphi \in j,</math> hence <math>\varphi</math> holds in <math>\mathcal M_j</math>
Line 65: Line 69:
* {{cite journal|last=Dawson|first=John W. junior|title=The compactness of first-order logic: From Gödel to Lindström|journal=History and Philosophy of Logic|year=1993|volume=14|pages=15–37|doi=10.1080/01445349308837208}}
* {{cite journal|last=Dawson|first=John W. junior|title=The compactness of first-order logic: From Gödel to Lindström|journal=History and Philosophy of Logic|year=1993|volume=14|pages=15–37|doi=10.1080/01445349308837208}}
* {{cite book|last=Hodges|first=Wilfrid|author-link=Wilfrid Hodges|publisher=Cambridge University Press|title=Model theory|url=https://archive.org/details/modeltheory0000hodg|url-access=registration|year=1993|isbn=0-521-30442-3}}
* {{cite book|last=Hodges|first=Wilfrid|author-link=Wilfrid Hodges|publisher=Cambridge University Press|title=Model theory|url=https://archive.org/details/modeltheory0000hodg|url-access=registration|year=1993|isbn=0-521-30442-3}}
* {{cite book|last=Goldblatt|first=Robert|title=Lectures on the Hyperreals|url=https://archive.org/details/lecturesonhyperr00gold_574|url-access=limited|author-link=Robert Goldblatt|year=1998|publisher=Springer Verlag|location=New York|isbn=0-387-98464-X}} <!--{{sfn|Goldblatt|1998|p=}}-->
* {{cite book|last1=Gowers|first1=Timothy|last2=Barrow-Green|first2=June|last3=Leader|first3=Imre|title=The Princeton Companion to Mathematics|publisher=Princeton University Press|publication-place=Princeton|year=2008|isbn=978-1-4008-3039-8|oclc=659590835|pages=635–646}} <!--{{sfn|Gowers|Barrow-Green|Leader|2008|p=}}-->
* {{cite book|last1=Gowers|first1=Timothy|last2=Barrow-Green|first2=June|last3=Leader|first3=Imre|title=The Princeton Companion to Mathematics|publisher=Princeton University Press|publication-place=Princeton|year=2008|isbn=978-1-4008-3039-8|oclc=659590835|pages=635–646}} <!--{{sfn|Gowers|Barrow-Green|Leader|2008|p=}}-->
* {{cite book|last=Marker|first=David|title= Model Theory: An Introduction|publisher=Springer|series=[[Graduate Texts in Mathematics]]|volume=217|year=2002|isbn=978-0-387-98760-6|oclc=49326991}} <!--{{sfn|Marker|2002|p=}}-->
* {{cite book|last=Marker|first=David|title= Model Theory: An Introduction|publisher=Springer|series=[[Graduate Texts in Mathematics]]|volume=217|year=2002|isbn=978-0-387-98760-6|oclc=49326991}} <!--{{sfn|Marker|2002|p=}}-->
* {{cite journal|last=Robinson|first=J. A.|title=A Machine-Oriented Logic Based on the Resolution Principle|journal=Journal of the ACM|publisher=Association for Computing Machinery (ACM)|volume=12|issue=1|year=1965|issn=0004-5411|doi=10.1145/321250.321253|pages=23–41|s2cid=14389185}} <!--{{sfn|Robinson|1965|pp=23–41}}-->
* {{cite journal|last=Robinson|first=J. A.|title=A Machine-Oriented Logic Based on the Resolution Principle|journal=Journal of the ACM|publisher=Association for Computing Machinery (ACM)|volume=12|issue=1|year=1965|issn=0004-5411|doi=10.1145/321250.321253|pages=23–41|s2cid=14389185|doi-access=free}} <!--{{sfn|Robinson|1965|pp=23–41}}-->
* {{cite book|last=Truss|first=John K.|author-link=John Truss|title=Foundations of Mathematical Analysis|year=1997|publisher=Oxford University Press|isbn=0-19-853375-6}}
* {{cite book|last=Truss|first=John K.|author-link=John Truss|title=Foundations of Mathematical Analysis|year=1997|publisher=Oxford University Press|isbn=0-19-853375-6}}



Latest revision as of 14:43, 16 November 2024

In mathematical logic, the compactness theorem states that a set of first-order sentences has a model if and only if every finite subset of it has a model. This theorem is an important tool in model theory, as it provides a useful (but generally not effective) method for constructing models of any set of sentences that is finitely consistent.

The compactness theorem for the propositional calculus is a consequence of Tychonoff's theorem (which says that the product of compact spaces is compact) applied to compact Stone spaces,[1] hence the theorem's name. Likewise, it is analogous to the finite intersection property characterization of compactness in topological spaces: a collection of closed sets in a compact space has a non-empty intersection if every finite subcollection has a non-empty intersection.

The compactness theorem is one of the two key properties, along with the downward Löwenheim–Skolem theorem, that is used in Lindström's theorem to characterize first-order logic. Although there are some generalizations of the compactness theorem to non-first-order logics, the compactness theorem itself does not hold in them, except for a very limited number of examples.[2]

History

[edit]

Kurt Gödel proved the countable compactness theorem in 1930. Anatoly Maltsev proved the uncountable case in 1936.[3][4]

Applications

[edit]

The compactness theorem has many applications in model theory; a few typical results are sketched here.

Robinson's principle

[edit]

The compactness theorem implies the following result, stated by Abraham Robinson in his 1949 dissertation.

Robinson's principle:[5][6] If a first-order sentence holds in every field of characteristic zero, then there exists a constant such that the sentence holds for every field of characteristic larger than This can be seen as follows: suppose is a sentence that holds in every field of characteristic zero. Then its negation together with the field axioms and the infinite sequence of sentences is not satisfiable (because there is no field of characteristic 0 in which holds, and the infinite sequence of sentences ensures any model would be a field of characteristic 0). Therefore, there is a finite subset of these sentences that is not satisfiable. must contain because otherwise it would be satisfiable. Because adding more sentences to does not change unsatisfiability, we can assume that contains the field axioms and, for some the first sentences of the form Let contain all the sentences of except Then any field with a characteristic greater than is a model of and together with is not satisfiable. This means that must hold in every model of which means precisely that holds in every field of characteristic greater than This completes the proof.

The Lefschetz principle, one of the first examples of a transfer principle, extends this result. A first-order sentence in the language of rings is true in some (or equivalently, in every) algebraically closed field of characteristic 0 (such as the complex numbers for instance) if and only if there exist infinitely many primes for which is true in some algebraically closed field of characteristic in which case is true in all algebraically closed fields of sufficiently large non-0 characteristic [5] One consequence is the following special case of the Ax–Grothendieck theorem: all injective complex polynomials are surjective[5] (indeed, it can even be shown that its inverse will also be a polynomial).[7] In fact, the surjectivity conclusion remains true for any injective polynomial where is a finite field or the algebraic closure of such a field.[7]

Upward Löwenheim–Skolem theorem

[edit]

A second application of the compactness theorem shows that any theory that has arbitrarily large finite models, or a single infinite model, has models of arbitrary large cardinality (this is the Upward Löwenheim–Skolem theorem). So for instance, there are nonstandard models of Peano arithmetic with uncountably many 'natural numbers'. To achieve this, let be the initial theory and let be any cardinal number. Add to the language of one constant symbol for every element of Then add to a collection of sentences that say that the objects denoted by any two distinct constant symbols from the new collection are distinct (this is a collection of sentences). Since every finite subset of this new theory is satisfiable by a sufficiently large finite model of or by any infinite model, the entire extended theory is satisfiable. But any model of the extended theory has cardinality at least .

Non-standard analysis

[edit]

A third application of the compactness theorem is the construction of nonstandard models of the real numbers, that is, consistent extensions of the theory of the real numbers that contain "infinitesimal" numbers. To see this, let be a first-order axiomatization of the theory of the real numbers. Consider the theory obtained by adding a new constant symbol to the language and adjoining to the axiom and the axioms for all positive integers Clearly, the standard real numbers are a model for every finite subset of these axioms, because the real numbers satisfy everything in and, by suitable choice of can be made to satisfy any finite subset of the axioms about By the compactness theorem, there is a model that satisfies and also contains an infinitesimal element

A similar argument, this time adjoining the axioms etc., shows that the existence of numbers with infinitely large magnitudes cannot be ruled out by any axiomatization of the reals.[8]

It can be shown that the hyperreal numbers satisfy the transfer principle:[9] a first-order sentence is true of if and only if it is true of

Proofs

[edit]

One can prove the compactness theorem using Gödel's completeness theorem, which establishes that a set of sentences is satisfiable if and only if no contradiction can be proven from it. Since proofs are always finite and therefore involve only finitely many of the given sentences, the compactness theorem follows. In fact, the compactness theorem is equivalent to Gödel's completeness theorem, and both are equivalent to the Boolean prime ideal theorem, a weak form of the axiom of choice.[10]

Gödel originally proved the compactness theorem in just this way, but later some "purely semantic" proofs of the compactness theorem were found; that is, proofs that refer to truth but not to provability. One of those proofs relies on ultraproducts hinging on the axiom of choice as follows:

Proof: Fix a first-order language and let be a collection of -sentences such that every finite subcollection of -sentences, of it has a model Also let be the direct product of the structures and be the collection of finite subsets of For each let The family of all of these sets generates a proper filter, so there is an ultrafilter containing all sets of the form

Now for any sentence in

  • the set is in
  • whenever then hence holds in
  • the set of all with the property that holds in is a superset of hence also in

Łoś's theorem now implies that holds in the ultraproduct So this ultraproduct satisfies all formulas in

See also

[edit]

Notes

[edit]
  1. ^ See Truss (1997).
  2. ^ J. Barwise, S. Feferman, eds., Model-Theoretic Logics (New York: Springer-Verlag, 1985) [1], in particular, Makowsky, J. A. Chapter XVIII: Compactness, Embeddings and Definability. 645--716, see Theorems 4.5.9, 4.6.12 and Proposition 4.6.9. For compact logics for an extended notion of model see Ziegler, M. Chapter XV: Topological Model Theory. 557--577. For logics without the relativization property it is possible to have simultaneously compactness and interpolation, while the problem is still open for logics with relativization. See Xavier Caicedo, A Simple Solution to Friedman's Fourth Problem, J. Symbolic Logic, Volume 51, Issue 3 (1986), 778-784.doi:10.2307/2274031 JSTOR 2274031
  3. ^ Vaught, Robert L.: "Alfred Tarski's work in model theory". Journal of Symbolic Logic 51 (1986), no. 4, 869–882
  4. ^ Robinson, A.: Non-standard analysis. North-Holland Publishing Co., Amsterdam 1966. page 48.
  5. ^ a b c Marker 2002, pp. 40–43.
  6. ^ Gowers, Barrow-Green & Leader 2008, pp. 639–643.
  7. ^ a b Terence, Tao (7 March 2009). "Infinite fields, finite fields, and the Ax-Grothendieck theorem".
  8. ^ Goldblatt 1998, pp. 10–11.
  9. ^ Goldblatt 1998, p. 11.
  10. ^ See Hodges (1993).

References

[edit]
[edit]