Jump to content

Lambda calculus: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
clarify a sentence by restricting its scope to untyped lambda calculus
m Explanation and applications: Restore link to first instance of important term while avoiding one adjacent link.
 
(22 intermediate revisions by 16 users not shown)
Line 1: Line 1:
{{Short description|Mathematical-logic system based on functions}}
{{Short description|Mathematical-logic system based on functions}}
'''Lambda calculus''' (also written as '''''λ''-calculus''') is a [[formal system]] in [[mathematical logic]] for expressing [[computability|computation]] based on function [[Abstraction (computer science)|abstraction]] and [[function application|application]] using variable [[Name binding|binding]] and [[Substitution (algebra)|substitution]]. Untyped lambda calculus, the topic of this article, is a universal [[model of computation]] that can be used to simulate any [[Turing machine]] (and vice versa). It was introduced by the mathematician [[Alonzo Church]] in the 1930s as part of his research into the [[foundations of mathematics]]. In 1936 Church found a formulation which was [[#History|logically consistent]], and documented it in 1940.


'''Lambda calculus''' (also written as '''''λ''-calculus''') is a [[formal system]] in [[mathematical logic]] for expressing [[computability|computation]] based on function [[Abstraction (computer science)|abstraction]] and [[function application|application]] using variable [[Name binding|binding]] and [[Substitution (algebra)|substitution]]. Untyped lambda calculus, the topic of this article, is a [[universal machine]], a [[model of computation]] that can be used to simulate any [[Turing machine]] (and vice versa). It was introduced by the mathematician [[Alonzo Church]] in the 1930s as part of his research into the [[foundations of mathematics]]. In 1936, Church found a formulation which was [[#History|logically consistent]], and documented it in 1940.
Lambda calculus consists of constructing [[#Lambda terms|lambda term]]s and performing [[#Reduction|reduction]] operations on them. In the simplest form of lambda calculus, terms are built using only the following rules:{{efn|name=3rules|1= These rules produce expressions such as: <math>(\lambda x.\lambda y.(\lambda z.(\lambda x .z\ x)\ (\lambda y.z\ y))(x\ y))</math>. Parentheses can be dropped if the expression is unambiguous. For some applications, terms for logical and mathematical constants and operations may be included.}}

Lambda calculus consists of constructing [[#Lambda terms|lambda term]]s and performing [[#Reduction|reduction]] operations on them. A term is defined as any valid lambda calculus expression. In the simplest form of lambda calculus, terms are built using only the following rules:{{efn|name=3rules|1= These rules produce expressions such as: <math>(\lambda x.\lambda y.(\lambda z.(\lambda x .z\ x)\ (\lambda y.z\ y))(x\ y))</math>. Parentheses can be dropped if the expression is unambiguous. For some applications, terms for logical and mathematical constants and operations may be included.}}


# <math display="inline">x</math>: A [[#validLambdaVar|variable]] is a character or string representing a parameter.
# <math display="inline">x</math>: A [[#validLambdaVar|variable]] is a character or string representing a parameter.
# <math display="inline">(\lambda x.M)</math>: A [[#lambdaAbstr|lambda abstraction]] is a function definition, taking as input the bound variable <math>x</math> (between the λ and the punctum/dot '''.''') and returning the body <math display="inline">M</math>.
# <math display="inline">(\lambda x.M)</math>: A [[#lambdaAbstr|lambda abstraction]] is a function definition, taking as input the bound variable <math>x</math> (between the λ and the punctum/dot '''.''') and returning the body <math display="inline">M</math>.
# <math display="inline">(M\ N)</math>: An [[#anApplic| application]], applying a function <math display="inline">M</math> to an argument <math display="inline">N</math>. Both <math display="inline">M</math> and <math display="inline">N</math> are lambda terms.
# <math display="inline">(M\ N)</math>: An [[#anApplic|application]], applying a function <math display="inline">M</math> to an argument <math display="inline">N</math>. Both <math display="inline">M</math> and <math display="inline">N</math> are lambda terms.


The reduction operations include:
The reduction operations include:


* <math display="inline">(\lambda x.M[x])\rightarrow(\lambda y.M[y])</math> [[#α-conversion|: α-conversion]], renaming the bound variables in the expression. Used to avoid [[name collision]]s.
* <math display="inline">(\lambda x.M[x])\rightarrow(\lambda y.M[y])</math> : [[#α-conversion|α-conversion]], renaming the bound variables in the expression. Used to avoid [[name collision]]s.
* <math display="inline">((\lambda x.M)\ N)\rightarrow (M[x:=N])</math> [[#β-reduction|: β-reduction]],{{efn|name=beta|1= Barendregt,Barendsen (2000) call this form
* <math display="inline">((\lambda x.M)\ N)\rightarrow (M[x:=N])</math> : [[#β-reduction|β-reduction]],{{efn|name=beta|1= Barendregt, Barendsen (2000) call this form
*''axiom β'': (λx.M[x]) N = M[N] , rewritten as (λx.M) N = M[x := N], "where M[x := N] denotes the substitution of N for every occurrence of x in M".<ref name="BarendregtBarendsen"/>{{rp|7}} Also denoted M[N/x], "the substitution of N for x in M".<ref>{{nlab|id=explicit+substitution|title=explicit substitution}}</ref>}} replacing the bound variables with the argument expression in the body of the abstraction.
*''axiom β'': (λx.M[x]) N = M[N] , rewritten as (λx.M) N = M[x := N], "where M[x := N] denotes the substitution of N for every occurrence of x in M".<ref name="BarendregtBarendsen"/>{{rp|7}} Also denoted M[N/x], "the substitution of N for x in M".<ref>{{nlab|id=explicit+substitution|title=explicit substitution}}</ref>}} replacing the bound variables with the argument expression in the body of the abstraction.


Line 19: Line 20:


== Explanation and applications ==
== Explanation and applications ==
Lambda calculus is [[Turing completeness|Turing complete]], that is, it is a universal [[model of computation]] that can be used to simulate any [[Turing machine]].<ref>{{cite journal|first=Alan M.|last=Turing|author-link=Alan Turing|title=Computability and λ-Definability|jstor=2268280|journal=The Journal of Symbolic Logic|volume=2|issue=4|date=December 1937|pages=153–163|doi=10.2307/2268280|s2cid=2317046 }}</ref> Its namesake, the Greek letter lambda (λ), is used in lambda expressions and lambda terms to denote [[Free variables and bound variables|binding]] a variable in a [[function (mathematics)|function]].
Lambda calculus is [[Turing completeness|Turing complete]], that is, it is a universal [[model of computation]] that can be used to simulate any [[Turing machine]].<ref>{{cite journal|first1=Alan M.|last1=Turing|author1-link=Alan Turing|title=Computability and λ-Definability|jstor=2268280|journal=The Journal of Symbolic Logic|volume=2|issue=4|date=December 1937|pages=153–163|doi=10.2307/2268280|s2cid=2317046}}</ref> Its namesake, the Greek letter lambda (λ), is used in lambda expressions and lambda terms to denote [[Free variables and bound variables|binding]] a variable in a [[Function (mathematics)|function]].


{{anchor|untyped lambda calculus}}{{anchor|untypedLambdaCalculus}}Lambda calculus may be ''untyped'' or ''typed''. In typed lambda calculus, functions can be applied only if they are capable of accepting the given input's "type" of data. Typed lambda calculi are ''weaker'' than the untyped lambda calculus, which is the primary subject of this article, in the sense that ''typed lambda calculi can express less'' than the untyped calculus can. On the other hand, typed lambda calculi allow more things to be proven. For example, in the [[simply typed lambda calculus]] it is a theorem that every evaluation strategy terminates for every simply typed lambda-term, whereas evaluation of untyped lambda-terms need not terminate (see [[#betaReducIsAcomput|below]]). One reason there are many different typed lambda calculi has been the desire to do more (of what the untyped calculus can do) without giving up on being able to prove strong theorems about the calculus.
{{anchor|untyped lambda calculus}}{{anchor|untypedLambdaCalculus}}Lambda calculus may be ''untyped'' or ''typed''. In typed lambda calculus, functions can be applied only if they are capable of accepting the given input's "type" of data. Typed lambda calculi are strictly ''weaker'' than the untyped lambda calculus, which is the primary subject of this article, in the sense that ''typed lambda calculi can express less'' than the untyped calculus can. On the other hand, typed lambda calculi allow more things to be proven. For example, in [[simply typed lambda calculus]], it is a theorem that every evaluation strategy terminates for every simply typed lambda-term, whereas evaluation of untyped lambda-terms need not terminate (see [[#betaReducIsAcomput|below]]). One reason there are many different typed lambda calculi has been the desire to do more (of what the untyped calculus can do) without giving up on being able to prove strong theorems about the calculus.


Lambda calculus has applications in many different areas in [[mathematics]], [[philosophy]],<ref>{{cite web|first=Thierry|last=Coquand|author-link=Thierry Coquand|title=Type Theory|website=The Stanford Encyclopedia of Philosophy|date=8 February 2006|url=http://plato.stanford.edu/archives/sum2013/entries/type-theory/|editor-first=Edward N.|editor-last=Zalta|edition=Summer 2013|access-date=November 17, 2020}}</ref> [[linguistics]],<ref>{{cite book|url=https://books.google.com/books?id=9CdFE9X_FCoC|title=Categorial Investigations: Logical and Linguistic Aspects of the Lambek Calculus|first=Michael|last=Moortgat|publisher=Foris Publications|year=1988|isbn=9789067653879}}</ref><ref>{{Cite book|url=https://books.google.com/books?id=nyFa5ngYThMC|title=Computing Meaning|editor1-first=Harry|editor1-last=Bunt|editor2-first=Reinhard|editor2-last=Muskens|publisher=Springer|year=2008|isbn=978-1-4020-5957-5}}</ref> and [[computer science]].<ref>{{cite book|title=Concepts in Programming Languages|first=John C.|last=Mitchell|author-link=John C. Mitchell|publisher=Cambridge University Press|year=2003|isbn=978-0-521-78098-8|page=57|url=https://books.google.com/books?id=7Uh8XGfJbEIC&pg=PA57}}.</ref>
Lambda calculus has applications in many different areas in [[mathematics]], [[philosophy]],<ref>{{cite web|first1=Thierry|last1=Coquand|author1-link=Thierry Coquand|title=Type Theory|website=The Stanford Encyclopedia of Philosophy|date=8 February 2006|url=http://plato.stanford.edu/archives/sum2013/entries/type-theory/|editor1-first=Edward N.|editor1-last=Zalta|edition=Summer 2013|access-date=November 17, 2020}}</ref> [[linguistics]],<ref>{{cite book|url=https://books.google.com/books?id=9CdFE9X_FCoC|title=Categorial Investigations: Logical and Linguistic Aspects of the Lambek Calculus|first1=Michael|last1=Moortgat|publisher=Foris Publications|year=1988|isbn=9789067653879}}</ref><ref>{{Cite book|url=https://books.google.com/books?id=nyFa5ngYThMC|title=Computing Meaning|editor1-last=Bunt|editor1-first=Harry|editor2-last=Muskens|editor2-first=Reinhard|year=2008|publisher=Springer|isbn=978-1-4020-5957-5}}</ref> and [[computer science]].<ref>{{cite book|title=Concepts in Programming Languages|first1=John C.|last1=Mitchell|author1-link=John C. Mitchell|publisher=Cambridge University Press|year=2003|isbn=978-0-521-78098-8|page=57|url=https://books.google.com/books?id=7Uh8XGfJbEIC&pg=PA57}}.</ref>
<ref>{{cite tech report |last=Chacón Sartori |first=Camilo |date=2023-12-05 |title=Introduction to Lambda Calculus using Racket |url=https://www.researchgate.net/publication/376232150 |archive-url=https://web.archive.org/web/20231207230752/https://www.researchgate.net/publication/376232150_Introduction_to_Lambda_Calculus_using_Racket |archive-date=2023-12-07}}</ref> Lambda calculus has played an important role in the development of the [[Programming language theory|theory of programming languages]]. [[Functional programming language]]s implement lambda calculus. Lambda calculus is also a current research topic in [[category theory]].<ref>{{cite book|title=Basic Category Theory for Computer Scientists|page=53|first=Benjamin C.|last=Pierce|author-link=Benjamin C. Pierce}}</ref>
<ref>{{cite tech report |last1=Chacón Sartori |first1=Camilo |date=2023-12-05 |title=Introduction to Lambda Calculus using Racket |url=https://www.researchgate.net/publication/376232150 |archive-url=https://web.archive.org/web/20231207230752/https://www.researchgate.net/publication/376232150_Introduction_to_Lambda_Calculus_using_Racket |archive-date=2023-12-07}}</ref> Lambda calculus has played an important role in the development of the [[Programming language theory|theory]] of [[programming language]]s. [[Functional programming]] languages implement lambda calculus. Lambda calculus is also a current research topic in [[category theory]].<ref>{{cite book |last1=Pierce |first1=Benjamin C. |author1-link=Benjamin C. Pierce |title=Basic Category Theory for Computer Scientists |page=53}}</ref>


== History ==
== History ==
Lambda calculus was introduced by mathematician [[Alonzo Church]] in the 1930s as part of an investigation into the [[foundations of mathematics]].<ref>{{cite journal|first=Alonzo|last=Church|author-link=Alonzo Church|title=A set of postulates for the foundation of logic|journal=Annals of Mathematics|series=Series 2|volume=33|issue=2|pages=346–366|year=1932|doi=10.2307/1968337|jstor=1968337}}</ref>{{efn|For a full history, see Cardone and Hindley's "History of Lambda-calculus and Combinatory Logic" (2006).}} The original system was shown to be [[Consistency|logically inconsistent]] in 1935 when [[Stephen Kleene]] and [[J. B. Rosser]] developed the [[Kleene–Rosser paradox]].<ref>{{cite journal|last1=Kleene|first1=Stephen C.|author-link1=Stephen Kleene|last2=Rosser|first2=J. B.|author-link2=J. B. Rosser|title=The Inconsistency of Certain Formal Logics|journal=The Annals of Mathematics|date=July 1935|volume=36|issue=3|pages=630|doi=10.2307/1968646|jstor=1968646}}</ref><ref>{{cite journal|last=Church|first=Alonzo|author-link=Alonzo Church|title=Review of Haskell B. Curry, ''The Inconsistency of Certain Formal Logics''|journal=The Journal of Symbolic Logic|date=December 1942|volume=7|issue=4|pages=170–171|doi=10.2307/2268117|jstor=2268117}}</ref>
Lambda calculus was introduced by mathematician [[Alonzo Church]] in the 1930s as part of an investigation into the [[foundations of mathematics]].<ref>{{cite journal|first1=Alonzo|last1=Church|author1-link=Alonzo Church|title=A set of postulates for the foundation of logic|journal=Annals of Mathematics|series=Series 2|volume=33|issue=2|pages=346–366|year=1932|doi=10.2307/1968337|jstor=1968337}}</ref>{{efn|For a full history, see Cardone and Hindley's "History of Lambda-calculus and Combinatory Logic" (2006).}} The original system was shown to be [[Consistency|logically inconsistent]] in 1935 when [[Stephen Kleene]] and [[J. B. Rosser]] developed the [[Kleene–Rosser paradox]].<ref>{{cite journal|last1=Kleene|first1=Stephen C.|author-link1=Stephen Kleene|last2=Rosser|first2=J. B.|author-link2=J. B. Rosser|title=The Inconsistency of Certain Formal Logics|journal=The Annals of Mathematics|date=July 1935|volume=36|issue=3|pages=630|doi=10.2307/1968646|jstor=1968646}}</ref><ref>{{cite journal|last1=Church|first1=Alonzo|author1-link=Alonzo Church|title=Review of Haskell B. Curry, ''The Inconsistency of Certain Formal Logics''|journal=The Journal of Symbolic Logic|date=December 1942|volume=7|issue=4|pages=170–171|doi=10.2307/2268117|jstor=2268117}}</ref>


Subsequently, in 1936 Church isolated and published just the portion relevant to computation, what is now called the untyped lambda calculus.<ref name="Church1936">{{cite journal|first=Alonzo|last=Church|author-link=Alonzo Church|title=An unsolvable problem of elementary number theory|journal=American Journal of Mathematics|volume=58|number=2|year=1936|pages=345–363|doi=10.2307/2371045|jstor=2371045}}</ref> In 1940, he also introduced a computationally weaker, but logically consistent system, known as the [[simply typed lambda calculus]].<ref>{{cite journal|last=Church|author-link=Alonzo Church|first=Alonzo|year=1940|title=A Formulation of the Simple Theory of Types|journal=Journal of Symbolic Logic|volume=5|issue=2|pages=56–68|doi=10.2307/2266170|jstor=2266170|s2cid=15889861 }}</ref>
Subsequently, in 1936 Church isolated and published just the portion relevant to computation, what is now called the untyped lambda calculus.<ref name="Church1936">{{cite journal|first1=Alonzo|last1=Church|author1-link=Alonzo Church|title=An unsolvable problem of elementary number theory|journal=American Journal of Mathematics|volume=58|number=2|year=1936|pages=345–363|doi=10.2307/2371045|jstor=2371045}}</ref> In 1940, he also introduced a computationally weaker, but logically consistent system, known as the [[simply typed lambda calculus]].<ref>{{cite journal|last1=Church|author1-link=Alonzo Church|first1=Alonzo|year=1940|title=A Formulation of the Simple Theory of Types|journal=Journal of Symbolic Logic|volume=5|issue=2|pages=56–68|doi=10.2307/2266170|jstor=2266170|s2cid=15889861}}</ref>


Until the 1960s when its relation to programming languages was clarified, the lambda calculus was only a formalism. Thanks to [[Richard Montague]] and other linguists' applications in the semantics of natural language, the lambda calculus has begun to enjoy a respectable place in both linguistics<ref name='mm-linguistics'>{{cite book|last1=Partee|first1=B. B. H.|last2=ter Meulen|first2=A.|author2-link=Alice ter Meulen|last3=Wall|first3=R. E.|title=Mathematical Methods in Linguistics |url=https://books.google.com/books?id=qV7TUuaYcUIC&pg=PA317 |access-date=29 Dec 2016|year=1990|publisher=Springer|isbn=9789027722454}}</ref> and computer science.<ref>{{cite web|first=Jesse|last=Alma|title=The Lambda Calculus|website=The Stanford Encyclopedia of Philosophy|url=http://plato.stanford.edu/entries/lambda-calculus/|editor-first=Edward N.|editor-last=Zalta|edition=Summer 2013|access-date=November 17, 2020}}</ref>
Until the 1960s when its relation to programming languages was clarified, the lambda calculus was only a formalism. Thanks to [[Richard Montague]] and other linguists' applications in the semantics of natural language, the lambda calculus has begun to enjoy a respectable place in both linguistics<ref name='mm-linguistics'>{{cite book|last1=Partee|first1=B. B. H.|last2=ter Meulen|first2=A.|author2-link=Alice ter Meulen|last3=Wall|first3=R. E.|title=Mathematical Methods in Linguistics |url=https://books.google.com/books?id=qV7TUuaYcUIC&pg=PA317 |access-date=29 Dec 2016|year=1990|publisher=Springer|isbn=9789027722454}}</ref> and computer science.<ref>{{cite web|first1=Jesse|last1=Alama|title=The Lambda Calculus|website=The Stanford Encyclopedia of Philosophy|url=http://plato.stanford.edu/entries/lambda-calculus/|editor1-first=Edward N.|editor1-last=Zalta|edition=Summer 2013|access-date=November 17, 2020}}</ref>


=== Origin of the ''λ'' symbol ===
=== Origin of the ''λ'' symbol ===
Line 87: Line 88:


==== Lambda terms ====
==== Lambda terms ====
The syntax of the lambda calculus defines some expressions as valid lambda calculus expressions and some as invalid, just as some strings of characters are valid [[C (programming language)|C]] programs and some are not. A valid lambda calculus expression is called a "'''lambda term'''".
The syntax of the lambda calculus defines some expressions as valid lambda calculus expressions and some as invalid, just as some strings of characters are valid computer programs and some are not. A valid lambda calculus expression is called a "'''lambda term'''".


The following three rules give an [[inductive definition]] that can be applied to build all syntactically valid lambda terms:{{efn|name=lamTerms|1= The expression e can be: variables x, lambda abstractions, or applications —in BNF, <math>e ::= x \mid \lambda x.e \mid e \, e</math> .— ''from Wikipedia's [[Simply typed lambda calculus#Syntax]] for untyped lambda calculus }}
The following three rules give an [[inductive definition]] that can be applied to build all syntactically valid lambda terms:{{efn|name=lamTerms|1= The expression e can be: variables x, lambda abstractions, or applications —in BNF, <math>e ::= x \mid \lambda x.e \mid e \, e</math> .— ''from Wikipedia's [[Simply typed lambda calculus#Syntax]] for untyped lambda calculus}}
*{{anchor|validLambdaVar }} variable {{mvar|x}} is itself a valid lambda term.
*{{anchor|validLambdaVar}} variable {{mvar|x}} is itself a valid lambda term.
*if {{mvar|t}} is a lambda term, and {{mvar|x}} is a variable, then <math>(\lambda x.t)</math> {{efn| <math>(\lambda x.t)</math> is sometimes written in [[ASCII]] as <math>L x.t</math>}} is a lambda term (called an ''abstraction'');
*if {{mvar|t}} is a lambda term, and {{mvar|x}} is a variable, then <math>(\lambda x.t)</math> {{efn| <math>(\lambda x.t)</math> is sometimes written in [[ASCII]] as <math>L x.t</math>}} is a lambda term (called an ''abstraction'');
*if {{mvar|t}} and {{mvar|s}} are lambda terms, then <math>(t </math>&nbsp; <math>s)</math> is a lambda term (called an ''application'').
*if {{mvar|t}} and {{mvar|s}} are lambda terms, then <math>(t </math>&nbsp; <math>s)</math> is a lambda term (called an ''application'').
Nothing else is a lambda term. That is, a lambda term is valid if and only if it can be obtained by repeated application of these three rules. For convenience, some parentheses can be omitted when writing a lambda term. For example, the outermost parentheses are usually not written. See [[#Notation|§ Notation]], below, for an explicit description of which parentheses are optional. It is also common to extend the syntax presented here with additional operations, which allows making sense of terms such as <math>\lambda x.x^2.</math> The focus of this article is the pure lambda calculus without extensions, but lambda terms extended with arithmetic operations are used for explanatory purposes.
Nothing else is a lambda term. That is, a lambda term is valid if and only if it can be obtained by repeated application of these three rules. For convenience, some parentheses can be omitted when writing a lambda term. For example, the outermost parentheses are usually not written. See [[#Notation|§ Notation]], below, for an explicit description of which parentheses are optional. It is also common to extend the syntax presented here with additional operations, which allows making sense of terms such as <math>\lambda x.x^2.</math> The focus of this article is the pure lambda calculus without extensions, but lambda terms extended with arithmetic operations are used for explanatory purposes.


{{anchor|lambdaAbstr }} An ''abstraction'' <math>\lambda x.t</math> denotes an [[#anonymousForm|§ anonymous function]]{{efn|The lambda term <math>(\lambda x.t)</math> represents the function <math>x \mapsto t</math> written in anonymous form.}} that takes a single input {{mvar|x}} and returns {{mvar|t}}. For example, <math>\lambda x.(x^2+2)</math> is an abstraction representing the function <math>f</math> defined by <math>f(x) = x^2 + 2,</math> using the term <math>x^2+2</math> for {{mvar|t}}. The name <math>f</math> is superfluous when using abstraction. The syntax <math>(\lambda x.t)</math> [[Free variables and bound variables|binds]] the variable {{mvar|x}} in the term {{mvar|t}}. The definition of a function with an abstraction merely "sets up" the function but does not invoke it.
{{anchor|lambdaAbstr}} An ''abstraction'' <math>\lambda x.t</math> denotes an [[#anonymousForm|§ anonymous function]]{{efn|The lambda term <math>(\lambda x.t)</math> represents the function <math>x \mapsto t</math> written in anonymous form.}} that takes a single input {{mvar|x}} and returns {{mvar|t}}. For example, <math>\lambda x.(x^2+2)</math> is an abstraction representing the function <math>f</math> defined by <math>f(x) = x^2 + 2,</math> using the term <math>x^2+2</math> for {{mvar|t}}. The name <math>f</math> is superfluous when using abstraction. The syntax <math>(\lambda x.t)</math> [[Free variables and bound variables|binds]] the variable {{mvar|x}} in the term {{mvar|t}}. The definition of a function with an abstraction merely "sets up" the function but does not invoke it.


{{anchor|anApplic }} An ''application'' <math>t </math>&nbsp; <math>s</math> represents the application of a function {{mvar|t}} to an input {{mvar|s}}, that is, it represents the act of calling function {{mvar|t}} on input {{mvar|s}} to produce <math>t(s)</math>.
{{anchor|anApplic}} An ''application'' <math>t </math>&nbsp; <math>s</math> represents the application of a function {{mvar|t}} to an input {{mvar|s}}, that is, it represents the act of calling function {{mvar|t}} on input {{mvar|s}} to produce <math>t(s)</math>.


A lambda term may refer to a variable that has not been bound, such as the term <math>\lambda x.(x+y)</math> (which represents the function definition <math>f(x) = x + y</math>). In this term, the variable {{mvar|y}} has not been defined and is considered an unknown. The abstraction <math>\lambda x.(x+y)</math> is a syntactically valid term and represents a function that adds its input to the yet-unknown {{mvar|y}}.
A lambda term may refer to a variable that has not been bound, such as the term <math>\lambda x.(x+y)</math> (which represents the function definition <math>f(x) = x + y</math>). In this term, the variable {{mvar|y}} has not been defined and is considered an unknown. The abstraction <math>\lambda x.(x+y)</math> is a syntactically valid term and represents a function that adds its input to the yet-unknown {{mvar|y}}.


Parentheses may be used and might be needed to disambiguate terms. For example,
Parentheses may be used and might be needed to disambiguate terms. For example,
#{{anchor|Example1parenRightIsAbstraction }}<math>\lambda x.((\lambda x.x)x)</math> is of form <math>\lambda x.B</math> and is therefore an abstraction, while
#{{anchor|Example1parenRightIsAbstraction}}<math>\lambda x.((\lambda x.x)x)</math> is of form <math>\lambda x.B</math> and is therefore an abstraction, while
#<math>(\lambda x.(\lambda x.x))</math> <math>x</math> is of form <math>M</math>&nbsp; <math>N</math> and is therefore an application.
#<math>(\lambda x.(\lambda x.x))</math> <math>x</math> is of form <math>M</math>&nbsp; <math>N</math> and is therefore an application.


Line 156: Line 157:
Similarly, <math>( \lambda x . y ) s \to y [ x := s ] = y </math>, which demonstrates that <math> \lambda x . y </math> is a constant function.
Similarly, <math>( \lambda x . y ) s \to y [ x := s ] = y </math>, which demonstrates that <math> \lambda x . y </math> is a constant function.


The lambda calculus may be seen as an idealized version of a functional programming language, like [[Haskell (programming language)|Haskell]] or [[Standard ML]]. Under this view,{{anchor|betaReducIsAcomput}} β-reduction corresponds to a computational step. This step can be repeated by additional β-reductions until there are no more applications left to reduce. In the untyped lambda calculus, as presented here, this reduction process may not terminate. For instance, consider the term <math>\Omega = (\lambda x . xx)( \lambda x . xx )</math>.
The lambda calculus may be seen as an idealized version of a functional programming language, like [[Haskell]] or [[Standard ML]]. Under this view,{{anchor|betaReducIsAcomput}} β-reduction corresponds to a computational step. This step can be repeated by additional β-reductions until there are no more applications left to reduce. In the untyped lambda calculus, as presented here, this reduction process may not terminate. For instance, consider the term <math>\Omega = (\lambda x . xx)( \lambda x . xx )</math>.
Here <math>( \lambda x . xx)( \lambda x . xx) \to ( xx )[ x := \lambda x . xx ] = ( x [ x := \lambda x . xx ] )( x [ x := \lambda x . xx ] ) = ( \lambda x . xx)( \lambda x . xx )</math>.
Here <math>( \lambda x . xx)( \lambda x . xx) \to ( xx )[ x := \lambda x . xx ] = ( x [ x := \lambda x . xx ] )( x [ x := \lambda x . xx ] ) = ( \lambda x . xx)( \lambda x . xx )</math>.
That is, the term reduces to itself in a single β-reduction, and therefore the reduction process will never terminate.
That is, the term reduces to itself in a single β-reduction, and therefore the reduction process will never terminate.
Line 163: Line 164:


== Formal definition ==
== Formal definition ==
{{Main|Lambda calculus definition}}
{{Further|Lambda calculus definition}}


=== Definition ===
=== Definition ===
Line 177: Line 178:
# If {{math|''M'', ''N'' ∈ Λ,}} then {{math|(''M N'') ∈ Λ.}}
# If {{math|''M'', ''N'' ∈ Λ,}} then {{math|(''M N'') ∈ Λ.}}


Instances of rule 2 are known as ''abstractions'' and instances of rule 3 are known as ''applications''.<ref>{{Cite book|url= https://www.elsevier.com/books/the-lambda-calculus/barendregt/978-0-444-87508-2|last=Barendregt|first=Hendrik Pieter|author-link=Henk Barendregt|title=The Lambda Calculus: Its Syntax and Semantics|publisher=North Holland|year=1984|volume=103|series=Studies in Logic and the Foundations of Mathematics|edition=Revised|isbn=0-444-87508-5 }} ([https://ftp.science.ru.nl/CSI/CompMath.Found/ErrataLCalculus.pdf Corrections]).</ref>
Instances of rule 2 are known as ''abstractions'' and instances of rule 3 are known as ''applications''.<ref>{{Cite book|url= https://www.elsevier.com/books/the-lambda-calculus/barendregt/978-0-444-87508-2|last1=Barendregt|first1=Hendrik Pieter|author1-link=Henk Barendregt|title=The Lambda Calculus: Its Syntax and Semantics|publisher=North Holland|year=1984|volume=103|series=Studies in Logic and the Foundations of Mathematics|edition=Revised|isbn=0-444-87508-5}} ([https://ftp.science.ru.nl/CSI/CompMath.Found/ErrataLCalculus.pdf Corrections]).</ref> ''See [[#redex|§ reducible expression]]''

This set of rules may be written in [[Backus–Naur form]] as:
<syntaxhighlight lang="bnf">
<expression> :== <abstraction> | <application> | <variable>
<abstraction> :== λ <variable> . <expression>
<application> :== ( <expression> <expression> )
<variable> :== v1 | v2 | ...
</syntaxhighlight>


=== Notation ===
=== Notation ===
Line 185: Line 194:
* When all variables are single-letter, the space in applications may be omitted: ''MNP'' instead of ''M'' ''N'' ''P''.<ref>{{cite web |title=The Basic Grammar of Lambda Expressions |url=https://softoption.us/node/33 |website=SoftOption |quote=Some other systems use juxtaposition to mean application, so 'ab' means 'a@b'. This is fine except that it requires that variables have length one so that we know that 'ab' is two variables juxtaposed not one variable of length 2. But we want to labels like 'firstVariable' to mean a single variable, so we cannot use this juxtaposition convention.}}</ref>
* When all variables are single-letter, the space in applications may be omitted: ''MNP'' instead of ''M'' ''N'' ''P''.<ref>{{cite web |title=The Basic Grammar of Lambda Expressions |url=https://softoption.us/node/33 |website=SoftOption |quote=Some other systems use juxtaposition to mean application, so 'ab' means 'a@b'. This is fine except that it requires that variables have length one so that we know that 'ab' is two variables juxtaposed not one variable of length 2. But we want to labels like 'firstVariable' to mean a single variable, so we cannot use this juxtaposition convention.}}</ref>
* The body of an abstraction extends [[Regular expression#Lazy matching|as far right as possible]]: λ''x''.''M N'' means λ''x''.(''M N'') and not (λ''x''.''M'') ''N''.
* The body of an abstraction extends [[Regular expression#Lazy matching|as far right as possible]]: λ''x''.''M N'' means λ''x''.(''M N'') and not (λ''x''.''M'') ''N''.
* A sequence of abstractions is contracted: λ''x''.λ''y''.λ''z''.''N'' is abbreviated as λ''xyz''.''N''.<ref name="Selinger">{{Citation|first=Peter|last=Selinger|title=Lecture Notes on the Lambda Calculus|year=2008|page=9|publisher=Department of Mathematics and Statistics, University of Ottawa|url=http://www.mathstat.dal.ca/~selinger/papers/lambdanotes.pdf|bibcode=2008arXiv0804.3434S|volume=0804|arxiv=0804.3434|issue=class: cs.LO}}</ref><ref name="lambda-bound" />
* A sequence of abstractions is contracted: λ''x''.λ''y''.λ''z''.''N'' is abbreviated as λ''xyz''.''N''.<ref name="Selinger">{{Citation|first1=Peter|last1=Selinger|title=Lecture Notes on the Lambda Calculus|year=2008|page=9|publisher=Department of Mathematics and Statistics, University of Ottawa|url=http://www.mathstat.dal.ca/~selinger/papers/lambdanotes.pdf|bibcode=2008arXiv0804.3434S|volume=0804|arxiv=0804.3434|issue=class: cs.LO}}</ref><ref name="lambda-bound" />


=== Free and bound variables ===
=== Free and bound variables ===
Line 192: Line 201:
The set of ''free variables'' of a lambda expression, ''M'', is denoted as FV(''M'') and is defined by recursion on the structure of the terms, as follows:
The set of ''free variables'' of a lambda expression, ''M'', is denoted as FV(''M'') and is defined by recursion on the structure of the terms, as follows:
# FV(''x'') = {''x''}, where ''x'' is a variable.
# FV(''x'') = {''x''}, where ''x'' is a variable.
# {{anchor| FreeMsExXs }} FV(λ''x''.''M'') = FV(''M'') \ {''x''}.{{efn|The set of free variables of M, but with {''x''} removed}}
# {{anchor|FreeMsExXs}} FV(λ''x''.''M'') = FV(''M'') \ {''x''}.{{efn|The set of free variables of M, but with {''x''} removed}}
# {{anchor| FreeMsNs }} {{math|1=FV(''M N'') = FV(''M'') ∪ FV(''N'').}}{{efn|The union of the set of free variables of <math>M</math> and the set of free variables of <math>N</math><ref name="BarendregtBarendsen">{{Citation|last1=Barendregt|first1=Henk|author-link=Henk Barendregt|last2=Barendsen|first2=Erik|title=Introduction to Lambda Calculus|date=March 2000|url=https://ftp.science.ru.nl/CSI/CompMath.Found/lambda.pdf}}</ref>}}
# {{anchor|FreeMsNs}} {{math|1=FV(''M N'') = FV(''M'') ∪ FV(''N'').}}{{efn|The union of the set of free variables of <math>M</math> and the set of free variables of <math>N</math><ref name="BarendregtBarendsen">{{Citation|last1=Barendregt|first1=Henk|author1-link=Henk Barendregt|last2=Barendsen|first2=Erik|title=Introduction to Lambda Calculus|date=March 2000|url=https://ftp.science.ru.nl/CSI/CompMath.Found/lambda.pdf}}</ref>}}


An expression that contains no free variables is said to be ''closed''. Closed lambda expressions are also known as ''combinators'' and are equivalent to terms in [[combinatory logic]].
An expression that contains no free variables is said to be ''closed''. Closed lambda expressions are also known as ''combinators'' and are equivalent to terms in [[combinatory logic]].


== Reduction ==
== Reduction ==
The meaning of lambda expressions is defined by how expressions can be reduced.<ref>{{cite journal|author-link=Ruy de Queiroz|last=de Queiroz|first=Ruy J. G. B.|doi=10.1111/j.1746-8361.1988.tb00919.x|title=A Proof-Theoretic Account of Programming and the Role of Reduction Rules|journal=Dialectica|volume=42|issue=4|pages=265–282|year=1988}}</ref>
The meaning of lambda expressions is defined by how expressions can be reduced.<ref>{{cite journal|last1=de Queiroz|first1=Ruy J. G. B.|author1-link=Ruy de Queiroz|year=1988|doi=10.1111/j.1746-8361.1988.tb00919.x|title=A Proof-Theoretic Account of Programming and the Role of Reduction Rules|journal=Dialectica|volume=42|issue=4|pages=265–282}}</ref>


There are three kinds of reduction:
There are three kinds of reduction:
Line 207: Line 216:
We also speak of the resulting equivalences: two expressions are ''α-equivalent'', if they can be α-converted into the same expression. β-equivalence and η-equivalence are defined similarly.
We also speak of the resulting equivalences: two expressions are ''α-equivalent'', if they can be α-converted into the same expression. β-equivalence and η-equivalence are defined similarly.


The term ''redex'', short for ''reducible expression'', refers to subterms that can be reduced by one of the reduction rules. For example, (λ''x''.''M'') ''N'' is a β-redex in expressing the substitution of ''N'' for ''x'' in ''M''. The expression to which a redex reduces is called its ''reduct''; the reduct of (λ''x''.''M'') ''N'' is ''M''[''x'' := ''N''].{{efn|name=beta}}
{{anchor|redex}}The term ''redex'', short for ''reducible expression'', refers to subterms that can be reduced by one of the reduction rules. For example, (λ''x''.''M'') ''N'' is a β-redex in expressing the substitution of ''N'' for ''x'' in ''M''. The expression to which a redex reduces is called its ''reduct''; the reduct of (λ''x''.''M'') ''N'' is ''M''[''x'' := ''N''].{{efn|name=beta}}


If ''x'' is not free in ''M'', λ''x''.''M x'' is also an η-redex, with a reduct of ''M''.
If ''x'' is not free in ''M'', λ''x''.''M x'' is also an η-redex, with a reduct of ''M''.
Line 234: Line 243:


=== β-reduction ===
=== β-reduction ===
'''β-reduction''' ([[beta]] reduction) captures the idea of function application. <br>
'''β-reduction''' ([[beta]] reduction) captures the idea of function application. β-reduction is defined in terms of substitution: the β-reduction of (λ''x''.''M'') ''N'' is ''M''[''x'' := ''N''].{{efn|name= beta}}
β-reduction is defined in terms of substitution: the β-reduction of (λ''x''.''M'') ''N'' is ''M''[''x'' := ''N''].{{efn|name= beta}}


For example, assuming some encoding of 2, 7, ×, we have the following β-reduction: (λ''n''.''n'' × 2) 7 → 7 × 2.
For example, assuming some encoding of 2, 7, ×, we have the following β-reduction: (λ''n''.''n'' × 2) 7 → 7 × 2.
Line 247: Line 255:


== Normal forms and confluence ==
== Normal forms and confluence ==
{{Main|Normalization property (abstract rewriting)}}
{{Further|Normalization property (abstract rewriting)}}
For the untyped lambda calculus, β-reduction as a [[rewrite system|rewriting rule]] is neither [[strongly normalising]] nor [[weakly normalising]].
For the untyped lambda calculus, β-reduction as a [[rewrite system|rewriting rule]] is neither [[strongly normalising]] nor [[weakly normalising]].


Line 255: Line 263:


== Encoding datatypes ==
== Encoding datatypes ==
{{Main|Church encoding|Mogensen–Scott encoding}}
{{Further|Church encoding|Mogensen–Scott encoding}}
The basic lambda calculus may be used to model [[arithmetic]], booleans, data structures, and recursion, as illustrated in the following sub-sections ''[[#Arithmetic in lambda calculus|i]]'', ''[[#Logic and predicates|ii]]'', ''[[#Pairs|iii]]'', and ''[[Mogensen–Scott encoding|§ iv]]''.
The basic lambda calculus may be used to model [[arithmetic]], Booleans, data structures, and recursion, as illustrated in the following sub-sections ''[[#Arithmetic in lambda calculus|i]]'', ''[[#Logic and predicates|ii]]'', ''[[#Pairs|iii]]'', and ''[[Mogensen–Scott encoding|§ iv]]''.


=== Arithmetic in lambda calculus ===
=== Arithmetic in lambda calculus ===
Line 301: Line 309:


=== Logic and predicates ===
=== Logic and predicates ===
By convention, the following two definitions (known as Church booleans) are used for the boolean values {{Mono|TRUE}} and {{Mono|FALSE}}:
By convention, the following two definitions (known as Church Booleans) are used for the Boolean values {{Mono|TRUE}} and {{Mono|FALSE}}:
: {{Mono|1=TRUE := λ''x''.λ''y''.''x''}}
: {{Mono|1=TRUE := λ''x''.λ''y''.''x''}}
: {{Mono|1=FALSE := λ''x''.λ''y''.''y''}}
: {{Mono|1=FALSE := λ''x''.λ''y''.''y''}}
Line 316: Line 324:
and we see that {{Mono|AND TRUE FALSE}} is equivalent to {{Mono|FALSE}}.
and we see that {{Mono|AND TRUE FALSE}} is equivalent to {{Mono|FALSE}}.


A ''predicate'' is a function that returns a boolean value. The most fundamental predicate is {{Mono|ISZERO}}, which returns {{Mono|TRUE}} if its argument is the Church numeral {{Mono|0}}, but {{Mono|FALSE}} if its argument were any other Church numeral:
A ''predicate'' is a function that returns a Boolean value. The most fundamental predicate is {{Mono|ISZERO}}, which returns {{Mono|TRUE}} if its argument is the Church numeral {{Mono|0}}, but {{Mono|FALSE}} if its argument were any other Church numeral:
: {{Mono|1=ISZERO := λ''n''.''n'' (λ''x''.FALSE) TRUE}}
: {{Mono|1=ISZERO := λ''n''.''n'' (λ''x''.FALSE) TRUE}}
The following predicate tests whether the first argument is less-than-or-equal-to the second:
The following predicate tests whether the first argument is less-than-or-equal-to the second:
Line 323: Line 331:


The availability of predicates and the above definition of {{Mono|TRUE}} and {{Mono|FALSE}} make it convenient to write "if-then-else" expressions in lambda calculus. For example, the predecessor function can be defined as:
The availability of predicates and the above definition of {{Mono|TRUE}} and {{Mono|FALSE}} make it convenient to write "if-then-else" expressions in lambda calculus. For example, the predecessor function can be defined as:
: {{Mono|1=PRED := λ''n''.''n'' (λ''g''.λ''k''.ISZERO (''g'' 1) ''k'' (PLUS (''g'' ''k'') 1)) (λ''v''.0) 0 }}
: {{Mono|1=PRED := λ''n''.''n'' (λ''g''.λ''k''.ISZERO (''g'' 1) ''k'' (PLUS (''g'' ''k'') 1)) (λ''v''.0) 0}}
which can be verified by showing inductively that {{Mono|''n'' (λ''g''.λ''k''.ISZERO (''g'' 1) ''k'' (PLUS (''g'' ''k'') 1)) (λ''v''.0)}} is the add {{Mono|''n''}} − 1 function for {{Mono|''n''}} > 0.
which can be verified by showing inductively that {{Mono|''n'' (λ''g''.λ''k''.ISZERO (''g'' 1) ''k'' (PLUS (''g'' ''k'') 1)) (λ''v''.0)}} is the add {{Mono|''n''}} − 1 function for {{Mono|''n''}} > 0.


Line 332: Line 340:
: {{Mono|1=FIRST := λ''p''.''p'' TRUE}}
: {{Mono|1=FIRST := λ''p''.''p'' TRUE}}
: {{Mono|1=SECOND := λ''p''.''p'' FALSE}}
: {{Mono|1=SECOND := λ''p''.''p'' FALSE}}
: {{Mono|1=NIL := λ''x''.TRUE }}
: {{Mono|1=NIL := λ''x''.TRUE}}
: {{Mono|1=NULL := λ''p''.''p'' (λ''x''.λ''y''.FALSE)}}
: {{Mono|1=NULL := λ''p''.''p'' (λ''x''.λ''y''.FALSE)}}


Line 349: Line 357:
: {{Mono|(λ''f''.}}''M''{{Mono|)}} ''N''
: {{Mono|(λ''f''.}}''M''{{Mono|)}} ''N''
Authors often introduce [[syntactic sugar]], such as {{Mono|let}},{{efn|{{Mono|(λ''f''.}}''M''{{Mono|)}} ''N'' can be pronounced "let f be N in M".}} to permit writing the above in the more intuitive order
Authors often introduce [[syntactic sugar]], such as {{Mono|let}},{{efn|{{Mono|(λ''f''.}}''M''{{Mono|)}} ''N'' can be pronounced "let f be N in M".}} to permit writing the above in the more intuitive order
: {{Mono|1=let ''f'' = }}''N''{{Mono| in }}''M''
: {{Mono|1=let ''f'' =}}''N''{{Mono|in}}''M''
By chaining such definitions, one can write a lambda calculus "program" as zero or more function definitions, followed by one lambda-term using those functions that constitutes the main body of the program.
By chaining such definitions, one can write a lambda calculus "program" as zero or more function definitions, followed by one lambda-term using those functions that constitutes the main body of the program.


Line 355: Line 363:


=== Recursion and fixed points ===
=== Recursion and fixed points ===
{{Main|Fixed-point combinator}}
{{Further|Fixed-point combinator}}
{{See also|SKI combinator calculus#Self-application and recursion}}
{{See also|SKI combinator calculus#Self-application and recursion}}
[[Recursion]] is the definition of a function using the function itself. A definition containing itself inside itself, by value, leads to the whole value being of infinite size. Other notations which support recursion natively overcome this by referring to the function definition ''by name''. Lambda calculus cannot express this: all functions are anonymous in lambda calculus, so we can't refer by name to a value which is yet to be defined, inside the lambda term defining that same value. However, a lambda expression can receive itself as its own argument, for example in&ensp; {{Mono|(λ''x''.''x'' ''x'') ''E''}}. Here ''E'' should be an abstraction, applying its parameter to a value to express recursion.
[[Recursion]] is the definition of a function invoking itself. A definition containing itself inside itself, by value, leads to the whole value being of infinite size. Other notations which support recursion natively overcome this by referring to the function definition ''by name''. Lambda calculus cannot express this: all functions are anonymous in lambda calculus, so we can't refer by name to a value which is yet to be defined, inside the lambda term defining that same value. However, a lambda expression can receive itself as its own argument, for example in&ensp; {{Mono|(λ''x''.''x'' ''x'') ''E''}}. Here ''E'' should be an abstraction, applying its parameter to a value to express recursion.


Consider the [[factorial]] function {{Mono|F(''n'')}} recursively defined by
Consider the [[factorial]] function {{Mono|F(''n'')}} recursively defined by
Line 376: Line 384:
::: with&ensp; {{Mono|1=''r'' ''x'' = F ''x'' = G ''r'' ''x''}} &ensp;to hold, so&ensp; {{Mono|1=''r'' = G ''r'' =: FIX G}} &ensp;and
::: with&ensp; {{Mono|1=''r'' ''x'' = F ''x'' = G ''r'' ''x''}} &ensp;to hold, so&ensp; {{Mono|1=''r'' = G ''r'' =: FIX G}} &ensp;and
: {{Mono|1=F := FIX G}} &ensp;where&ensp; {{Mono|1=FIX ''g'' := (''r'' where ''r'' = ''g'' ''r'') = ''g'' (FIX ''g'')}}
: {{Mono|1=F := FIX G}} &ensp;where&ensp; {{Mono|1=FIX ''g'' := (''r'' where ''r'' = ''g'' ''r'') = ''g'' (FIX ''g'')}}
::: so that&ensp; {{Mono|1=FIX G = G (FIX G) = (λ''n''.(1, if ''n'' = 0; else ''n'' × ((FIX G) (''n''−1)))) }}
::: so that&ensp; {{Mono|1=FIX G = G (FIX G) = (λ''n''.(1, if ''n'' = 0; else ''n'' × ((FIX G) (''n''−1))))}}


Given a lambda term with first argument representing recursive call (e.g. {{Mono|G}} here), the ''fixed-point'' combinator {{Mono|FIX}} will return a self-replicating lambda expression representing the recursive function (here, {{Mono|F}}). The function does not need to be explicitly passed to itself at any point, for the self-replication is arranged in advance, when it is created, to be done each time it is called. Thus the original lambda expression {{Mono|(FIX G)}} is re-created inside itself, at call-point, achieving [[self-reference]].
Given a lambda term with first argument representing recursive call (e.g. {{Mono|G}} here), the ''fixed-point'' combinator {{Mono|FIX}} will return a self-replicating lambda expression representing the recursive function (here, {{Mono|F}}). The function does not need to be explicitly passed to itself at any point, for the self-replication is arranged in advance, when it is created, to be done each time it is called. Thus the original lambda expression {{Mono|(FIX G)}} is re-created inside itself, at call-point, achieving [[self-reference]].
Line 394: Line 402:
Now, to perform our recursive call to the factorial function, we would simply call {{Mono|('''Y''' G) ''n''}},&nbsp; where ''n'' is the number we are calculating the factorial of. Given ''n'' = 4, for example, this gives:
Now, to perform our recursive call to the factorial function, we would simply call {{Mono|('''Y''' G) ''n''}},&nbsp; where ''n'' is the number we are calculating the factorial of. Given ''n'' = 4, for example, this gives:


: {{Mono|('''Y''' G) 4 }}
: {{Mono|('''Y''' G) 4}}
: {{Mono|G ('''Y''' G) 4 }}
: {{Mono|G ('''Y''' G) 4}}
: {{Mono|1=(λ''r''.λ''n''.(1, if ''n'' = 0; else ''n'' × (''r'' (''n''−1)))) ('''Y''' G) 4}}
: {{Mono|1=(λ''r''.λ''n''.(1, if ''n'' = 0; else ''n'' × (''r'' (''n''−1)))) ('''Y''' G) 4}}
: {{Mono|1=(λ''n''.(1, if ''n'' = 0; else ''n'' × (('''Y''' G) (''n''−1)))) 4}}
: {{Mono|1=(λ''n''.(1, if ''n'' = 0; else ''n'' × (('''Y''' G) (''n''−1)))) 4}}
Line 420: Line 428:


: {{anchor|I}} {{Mono|1='''I''' := λ''x''.''x''}}
: {{anchor|I}} {{Mono|1='''I''' := λ''x''.''x''}}
: {{anchor|S}} {{Mono|1='''S''' := λ''x''.λ''y''.λ''z''.''x'' ''z'' (''y'' ''z'') }}
: {{anchor|S}} {{Mono|1='''S''' := λ''x''.λ''y''.λ''z''.''x'' ''z'' (''y'' ''z'')}}
: {{anchor|K}} {{Mono|1='''K''' := λ''x''.λ''y''.''x''}}
: {{anchor|K}} {{Mono|1='''K''' := λ''x''.λ''y''.''x''}}
: {{anchor|B}} {{Mono|1='''B''' := λ''x''.λ''y''.λ''z''.''x'' (''y'' ''z'') }}
: {{anchor|B}} {{Mono|1='''B''' := λ''x''.λ''y''.λ''z''.''x'' (''y'' ''z'')}}
: {{anchor|C}} {{Mono|1='''C''' := λ''x''.λ''y''.λ''z''.''x'' ''z'' ''y''}}
: {{anchor|C}} {{Mono|1='''C''' := λ''x''.λ''y''.λ''z''.''x'' ''z'' ''y''}}
: {{anchor|W}} {{Mono|1='''W''' := λ''x''.λ''y''.''x'' ''y'' ''y''}}
: {{anchor|W}} {{Mono|1='''W''' := λ''x''.λ''y''.''x'' ''y'' ''y''}}
: {{anchor|omega}} {{Mono|1='''ω''' or '''Δ''' or '''U''' := λ''x''.''x'' ''x'' }}
: {{anchor|omega}} {{Mono|1='''ω''' or '''Δ''' or '''U''' := λ''x''.''x'' ''x''}}
: {{anchor|Omega}} {{Mono|1='''Ω''' := '''ω ω''' }}
: {{anchor|Omega}} {{Mono|1='''Ω''' := '''ω ω'''}}


{{Mono|'''I'''}} is the identity function. {{Mono|'''SK'''}} and {{Mono|'''BCKW'''}} form complete [[combinator calculus]] systems that can express any lambda term - see
{{Mono|'''I'''}} is the identity function. {{Mono|'''SK'''}} and {{Mono|'''BCKW'''}} form complete [[combinator calculus]] systems that can express any lambda term - see
Line 433: Line 441:


=== Abstraction elimination ===
=== Abstraction elimination ===
{{Main|Combinatory logic#Completeness of the S-K basis}}
{{Further|Combinatory logic#Completeness of the S-K basis}}
If ''N'' is a lambda-term without abstraction, but possibly containing named constants ([[combinatory logic|combinators]]), then there exists a lambda-term ''T''({{Mono|''x''}},''N'') which is equivalent to {{Mono|λ''x''.}}''N'' but lacks abstraction (except as part of the named constants, if these are considered non-atomic). This can also be viewed as anonymising variables, as ''T''({{Mono|''x''}},''N'') removes all occurrences of {{Mono|''x''}} from ''N'', while still allowing argument values to be substituted into the positions where ''N'' contains an {{Mono|''x''}}. The conversion function ''T'' can be defined by:
If ''N'' is a lambda-term without abstraction, but possibly containing named constants ([[combinatory logic|combinators]]), then there exists a lambda-term ''T''({{Mono|''x''}},''N'') which is equivalent to {{Mono|λ''x''.}}''N'' but lacks abstraction (except as part of the named constants, if these are considered non-atomic). This can also be viewed as anonymising variables, as ''T''({{Mono|''x''}},''N'') removes all occurrences of {{Mono|''x''}} from ''N'', while still allowing argument values to be substituted into the positions where ''N'' contains an {{Mono|''x''}}. The conversion function ''T'' can be defined by:
: ''T''({{Mono|''x''}}, {{Mono|''x''}}) := '''I'''
: ''T''({{Mono|''x''}}, {{Mono|''x''}}) := '''I'''
Line 443: Line 451:


== Typed lambda calculus ==
== Typed lambda calculus ==
{{Main|Typed lambda calculus}}
{{Further|Typed lambda calculus}}
A ''typed lambda calculus'' is a typed [[formalism (mathematics)|formalism]] that uses the lambda-symbol (<math>\lambda</math>) to denote anonymous function abstraction. In this context, types are usually objects of a syntactic nature that are assigned to lambda terms; the exact nature of a type depends on the calculus considered (see [[Typed lambda calculus#Kinds of typed lambda calculi|Kinds of typed lambda calculi]]). From a certain point of view, typed lambda calculi can be seen as refinements of the untyped lambda calculus but from another point of view, they can also be considered the more fundamental theory and ''untyped lambda calculus'' a special case with only one type.<ref>Types and Programming Languages, p. 273, Benjamin C. Pierce</ref>
A ''typed lambda calculus'' is a typed [[formalism (mathematics)|formalism]] that uses the lambda-symbol (<math>\lambda</math>) to denote anonymous function abstraction. In this context, types are usually objects of a syntactic nature that are assigned to lambda terms; the exact nature of a type depends on the calculus considered (see [[Typed lambda calculus#Kinds of typed lambda calculi|Kinds of typed lambda calculi]]). From a certain point of view, typed lambda calculi can be seen as refinements of the untyped lambda calculus but from another point of view, they can also be considered the more fundamental theory and ''untyped lambda calculus'' a special case with only one type.<ref>Types and Programming Languages, p. 273, Benjamin C. Pierce</ref>


Typed lambda calculi are foundational [[programming languages]] and are the base of typed [[functional programming languages]] such as [[ML programming language|ML]] and [[Haskell (programming language)|Haskell]] and, more indirectly, typed [[imperative programming]] languages. Typed lambda calculi play an important role in the design of [[type systems]] for programming languages; here typability usually captures desirable properties of the program, e.g. the program will not cause a memory access violation.
Typed lambda calculi are foundational programming languages and are the base of typed functional programming languages such as ML and Haskell and, more indirectly, typed [[imperative programming]] languages. Typed lambda calculi play an important role in the design of [[type system]]s for programming languages; here typability usually captures desirable properties of the program, e.g., the program will not cause a memory access violation.


Typed lambda calculi are closely related to [[mathematical logic]] and [[proof theory]] via the [[Curry–Howard isomorphism]] and they can be considered as the [[internal language]] of classes of [[category theory|categories]], e.g. the simply typed lambda calculus is the language of [[cartesian closed category|Cartesian closed categories]] (CCCs).
Typed lambda calculi are closely related to [[mathematical logic]] and [[proof theory]] via the [[Curry–Howard isomorphism]] and they can be considered as the [[internal language]] of classes of [[category theory|categories]], e.g., the simply typed lambda calculus is the language of a [[Cartesian closed category]] (CCC).


== Reduction strategies ==
== Reduction strategies ==
{{Main|Reduction strategy#Lambda calculus}}
{{Further|Reduction strategy#Lambda calculus}}
Whether a term is normalising or not, and how much work needs to be done in normalising it if it is, depends to a large extent on the reduction strategy used. Common lambda calculus reduction strategies include:<ref>{{cite book |last=Pierce |first=Benjamin C. |author-link=Benjamin C. Pierce |title=Types and Programming Languages |year=2002 |publisher=[[MIT Press]] |isbn=0-262-16209-1 | page=56|url=https://books.google.com/books?id=ti6zoAC9Ph8C&pg=PA56}}</ref><ref>{{cite book |last1=Sestoft |first1=Peter |title=The Essence of Computation |chapter=Demonstrating Lambda Calculus Reduction |series=Lecture Notes in Computer Science |date=2002 |volume=2566 |pages=420–435 |doi=10.1007/3-540-36377-7_19 |isbn=978-3-540-00326-7 |chapter-url=http://itu.dk/people/sestoft/papers/sestoft-lamreduce.pdf |access-date=22 August 2022}}</ref><ref>{{cite journal |last1=Biernacka |first1=Małgorzata |last2=Charatonik |first2=Witold |last3=Drab |first3=Tomasz |editor1-last=Andronick |editor1-first=June |editor2-last=de Moura |editor2-first=Leonardo |title=The Zoo of Lambda-Calculus Reduction Strategies, and Coq |journal=13th International Conference on Interactive Theorem Proving (ITP 2022) |date=2022 |volume=237 |pages=7:1–7:19 |publisher=Schloss Dagstuhl – Leibniz-Zentrum für Informatik |doi=10.4230/LIPIcs.ITP.2022.7 |doi-access=free |url=https://drops.dagstuhl.de/opus/volltexte/2022/16716/pdf/LIPIcs-ITP-2022-7.pdf |access-date=22 August 2022}}</ref>
Whether a term is normalising or not, and how much work needs to be done in normalising it if it is, depends to a large extent on the reduction strategy used. Common lambda calculus reduction strategies include:<ref>{{cite book |last1=Pierce |first1=Benjamin C. |author1-link=Benjamin C. Pierce |title=Types and Programming Languages |year=2002 |publisher=[[MIT Press]] |isbn=0-262-16209-1 |page=56|url=https://books.google.com/books?id=ti6zoAC9Ph8C&pg=PA56}}</ref><ref>{{cite book |last1=Sestoft |first1=Peter |title=The Essence of Computation |chapter=Demonstrating Lambda Calculus Reduction |series=Lecture Notes in Computer Science |date=2002 |volume=2566 |pages=420–435 |doi=10.1007/3-540-36377-7_19 |isbn=978-3-540-00326-7 |chapter-url=http://itu.dk/people/sestoft/papers/sestoft-lamreduce.pdf |access-date=22 August 2022}}</ref><ref>{{cite journal |last1=Biernacka |first1=Małgorzata |last2=Charatonik |first2=Witold |last3=Drab |first3=Tomasz |editor1-last=Andronick |editor1-first=June |editor2-last=de Moura |editor2-first=Leonardo |title=The Zoo of Lambda-Calculus Reduction Strategies, and Coq |journal=13th International Conference on Interactive Theorem Proving (ITP 2022) |date=2022 |volume=237 |pages=7:1–7:19 |publisher=Schloss Dagstuhl – Leibniz-Zentrum für Informatik |doi=10.4230/LIPIcs.ITP.2022.7 |doi-access=free |url=https://drops.dagstuhl.de/opus/volltexte/2022/16716/pdf/LIPIcs-ITP-2022-7.pdf |access-date=22 August 2022}}</ref>
; Normal order: The leftmost outermost redex is reduced first. That is, whenever possible, arguments are substituted into the body of an abstraction before the arguments are reduced. If a term has a beta-normal form, normal order reduction will always reach that normal form.
; Normal order: The leftmost outermost redex is reduced first. That is, whenever possible, arguments are substituted into the body of an abstraction before the arguments are reduced. If a term has a beta-normal form, normal order reduction will always reach that normal form.
; Applicative order: The leftmost innermost redex is reduced first. As a consequence, a function's arguments are always reduced before they are substituted into the function. Unlike normal order reduction, applicative order reduction may fail to find the beta-normal form of an expression, even if such a normal form exists. For example, the term <math>( \; \lambda x.y \;\; (\lambda z. (z z) \; \lambda z. (z z)) \; )</math> is reduced to itself by applicative order, while normal order reduces it to its beta-normal form <math>y</math>.
; Applicative order: The leftmost innermost redex is reduced first. As a consequence, a function's arguments are always reduced before they are substituted into the function. Unlike normal order reduction, applicative order reduction may fail to find the beta-normal form of an expression, even if such a normal form exists. For example, the term <math>( \; \lambda x.y \;\; (\lambda z. (z z) \; \lambda z. (z z)) \; )</math> is reduced to itself by applicative order, while normal order reduces it to its beta-normal form <math>y</math>.
Line 471: Line 479:


== Complexity ==
== Complexity ==
The notion of [[computational complexity theory|computational complexity]] for the lambda calculus is a bit tricky, because the cost of a β-reduction may vary depending on how it is implemented.<ref>{{cite book |last1=Frandsen |first1=Gudmund Skovbjerg |last2=Sturtivant |first2=Carl |title= Functional Programming Languages and Computer Architecture: 5th ACM Conference. Cambridge, MA, USA, August 26-30, 1991. Proceedings |chapter=What is an efficient implementation of the λ-calculus? |series=Lecture Notes in Computer Science |date=26 August 1991 |volume=523 |pages=289–312 |publisher=Springer-Verlag|doi=10.1007/3540543961_14 |isbn=9783540543961 |citeseerx=10.1.1.139.6913|contribution-url=https://scholar.archive.org/work/i7pfbnat3vdwfgalk5l6pkvbze}}</ref>
The notion of [[computational complexity theory|computational complexity]] for the lambda calculus is a bit tricky, because the cost of a β-reduction may vary depending on how it is implemented.<ref>{{cite book |last1=Frandsen |first1=Gudmund Skovbjerg |last2=Sturtivant |first2=Carl |title=Functional Programming Languages and Computer Architecture: 5th ACM Conference. Cambridge, MA, USA, August 26-30, 1991. Proceedings |chapter=What is an efficient implementation of the λ-calculus? |series=Lecture Notes in Computer Science |date=26 August 1991 |volume=523 |pages=289–312 |publisher=Springer-Verlag|doi=10.1007/3540543961_14 |isbn=9783540543961 |citeseerx=10.1.1.139.6913|contribution-url=https://scholar.archive.org/work/i7pfbnat3vdwfgalk5l6pkvbze}}</ref>
To be precise, one must somehow find the location of all of the occurrences of the bound variable {{Mono|''V''}} in the expression {{Mono|''E''}}, implying a time cost, or one must keep track of the locations of free variables in some way, implying a space cost. A naïve search for the locations of {{Mono|''V''}} in {{Mono|''E''}} is [[Big O notation|''O''(''n'')]] in the length ''n'' of {{Mono|''E''}}. [[Director string]]s were an early approach that traded this time cost for a quadratic space usage.<ref>{{cite journal|first=F.-R.|last=Sinot|url=http://www.lsv.fr/Publis/PAPERS/PDF/sinot-jlc05.pdf|title=Director Strings Revisited: A Generic Approach to the Efficient Representation of Free Variables in Higher-order Rewriting|journal=Journal of Logic and Computation|volume=15|number=2|pages=201–218|year=2005|doi=10.1093/logcom/exi010}}</ref> More generally this has led to the study of systems that use [[explicit substitution]].
To be precise, one must somehow find the location of all of the occurrences of the bound variable {{Mono|''V''}} in the expression {{Mono|''E''}}, implying a time cost, or one must keep track of the locations of free variables in some way, implying a space cost. A naïve search for the locations of {{Mono|''V''}} in {{Mono|''E''}} is [[Big O notation|''O''(''n'')]] in the length ''n'' of {{Mono|''E''}}. [[Director string]]s were an early approach that traded this time cost for a quadratic space usage.<ref>{{cite journal|first1=F.-R.|last1=Sinot|url=http://www.lsv.fr/Publis/PAPERS/PDF/sinot-jlc05.pdf|title=Director Strings Revisited: A Generic Approach to the Efficient Representation of Free Variables in Higher-order Rewriting|journal=Journal of Logic and Computation|volume=15|number=2|pages=201–218|year=2005|doi=10.1093/logcom/exi010}}</ref> More generally this has led to the study of systems that use [[explicit substitution]].


In 2014, it was shown that the number of β-reduction steps taken by normal order reduction to reduce a term is a ''reasonable'' time cost model, that is, the reduction can be simulated on a Turing machine in time polynomially proportional to the number of steps.<ref>{{cite book |last1=Accattoli |first1=Beniamino |last2=Dal Lago |first2=Ugo |title=Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS) |chapter=Beta reduction is invariant, indeed |date=14 July 2014 |pages=1–10 |doi=10.1145/2603088.2603105 |arxiv=1601.01233 |isbn=9781450328869 |s2cid=11485010 }}</ref> This was a long-standing open problem, due to ''size explosion'', the existence of lambda terms which grow exponentially in size for each β-reduction. The result gets around this by working with a compact shared representation. The result makes clear that the amount of space needed to evaluate a lambda term is not proportional to the size of the term during reduction. It is not currently known what a good measure of space complexity would be.<ref name=Reasonable>{{cite journal |last1=Accattoli |first1=Beniamino |title=(In)Efficiency and Reasonable Cost Models |journal=Electronic Notes in Theoretical Computer Science |date=October 2018 |volume=338 |pages=23–43 |doi=10.1016/j.entcs.2018.10.003 |doi-access=free }}</ref>
In 2014, it was shown that the number of β-reduction steps taken by normal order reduction to reduce a term is a ''reasonable'' time cost model, that is, the reduction can be simulated on a Turing machine in time polynomially proportional to the number of steps.<ref>{{cite book |last1=Accattoli |first1=Beniamino |last2=Dal Lago |first2=Ugo |title=Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS) |chapter=Beta reduction is invariant, indeed |date=14 July 2014 |pages=1–10 |doi=10.1145/2603088.2603105 |arxiv=1601.01233 |isbn=9781450328869 |s2cid=11485010}}</ref> This was a long-standing open problem, due to ''size explosion'', the existence of lambda terms which grow exponentially in size for each β-reduction. The result gets around this by working with a compact shared representation. The result makes clear that the amount of space needed to evaluate a lambda term is not proportional to the size of the term during reduction. It is not currently known what a good measure of space complexity would be.<ref name=Reasonable>{{cite journal |last1=Accattoli |first1=Beniamino |title=(In)Efficiency and Reasonable Cost Models |journal=Electronic Notes in Theoretical Computer Science |date=October 2018 |volume=338 |pages=23–43 |doi=10.1016/j.entcs.2018.10.003 |doi-access=free}}</ref>


An unreasonable model does not necessarily mean inefficient. [[Reduction strategy#Optimal reduction|Optimal reduction]] reduces all computations with the same label in one step, avoiding duplicated work, but the number of parallel β-reduction steps to reduce a given term to normal form is approximately linear in the size of the term. This is far too small to be a reasonable cost measure, as any Turing machine may be encoded in the lambda calculus in size linearly proportional to the size of the Turing machine. The true cost of reducing lambda terms is not due to β-reduction per se but rather the handling of the duplication of redexes during β-reduction.<ref name=Asperti>{{cite arXiv |last1=Asperti |first1=Andrea |title=About the efficient reduction of lambda terms |date=16 Jan 2017 |class=cs.LO |eprint=1701.04240v1 }}</ref> It is not known if optimal reduction implementations are reasonable when measured with respect to a reasonable cost model such as the number of leftmost-outermost steps to normal form, but it has been shown for fragments of the lambda calculus that the optimal reduction algorithm is efficient and has at most a quadratic overhead compared to leftmost-outermost.<ref name=Reasonable/> In addition the BOHM prototype implementation of optimal reduction outperformed both [[Caml|Caml Light]] and [[Haskell (programming language)|Haskell]] on pure lambda terms.<ref name=Asperti/>
An unreasonable model does not necessarily mean inefficient. [[Reduction strategy#Optimal reduction|Optimal reduction]] reduces all computations with the same label in one step, avoiding duplicated work, but the number of parallel β-reduction steps to reduce a given term to normal form is approximately linear in the size of the term. This is far too small to be a reasonable cost measure, as any Turing machine may be encoded in the lambda calculus in size linearly proportional to the size of the Turing machine. The true cost of reducing lambda terms is not due to β-reduction per se but rather the handling of the duplication of redexes during β-reduction.<ref name=Asperti>{{cite arXiv |last1=Asperti |first1=Andrea |title=About the efficient reduction of lambda terms |date=16 Jan 2017 |class=cs.LO |eprint=1701.04240v1}}</ref> It is not known if optimal reduction implementations are reasonable when measured with respect to a reasonable cost model such as the number of leftmost-outermost steps to normal form, but it has been shown for fragments of the lambda calculus that the optimal reduction algorithm is efficient and has at most a quadratic overhead compared to leftmost-outermost.<ref name=Reasonable/> In addition the BOHM prototype implementation of optimal reduction outperformed both [[Caml]] Light and Haskell on pure lambda terms.<ref name=Asperti/>


== Lambda calculus and programming languages ==
== Lambda calculus and programming languages ==
As pointed out by [[Peter Landin]]'s 1965 paper "A Correspondence between ALGOL 60 and Church's Lambda-notation",<ref>{{cite journal|title=A Correspondence between ALGOL 60 and Church's Lambda-notation|first=P. J. |last=Landin |author-link=Peter Landin |journal=Communications of the ACM|volume=8|issue=2|year=1965|pages=89–101|doi=10.1145/363744.363749|s2cid=6505810 |doi-access=free}}</ref> sequential [[procedural programming]] languages can be understood in terms of the lambda calculus, which provides the basic mechanisms for procedural abstraction and procedure (subprogram) application.
As pointed out by [[Peter Landin]]'s 1965 paper "''A Correspondence between [[ALGOL 60]] and Church's Lambda-notation''",<ref>{{cite journal |last1=Landin |first1=P. J. |author1-link=Peter Landin |year=1965 |title=A Correspondence between ALGOL 60 and Church's Lambda-notation |journal=Communications of the ACM |volume=8 |issue=2 |pages=89–101 |doi=10.1145/363744.363749 |s2cid=6505810 |doi-access=free}}</ref> sequential [[procedural programming]] languages can be understood in terms of the lambda calculus, which provides the basic mechanisms for procedural abstraction and procedure (subprogram) application.


=== Anonymous functions ===
=== Anonymous functions ===
{{Main|Anonymous function}}
{{Further|Anonymous function}}
For example, in [[Python (programming language)|Python]] the "square" function can be expressed as a lambda expression as follows:
For example, in [[Python (programming language)|Python]] the "square" function can be expressed as a lambda expression as follows:
<!-- Please do not add the same example in different languages to this article, see Anonymous function for that. Thank you! -->
<!-- Please do not add the same example in different languages to this article, see Anonymous function for that. Thank you! -->
Line 491: Line 499:
The above example is an expression that evaluates to a first-class function. The symbol <code>lambda</code> creates an anonymous function, given a list of parameter names, <code>x</code> – just a single argument in this case, and an expression that is evaluated as the body of the function, <code>x**2</code>. Anonymous functions are sometimes called lambda expressions.
The above example is an expression that evaluates to a first-class function. The symbol <code>lambda</code> creates an anonymous function, given a list of parameter names, <code>x</code> – just a single argument in this case, and an expression that is evaluated as the body of the function, <code>x**2</code>. Anonymous functions are sometimes called lambda expressions.


For example, [[Pascal (programming language)|Pascal]] and many other imperative languages have long supported passing [[subprograms]] as [[arguments]] to other subprograms through the mechanism of [[function pointers]]. However, function pointers are not a sufficient condition for functions to be [[First-class function|first class]] datatypes, because a function is a first class datatype if and only if new instances of the function can be created at run-time. And this run-time creation of functions is supported in [[Smalltalk]], [[JavaScript]] and [[Wolfram Language]], and more recently in [[Scala (programming language)|Scala]], [[Eiffel (programming language)|Eiffel]] ("agents"), [[C Sharp (programming language)|C#]] ("delegates") and [[C++11]], among others.
For example, [[Pascal (programming language)|Pascal]] and many other imperative languages have long supported passing [[Function (computer programming)|subprograms]] as [[Parameter (computer programming)|arguments]] to other subprograms through the mechanism of [[function pointer]]s. However, function pointers are an insufficient condition for functions to be [[First-class function|first class]] datatypes, because a function is a first class datatype if and only if new instances of the function can be created at [[Execution (computing)#runtime|runtime]]. Such runtime creation of functions is supported in [[Smalltalk]], [[JavaScript]], [[Wolfram Language]], and more recently in [[Scala (programming language)|Scala]], [[Eiffel (programming language)|Eiffel]] (as agents), [[C Sharp (programming language)|C#]] (as delegates) and [[C++11]], among others.


=== Parallelism and concurrency ===
=== Parallelism and concurrency ===
The [[Church–Rosser theorem|Church–Rosser]] property of the lambda calculus means that evaluation (β-reduction) can be carried out in ''any order'', even in parallel. This means that various [[Nondeterministic algorithm|nondeterministic]] [[evaluation strategy|evaluation strategies]] are relevant. However, the lambda calculus does not offer any explicit constructs for [[parallel computing|parallelism]]. One can add constructs such as [[Futures and promises|Futures]] to the lambda calculus. Other [[process calculi]] have been developed for describing communication and concurrency.
The [[Church–Rosser theorem|Church–Rosser]] property of the lambda calculus means that evaluation (β-reduction) can be carried out in ''any order'', even in parallel. This means that various [[Nondeterministic algorithm|nondeterministic]] [[evaluation strategy|evaluation strategies]] are relevant. However, the lambda calculus does not offer any explicit constructs for [[Parallel computing|parallelism]]. One can add constructs such as [[Futures and promises|futures]] to the lambda calculus. Other [[process calculi]] have been developed for describing communication and concurrency.


== Semantics ==
== Semantics ==
Line 519: Line 527:
* [[Typed lambda calculus]] – Lambda calculus with typed variables (and functions)
* [[Typed lambda calculus]] – Lambda calculus with typed variables (and functions)
* [[System F]] – A typed lambda calculus with type-variables
* [[System F]] – A typed lambda calculus with type-variables
* [[Calculus of constructions]] – A typed lambda calculus with [[type system|types]] as first-class values
* [[Calculus of constructions]] – A typed lambda calculus with types as first-class values


These [[formal system]]s are extensions of lambda calculus that are not in the lambda cube:
These [[formal system]]s are extensions of lambda calculus that are not in the lambda cube:
* [[Binary lambda calculus]] – A version of lambda calculus with binary I/O, a binary encoding of terms, and a designated universal machine.
* [[Binary lambda calculus]] – A version of lambda calculus with binary [[input/output]] (I/O), a binary encoding of terms, and a designated universal machine.
* [[Lambda-mu calculus]] – An extension of the lambda calculus for treating [[classical logic]]
* [[Lambda-mu calculus]] – An extension of the lambda calculus for treating [[classical logic]]


Line 533: Line 541:


== See also ==
== See also ==
{{portal|Mathematics}}
{{Portal|Mathematics}}
{{colbegin|colwidth=30em}}
{{Colbegin|colwidth=30em}}
* [[Applicative computing systems]] – Treatment of [[object (computer science)|objects]] in the style of the lambda calculus
* [[Applicative computing systems]] – Treatment of [[Object (computer science)|objects]] in the style of the lambda calculus
* [[Cartesian closed category]] – A setting for lambda calculus in [[category theory]]
* [[Cartesian closed category]] – A setting for lambda calculus in [[category theory]]
* [[Categorical abstract machine]] – A [[model of computation]] applicable to lambda calculus
* [[Categorical abstract machine]] – A [[model of computation]] applicable to lambda calculus
Line 545: Line 553:
* [[Evaluation strategy]] – Rules for the evaluation of expressions in [[programming language]]s
* [[Evaluation strategy]] – Rules for the evaluation of expressions in [[programming language]]s
* [[Explicit substitution]] – The theory of substitution, as used in [[#β-reduction|β-reduction]]
* [[Explicit substitution]] – The theory of substitution, as used in [[#β-reduction|β-reduction]]
* [[Functional programming]]
* [[Harrop formula]] – A kind of constructive logical formula such that proofs are lambda terms
* [[Harrop formula]] – A kind of constructive logical formula such that proofs are lambda terms
* [[Interaction nets]]
* [[Interaction nets]]
Line 558: Line 565:
* [[Scott–Curry theorem]] – A theorem about sets of lambda terms
* [[Scott–Curry theorem]] – A theorem about sets of lambda terms
* ''[[To Mock a Mockingbird]]'' – An introduction to [[combinatory logic]]
* ''[[To Mock a Mockingbird]]'' – An introduction to [[combinatory logic]]
* [[Universal Turing machine]] – A formal computing machine that is equivalent to lambda calculus
* [[Universal Turing machine]] – A formal computing machine equivalent to lambda calculus
* [[Unlambda]] – An [[esoteric programming language|esoteric]] functional programming language based on combinatory logic
* [[Unlambda]] – A functional [[esoteric programming language]] based on combinatory logic
{{colend}}
{{colend}}


Line 567: Line 574:
* Barendregt, Hendrik Pieter, [https://www.jstor.org/stable/421013 The Impact of the Lambda Calculus in Logic and Computer Science]. The Bulletin of Symbolic Logic, Volume 3, Number 2, June 1997.
* Barendregt, Hendrik Pieter, [https://www.jstor.org/stable/421013 The Impact of the Lambda Calculus in Logic and Computer Science]. The Bulletin of Symbolic Logic, Volume 3, Number 2, June 1997.
* Barendregt, Hendrik Pieter, ''The Type Free Lambda Calculus'' pp1091–1132 of ''Handbook of Mathematical Logic'', [[North-Holland Publishing Company|North-Holland]] (1977) {{isbn|0-7204-2285-X}}
* Barendregt, Hendrik Pieter, ''The Type Free Lambda Calculus'' pp1091–1132 of ''Handbook of Mathematical Logic'', [[North-Holland Publishing Company|North-Holland]] (1977) {{isbn|0-7204-2285-X}}
* Cardone, Felice and Hindley, J. Roger, 2006. [http://www.users.waitrose.com/~hindley/SomePapers_PDFs/2006CarHin,HistlamRp.pdf History of Lambda-calculus and Combinatory Logic] {{Webarchive|url=https://web.archive.org/web/20210506154120/http://www.users.waitrose.com/~hindley/SomePapers_PDFs/2006CarHin,HistlamRp.pdf |date=2021-05-06 }}. In Gabbay and Woods (eds.), ''Handbook of the History of Logic'', vol. 5. Elsevier.
* Cardone, Felice and Hindley, J. Roger, 2006. [http://www.users.waitrose.com/~hindley/SomePapers_PDFs/2006CarHin,HistlamRp.pdf History of Lambda-calculus and Combinatory Logic] {{Webarchive|url=https://web.archive.org/web/20210506154120/http://www.users.waitrose.com/~hindley/SomePapers_PDFs/2006CarHin,HistlamRp.pdf |date=2021-05-06}}. In Gabbay and Woods (eds.), ''Handbook of the History of Logic'', vol. 5. Elsevier.
* Church, Alonzo, ''An unsolvable problem of elementary number theory'', [[American Journal of Mathematics]], 58 (1936), pp.&nbsp;345–363. This paper contains the proof that the equivalence of lambda expressions is in general not decidable.
* Church, Alonzo, ''An unsolvable problem of elementary number theory'', [[American Journal of Mathematics]], 58 (1936), pp.&nbsp;345–363. This paper contains the proof that the equivalence of lambda expressions is in general not decidable.
* {{cite book |last1=Church |first1=Alonzo |url=https://archive.org/details/AnnalsOfMathematicalStudies6ChurchAlonzoTheCalculiOfLambdaConversionPrincetonUniversityPress1941 |title=The Calculi of Lambda-Conversion |location=Princeton |publisher=Princeton University Press |year=1941 |access-date=2020-04-14 }} ({{isbn|978-0-691-08394-0}})
* {{cite book |last1=Church |first1=Alonzo |year=1941 |url=https://archive.org/details/AnnalsOfMathematicalStudies6ChurchAlonzoTheCalculiOfLambdaConversionPrincetonUniversityPress1941 |title=The Calculi of Lambda-Conversion |location=Princeton |publisher=Princeton University Press |access-date=2020-04-14}} ({{isbn|978-0-691-08394-0}})
* {{cite journal|author-link=Orrin Frink|author=Frink Jr., Orrin|title=Review: ''The Calculi of Lambda-Conversion'' by Alonzo Church|journal=Bull. Amer. Math. Soc.|year=1944|volume=50|issue=3|pages=169–172|url=https://www.ams.org/bull/1944-50-03/S0002-9904-1944-08090-7/S0002-9904-1944-08090-7.pdf|doi=10.1090/s0002-9904-1944-08090-7|doi-access=free}}
* {{cite journal|author1-link=Orrin Frink |author=Frink Jr., Orrin |year=1944 |title=Review: ''The Calculi of Lambda-Conversion'' by Alonzo Church |journal=Bulletin of the American Mathematics Society |volume=50 |issue=3 |pages=169–172 |url=https://www.ams.org/bull/1944-50-03/S0002-9904-1944-08090-7/S0002-9904-1944-08090-7.pdf|doi=10.1090/s0002-9904-1944-08090-7 |doi-access=free}}
* Kleene, Stephen, ''A theory of positive integers in formal logic'', [[American Journal of Mathematics]], 57 (1935), pp.&nbsp;153–173 and 219–244. Contains the lambda calculus definitions of several familiar functions.
* Kleene, Stephen, ''A theory of positive integers in formal logic'', [[American Journal of Mathematics]], 57 (1935), pp.&nbsp;153–173 and 219–244. Contains the lambda calculus definitions of several familiar functions.
* [[Peter Landin|Landin, Peter]], ''A Correspondence Between ALGOL 60 and Church's Lambda-Notation'', [[Communications of the ACM]], vol. 8, no. 2 (1965), pages 89–101. Available from the [http://portal.acm.org/citation.cfm?id=363749&coll=portal&dl=ACM ACM site]. A classic paper highlighting the importance of lambda calculus as a basis for programming languages.
* [[Peter Landin|Landin, Peter]], ''A Correspondence Between ALGOL 60 and Church's Lambda-Notation'', [[Communications of the ACM]], vol. 8, no. 2 (1965), pages 89–101. Available from the [http://portal.acm.org/citation.cfm?id=363749&coll=portal&dl=ACM ACM site]. A classic paper highlighting the importance of lambda calculus as a basis for programming languages.
Line 576: Line 583:
*{{cite book |last1=Michaelson |first1=Greg |title=An Introduction to Functional Programming Through Lambda Calculus |date=10 April 2013 |publisher=Courier Corporation |isbn=978-0-486-28029-5 |language=en}}<ref>{{cite web |title=Greg Michaelson's Homepage |url=http://www.macs.hw.ac.uk/~greg/ |website=Mathematical and Computer Sciences |publisher=[[Heriot-Watt University]] |access-date=6 November 2022 |location=Riccarton, Edinburgh}}</ref>
*{{cite book |last1=Michaelson |first1=Greg |title=An Introduction to Functional Programming Through Lambda Calculus |date=10 April 2013 |publisher=Courier Corporation |isbn=978-0-486-28029-5 |language=en}}<ref>{{cite web |title=Greg Michaelson's Homepage |url=http://www.macs.hw.ac.uk/~greg/ |website=Mathematical and Computer Sciences |publisher=[[Heriot-Watt University]] |access-date=6 November 2022 |location=Riccarton, Edinburgh}}</ref>
* Schalk, A. and Simmons, H. (2005) ''[https://web.archive.org/web/20080307014129/http://www.cs.man.ac.uk/~hsimmons/BOOKS/lcalculus.pdf An introduction to λ-calculi and arithmetic with a decent selection of exercises]''. Notes for a course in the Mathematical Logic MSc at Manchester University.
* Schalk, A. and Simmons, H. (2005) ''[https://web.archive.org/web/20080307014129/http://www.cs.man.ac.uk/~hsimmons/BOOKS/lcalculus.pdf An introduction to λ-calculi and arithmetic with a decent selection of exercises]''. Notes for a course in the Mathematical Logic MSc at Manchester University.
* {{cite journal|last=de Queiroz|first=Ruy J.G.B.|author-link=Ruy de Queiroz|year=2008|title=On Reduction Rules, Meaning-as-Use and Proof-Theoretic Semantics|journal=[[Studia Logica]]|volume=90|issue=2|pages=211–247|doi=10.1007/s11225-008-9150-5|s2cid=11321602}} A paper giving a formal underpinning to the idea of 'meaning-is-use' which, even if based on proofs, it is different from proof-theoretic semantics as in the Dummett–Prawitz tradition since it takes reduction as the rules giving meaning.
* {{cite journal|last1=de Queiroz|first1=Ruy J.G.B.|author1-link=Ruy de Queiroz|year=2008|title=On Reduction Rules, Meaning-as-Use and Proof-Theoretic Semantics|journal=[[Studia Logica]]|volume=90|issue=2|pages=211–247|doi=10.1007/s11225-008-9150-5|s2cid=11321602}} A paper giving a formal underpinning to the idea of 'meaning-is-use' which, even if based on proofs, it is different from proof-theoretic semantics as in the Dummett–Prawitz tradition since it takes reduction as the rules giving meaning.
* Hankin, Chris, ''An Introduction to Lambda Calculi for Computer Scientists,'' {{isbn|0954300653}}
* Hankin, Chris, ''An Introduction to Lambda Calculi for Computer Scientists,'' {{isbn|0954300653}}


;Monographs/textbooks for graduate students:
;Monographs/textbooks for graduate students:
* Sørensen, Morten Heine and Urzyczyn, Paweł (2006), ''Lectures on the Curry–Howard isomorphism'', Elsevier, {{isbn|0-444-52077-5}} is a recent monograph that covers the main topics of lambda calculus from the type-free variety, to most [[typed lambda calculi]], including more recent developments like [[pure type system]]s and the [[lambda cube]]. It does not cover [[subtyping]] extensions.
* Sørensen, Morten Heine and Urzyczyn, Paweł (2006), ''Lectures on the Curry–Howard isomorphism'', Elsevier, {{isbn|0-444-52077-5}} is a recent monograph that covers the main topics of lambda calculus from the type-free variety, to most [[typed lambda calculi]], including more recent developments like [[pure type system]]s and the [[lambda cube]]. It does not cover [[subtyping]] extensions.
* {{Citation|last=Pierce|first=Benjamin|title=Types and Programming Languages|publisher=MIT Press|year=2002|isbn=0-262-16209-1}} covers lambda calculi from a practical type system perspective; some topics like dependent types are only mentioned, but subtyping is an important topic.
* {{Citation|last1=Pierce|first1=Benjamin|title=Types and Programming Languages|publisher=MIT Press|year=2002|isbn=0-262-16209-1}} covers lambda calculi from a practical type system perspective; some topics like dependent types are only mentioned, but subtyping is an important topic.


;Documents
;Documents
Line 596: Line 603:
== References ==
== References ==
''Some parts of this article are based on material from [[Free On-line Dictionary of Computing|FOLDOC]], used with [[Wikipedia:Foldoc license|permission]].''
''Some parts of this article are based on material from [[Free On-line Dictionary of Computing|FOLDOC]], used with [[Wikipedia:Foldoc license|permission]].''
{{Reflist|30em}}
{{Reflist}}


== External links ==
== External links ==
Line 607: Line 614:
* Georg P. Loczewski, [http://www.lambda-bound.com/book/lambdacalc/lcalconl.html ''The Lambda Calculus and A++'']
* Georg P. Loczewski, [http://www.lambda-bound.com/book/lambdacalc/lcalconl.html ''The Lambda Calculus and A++'']
* Bret Victor, ''[http://worrydream.com/AlligatorEggs/ Alligator Eggs: A Puzzle Game Based on Lambda Calculus]''
* Bret Victor, ''[http://worrydream.com/AlligatorEggs/ Alligator Eggs: A Puzzle Game Based on Lambda Calculus]''
* ''[http://www.safalra.com/science/lambda-calculus/ Lambda Calculus] {{Webarchive|url=https://web.archive.org/web/20121014180848/http://safalra.com/science/lambda-calculus/ |date=2012-10-14 }}'' on [http://www.safalra.com/ Safalra's Website] {{Webarchive|url=https://web.archive.org/web/20210502051230/https://safalra.com/ |date=2021-05-02 }}
* ''[http://www.safalra.com/science/lambda-calculus/ Lambda Calculus] {{Webarchive|url=https://web.archive.org/web/20121014180848/http://safalra.com/science/lambda-calculus/ |date=2012-10-14}}'' on [http://www.safalra.com/ Safalra's Website] {{Webarchive|url=https://web.archive.org/web/20210502051230/https://safalra.com/ |date=2021-05-02}}
* [https://chatziko.github.io/lci/ LCI Lambda Interpreter] a simple yet powerful pure calculus interpreter
* [https://chatziko.github.io/lci/ LCI Lambda Interpreter] a simple yet powerful pure calculus interpreter
* [http://lambda-the-ultimate.org/classic/lc.html Lambda Calculus links on Lambda-the-Ultimate]
* [http://lambda-the-ultimate.org/classic/lc.html Lambda Calculus links on Lambda-the-Ultimate]
Line 619: Line 626:
{{authority control}}
{{authority control}}
{{Formal semantics}}
{{Formal semantics}}
{{Functions navbox}}


[[Category:Lambda calculus| ]]
[[Category:Lambda calculus| ]]
[[Category:1936 in computing]]
[[Category:1936 in computing]]
[[Category:Articles with example Lisp (programming language) code]]
[[Category:Computability theory]]
[[Category:Computability theory]]
[[Category:Formal methods]]
[[Category:Formal methods]]
[[Category:Models of computation]]
[[Category:Models of computation]]
[[Category:Theoretical computer science]]
[[Category:Theoretical computer science]]
[[Category:Programming language comparisons]]
<!-- Hidden categories below -->
[[Category:Articles with example Lisp (programming language) code]]
[[Category:Articles with example Python (programming language) code]]

Latest revision as of 15:20, 9 December 2024

Lambda calculus (also written as λ-calculus) is a formal system in mathematical logic for expressing computation based on function abstraction and application using variable binding and substitution. Untyped lambda calculus, the topic of this article, is a universal machine, a model of computation that can be used to simulate any Turing machine (and vice versa). It was introduced by the mathematician Alonzo Church in the 1930s as part of his research into the foundations of mathematics. In 1936, Church found a formulation which was logically consistent, and documented it in 1940.

Lambda calculus consists of constructing lambda terms and performing reduction operations on them. A term is defined as any valid lambda calculus expression. In the simplest form of lambda calculus, terms are built using only the following rules:[a]

  1. : A variable is a character or string representing a parameter.
  2. : A lambda abstraction is a function definition, taking as input the bound variable (between the λ and the punctum/dot .) and returning the body .
  3. : An application, applying a function to an argument . Both and are lambda terms.

The reduction operations include:

  •  : α-conversion, renaming the bound variables in the expression. Used to avoid name collisions.
  •  : β-reduction,[b] replacing the bound variables with the argument expression in the body of the abstraction.

If De Bruijn indexing is used, then α-conversion is no longer required as there will be no name collisions. If repeated application of the reduction steps eventually terminates, then by the Church–Rosser theorem it will produce a β-normal form.

Variable names are not needed if using a universal lambda function, such as Iota and Jot, which can create any function behavior by calling it on itself in various combinations.

Explanation and applications

[edit]

Lambda calculus is Turing complete, that is, it is a universal model of computation that can be used to simulate any Turing machine.[3] Its namesake, the Greek letter lambda (λ), is used in lambda expressions and lambda terms to denote binding a variable in a function.

Lambda calculus may be untyped or typed. In typed lambda calculus, functions can be applied only if they are capable of accepting the given input's "type" of data. Typed lambda calculi are strictly weaker than the untyped lambda calculus, which is the primary subject of this article, in the sense that typed lambda calculi can express less than the untyped calculus can. On the other hand, typed lambda calculi allow more things to be proven. For example, in simply typed lambda calculus, it is a theorem that every evaluation strategy terminates for every simply typed lambda-term, whereas evaluation of untyped lambda-terms need not terminate (see below). One reason there are many different typed lambda calculi has been the desire to do more (of what the untyped calculus can do) without giving up on being able to prove strong theorems about the calculus.

Lambda calculus has applications in many different areas in mathematics, philosophy,[4] linguistics,[5][6] and computer science.[7] [8] Lambda calculus has played an important role in the development of the theory of programming languages. Functional programming languages implement lambda calculus. Lambda calculus is also a current research topic in category theory.[9]

History

[edit]

Lambda calculus was introduced by mathematician Alonzo Church in the 1930s as part of an investigation into the foundations of mathematics.[10][c] The original system was shown to be logically inconsistent in 1935 when Stephen Kleene and J. B. Rosser developed the Kleene–Rosser paradox.[11][12]

Subsequently, in 1936 Church isolated and published just the portion relevant to computation, what is now called the untyped lambda calculus.[13] In 1940, he also introduced a computationally weaker, but logically consistent system, known as the simply typed lambda calculus.[14]

Until the 1960s when its relation to programming languages was clarified, the lambda calculus was only a formalism. Thanks to Richard Montague and other linguists' applications in the semantics of natural language, the lambda calculus has begun to enjoy a respectable place in both linguistics[15] and computer science.[16]

Origin of the λ symbol

[edit]

There is some uncertainty over the reason for Church's use of the Greek letter lambda (λ) as the notation for function-abstraction in the lambda calculus, perhaps in part due to conflicting explanations by Church himself. According to Cardone and Hindley (2006):

By the way, why did Church choose the notation “λ”? In [an unpublished 1964 letter to Harald Dickson] he stated clearly that it came from the notation “” used for class-abstraction by Whitehead and Russell, by first modifying “” to “” to distinguish function-abstraction from class-abstraction, and then changing “” to “λ” for ease of printing.

This origin was also reported in [Rosser, 1984, p.338]. On the other hand, in his later years Church told two enquirers that the choice was more accidental: a symbol was needed and λ just happened to be chosen.

Dana Scott has also addressed this question in various public lectures.[17] Scott recounts that he once posed a question about the origin of the lambda symbol to Church's former student and son-in-law John W. Addison Jr., who then wrote his father-in-law a postcard:

Dear Professor Church,

Russell had the iota operator, Hilbert had the epsilon operator. Why did you choose lambda for your operator?

According to Scott, Church's entire response consisted of returning the postcard with the following annotation: "eeny, meeny, miny, moe".

Informal description

[edit]

Motivation

[edit]

Computable functions are a fundamental concept within computer science and mathematics. The lambda calculus provides simple semantics for computation which are useful for formally studying properties of computation. The lambda calculus incorporates two simplifications that make its semantics simple. The first simplification is that the lambda calculus treats functions "anonymously;" it does not give them explicit names. For example, the function

can be rewritten in anonymous form as

(which is read as "a tuple of x and y is mapped to ").[d] Similarly, the function

can be rewritten in anonymous form as

where the input is simply mapped to itself.[d]

The second simplification is that the lambda calculus only uses functions of a single input. An ordinary function that requires two inputs, for instance the function, can be reworked into an equivalent function that accepts a single input, and as output returns another function, that in turn accepts a single input. For example,

can be reworked into

This method, known as currying, transforms a function that takes multiple arguments into a chain of functions each with a single argument.

Function application of the function to the arguments (5, 2), yields at once

,

whereas evaluation of the curried version requires one more step

// the definition of has been used with in the inner expression. This is like β-reduction.
// the definition of has been used with . Again, similar to β-reduction.

to arrive at the same result.

The lambda calculus

[edit]

The lambda calculus consists of a language of lambda terms, that are defined by a certain formal syntax, and a set of transformation rules for manipulating the lambda terms. These transformation rules can be viewed as an equational theory or as an operational definition.

As described above, having no names, all functions in the lambda calculus are anonymous functions. They only accept one input variable, so currying is used to implement functions of several variables.

Lambda terms

[edit]

The syntax of the lambda calculus defines some expressions as valid lambda calculus expressions and some as invalid, just as some strings of characters are valid computer programs and some are not. A valid lambda calculus expression is called a "lambda term".

The following three rules give an inductive definition that can be applied to build all syntactically valid lambda terms:[e]

  • variable x is itself a valid lambda term.
  • if t is a lambda term, and x is a variable, then [f] is a lambda term (called an abstraction);
  • if t and s are lambda terms, then   is a lambda term (called an application).

Nothing else is a lambda term. That is, a lambda term is valid if and only if it can be obtained by repeated application of these three rules. For convenience, some parentheses can be omitted when writing a lambda term. For example, the outermost parentheses are usually not written. See § Notation, below, for an explicit description of which parentheses are optional. It is also common to extend the syntax presented here with additional operations, which allows making sense of terms such as The focus of this article is the pure lambda calculus without extensions, but lambda terms extended with arithmetic operations are used for explanatory purposes.

An abstraction denotes an § anonymous function[g] that takes a single input x and returns t. For example, is an abstraction representing the function defined by using the term for t. The name is superfluous when using abstraction. The syntax binds the variable x in the term t. The definition of a function with an abstraction merely "sets up" the function but does not invoke it.

An application   represents the application of a function t to an input s, that is, it represents the act of calling function t on input s to produce .

A lambda term may refer to a variable that has not been bound, such as the term (which represents the function definition ). In this term, the variable y has not been defined and is considered an unknown. The abstraction is a syntactically valid term and represents a function that adds its input to the yet-unknown y.

Parentheses may be used and might be needed to disambiguate terms. For example,

  1. is of form and is therefore an abstraction, while
  2. is of form   and is therefore an application.

The examples 1 and 2 denote different terms, differing only in where the parentheses are placed. They have different meanings: example 1 is a function definition, while example 2 is a function application. The lambda variable x is a placeholder in both examples.

Here, example 1 defines a function , where is , an anonymous function , with input ; while example 2,  , is M applied to N, where is the lambda term being applied to the input which is . Both examples 1 and 2 would evaluate to the identity function .

Functions that operate on functions

[edit]

In lambda calculus, functions are taken to be 'first class values', so functions may be used as the inputs, or be returned as outputs from other functions.

For example, the lambda term represents the identity function, . Further, represents the constant function , the function that always returns , no matter the input. As an example of a function operating on functions, the function composition can be defined as .

There are several notions of "equivalence" and "reduction" that allow lambda terms to be "reduced" to "equivalent" lambda terms.

Alpha equivalence

[edit]

A basic form of equivalence, definable on lambda terms, is alpha equivalence. It captures the intuition that the particular choice of a bound variable, in an abstraction, does not (usually) matter. For instance, and are alpha-equivalent lambda terms, and they both represent the same function (the identity function). The terms and are not alpha-equivalent, because they are not bound in an abstraction. In many presentations, it is usual to identify alpha-equivalent lambda terms.

The following definitions are necessary in order to be able to define β-reduction:

Free variables

[edit]

The free variables [h] of a term are those variables not bound by an abstraction. The set of free variables of an expression is defined inductively:

  • The free variables of are just
  • The set of free variables of is the set of free variables of , but with removed
  • The set of free variables of is the union of the set of free variables of and the set of free variables of .

For example, the lambda term representing the identity has no free variables, but the function has a single free variable, .

Capture-avoiding substitutions

[edit]

Suppose , and are lambda terms, and and are variables. The notation indicates substitution of for in in a capture-avoiding manner. This is defined so that:

  •  ; with substituted for , becomes
  • if  ; with substituted for , (which is not ) remains
  •  ; substitution distributes to both sides of an application
  •  ; a variable bound by an abstraction is not subject to substitution; substituting such variable leaves the abstraction unchanged
  • if and does not appear among the free variables of ( is said to be "fresh" for ) ; substituting a variable which is not bound by an abstraction proceeds in the abstraction's body, provided that the abstracted variable is "fresh" for the substitution term .

For example, , and .

The freshness condition (requiring that is not in the free variables of ) is crucial in order to ensure that substitution does not change the meaning of functions.

For example, a substitution that ignores the freshness condition could lead to errors: . This erroneous substitution would turn the constant function into the identity .

In general, failure to meet the freshness condition can be remedied by alpha-renaming first, with a suitable fresh variable. For example, switching back to our correct notion of substitution, in the abstraction can be renamed with a fresh variable , to obtain , and the meaning of the function is preserved by substitution.

β-reduction

[edit]

The β-reduction rule[b] states that an application of the form reduces to the term . The notation is used to indicate that β-reduces to . For example, for every , . This demonstrates that really is the identity. Similarly, , which demonstrates that is a constant function.

The lambda calculus may be seen as an idealized version of a functional programming language, like Haskell or Standard ML. Under this view, β-reduction corresponds to a computational step. This step can be repeated by additional β-reductions until there are no more applications left to reduce. In the untyped lambda calculus, as presented here, this reduction process may not terminate. For instance, consider the term . Here . That is, the term reduces to itself in a single β-reduction, and therefore the reduction process will never terminate.

Another aspect of the untyped lambda calculus is that it does not distinguish between different kinds of data. For instance, it may be desirable to write a function that only operates on numbers. However, in the untyped lambda calculus, there is no way to prevent a function from being applied to truth values, strings, or other non-number objects.

Formal definition

[edit]

Definition

[edit]

Lambda expressions are composed of:

  • variables v1, v2, ...;
  • the abstraction symbols λ (lambda) and . (dot);
  • parentheses ().

The set of lambda expressions, Λ, can be defined inductively:

  1. If x is a variable, then x ∈ Λ.
  2. If x is a variable and M ∈ Λ, then x.M) ∈ Λ.
  3. If M, N ∈ Λ, then (M N) ∈ Λ.

Instances of rule 2 are known as abstractions and instances of rule 3 are known as applications.[18] See § reducible expression

This set of rules may be written in Backus–Naur form as:

 <expression>  :== <abstraction> | <application> | <variable>
 <abstraction> :== λ <variable> . <expression>
 <application> :== ( <expression> <expression> )
 <variable>    :== v1 | v2 | ...

Notation

[edit]

To keep the notation of lambda expressions uncluttered, the following conventions are usually applied:

  • Outermost parentheses are dropped: M N instead of (M N).
  • Applications are assumed to be left associative: M N P may be written instead of ((M N) P).[19]
  • When all variables are single-letter, the space in applications may be omitted: MNP instead of M N P.[20]
  • The body of an abstraction extends as far right as possible: λx.M N means λx.(M N) and not (λx.M) N.
  • A sequence of abstractions is contracted: λxyz.N is abbreviated as λxyz.N.[21][19]

Free and bound variables

[edit]

The abstraction operator, λ, is said to bind its variable wherever it occurs in the body of the abstraction. Variables that fall within the scope of an abstraction are said to be bound. In an expression λx.M, the part λx is often called binder, as a hint that the variable x is getting bound by prepending λx to M. All other variables are called free. For example, in the expression λy.x x y, y is a bound variable and x is a free variable. Also a variable is bound by its nearest abstraction. In the following example the single occurrence of x in the expression is bound by the second lambda: λx.yx.z x).

The set of free variables of a lambda expression, M, is denoted as FV(M) and is defined by recursion on the structure of the terms, as follows:

  1. FV(x) = {x}, where x is a variable.
  2. FV(λx.M) = FV(M) \ {x}.[i]
  3. FV(M N) = FV(M) ∪ FV(N).[j]

An expression that contains no free variables is said to be closed. Closed lambda expressions are also known as combinators and are equivalent to terms in combinatory logic.

Reduction

[edit]

The meaning of lambda expressions is defined by how expressions can be reduced.[22]

There are three kinds of reduction:

  • α-conversion: changing bound variables;
  • β-reduction: applying functions to their arguments;
  • η-reduction: which captures a notion of extensionality.

We also speak of the resulting equivalences: two expressions are α-equivalent, if they can be α-converted into the same expression. β-equivalence and η-equivalence are defined similarly.

The term redex, short for reducible expression, refers to subterms that can be reduced by one of the reduction rules. For example, (λx.M) N is a β-redex in expressing the substitution of N for x in M. The expression to which a redex reduces is called its reduct; the reduct of (λx.M) N is M[x := N].[b]

If x is not free in M, λx.M x is also an η-redex, with a reduct of M.

α-conversion

[edit]

α-conversion (alpha-conversion), sometimes known as α-renaming,[23] allows bound variable names to be changed. For example, α-conversion of λx.x might yield λy.y. Terms that differ only by α-conversion are called α-equivalent. Frequently, in uses of lambda calculus, α-equivalent terms are considered to be equivalent.

The precise rules for α-conversion are not completely trivial. First, when α-converting an abstraction, the only variable occurrences that are renamed are those that are bound to the same abstraction. For example, an α-conversion of λxx.x could result in λyx.x, but it could not result in λyx.y. The latter has a different meaning from the original. This is analogous to the programming notion of variable shadowing.

Second, α-conversion is not possible if it would result in a variable getting captured by a different abstraction. For example, if we replace x with y in λxy.x, we get λyy.y, which is not at all the same.

In programming languages with static scope, α-conversion can be used to make name resolution simpler by ensuring that no variable name masks a name in a containing scope (see α-renaming to make name resolution trivial).

In the De Bruijn index notation, any two α-equivalent terms are syntactically identical.

Substitution

[edit]

Substitution, written M[x := N], is the process of replacing all free occurrences of the variable x in the expression M with expression N. Substitution on terms of the lambda calculus is defined by recursion on the structure of terms, as follows (note: x and y are only variables while M and N are any lambda expression):

x[x := N] = N
y[x := N] = y, if xy
(M1 M2)[x := N] = M1[x := N] M2[x := N]
x.M)[x := N] = λx.M
y.M)[x := N] = λy.(M[x := N]), if xy and y ∉ FV(N) See above for the FV

To substitute into an abstraction, it is sometimes necessary to α-convert the expression. For example, it is not correct for (λx.y)[y := x] to result in λx.x, because the substituted x was supposed to be free but ended up being bound. The correct substitution in this case is λz.x, up to α-equivalence. Substitution is defined uniquely up to α-equivalence. See Capture-avoiding substitutions above

β-reduction

[edit]

β-reduction (beta reduction) captures the idea of function application. β-reduction is defined in terms of substitution: the β-reduction of (λx.M) N is M[x := N].[b]

For example, assuming some encoding of 2, 7, ×, we have the following β-reduction: (λn.n × 2) 7 → 7 × 2.

β-reduction can be seen to be the same as the concept of local reducibility in natural deduction, via the Curry–Howard isomorphism.

η-reduction

[edit]

η-reduction (eta reduction) expresses the idea of extensionality,[24] which in this context is that two functions are the same if and only if they give the same result for all arguments. η-reduction converts between λx.f x and f whenever x does not appear free in f.

η-reduction can be seen to be the same as the concept of local completeness in natural deduction, via the Curry–Howard isomorphism.

Normal forms and confluence

[edit]

For the untyped lambda calculus, β-reduction as a rewriting rule is neither strongly normalising nor weakly normalising.

However, it can be shown that β-reduction is confluent when working up to α-conversion (i.e. we consider two normal forms to be equal if it is possible to α-convert one into the other).

Therefore, both strongly normalising terms and weakly normalising terms have a unique normal form. For strongly normalising terms, any reduction strategy is guaranteed to yield the normal form, whereas for weakly normalising terms, some reduction strategies may fail to find it.

Encoding datatypes

[edit]

The basic lambda calculus may be used to model arithmetic, Booleans, data structures, and recursion, as illustrated in the following sub-sections i, ii, iii, and § iv.

Arithmetic in lambda calculus

[edit]

There are several possible ways to define the natural numbers in lambda calculus, but by far the most common are the Church numerals, which can be defined as follows:

0 := λfx.x
1 := λfx.f x
2 := λfx.f (f x)
3 := λfx.f (f (f x))

and so on. Or using the alternative syntax presented above in Notation:

0 := λfx.x
1 := λfx.f x
2 := λfx.f (f x)
3 := λfx.f (f (f x))

A Church numeral is a higher-order function—it takes a single-argument function f, and returns another single-argument function. The Church numeral n is a function that takes a function f as argument and returns the n-th composition of f, i.e. the function f composed with itself n times. This is denoted f(n) and is in fact the n-th power of f (considered as an operator); f(0) is defined to be the identity function. Such repeated compositions (of a single function f) obey the laws of exponents, which is why these numerals can be used for arithmetic. (In Church's original lambda calculus, the formal parameter of a lambda expression was required to occur at least once in the function body, which made the above definition of 0 impossible.)

One way of thinking about the Church numeral n, which is often useful when analysing programs, is as an instruction 'repeat n times'. For example, using the PAIR and NIL functions defined below, one can define a function that constructs a (linked) list of n elements all equal to x by repeating 'prepend another x element' n times, starting from an empty list. The lambda term is

λnx.n (PAIR x) NIL

By varying what is being repeated, and varying what argument that function being repeated is applied to, a great many different effects can be achieved.

We can define a successor function, which takes a Church numeral n and returns n + 1 by adding another application of f, where '(mf)x' means the function 'f' is applied 'm' times on 'x':

SUCC := λnfx.f (n f x)

Because the m-th composition of f composed with the n-th composition of f gives the m+n-th composition of f, addition can be defined as follows:

PLUS := λmnfx.m f (n f x)

PLUS can be thought of as a function taking two natural numbers as arguments and returning a natural number; it can be verified that

PLUS 2 3

and

5

are β-equivalent lambda expressions. Since adding m to a number n can be accomplished by adding 1 m times, an alternative definition is:

PLUS := λmn.m SUCC n [25]

Similarly, multiplication can be defined as

MULT := λmnf.m (n f)[21]

Alternatively

MULT := λmn.m (PLUS n) 0

since multiplying m and n is the same as repeating the add n function m times and then applying it to zero. Exponentiation has a rather simple rendering in Church numerals, namely

POW := λbe.e b[1]

The predecessor function defined by PRED n = n − 1 for a positive integer n and PRED 0 = 0 is considerably more difficult. The formula

PRED := λnfx.ngh.h (g f)) (λu.x) (λu.u)

can be validated by showing inductively that if T denotes gh.h (g f)), then T(n)u.x) = (λh.h(f(n−1)(x))) for n > 0. Two other definitions of PRED are given below, one using conditionals and the other using pairs. With the predecessor function, subtraction is straightforward. Defining

SUB := λmn.n PRED m,

SUB m n yields mn when m > n and 0 otherwise.

Logic and predicates

[edit]

By convention, the following two definitions (known as Church Booleans) are used for the Boolean values TRUE and FALSE:

TRUE := λxy.x
FALSE := λxy.y

Then, with these two lambda terms, we can define some logic operators (these are just possible formulations; other expressions could be equally correct):

AND := λpq.p q p
OR := λpq.p p q
NOT := λp.p FALSE TRUE
IFTHENELSE := λpab.p a b

We are now able to compute some logic functions, for example:

AND TRUE FALSE
≡ (λpq.p q p) TRUE FALSE →β TRUE FALSE TRUE
≡ (λxy.x) FALSE TRUE →β FALSE

and we see that AND TRUE FALSE is equivalent to FALSE.

A predicate is a function that returns a Boolean value. The most fundamental predicate is ISZERO, which returns TRUE if its argument is the Church numeral 0, but FALSE if its argument were any other Church numeral:

ISZERO := λn.nx.FALSE) TRUE

The following predicate tests whether the first argument is less-than-or-equal-to the second:

LEQ := λmn.ISZERO (SUB m n),

and since m = n, if LEQ m n and LEQ n m, it is straightforward to build a predicate for numerical equality.

The availability of predicates and the above definition of TRUE and FALSE make it convenient to write "if-then-else" expressions in lambda calculus. For example, the predecessor function can be defined as:

PRED := λn.ngk.ISZERO (g 1) k (PLUS (g k) 1)) (λv.0) 0

which can be verified by showing inductively that ngk.ISZERO (g 1) k (PLUS (g k) 1)) (λv.0) is the add n − 1 function for n > 0.

Pairs

[edit]

A pair (2-tuple) can be defined in terms of TRUE and FALSE, by using the Church encoding for pairs. For example, PAIR encapsulates the pair (x,y), FIRST returns the first element of the pair, and SECOND returns the second.

PAIR := λxyf.f x y
FIRST := λp.p TRUE
SECOND := λp.p FALSE
NIL := λx.TRUE
NULL := λp.pxy.FALSE)

A linked list can be defined as either NIL for the empty list, or the PAIR of an element and a smaller list. The predicate NULL tests for the value NIL. (Alternatively, with NIL := FALSE, the construct lhtz.deal_with_head_h_and_tail_t) (deal_with_nil) obviates the need for an explicit NULL test).

As an example of the use of pairs, the shift-and-increment function that maps (m, n) to (n, n + 1) can be defined as

Φ := λx.PAIR (SECOND x) (SUCC (SECOND x))

which allows us to give perhaps the most transparent version of the predecessor function:

PRED := λn.FIRST (n Φ (PAIR 0 0)).

Additional programming techniques

[edit]

There is a considerable body of programming idioms for lambda calculus. Many of these were originally developed in the context of using lambda calculus as a foundation for programming language semantics, effectively using lambda calculus as a low-level programming language. Because several programming languages include the lambda calculus (or something very similar) as a fragment, these techniques also see use in practical programming, but may then be perceived as obscure or foreign.

Named constants

[edit]

In lambda calculus, a library would take the form of a collection of previously defined functions, which as lambda-terms are merely particular constants. The pure lambda calculus does not have a concept of named constants since all atomic lambda-terms are variables, but one can emulate having named constants by setting aside a variable as the name of the constant, using abstraction to bind that variable in the main body, and apply that abstraction to the intended definition. Thus to use f to mean N (some explicit lambda-term) in M (another lambda-term, the "main program"), one can say

f.M) N

Authors often introduce syntactic sugar, such as let,[k] to permit writing the above in the more intuitive order

let f =NinM

By chaining such definitions, one can write a lambda calculus "program" as zero or more function definitions, followed by one lambda-term using those functions that constitutes the main body of the program.

A notable restriction of this let is that the name f be not defined in N, for N to be outside the scope of the abstraction binding f; this means a recursive function definition cannot be used as the N with let. The letrec[l] construction would allow writing recursive function definitions.

Recursion and fixed points

[edit]

Recursion is the definition of a function invoking itself. A definition containing itself inside itself, by value, leads to the whole value being of infinite size. Other notations which support recursion natively overcome this by referring to the function definition by name. Lambda calculus cannot express this: all functions are anonymous in lambda calculus, so we can't refer by name to a value which is yet to be defined, inside the lambda term defining that same value. However, a lambda expression can receive itself as its own argument, for example in  x.x x) E. Here E should be an abstraction, applying its parameter to a value to express recursion.

Consider the factorial function F(n) recursively defined by

F(n) = 1, if n = 0; else n × F(n − 1).

In the lambda expression which is to represent this function, a parameter (typically the first one) will be assumed to receive the lambda expression itself as its value, so that calling it – applying it to an argument – will amount to recursion. Thus to achieve recursion, the intended-as-self-referencing argument (called r here) must always be passed to itself within the function body, at a call point:

G := λr. λn.(1, if n = 0; else n × (r r (n−1)))
with  r r x = F x = G r x  to hold, so  r = G  and
F := G G = (λx.x x) G

The self-application achieves replication here, passing the function's lambda expression on to the next invocation as an argument value, making it available to be referenced and called there.

This solves it but requires re-writing each recursive call as self-application. We would like to have a generic solution, without a need for any re-writes:

G := λr. λn.(1, if n = 0; else n × (r (n−1)))
with  r x = F x = G r x  to hold, so  r = G r =: FIX G  and
F := FIX G  where  FIX g := (r where r = g r) = g (FIX g)
so that  FIX G = G (FIX G) = (λn.(1, if n = 0; else n × ((FIX G) (n−1))))

Given a lambda term with first argument representing recursive call (e.g. G here), the fixed-point combinator FIX will return a self-replicating lambda expression representing the recursive function (here, F). The function does not need to be explicitly passed to itself at any point, for the self-replication is arranged in advance, when it is created, to be done each time it is called. Thus the original lambda expression (FIX G) is re-created inside itself, at call-point, achieving self-reference.

In fact, there are many possible definitions for this FIX operator, the simplest of them being:

Y := λg.(λx.g (x x)) (λx.g (x x))

In the lambda calculus, Y g  is a fixed-point of g, as it expands to:

Y g
h.(λx.h (x x)) (λx.h (x x))) g
x.g (x x)) (λx.g (x x))
g ((λx.g (x x)) (λx.g (x x)))
g (Y g)

Now, to perform our recursive call to the factorial function, we would simply call (Y G) n,  where n is the number we are calculating the factorial of. Given n = 4, for example, this gives:

(Y G) 4
G (Y G) 4
rn.(1, if n = 0; else n × (r (n−1)))) (Y G) 4
n.(1, if n = 0; else n × ((Y G) (n−1)))) 4
1, if 4 = 0; else 4 × ((Y G) (4−1))
4 × (G (Y G) (4−1))
4 × ((λn.(1, if n = 0; else n × ((Y G) (n−1)))) (4−1))
4 × (1, if 3 = 0; else 3 × ((Y G) (3−1)))
4 × (3 × (G (Y G) (3−1)))
4 × (3 × ((λn.(1, if n = 0; else n × ((Y G) (n−1)))) (3−1)))
4 × (3 × (1, if 2 = 0; else 2 × ((Y G) (2−1))))
4 × (3 × (2 × (G (Y G) (2−1))))
4 × (3 × (2 × ((λn.(1, if n = 0; else n × ((Y G) (n−1)))) (2−1))))
4 × (3 × (2 × (1, if 1 = 0; else 1 × ((Y G) (1−1)))))
4 × (3 × (2 × (1 × (G (Y G) (1−1)))))
4 × (3 × (2 × (1 × ((λn.(1, if n = 0; else n × ((Y G) (n−1)))) (1−1)))))
4 × (3 × (2 × (1 × (1, if 0 = 0; else 0 × ((Y G) (0−1))))))
4 × (3 × (2 × (1 × (1))))
24

Every recursively defined function can be seen as a fixed point of some suitably defined function closing over the recursive call with an extra argument, and therefore, using Y, every recursively defined function can be expressed as a lambda expression. In particular, we can now cleanly define the subtraction, multiplication and comparison predicate of natural numbers recursively.

Standard terms

[edit]

Certain terms have commonly accepted names:[27][28][29]

I := λx.x
S := λxyz.x z (y z)
K := λxy.x
B := λxyz.x (y z)
C := λxyz.x z y
W := λxy.x y y
ω or Δ or U := λx.x x
Ω := ω ω

I is the identity function. SK and BCKW form complete combinator calculus systems that can express any lambda term - see the next section. Ω is UU, the smallest term that has no normal form. YI is another such term. Y is standard and defined above, and can also be defined as Y=BU(CBU), so that Yg=g(Yg). TRUE and FALSE defined above are commonly abbreviated as T and F.

Abstraction elimination

[edit]

If N is a lambda-term without abstraction, but possibly containing named constants (combinators), then there exists a lambda-term T(x,N) which is equivalent to λx.N but lacks abstraction (except as part of the named constants, if these are considered non-atomic). This can also be viewed as anonymising variables, as T(x,N) removes all occurrences of x from N, while still allowing argument values to be substituted into the positions where N contains an x. The conversion function T can be defined by:

T(x, x) := I
T(x, N) := K N if x is not free in N.
T(x, M N) := S T(x, M) T(x, N)

In either case, a term of the form T(x,N) P can reduce by having the initial combinator I, K, or S grab the argument P, just like β-reduction of x.N) P would do. I returns that argument. K throws the argument away, just like x.N) would do if x has no free occurrence in N. S passes the argument on to both subterms of the application, and then applies the result of the first to the result of the second.

The combinators B and C are similar to S, but pass the argument on to only one subterm of an application (B to the "argument" subterm and C to the "function" subterm), thus saving a subsequent K if there is no occurrence of x in one subterm. In comparison to B and C, the S combinator actually conflates two functionalities: rearranging arguments, and duplicating an argument so that it may be used in two places. The W combinator does only the latter, yielding the B, C, K, W system as an alternative to SKI combinator calculus.

Typed lambda calculus

[edit]

A typed lambda calculus is a typed formalism that uses the lambda-symbol () to denote anonymous function abstraction. In this context, types are usually objects of a syntactic nature that are assigned to lambda terms; the exact nature of a type depends on the calculus considered (see Kinds of typed lambda calculi). From a certain point of view, typed lambda calculi can be seen as refinements of the untyped lambda calculus but from another point of view, they can also be considered the more fundamental theory and untyped lambda calculus a special case with only one type.[30]

Typed lambda calculi are foundational programming languages and are the base of typed functional programming languages such as ML and Haskell and, more indirectly, typed imperative programming languages. Typed lambda calculi play an important role in the design of type systems for programming languages; here typability usually captures desirable properties of the program, e.g., the program will not cause a memory access violation.

Typed lambda calculi are closely related to mathematical logic and proof theory via the Curry–Howard isomorphism and they can be considered as the internal language of classes of categories, e.g., the simply typed lambda calculus is the language of a Cartesian closed category (CCC).

Reduction strategies

[edit]

Whether a term is normalising or not, and how much work needs to be done in normalising it if it is, depends to a large extent on the reduction strategy used. Common lambda calculus reduction strategies include:[31][32][33]

Normal order
The leftmost outermost redex is reduced first. That is, whenever possible, arguments are substituted into the body of an abstraction before the arguments are reduced. If a term has a beta-normal form, normal order reduction will always reach that normal form.
Applicative order
The leftmost innermost redex is reduced first. As a consequence, a function's arguments are always reduced before they are substituted into the function. Unlike normal order reduction, applicative order reduction may fail to find the beta-normal form of an expression, even if such a normal form exists. For example, the term is reduced to itself by applicative order, while normal order reduces it to its beta-normal form .
Full β-reductions
Any redex can be reduced at any time. This means essentially the lack of any particular reduction strategy—with regard to reducibility, "all bets are off".

Weak reduction strategies do not reduce under lambda abstractions:

Call by value
Like applicative order, but no reductions are performed inside abstractions. This is similar to the evaluation order of strict languages like C: the arguments to a function are evaluated before calling the function, and function bodies are not even partially evaluated until the arguments are substituted in.
Call by name
Like normal order, but no reductions are performed inside abstractions. For example, λx.(λy.y)x is in normal form according to this strategy, although it contains the redex y.y)x.

Strategies with sharing reduce computations that are "the same" in parallel:

Optimal reduction
As normal order, but computations that have the same label are reduced simultaneously.
Call by need
As call by name (hence weak), but function applications that would duplicate terms instead name the argument. The argument may be evaluated "when needed," at which point the name binding is updated with the reduced value. This can save time compared to normal order evaluation.

Computability

[edit]

There is no algorithm that takes as input any two lambda expressions and outputs TRUE or FALSE depending on whether one expression reduces to the other.[13] More precisely, no computable function can decide the question. This was historically the first problem for which undecidability could be proven. As usual for such a proof, computable means computable by any model of computation that is Turing complete. In fact computability can itself be defined via the lambda calculus: a function F: NN of natural numbers is a computable function if and only if there exists a lambda expression f such that for every pair of x, y in N, F(x)=y if and only if f x =β y,  where x and y are the Church numerals corresponding to x and y, respectively and =β meaning equivalence with β-reduction. See the Church–Turing thesis for other approaches to defining computability and their equivalence.

Church's proof of uncomputability first reduces the problem to determining whether a given lambda expression has a normal form. Then he assumes that this predicate is computable, and can hence be expressed in lambda calculus. Building on earlier work by Kleene and constructing a Gödel numbering for lambda expressions, he constructs a lambda expression e that closely follows the proof of Gödel's first incompleteness theorem. If e is applied to its own Gödel number, a contradiction results.

Complexity

[edit]

The notion of computational complexity for the lambda calculus is a bit tricky, because the cost of a β-reduction may vary depending on how it is implemented.[34] To be precise, one must somehow find the location of all of the occurrences of the bound variable V in the expression E, implying a time cost, or one must keep track of the locations of free variables in some way, implying a space cost. A naïve search for the locations of V in E is O(n) in the length n of E. Director strings were an early approach that traded this time cost for a quadratic space usage.[35] More generally this has led to the study of systems that use explicit substitution.

In 2014, it was shown that the number of β-reduction steps taken by normal order reduction to reduce a term is a reasonable time cost model, that is, the reduction can be simulated on a Turing machine in time polynomially proportional to the number of steps.[36] This was a long-standing open problem, due to size explosion, the existence of lambda terms which grow exponentially in size for each β-reduction. The result gets around this by working with a compact shared representation. The result makes clear that the amount of space needed to evaluate a lambda term is not proportional to the size of the term during reduction. It is not currently known what a good measure of space complexity would be.[37]

An unreasonable model does not necessarily mean inefficient. Optimal reduction reduces all computations with the same label in one step, avoiding duplicated work, but the number of parallel β-reduction steps to reduce a given term to normal form is approximately linear in the size of the term. This is far too small to be a reasonable cost measure, as any Turing machine may be encoded in the lambda calculus in size linearly proportional to the size of the Turing machine. The true cost of reducing lambda terms is not due to β-reduction per se but rather the handling of the duplication of redexes during β-reduction.[38] It is not known if optimal reduction implementations are reasonable when measured with respect to a reasonable cost model such as the number of leftmost-outermost steps to normal form, but it has been shown for fragments of the lambda calculus that the optimal reduction algorithm is efficient and has at most a quadratic overhead compared to leftmost-outermost.[37] In addition the BOHM prototype implementation of optimal reduction outperformed both Caml Light and Haskell on pure lambda terms.[38]

Lambda calculus and programming languages

[edit]

As pointed out by Peter Landin's 1965 paper "A Correspondence between ALGOL 60 and Church's Lambda-notation",[39] sequential procedural programming languages can be understood in terms of the lambda calculus, which provides the basic mechanisms for procedural abstraction and procedure (subprogram) application.

Anonymous functions

[edit]

For example, in Python the "square" function can be expressed as a lambda expression as follows:

(lambda x: x**2)

The above example is an expression that evaluates to a first-class function. The symbol lambda creates an anonymous function, given a list of parameter names, x – just a single argument in this case, and an expression that is evaluated as the body of the function, x**2. Anonymous functions are sometimes called lambda expressions.

For example, Pascal and many other imperative languages have long supported passing subprograms as arguments to other subprograms through the mechanism of function pointers. However, function pointers are an insufficient condition for functions to be first class datatypes, because a function is a first class datatype if and only if new instances of the function can be created at runtime. Such runtime creation of functions is supported in Smalltalk, JavaScript, Wolfram Language, and more recently in Scala, Eiffel (as agents), C# (as delegates) and C++11, among others.

Parallelism and concurrency

[edit]

The Church–Rosser property of the lambda calculus means that evaluation (β-reduction) can be carried out in any order, even in parallel. This means that various nondeterministic evaluation strategies are relevant. However, the lambda calculus does not offer any explicit constructs for parallelism. One can add constructs such as futures to the lambda calculus. Other process calculi have been developed for describing communication and concurrency.

Semantics

[edit]

The fact that lambda calculus terms act as functions on other lambda calculus terms, and even on themselves, led to questions about the semantics of the lambda calculus. Could a sensible meaning be assigned to lambda calculus terms? The natural semantics was to find a set D isomorphic to the function space DD, of functions on itself. However, no nontrivial such D can exist, by cardinality constraints because the set of all functions from D to D has greater cardinality than D, unless D is a singleton set.

In the 1970s, Dana Scott showed that if only continuous functions were considered, a set or domain D with the required property could be found, thus providing a model for the lambda calculus.[40]

This work also formed the basis for the denotational semantics of programming languages.

Variations and extensions

[edit]

These extensions are in the lambda cube:

These formal systems are extensions of lambda calculus that are not in the lambda cube:

These formal systems are variations of lambda calculus:

These formal systems are related to lambda calculus:

  • Combinatory logic – A notation for mathematical logic without variables
  • SKI combinator calculus – A computational system based on the S, K and I combinators, equivalent to lambda calculus, but reducible without variable substitutions

See also

[edit]

Further reading

[edit]
  • Abelson, Harold & Gerald Jay Sussman. Structure and Interpretation of Computer Programs. The MIT Press. ISBN 0-262-51087-1.
  • Barendregt, Hendrik Pieter Introduction to Lambda Calculus.
  • Barendregt, Hendrik Pieter, The Impact of the Lambda Calculus in Logic and Computer Science. The Bulletin of Symbolic Logic, Volume 3, Number 2, June 1997.
  • Barendregt, Hendrik Pieter, The Type Free Lambda Calculus pp1091–1132 of Handbook of Mathematical Logic, North-Holland (1977) ISBN 0-7204-2285-X
  • Cardone, Felice and Hindley, J. Roger, 2006. History of Lambda-calculus and Combinatory Logic Archived 2021-05-06 at the Wayback Machine. In Gabbay and Woods (eds.), Handbook of the History of Logic, vol. 5. Elsevier.
  • Church, Alonzo, An unsolvable problem of elementary number theory, American Journal of Mathematics, 58 (1936), pp. 345–363. This paper contains the proof that the equivalence of lambda expressions is in general not decidable.
  • Church, Alonzo (1941). The Calculi of Lambda-Conversion. Princeton: Princeton University Press. Retrieved 2020-04-14. (ISBN 978-0-691-08394-0)
  • Frink Jr., Orrin (1944). "Review: The Calculi of Lambda-Conversion by Alonzo Church" (PDF). Bulletin of the American Mathematics Society. 50 (3): 169–172. doi:10.1090/s0002-9904-1944-08090-7.
  • Kleene, Stephen, A theory of positive integers in formal logic, American Journal of Mathematics, 57 (1935), pp. 153–173 and 219–244. Contains the lambda calculus definitions of several familiar functions.
  • Landin, Peter, A Correspondence Between ALGOL 60 and Church's Lambda-Notation, Communications of the ACM, vol. 8, no. 2 (1965), pages 89–101. Available from the ACM site. A classic paper highlighting the importance of lambda calculus as a basis for programming languages.
  • Larson, Jim, An Introduction to Lambda Calculus and Scheme. A gentle introduction for programmers.
  • Michaelson, Greg (10 April 2013). An Introduction to Functional Programming Through Lambda Calculus. Courier Corporation. ISBN 978-0-486-28029-5.[41]
  • Schalk, A. and Simmons, H. (2005) An introduction to λ-calculi and arithmetic with a decent selection of exercises. Notes for a course in the Mathematical Logic MSc at Manchester University.
  • de Queiroz, Ruy J.G.B. (2008). "On Reduction Rules, Meaning-as-Use and Proof-Theoretic Semantics". Studia Logica. 90 (2): 211–247. doi:10.1007/s11225-008-9150-5. S2CID 11321602. A paper giving a formal underpinning to the idea of 'meaning-is-use' which, even if based on proofs, it is different from proof-theoretic semantics as in the Dummett–Prawitz tradition since it takes reduction as the rules giving meaning.
  • Hankin, Chris, An Introduction to Lambda Calculi for Computer Scientists, ISBN 0954300653
Monographs/textbooks for graduate students
  • Sørensen, Morten Heine and Urzyczyn, Paweł (2006), Lectures on the Curry–Howard isomorphism, Elsevier, ISBN 0-444-52077-5 is a recent monograph that covers the main topics of lambda calculus from the type-free variety, to most typed lambda calculi, including more recent developments like pure type systems and the lambda cube. It does not cover subtyping extensions.
  • Pierce, Benjamin (2002), Types and Programming Languages, MIT Press, ISBN 0-262-16209-1 covers lambda calculi from a practical type system perspective; some topics like dependent types are only mentioned, but subtyping is an important topic.
Documents

Notes

[edit]
  1. ^ These rules produce expressions such as: . Parentheses can be dropped if the expression is unambiguous. For some applications, terms for logical and mathematical constants and operations may be included.
  2. ^ a b c d Barendregt, Barendsen (2000) call this form
    • axiom β: (λx.M[x]) N = M[N] , rewritten as (λx.M) N = M[x := N], "where M[x := N] denotes the substitution of N for every occurrence of x in M".[1]: 7  Also denoted M[N/x], "the substitution of N for x in M".[2]
  3. ^ For a full history, see Cardone and Hindley's "History of Lambda-calculus and Combinatory Logic" (2006).
  4. ^ a b is pronounced "maps to".
  5. ^ The expression e can be: variables x, lambda abstractions, or applications —in BNF, .— from Wikipedia's Simply typed lambda calculus#Syntax for untyped lambda calculus
  6. ^ is sometimes written in ASCII as
  7. ^ The lambda term represents the function written in anonymous form.
  8. ^ free variables in lambda Notation and its Calculus are comparable to linear algebra and mathematical concepts of the same name
  9. ^ The set of free variables of M, but with {x} removed
  10. ^ The union of the set of free variables of and the set of free variables of [1]
  11. ^ f.M) N can be pronounced "let f be N in M".
  12. ^ Ariola and Blom[26] employ 1) axioms for a representational calculus using well-formed cyclic lambda graphs extended with letrec, to detect possibly infinite unwinding trees; 2) the representational calculus with β-reduction of scoped lambda graphs constitute Ariola/Blom's cyclic extension of lambda calculus; 3) Ariola/Blom reason about strict languages using § call-by-value, and compare to Moggi's calculus, and to Hasegawa's calculus. Conclusions on p. 111.[26]

References

[edit]

Some parts of this article are based on material from FOLDOC, used with permission.

  1. ^ a b c Barendregt, Henk; Barendsen, Erik (March 2000), Introduction to Lambda Calculus (PDF)
  2. ^ explicit substitution at the nLab
  3. ^ Turing, Alan M. (December 1937). "Computability and λ-Definability". The Journal of Symbolic Logic. 2 (4): 153–163. doi:10.2307/2268280. JSTOR 2268280. S2CID 2317046.
  4. ^ Coquand, Thierry (8 February 2006). Zalta, Edward N. (ed.). "Type Theory". The Stanford Encyclopedia of Philosophy (Summer 2013 ed.). Retrieved November 17, 2020.
  5. ^ Moortgat, Michael (1988). Categorial Investigations: Logical and Linguistic Aspects of the Lambek Calculus. Foris Publications. ISBN 9789067653879.
  6. ^ Bunt, Harry; Muskens, Reinhard, eds. (2008). Computing Meaning. Springer. ISBN 978-1-4020-5957-5.
  7. ^ Mitchell, John C. (2003). Concepts in Programming Languages. Cambridge University Press. p. 57. ISBN 978-0-521-78098-8..
  8. ^ Chacón Sartori, Camilo (2023-12-05). Introduction to Lambda Calculus using Racket (Technical report). Archived from the original on 2023-12-07.
  9. ^ Pierce, Benjamin C. Basic Category Theory for Computer Scientists. p. 53.
  10. ^ Church, Alonzo (1932). "A set of postulates for the foundation of logic". Annals of Mathematics. Series 2. 33 (2): 346–366. doi:10.2307/1968337. JSTOR 1968337.
  11. ^ Kleene, Stephen C.; Rosser, J. B. (July 1935). "The Inconsistency of Certain Formal Logics". The Annals of Mathematics. 36 (3): 630. doi:10.2307/1968646. JSTOR 1968646.
  12. ^ Church, Alonzo (December 1942). "Review of Haskell B. Curry, The Inconsistency of Certain Formal Logics". The Journal of Symbolic Logic. 7 (4): 170–171. doi:10.2307/2268117. JSTOR 2268117.
  13. ^ a b Church, Alonzo (1936). "An unsolvable problem of elementary number theory". American Journal of Mathematics. 58 (2): 345–363. doi:10.2307/2371045. JSTOR 2371045.
  14. ^ Church, Alonzo (1940). "A Formulation of the Simple Theory of Types". Journal of Symbolic Logic. 5 (2): 56–68. doi:10.2307/2266170. JSTOR 2266170. S2CID 15889861.
  15. ^ Partee, B. B. H.; ter Meulen, A.; Wall, R. E. (1990). Mathematical Methods in Linguistics. Springer. ISBN 9789027722454. Retrieved 29 Dec 2016.
  16. ^ Alama, Jesse. Zalta, Edward N. (ed.). "The Lambda Calculus". The Stanford Encyclopedia of Philosophy (Summer 2013 ed.). Retrieved November 17, 2020.
  17. ^ Dana Scott, "Looking Backward; Looking Forward", Invited Talk at the Workshop in honour of Dana Scott’s 85th birthday and 50 years of domain theory, 7–8 July, FLoC 2018 (talk 7 July 2018). The relevant passage begins at 32:50. (See also this extract of a May 2016 talk at the University of Birmingham, UK.)
  18. ^ Barendregt, Hendrik Pieter (1984). The Lambda Calculus: Its Syntax and Semantics. Studies in Logic and the Foundations of Mathematics. Vol. 103 (Revised ed.). North Holland. ISBN 0-444-87508-5. (Corrections).
  19. ^ a b "Example for Rules of Associativity". Lambda-bound.com. Retrieved 2012-06-18.
  20. ^ "The Basic Grammar of Lambda Expressions". SoftOption. Some other systems use juxtaposition to mean application, so 'ab' means 'a@b'. This is fine except that it requires that variables have length one so that we know that 'ab' is two variables juxtaposed not one variable of length 2. But we want to labels like 'firstVariable' to mean a single variable, so we cannot use this juxtaposition convention.
  21. ^ a b Selinger, Peter (2008), Lecture Notes on the Lambda Calculus (PDF), vol. 0804, Department of Mathematics and Statistics, University of Ottawa, p. 9, arXiv:0804.3434, Bibcode:2008arXiv0804.3434S
  22. ^ de Queiroz, Ruy J. G. B. (1988). "A Proof-Theoretic Account of Programming and the Role of Reduction Rules". Dialectica. 42 (4): 265–282. doi:10.1111/j.1746-8361.1988.tb00919.x.
  23. ^ Turbak, Franklyn; Gifford, David (2008), Design concepts in programming languages, MIT press, p. 251, ISBN 978-0-262-20175-9
  24. ^ Luke Palmer (29 Dec 2010) Haskell-cafe: What's the motivation for η rules?
  25. ^ Felleisen, Matthias; Flatt, Matthew (2006), Programming Languages and Lambda Calculi (PDF), p. 26, archived from the original (PDF) on 2009-02-05; A note (accessed 2017) at the original location suggests that the authors consider the work originally referenced to have been superseded by a book.
  26. ^ a b Zena M. Ariola and Stefan Blom, Proc. TACS '94 Sendai, Japan 1997 (1997) Cyclic lambda calculi 114 pages.
  27. ^ Ker, Andrew D. "Lambda Calculus and Types" (PDF). p. 6. Retrieved 14 January 2022.
  28. ^ Dezani-Ciancaglini, Mariangiola; Ghilezan, Silvia (2014). "Preciseness of Subtyping on Intersection and Union Types" (PDF). Rewriting and Typed Lambda Calculi. Lecture Notes in Computer Science. Vol. 8560. p. 196. doi:10.1007/978-3-319-08918-8_14. hdl:2318/149874. ISBN 978-3-319-08917-1. Retrieved 14 January 2022.
  29. ^ Forster, Yannick; Smolka, Gert (August 2019). "Call-by-Value Lambda Calculus as a Model of Computation in Coq" (PDF). Journal of Automated Reasoning. 63 (2): 393–413. doi:10.1007/s10817-018-9484-2. S2CID 53087112. Retrieved 14 January 2022.
  30. ^ Types and Programming Languages, p. 273, Benjamin C. Pierce
  31. ^ Pierce, Benjamin C. (2002). Types and Programming Languages. MIT Press. p. 56. ISBN 0-262-16209-1.
  32. ^ Sestoft, Peter (2002). "Demonstrating Lambda Calculus Reduction" (PDF). The Essence of Computation. Lecture Notes in Computer Science. Vol. 2566. pp. 420–435. doi:10.1007/3-540-36377-7_19. ISBN 978-3-540-00326-7. Retrieved 22 August 2022.
  33. ^ Biernacka, Małgorzata; Charatonik, Witold; Drab, Tomasz (2022). Andronick, June; de Moura, Leonardo (eds.). "The Zoo of Lambda-Calculus Reduction Strategies, and Coq" (PDF). 13th International Conference on Interactive Theorem Proving (ITP 2022). 237. Schloss Dagstuhl – Leibniz-Zentrum für Informatik: 7:1–7:19. doi:10.4230/LIPIcs.ITP.2022.7. Retrieved 22 August 2022.
  34. ^ Frandsen, Gudmund Skovbjerg; Sturtivant, Carl (26 August 1991). "What is an efficient implementation of the λ-calculus?". Functional Programming Languages and Computer Architecture: 5th ACM Conference. Cambridge, MA, USA, August 26-30, 1991. Proceedings. Lecture Notes in Computer Science. Vol. 523. Springer-Verlag. pp. 289–312. CiteSeerX 10.1.1.139.6913. doi:10.1007/3540543961_14. ISBN 9783540543961.
  35. ^ Sinot, F.-R. (2005). "Director Strings Revisited: A Generic Approach to the Efficient Representation of Free Variables in Higher-order Rewriting" (PDF). Journal of Logic and Computation. 15 (2): 201–218. doi:10.1093/logcom/exi010.
  36. ^ Accattoli, Beniamino; Dal Lago, Ugo (14 July 2014). "Beta reduction is invariant, indeed". Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS). pp. 1–10. arXiv:1601.01233. doi:10.1145/2603088.2603105. ISBN 9781450328869. S2CID 11485010.
  37. ^ a b Accattoli, Beniamino (October 2018). "(In)Efficiency and Reasonable Cost Models". Electronic Notes in Theoretical Computer Science. 338: 23–43. doi:10.1016/j.entcs.2018.10.003.
  38. ^ a b Asperti, Andrea (16 Jan 2017). "About the efficient reduction of lambda terms". arXiv:1701.04240v1 [cs.LO].
  39. ^ Landin, P. J. (1965). "A Correspondence between ALGOL 60 and Church's Lambda-notation". Communications of the ACM. 8 (2): 89–101. doi:10.1145/363744.363749. S2CID 6505810.
  40. ^ Scott, Dana (1993). "A type-theoretical alternative to ISWIM, CUCH, OWHY" (PDF). Theoretical Computer Science. 121 (1–2): 411–440. doi:10.1016/0304-3975(93)90095-B. Retrieved 2022-12-01. Written 1969, widely circulated as an unpublished manuscript.
  41. ^ "Greg Michaelson's Homepage". Mathematical and Computer Sciences. Riccarton, Edinburgh: Heriot-Watt University. Retrieved 6 November 2022.
[edit]