Jump to content

Stable model semantics: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
removed Category:Model theory using HotCat Not what mathematicians usually consider model theory.
Bm319 (talk | contribs)
No edit summary
Line 3: Line 3:


==Motivation==
==Motivation==
Research on the declarative semantics of negation in logic programming was motivated by the fact that the behavior of [[SLD resolution#SLDNF|SLDNF]] resolution — the generalization of [[SLD resolution]] used by [[Prolog]] in the presence of negation in the bodies of rules — does not fully match the [[truth tables]] familiar from classical [[propositional logic]]. Consider, for instance, the program
Research on the declarative semantics of negation in logic programming was motivated by the fact that the behavior of [[SLD resolution#SLDNF|SLDNF]] resolution—the generalization of [[SLD resolution]] used by [[Prolog]] in the presence of negation in the bodies of rules—does not fully match the [[truth tables]] familiar from classical [[propositional logic]]. Consider, for instance, the program


:<math>p\ </math>
:<math>p</math>
:<math>r \leftarrow p,\ q</math>
:<math>r \leftarrow p, q</math>
:<math>s \leftarrow p,\ \operatorname{not} q.</math>
:<math>s \leftarrow p, \operatorname{not} q.</math>


Given this program, the query {{mvar|p}} will succeed, because the program includes {{mvar|p}} as a fact; the query {{mvar|q}} will fail, because it does not occur in the head of any of the rules. The query {{mvar|r}} will fail also, because the only rule with {{mvar|r}} in the head contains the subgoal {{mvar|q}} in its body; as we have seen, that subgoal fails. Finally, the query {{mvar|s}} succeeds, because each of the subgoals {{mvar|p}}, <math>\operatorname{not} q</math> succeeds. (The latter succeeds because the corresponding positive goal {{mvar|q}} fails.) To sum up, the behavior of SLDNF resolution on the given program can be represented by the following truth assignment:
Given this program, the query {{mvar|p}} will succeed, because the program includes {{mvar|p}} as a fact; the query {{mvar|q}} will fail, because it does not occur in the head of any of the rules. The query {{mvar|r}} will fail also, because the only rule with {{mvar|r}} in the head contains the subgoal {{mvar|q}} in its body; as we have seen, that subgoal fails. Finally, the query {{mvar|s}} succeeds, because each of the subgoals {{mvar|p}}, <math>\operatorname{not} q</math> succeeds. (The latter succeeds because the corresponding positive goal {{mvar|q}} fails.) To sum up, the behavior of SLDNF resolution on the given program can be represented by the following truth assignment:
Line 45: Line 45:
==Relation to nonmonotonic logic==
==Relation to nonmonotonic logic==


The meaning of negation in logic programs is closely related to two theories of [[Nonmonotonic logic|nonmonotonic reasoning]]&nbsp; [[autoepistemic logic]] and [[default logic]]. The discovery of these relationships was a key step towards the invention of the stable model semantics.
The meaning of negation in logic programs is closely related to two theories of [[Nonmonotonic logic|nonmonotonic reasoning]]—[[autoepistemic logic]] and [[default logic]]. The discovery of these relationships was a key step towards the invention of the stable model semantics.


The syntax of autoepistemic logic uses a [[modal operator]] that allows us to distinguish between what is true and what is believed. [[Michael Gelfond]] [1987] proposed to read <math>\operatorname{not} p</math> in the body of a rule as "<math>p</math> is not believed", and to understand a rule with negation as the corresponding formula of autoepistemic logic. The stable model semantics, in its basic form, can be viewed as a reformulation of this idea that avoids explicit references to autoepistemic logic.
The syntax of autoepistemic logic uses a [[modal operator]] that allows us to distinguish between what is true and what is known. [[Michael Gelfond]] [1987] proposed to read <math>\operatorname{not} p</math> in the body of a rule as "<math>p</math> is not known", and to understand a rule with negation as the corresponding formula of autoepistemic logic. The stable model semantics, in its basic form, can be viewed as a reformulation of this idea that avoids explicit references to autoepistemic logic.


In default logic, a default is similar to an [[inference rule]], except that it includes, besides its premises and conclusion, a list of formulas called justifications. A default can be used to derive its conclusion under the assumption that its justifications are consistent with what is currently believed. Nicole Bidoit and Christine Froidevaux [1987] proposed to treat negated atoms in the bodies of rules as justifications. For instance, the rule
In default logic, a default is similar to an [[inference rule]], except that it includes, besides its premises and conclusion, a list of formulas called justifications. A default can be used to derive its conclusion under the assumption that its justifications are consistent with what is currently known. Nicole Bidoit and Christine Froidevaux [1987] proposed to treat negated atoms in the bodies of rules as justifications. For instance, the rule


:<math>s \leftarrow p,\ \operatorname{not} q</math>
:<math>s \leftarrow p, \operatorname{not} q</math>


can be understood as the default that allows us to derive <math>s</math> from <math>p</math> assuming that <math>\neg q</math> is consistent. The stable model semantics uses the same idea, but it does not explicitly refer to default logic.
can be understood as the default that allows us to derive <math>s</math> from <math>p</math> assuming that <math>\neg q</math> is consistent. The stable model semantics uses the same idea, but it does not explicitly refer to default logic.
Line 71: Line 71:
|}
|}


is identified with the set <math>\{p,s\}</math>. This convention allows us to use the set inclusion relation to compare truth assignments with each other. The smallest of all truth assignments <math>\emptyset</math> is the one that makes every atom false; the largest truth assignment makes every atom true.
is identified with the set <math>\{p,s\}</math>. This convention allows us to use the [[set inclusion]] relation to compare truth assignments with each other. The smallest of all truth assignments <math>\emptyset</math> is the one that makes every atom false; the largest truth assignment makes every atom true.


Second, a logic program with variables is viewed as shorthand for the set of all [[Ground expression|ground]] instances of its rules, that is, for the result of substituting variable-free terms for variables in the rules of the program in all possible ways. For instance, the logic programming definition of even numbers
Second, a logic program with variables is viewed as shorthand for the set of all [[Ground expression|ground]] instances of its rules, that is, for the result of substituting variable-free terms for variables in the rules of the program in all possible ways. For instance, the logic programming definition of even numbers


:<math>\operatorname{even}(0)\ </math>
:<math>\operatorname{even}(0) </math>


:<math>\operatorname{even}(s(X))\leftarrow \operatorname{not} \operatorname{even}(X)</math>
:<math>\operatorname{even}(s(X))\leftarrow \operatorname{not} \operatorname{even}(X)</math>
Line 81: Line 81:
is understood as the result of replacing {{mvar|X}} in this program by the ground terms
is understood as the result of replacing {{mvar|X}} in this program by the ground terms


:<math>0,\ s(0),\ s(s(0)),\dots.</math>
:<math>0, s(0), s(s(0)),\dots.</math>


in all possible ways. The result is the infinite ground program
in all possible ways. The result is the infinite ground program


:<math>\operatorname{even}(0)\ </math>
:<math>\operatorname{even}(0) </math>


:<math>\operatorname{even}(s(0))\leftarrow \operatorname{not} \operatorname{even}(0)</math>
:<math>\operatorname{even}(s(0))\leftarrow \operatorname{not} \operatorname{even}(0)</math>
Line 99: Line 99:
:<math>A \leftarrow B_{1},\dots,B_{m},\operatorname{not} C_{1},\dots,\operatorname{not} C_{n}</math>
:<math>A \leftarrow B_{1},\dots,B_{m},\operatorname{not} C_{1},\dots,\operatorname{not} C_{n}</math>


where <math>A,B_{1},\dots,B_{m},C_{1},\dots,C_{n}</math> are ground atoms. If {{mvar|P}} does not contain negation (<math>n=0</math> in every rule of the program) then, by definition, the only stable model of {{mvar|P}} is its model that is minimal relative to set inclusion.<ref>This approach to the semantics of logic programs without negation is due to Maarten van Emden and [[Robert Kowalski]] {{harvnb|van Emden|Kowalski|1976}}.</ref> (Any program without negation has exactly one minimal model.) To extend this definition to the case of programs with negation, we need the auxiliary concept of the reduct, defined as follows.
where <math>A,B_{1},\dots,B_{m},C_{1},\dots,C_{n}</math> are ground atoms. If {{mvar|P}} does not contain negation (<math>n=0</math> in every rule of the program) then, by definition, the only stable model of {{mvar|P}} is its model that is minimal relative to set inclusion.<ref>This approach to the semantics of logic programs without negation is due to Maarten van Emden and [[Robert Kowalski]]—{{harvnb|van Emden|Kowalski|1976}}.</ref> (Any program without negation has exactly one minimal model.) To extend this definition to the case of programs with negation, we need the auxiliary concept of the reduct, defined as follows.


For any set {{mvar|I}} of ground atoms, the ''reduct'' of {{mvar|P}} relative to {{mvar|I}} is the set of rules without negation obtained from {{mvar|P}} by first dropping every rule such that at least one of the atoms {{tmath|C_i}} in its body
For any set {{mvar|I}} of ground atoms, the ''reduct'' of {{mvar|P}} relative to {{mvar|I}} is the set of rules without negation obtained from {{mvar|P}} by first dropping every rule such that at least one of the atoms {{tmath|C_i}} in its body
Line 113: Line 113:
To illustrate these definitions, let us check that <math>\{p,s\}</math> is a stable model of the program
To illustrate these definitions, let us check that <math>\{p,s\}</math> is a stable model of the program


:<math>p\ </math>
:<math>p </math>


:<math>r \leftarrow p,\ q</math>
:<math>r \leftarrow p, q</math>


:<math>s \leftarrow p,\ \operatorname{not} q.</math>
:<math>s \leftarrow p, \operatorname{not} q.</math>


The reduct of this program relative to <math>\{p,s\}</math> is
The reduct of this program relative to <math>\{p,s\}</math> is


:<math>p\ </math>
:<math>p </math>


:<math>r \leftarrow p,\ q</math>
:<math>r \leftarrow p, q</math>


:<math>s \leftarrow p.</math>
:<math>s \leftarrow p.</math>


(Indeed, since <math>q\not\in\{p,s\}</math>, the reduct is obtained from the program by dropping the part <math>\operatorname{not} q.\ </math>) The stable model of the reduct is <math>\{p,s\}</math>. (Indeed, this set of atoms satisfies every rule of the reduct, and it has no proper subsets with the same property.) Thus after computing the stable model of the reduct we arrived at the same set <math>\{p,s\}</math> that we started with. Consequently, that set is a stable model.
(Indeed, since <math>q\not\in\{p,s\}</math>, the reduct is obtained from the program by dropping the part <math>\operatorname{not} q. </math>) The stable model of the reduct is <math>\{p,s\}</math>. (Indeed, this set of atoms satisfies every rule of the reduct, and it has no proper subsets with the same property.) Thus after computing the stable model of the reduct we arrived at the same set <math>\{p,s\}</math> that we started with. Consequently, that set is a stable model.


Checking in the same way the other 15 sets consisting of the atoms <math>p,\ q,\ r,\ s</math> shows that this program has no other stable models. For instance, the reduct of the program relative to <math>\{p,q,r\}</math> is
Checking in the same way the other 15 sets consisting of the atoms <math>p, q, r, s</math> shows that this program has no other stable models. For instance, the reduct of the program relative to <math>\{p,q,r\}</math> is


:<math>p\ </math>
:<math>p </math>


:<math>r \leftarrow p,\ q.</math>
:<math>r \leftarrow p, q.</math>


The stable model of the reduct is <math>\{p\}</math>, which is different from the set <math>\{p,q,r\}</math> that we started with.
The stable model of the reduct is <math>\{p\}</math>, which is different from the set <math>\{p,q,r\}</math> that we started with.
Line 151: Line 151:
has no stable models.
has no stable models.


If we think of the stable model semantics as a description of the behavior of [[Prolog]] in the presence of negation then programs without a unique stable model can be judged unsatisfactory: they do not provide an unambiguous specification for Prolog-style query answering. For instance, the two programs above are not reasonable as Prolog programs&nbsp;— SLDNF resolution does not terminate on them.
If we think of the stable model semantics as a description of the behavior of [[Prolog]] in the presence of negation then programs without a unique stable model can be judged unsatisfactory: they do not provide an unambiguous specification for Prolog-style query answering. For instance, the two programs above are not reasonable as Prolog programs—SLDNF resolution does not terminate on them.


But the use of stable models in [[answer set programming]] provides a different perspective on such programs. In that programming paradigm, a given search problem is represented by a logic program so that the stable models of the program correspond to solutions. Then programs with many stable models correspond to problems with many solutions, and programs without stable models correspond to unsolvable problems. For instance, the [[eight queens puzzle]] has 92 solutions; to solve it using answer set programming, we encode it by a logic program with 92 stable models. From this point of view, logic programs with exactly one stable model are rather special in answer set programming, like polynomials with exactly one root in algebra.
But the use of stable models in [[answer set programming]] provides a different perspective on such programs. In that programming paradigm, a given search problem is represented by a logic program so that the stable models of the program correspond to solutions. Then programs with many stable models correspond to problems with many solutions, and programs without stable models correspond to unsolvable problems. For instance, the [[eight queens puzzle]] has 92 solutions; to solve it using answer set programming, we encode it by a logic program with 92 stable models. From this point of view, logic programs with exactly one stable model are rather special in answer set programming, like polynomials with exactly one root in algebra.
Line 179: Line 179:
:<math>p \leftarrow p</math>
:<math>p \leftarrow p</math>


is the [[tautology (logic)|tautology]] <math>p \leftrightarrow p</math>. The model <math>\emptyset</math> of this tautology is a stable model of <math>p \leftarrow p</math>, but its other model <math>\{p\}\ </math> is not. François Fages [1994] found a syntactic condition on logic programs that eliminates such counterexamples and guarantees the stability of every model of the program's completion. The programs that satisfy his condition are called ''tight''.
is the [[tautology (logic)|tautology]] <math>p \leftrightarrow p</math>. The model <math>\emptyset</math> of this tautology is a stable model of <math>p \leftarrow p</math>, but its other model <math>\{p\} </math> is not. François Fages [1994] found a syntactic condition on logic programs that eliminates such counterexamples and guarantees the stability of every model of the program's completion. The programs that satisfy his condition are called ''tight''.


Fangzhen Lin and Yuting Zhao [2004] showed how to make the completion of a nontight program stronger so that all its nonstable models will be eliminated. The additional formulas that they add to the completion are called ''loop formulas''.
Fangzhen Lin and Yuting Zhao [2004] showed how to make the completion of a nontight program stronger so that all its nonstable models will be eliminated. The additional formulas that they add to the completion are called ''loop formulas''.
Line 205: Line 205:
From the perspective of [[knowledge representation]], a set of ground atoms can be thought of as a description of a complete state of knowledge: the atoms that belong to the set are known to be true, and the atoms that do not belong to the set are known to be false. A possibly ''incomplete'' state of knowledge can be described using a consistent but possibly incomplete set of literals; if an atom <math>p</math> does not belong to the set and its negation does not belong to the set either then it is not known whether <math>p</math> is true or false.
From the perspective of [[knowledge representation]], a set of ground atoms can be thought of as a description of a complete state of knowledge: the atoms that belong to the set are known to be true, and the atoms that do not belong to the set are known to be false. A possibly ''incomplete'' state of knowledge can be described using a consistent but possibly incomplete set of literals; if an atom <math>p</math> does not belong to the set and its negation does not belong to the set either then it is not known whether <math>p</math> is true or false.


In the context of logic programming, this idea leads to the need to distinguish between two kinds of negation&nbsp;— ''[[negation as failure]]'', discussed above, and ''strong negation'', which is denoted here by <math>\sim</math>.<ref>{{harvnb|Gelfond|Lifschitz|1991}} call the second negation ''classical'' and denote it by <math>\neg</math>.</ref> The following example, illustrating the difference between the two kinds of negation, belongs to [[John McCarthy (computer scientist)|John McCarthy]]. A school bus may cross railway tracks under the condition that there is no approaching train. If we do not necessarily know whether a train is approaching then the rule using negation as failure
In the context of logic programming, this idea leads to the need to distinguish between two kinds of negation—''[[negation as failure]]'', discussed above, and ''strong negation'', which is denoted here by <math>\sim</math>.<ref>{{harvnb|Gelfond|Lifschitz|1991}} call the second negation ''classical'' and denote it by <math>\neg</math>.</ref> The following example, illustrating the difference between the two kinds of negation, belongs to [[John McCarthy (computer scientist)|John McCarthy]]. A school bus may cross railway tracks under the condition that there is no approaching train. If we do not necessarily know whether a train is approaching then the rule using negation as failure


:<math>\hbox{Cross} \leftarrow \hbox{not Train}</math>
:<math>\hbox{Cross} \leftarrow \hbox{not Train}</math>
Line 223: Line 223:
to be either an atom or an atom prefixed with the strong negation symbol. Instead of stable models, this generalization uses ''answer sets'', which may include both atoms and atoms prefixed with strong negation.
to be either an atom or an atom prefixed with the strong negation symbol. Instead of stable models, this generalization uses ''answer sets'', which may include both atoms and atoms prefixed with strong negation.


An alternative approach [Ferraris and Lifschitz, 2005] treats strong negation as a part of an atom, and it does not require any changes in the definition of a stable model. In this theory of strong negation, we distinguish between atoms of two kinds, ''positive'' and ''negative'', and assume that each negative atom is an expression of the form <math>\sim A</math>, where <math>A\ </math> is a positive atom. A set of atoms is called ''coherent'' if it does not contain "complementary" pairs of atoms <math>\ A,\sim A</math>. Coherent stable models of a program are identical to its consistent answer sets in the sense of [Gelfond and Lifschitz, 1991].
An alternative approach [Ferraris and Lifschitz, 2005] treats strong negation as a part of an atom, and it does not require any changes in the definition of a stable model. In this theory of strong negation, we distinguish between atoms of two kinds, ''positive'' and ''negative'', and assume that each negative atom is an expression of the form <math>{\sim} A</math>, where <math>A </math> is a positive atom. A set of atoms is called ''coherent'' if it does not contain "complementary" pairs of atoms <math> A,{\sim} A</math>. Coherent stable models of a program are identical to its consistent answer sets in the sense of [Gelfond and Lifschitz, 1991].


For instance, the program
For instance, the program
Line 231: Line 231:
:<math>q \leftarrow \operatorname{not} p</math>
:<math>q \leftarrow \operatorname{not} p</math>


:<math>r\ </math>
:<math>r </math>


:<math>\sim r\leftarrow \operatorname{not}p</math>
:<math>{\sim} r\leftarrow \operatorname{not}p</math>


has two stable models, <math>\{p,r\}\ </math> and <math>\ \{q,r,\sim r\}</math>. The first model is coherent; the second is not, because it contains both the atom <math>\ r</math> and the atom <math>\ \sim r</math>.
has two stable models, <math>\{p,r\}</math> and <math>\{q,r,{\sim} r\}</math>. The first model is coherent; the second is not, because it contains both the atom <math>r</math> and the atom <math>{\sim} r</math>.


===Closed world assumption===
===Closed world assumption===


According to [Gelfond and Lifschitz, 1991], the [[closed world assumption]] for a predicate <math>p\ </math> can be expressed by the rule
According to [Gelfond and Lifschitz, 1991], the [[closed world assumption]] for a predicate <math>p</math> can be expressed by the rule


:<math>\sim p(X_1,\dots,X_n)\leftarrow\operatorname{not}p(X_1,\dots,X_n)</math>
:<math>\sim p(X_1,\dots,X_n)\leftarrow\operatorname{not}p(X_1,\dots,X_n)</math>


(the relation <math>p\ </math> does not hold for a tuple <math>X_1,\dots,X_n</math> if there is no evidence that it does). For instance, the stable model of the program
(the relation <math>p</math> does not hold for a tuple <math>X_1,\dots,X_n</math> if there is no evidence that it does). For instance, the stable model of the program


:<math>p(a,b)\ </math>
:<math>p(a,b)</math>


:<math>p(c,d)\ </math>
:<math>p(c,d)</math>


:<math>\sim p(X,Y)\leftarrow\operatorname{not}p(X,Y)</math>
:<math>\sim p(X,Y)\leftarrow\operatorname{not}p(X,Y)</math>
Line 253: Line 253:
consists of 2 positive atoms
consists of 2 positive atoms


:<math>p(a,b),\ p(c,d)\ </math>
:<math>p(a,b),p(c,d)</math>


and 14 negative atoms
and 14 negative atoms


:<math>\sim p(a,a),\ \sim p(a,c),\ \dots</math>
:<math>\sim p(a,a),{\sim} p(a,c),\dots</math>


i.e., the strong negations of all other positive ground atoms formed from <math>p,\ a,\ b,\ c,\ d</math>.
i.e., the strong negations of all other positive ground atoms formed from <math>p,a,b,c, d</math>.


A logic program with strong negation can include the closed world assumption rules for some of its predicates and leave the other predicates in the realm of the [[open world assumption]].
A logic program with strong negation can include the closed world assumption rules for some of its predicates and leave the other predicates in the realm of the [[open world assumption]].
Line 265: Line 265:
==Programs with constraints==
==Programs with constraints==


The stable model semantics has been generalized to many kinds of logic programs other than collections of "traditional" rules discussed above&nbsp;— rules of the form
The stable model semantics has been generalized to many kinds of logic programs other than collections of "traditional" rules discussed above—rules of the form


:<math>A \leftarrow B_{1},\dots,B_{m},\operatorname{not} C_{1},\dots,\operatorname{not} C_{n}</math>
:<math>A \leftarrow B_{1},\dots,B_{m},\operatorname{not} C_{1},\dots,\operatorname{not} C_{n}</math>


where <math>A,B_{1},\dots,B_{m},C_{1},\dots,C_{n}</math> are atoms. One simple extension allows programs to contain ''constraints''&nbsp;— rules with the empty head:
where <math>A,B_{1},\dots,B_{m},C_{1},\dots,C_{n}</math> are atoms. One simple extension allows programs to contain ''constraints''—rules with the empty head:


:<math>\leftarrow B_{1},\dots,B_{m},\operatorname{not}C_{1},\dots,\operatorname{not} C_{n}.</math>
:<math>\leftarrow B_{1},\dots,B_{m},\operatorname{not}C_{1},\dots,\operatorname{not} C_{n}.</math>
Line 277: Line 277:
:<math>\neg(B_{1}\land\cdots\land B_{m}\land\neg C_{1}\land\cdots\land\neg C_{n}).</math>
:<math>\neg(B_{1}\land\cdots\land B_{m}\land\neg C_{1}\land\cdots\land\neg C_{n}).</math>


We can now extend the definition of a stable model to programs with constraints. As in the case of traditional programs, we begin with programs that do not contain negation. Such a program may be inconsistent; then we say that it has no stable models. If such a program <math>P</math> is consistent then <math>P</math> has a unique minimal model, and that model is considered the only stable model of <math>P</math>.
We can now extend the definition of a stable model to programs with constraints. As in the case of traditional programs, to define stable models, we begin with programs that do not contain negation. Such a program may be inconsistent; then we say that it has no stable models. If such a program <math>P</math> is consistent then <math>P</math> has a unique minimal model, and that model is considered the only stable model of <math>P</math>.


Next, stable models of arbitrary programs with constraints are defined using reducts, formed in the same way as in the case of traditional programs (see the [[#Definition|definition of a stable model]] above). A set <math>I</math> of atoms is a ''stable model'' of a program <math>P</math> with constraints if the reduct of <math>P</math> relative to <math>I</math> has a stable model, and that stable model equals <math>I</math>.
Next, stable models of arbitrary programs with constraints are defined using reducts, formed in the same way as in the case of traditional programs (see the [[#Definition|definition of a stable model]] above). A set <math>I</math> of atoms is a ''stable model'' of a program <math>P</math> with constraints if the reduct of <math>P</math> relative to <math>I</math> has a stable model, and that stable model equals <math>I</math>.
Line 297: Line 297:
For example, the set <math>\{p,r\}</math> is a stable model of the disjunctive program
For example, the set <math>\{p,r\}</math> is a stable model of the disjunctive program


:<math>p;q\ </math>
:<math>p;q</math>


:<math>r\leftarrow \operatorname{not} q</math>
:<math>r\leftarrow \operatorname{not} q</math>
Line 303: Line 303:
because it is one of two minimal models of the reduct
because it is one of two minimal models of the reduct


:<math>p;q\ </math>
:<math>p;q</math>


:<math>r.\ </math>
:<math>r.</math>


The program above has one more stable model, <math>\{q\}</math>.
The program above has one more stable model, <math>\{q\}</math>.
Line 315: Line 315:
Rules, and even [[#Disjunctive programs|disjunctive rules]], have a rather special syntactic form, in comparison with arbitrary [[propositional formula]]s. Each disjunctive rule is essentially an implication such that its [[Antecedent (logic)|antecedent]] (the body of the rule) is a conjunction of [[Literal (mathematical logic)|literals]], and its [[consequent]] (head) is a disjunction of atoms. David Pearce [1997] and Paolo Ferraris [2005] showed how to extend the definition of a stable model to sets of arbitrary propositional formulas. This generalization has applications to [[answer set programming]].
Rules, and even [[#Disjunctive programs|disjunctive rules]], have a rather special syntactic form, in comparison with arbitrary [[propositional formula]]s. Each disjunctive rule is essentially an implication such that its [[Antecedent (logic)|antecedent]] (the body of the rule) is a conjunction of [[Literal (mathematical logic)|literals]], and its [[consequent]] (head) is a disjunction of atoms. David Pearce [1997] and Paolo Ferraris [2005] showed how to extend the definition of a stable model to sets of arbitrary propositional formulas. This generalization has applications to [[answer set programming]].


Pearce's formulation looks very different from the [[#Definition|original definition of a stable model]]. Instead of reducts, it refers to ''equilibrium logic''&nbsp;— a system of [[nonmonotonic logic]] based on [[Kripke semantics|Kripke models]]. Ferraris's formulation, on the other hand, is based on reducts, although the process of constructing the reduct that it uses differs from the one [[#Definition|described above]]. The two approaches to defining stable models for sets of propositional formulas are equivalent to each other.
Pearce's formulation looks very different from the [[#Definition|original definition of a stable model]]. Instead of reducts, it refers to ''equilibrium logic''—a system of [[nonmonotonic logic]] based on [[Kripke semantics|Kripke models]]. Ferraris's formulation, on the other hand, is based on reducts, although the process of constructing the reduct that it uses differs from the one [[#Definition|described above]]. The two approaches to defining stable models for sets of propositional formulas are equivalent to each other.


===General definition of a stable model===
===General definition of a stable model===
Line 323: Line 323:
For instance, the reduct of the set
For instance, the reduct of the set


:<math>\{p,\ p\land q \rightarrow r,\ p \land \neg q \rightarrow s\}</math>
:<math>\{p,p\land q \rightarrow r,p \land \neg q \rightarrow s\}</math>


relative to <math>\{p,s\}</math> is
relative to <math>\{p,s\}</math> is


:<math>\{p,\ \bot\rightarrow \bot,\ p \land \neg\bot \rightarrow s\}.</math>
:<math>\{p, \bot\rightarrow \bot, p \land \neg\bot \rightarrow s\}.</math>


Since <math>\{p,s\}</math> is a model of the reduct, and the proper subsets of that set are not models of the reduct, <math>\{p,s\}</math> is a stable model of the given set of formulas.
Since <math>\{p,s\}</math> is a model of the reduct, and the proper subsets of that set are not models of the reduct, <math>\{p,s\}</math> is a stable model of the given set of formulas.
Line 355: Line 355:
==References==
==References==


*{{cite book |first1=N. |last1=Bidoit |first2=C. |last2=Froidevaux |chapter=Minimalism subsumes default logic and circumscription |chapter-url= |title=Proceedings: Symposium on Logic in Computer Science, Ithaca, New York, June 22-25, 1987 |publisher=IEEE Computer Society Press |date=1987 |isbn=978-0-8186-0793-6 |id=87CH2464-6 |pages=89–97 |url=}}
*{{cite book |first1=N. |last1=Bidoit |first2=C. |last2=Froidevaux |chapter=Minimalism subsumes default logic and circumscription |chapter-url= |title=Proceedings: [[Symposium on Logic in Computer Science]], Ithaca, New York, June 22-25, 1987 |publisher=IEEE Computer Society Press |date=1987 |isbn=978-0-8186-0793-6 |id=87CH2464-6 |pages=89–97 |url=}}
*{{cite book |first1=T. |last1=Eiter |first2=G. |last2=Gottlob |chapter=Complexity results for disjunctive logic programming and application to nonmonotonic logics |chapter-url=http://www.kr.tuwien.ac.at/staff/eiter/et-archive/ilps93.ps.gz |title=ILPS '93: Proceedings of the 1993 international symposium on Logic programming |publisher=MIT Press |date=1993 |isbn=978-0-262-63152-5 |pages=266–278 }}
*{{cite book |first1=T. |last1=Eiter |first2=G. |last2=Gottlob |author2link = Georg Gottlob|chapter=Complexity results for disjunctive logic programming and application to nonmonotonic logics |chapter-url=http://www.kr.tuwien.ac.at/staff/eiter/et-archive/ilps93.ps.gz |title=ILPS '93: Proceedings of the 1993 international symposium on Logic programming |publisher=MIT Press |date=1993 |isbn=978-0-262-63152-5 |pages=266–278 }}
*{{cite journal |first1=M. |last1=van Emden |author2-link=Robert Kowalski |first2=R. |last2=Kowalski |title=The semantics of predicate logic as a programming language |journal=Journal of the ACM |volume=23 |issue= 4|pages=733–742 |date=1976 |doi=10.1145/321978.321991 |citeseerx=10.1.1.64.9246 |s2cid=11048276 |url=http://www.doc.ic.ac.uk/~rak/papers/kowalski-van_emden.pdf}}
*{{cite journal |first1=M. |last1=van Emden |author2-link=Robert Kowalski |first2=R. |last2=Kowalski |title=The semantics of predicate logic as a programming language |journal=[[Journal of the ACM]] |volume=23 |issue= 4|pages=733–742 |date=1976 |doi=10.1145/321978.321991 |citeseerx=10.1.1.64.9246 |s2cid=11048276 |url=http://www.doc.ic.ac.uk/~rak/papers/kowalski-van_emden.pdf}}
*{{cite journal |first=F. |last=Fages |title=Consistency of Clark's completion and existence of stable models |journal=Journal of Methods of Logic in Computer Science |volume=1 |issue= |pages=51–60 |date=1994 |doi= |citeseerx=10.1.1.48.2157
*{{cite journal |first=F. |last=Fages |title=Consistency of Clark's completion and existence of stable models |journal=Journal of Methods of Logic in Computer Science |volume=1 |issue= |pages=51–60 |date=1994 |doi= |citeseerx=10.1.1.48.2157
|url=https://www.researchgate.net/publication/220492237}}
|url=https://www.researchgate.net/publication/220492237}}
Line 365: Line 365:
*{{cite book |first1=M. |last1=Gelfond |first2=V. |last2=Lifschitz |chapter=The stable model semantics for logic programming |chapter-url=http://www.cs.utexas.edu/users/vl/papers/stable.ps |title=Proceedings of the Fifth International Conference on Logic Programming (ICLP) |publisher=MIT Press |location= |date=1988 |isbn=978-0-262-61054-4 |pages=1070–80 |url=}}
*{{cite book |first1=M. |last1=Gelfond |first2=V. |last2=Lifschitz |chapter=The stable model semantics for logic programming |chapter-url=http://www.cs.utexas.edu/users/vl/papers/stable.ps |title=Proceedings of the Fifth International Conference on Logic Programming (ICLP) |publisher=MIT Press |location= |date=1988 |isbn=978-0-262-61054-4 |pages=1070–80 |url=}}
*{{cite journal |first1=M. |last1=Gelfond |first2=V. |last2=Lifschitz |title=Classical negation in logic programs and disjunctive databases |journal=New Generation Computing |volume=9 |issue= 3–4|pages=365–385 |date=1991 |doi=10.1007/BF03037169 |url=http://www.cs.utexas.edu/users/vl/papers/clnegdd.ps |citeseerx=10.1.1.49.9332|s2cid=13036056 }}
*{{cite journal |first1=M. |last1=Gelfond |first2=V. |last2=Lifschitz |title=Classical negation in logic programs and disjunctive databases |journal=New Generation Computing |volume=9 |issue= 3–4|pages=365–385 |date=1991 |doi=10.1007/BF03037169 |url=http://www.cs.utexas.edu/users/vl/papers/clnegdd.ps |citeseerx=10.1.1.49.9332|s2cid=13036056 }}
*{{cite journal |first1=S. |last1=Hanks |author2-link=Drew McDermott |first2=D. |last2=McDermott |title=Nonmonotonic logic and temporal projection |journal=Artificial Intelligence |volume=33 |issue= 3|pages=379–412 |date=1987 |doi=10.1016/0004-3702(87)90043-9 |url=https://dx.doi.org/10.1016/0004-3702%2887%2990043-9}}
*{{cite journal |first1=S. |last1=Hanks |author2-link=Drew McDermott |first2=D. |last2=McDermott |title=Nonmonotonic logic and temporal projection |journal=[[Artificial Intelligence (journal)|Artificial Intelligence]]|volume=33 |issue= 3|pages=379–412 |date=1987 |doi=10.1016/0004-3702(87)90043-9 |url=https://dx.doi.org/10.1016/0004-3702%2887%2990043-9}}
*{{cite journal |first1=F. |last1=Lin |first2=Y. |last2=Zhao |title=ASSAT: Computing answer sets of a logic program by SAT solvers |journal=Artificial Intelligence |volume=157 |issue=1–2 |pages=115–137 |date=2004 |doi=10.1016/j.artint.2004.04.004 |s2cid=514581 |url=http://www.cs.ust.hk/faculty/flin/papers/assat-aij-revised.pdf}}
*{{cite journal |first1=F. |last1=Lin |first2=Y. |last2=Zhao |title=ASSAT: Computing answer sets of a logic program by SAT solvers |journal=Artificial Intelligence |volume=157 |issue=1–2 |pages=115–137 |date=2004 |doi=10.1016/j.artint.2004.04.004 |s2cid=514581 |url=http://www.cs.ust.hk/faculty/flin/papers/assat-aij-revised.pdf}}
*{{cite book |first1=V. |last1=Marek |first2=V.S. |last2=Subrahmanian |chapter=The relationship between logic program semantics and non-monotonic reasoning |chapter-url= |title=Logic Programming: Proceedings of the Sixth International Conference |publisher=MIT Press |date=1989 |isbn=978-0-262-62065-9 |pages=600–617 |url=}}
*{{cite book |first1=V. |last1=Marek |first2=V.S. |last2=Subrahmanian |chapter=The relationship between logic program semantics and non-monotonic reasoning |chapter-url= |title=Logic Programming: Proceedings of the Sixth International Conference |publisher=MIT Press |date=1989 |isbn=978-0-262-62065-9 |pages=600–617 |url=}}

Revision as of 23:12, 1 October 2023

The concept of a stable model, or answer set, is used to define a declarative semantics for logic programs with negation as failure. This is one of several standard approaches to the meaning of negation in logic programming, along with program completion and the well-founded semantics. The stable model semantics is the basis of answer set programming.

Motivation

Research on the declarative semantics of negation in logic programming was motivated by the fact that the behavior of SLDNF resolution—the generalization of SLD resolution used by Prolog in the presence of negation in the bodies of rules—does not fully match the truth tables familiar from classical propositional logic. Consider, for instance, the program

Given this program, the query p will succeed, because the program includes p as a fact; the query q will fail, because it does not occur in the head of any of the rules. The query r will fail also, because the only rule with r in the head contains the subgoal q in its body; as we have seen, that subgoal fails. Finally, the query s succeeds, because each of the subgoals p, succeeds. (The latter succeeds because the corresponding positive goal q fails.) To sum up, the behavior of SLDNF resolution on the given program can be represented by the following truth assignment:

p q r s
T F F T.

On the other hand, the rules of the given program can be viewed as propositional formulas if we identify the comma with conjunction , the symbol with negation , and agree to treat as the implication written backwards. For instance, the last rule of the given program is, from this point of view, alternative notation for the propositional formula

If we calculate the truth values of the rules of the program for the truth assignment shown above then we will see that each rule gets the value T. In other words, that assignment is a model of the program. But this program has also other models, for instance

p q r s
T T T F.

Thus one of the models of the given program is special in the sense that it correctly represents the behavior of SLDNF resolution. What are the mathematical properties of that model that make it special? An answer to this question is provided by the definition of a stable model.

Relation to nonmonotonic logic

The meaning of negation in logic programs is closely related to two theories of nonmonotonic reasoningautoepistemic logic and default logic. The discovery of these relationships was a key step towards the invention of the stable model semantics.

The syntax of autoepistemic logic uses a modal operator that allows us to distinguish between what is true and what is known. Michael Gelfond [1987] proposed to read in the body of a rule as " is not known", and to understand a rule with negation as the corresponding formula of autoepistemic logic. The stable model semantics, in its basic form, can be viewed as a reformulation of this idea that avoids explicit references to autoepistemic logic.

In default logic, a default is similar to an inference rule, except that it includes, besides its premises and conclusion, a list of formulas called justifications. A default can be used to derive its conclusion under the assumption that its justifications are consistent with what is currently known. Nicole Bidoit and Christine Froidevaux [1987] proposed to treat negated atoms in the bodies of rules as justifications. For instance, the rule

can be understood as the default that allows us to derive from assuming that is consistent. The stable model semantics uses the same idea, but it does not explicitly refer to default logic.

Stable models

The definition of a stable model below, reproduced from [Gelfond and Lifschitz, 1988], uses two conventions. First, a truth assignment is identified with the set of atoms that get the value T. For instance, the truth assignment

p q r s
T F F T.

is identified with the set . This convention allows us to use the set inclusion relation to compare truth assignments with each other. The smallest of all truth assignments is the one that makes every atom false; the largest truth assignment makes every atom true.

Second, a logic program with variables is viewed as shorthand for the set of all ground instances of its rules, that is, for the result of substituting variable-free terms for variables in the rules of the program in all possible ways. For instance, the logic programming definition of even numbers

is understood as the result of replacing X in this program by the ground terms

in all possible ways. The result is the infinite ground program

Definition

Let P be a set of rules of the form

where are ground atoms. If P does not contain negation ( in every rule of the program) then, by definition, the only stable model of P is its model that is minimal relative to set inclusion.[1] (Any program without negation has exactly one minimal model.) To extend this definition to the case of programs with negation, we need the auxiliary concept of the reduct, defined as follows.

For any set I of ground atoms, the reduct of P relative to I is the set of rules without negation obtained from P by first dropping every rule such that at least one of the atoms in its body

belongs to I, and then dropping the parts from the bodies of all remaining rules.

We say that I is a stable model of P if I is the stable model of the reduct of P relative to I. (Since the reduct does not contain negation, its stable model has been already defined.) As the term "stable model" suggests, every stable model of P is a model of P.

Example

To illustrate these definitions, let us check that is a stable model of the program

The reduct of this program relative to is

(Indeed, since , the reduct is obtained from the program by dropping the part ) The stable model of the reduct is . (Indeed, this set of atoms satisfies every rule of the reduct, and it has no proper subsets with the same property.) Thus after computing the stable model of the reduct we arrived at the same set that we started with. Consequently, that set is a stable model.

Checking in the same way the other 15 sets consisting of the atoms shows that this program has no other stable models. For instance, the reduct of the program relative to is

The stable model of the reduct is , which is different from the set that we started with.

Programs without a unique stable model

A program with negation may have many stable models or no stable models. For instance, the program

has two stable models , . The one-rule program

has no stable models.

If we think of the stable model semantics as a description of the behavior of Prolog in the presence of negation then programs without a unique stable model can be judged unsatisfactory: they do not provide an unambiguous specification for Prolog-style query answering. For instance, the two programs above are not reasonable as Prolog programs—SLDNF resolution does not terminate on them.

But the use of stable models in answer set programming provides a different perspective on such programs. In that programming paradigm, a given search problem is represented by a logic program so that the stable models of the program correspond to solutions. Then programs with many stable models correspond to problems with many solutions, and programs without stable models correspond to unsolvable problems. For instance, the eight queens puzzle has 92 solutions; to solve it using answer set programming, we encode it by a logic program with 92 stable models. From this point of view, logic programs with exactly one stable model are rather special in answer set programming, like polynomials with exactly one root in algebra.

Properties of the stable model semantics

In this section, as in the definition of a stable model above, by a logic program we mean a set of rules of the form

where are ground atoms.

Head atoms
If an atom A belongs to a stable model of a logic program P then A is the head of one of the rules of P.
Minimality
Any stable model of a logic program P is minimal among the models of P relative to set inclusion.
The antichain property
If I and J are stable models of the same logic program then I is not a proper subset of J. In other words, the set of stable models of a program is an antichain.
NP-completeness
Testing whether a finite ground logic program has a stable model is NP-complete.

Relation to other theories of negation as failure

Program completion

Any stable model of a finite ground program is not only a model of the program itself, but also a model of its completion [Marek and Subrahmanian, 1989]. The converse, however, is not true. For instance, the completion of the one-rule program

is the tautology . The model of this tautology is a stable model of , but its other model is not. François Fages [1994] found a syntactic condition on logic programs that eliminates such counterexamples and guarantees the stability of every model of the program's completion. The programs that satisfy his condition are called tight.

Fangzhen Lin and Yuting Zhao [2004] showed how to make the completion of a nontight program stronger so that all its nonstable models will be eliminated. The additional formulas that they add to the completion are called loop formulas.

Well-founded semantics

The well-founded model of a logic program partitions all ground atoms into three sets: true, false and unknown. If an atom is true in the well-founded model of then it belongs to every stable model of . The converse, generally, does not hold. For instance, the program

has two stable models, and . Even though belongs to both of them, its value in the well-founded model is unknown.

Furthermore, if an atom is false in the well-founded model of a program then it does not belong to any of its stable models. Thus the well-founded model of a logic program provides a lower bound on the intersection of its stable models and an upper bound on their union.

Strong negation

Representing incomplete information

From the perspective of knowledge representation, a set of ground atoms can be thought of as a description of a complete state of knowledge: the atoms that belong to the set are known to be true, and the atoms that do not belong to the set are known to be false. A possibly incomplete state of knowledge can be described using a consistent but possibly incomplete set of literals; if an atom does not belong to the set and its negation does not belong to the set either then it is not known whether is true or false.

In the context of logic programming, this idea leads to the need to distinguish between two kinds of negation—negation as failure, discussed above, and strong negation, which is denoted here by .[2] The following example, illustrating the difference between the two kinds of negation, belongs to John McCarthy. A school bus may cross railway tracks under the condition that there is no approaching train. If we do not necessarily know whether a train is approaching then the rule using negation as failure

is not an adequate representation of this idea: it says that it's okay to cross in the absence of information about an approaching train. The weaker rule, that uses strong negation in the body, is preferable:

It says that it's okay to cross if we know that no train is approaching.

Coherent stable models

To incorporate strong negation in the theory of stable models, Gelfond and Lifschitz [1991] allowed each of the expressions , , in a rule

to be either an atom or an atom prefixed with the strong negation symbol. Instead of stable models, this generalization uses answer sets, which may include both atoms and atoms prefixed with strong negation.

An alternative approach [Ferraris and Lifschitz, 2005] treats strong negation as a part of an atom, and it does not require any changes in the definition of a stable model. In this theory of strong negation, we distinguish between atoms of two kinds, positive and negative, and assume that each negative atom is an expression of the form , where is a positive atom. A set of atoms is called coherent if it does not contain "complementary" pairs of atoms . Coherent stable models of a program are identical to its consistent answer sets in the sense of [Gelfond and Lifschitz, 1991].

For instance, the program

has two stable models, and . The first model is coherent; the second is not, because it contains both the atom and the atom .

Closed world assumption

According to [Gelfond and Lifschitz, 1991], the closed world assumption for a predicate can be expressed by the rule

(the relation does not hold for a tuple if there is no evidence that it does). For instance, the stable model of the program

consists of 2 positive atoms

and 14 negative atoms

i.e., the strong negations of all other positive ground atoms formed from .

A logic program with strong negation can include the closed world assumption rules for some of its predicates and leave the other predicates in the realm of the open world assumption.

Programs with constraints

The stable model semantics has been generalized to many kinds of logic programs other than collections of "traditional" rules discussed above—rules of the form

where are atoms. One simple extension allows programs to contain constraints—rules with the empty head:

Recall that a traditional rule can be viewed as alternative notation for a propositional formula if we identify the comma with conjunction , the symbol with negation , and agree to treat as the implication written backwards. To extend this convention to constraints, we identify a constraint with the negation of the formula corresponding to its body:

We can now extend the definition of a stable model to programs with constraints. As in the case of traditional programs, to define stable models, we begin with programs that do not contain negation. Such a program may be inconsistent; then we say that it has no stable models. If such a program is consistent then has a unique minimal model, and that model is considered the only stable model of .

Next, stable models of arbitrary programs with constraints are defined using reducts, formed in the same way as in the case of traditional programs (see the definition of a stable model above). A set of atoms is a stable model of a program with constraints if the reduct of relative to has a stable model, and that stable model equals .

The properties of the stable model semantics stated above for traditional programs hold in the presence of constraints as well.

Constraints play an important role in answer set programming because adding a constraint to a logic program affects the collection of stable models of in a very simple way: it eliminates the stable models that violate the constraint. In other words, for any program with constraints and any constraint , the stable models of can be characterized as the stable models of that satisfy .

Disjunctive programs

In a disjunctive rule, the head may be the disjunction of several atoms:

(the semicolon is viewed as alternative notation for disjunction ). Traditional rules correspond to , and constraints to . To extend the stable model semantics to disjunctive programs [Gelfond and Lifschitz, 1991], we first define that in the absence of negation ( in each rule) the stable models of a program are its minimal models. The definition of the reduct for disjunctive programs remains the same as before. A set of atoms is a stable model of if is a stable model of the reduct of relative to .

For example, the set is a stable model of the disjunctive program

because it is one of two minimal models of the reduct

The program above has one more stable model, .

As in the case of traditional programs, each element of any stable model of a disjunctive program is a head atom of , in the sense that it occurs in the head of one of the rules of . As in the traditional case, the stable models of a disjunctive program are minimal and form an antichain. Testing whether a finite disjunctive program has a stable model is -complete [Eiter and Gottlob, 1993].

Stable models of a set of propositional formulas

Rules, and even disjunctive rules, have a rather special syntactic form, in comparison with arbitrary propositional formulas. Each disjunctive rule is essentially an implication such that its antecedent (the body of the rule) is a conjunction of literals, and its consequent (head) is a disjunction of atoms. David Pearce [1997] and Paolo Ferraris [2005] showed how to extend the definition of a stable model to sets of arbitrary propositional formulas. This generalization has applications to answer set programming.

Pearce's formulation looks very different from the original definition of a stable model. Instead of reducts, it refers to equilibrium logic—a system of nonmonotonic logic based on Kripke models. Ferraris's formulation, on the other hand, is based on reducts, although the process of constructing the reduct that it uses differs from the one described above. The two approaches to defining stable models for sets of propositional formulas are equivalent to each other.

General definition of a stable model

According to [Ferraris, 2005], the reduct of a propositional formula relative to a set of atoms is the formula obtained from by replacing each maximal subformula that is not satisfied by with the logical constant (false). The reduct of a set of propositional formulas relative to consists of the reducts of all formulas from relative to . As in the case of disjunctive programs, we say that a set of atoms is a stable model of if is minimal (with respect to set inclusion) among the models of the reduct of relative to .

For instance, the reduct of the set

relative to is

Since is a model of the reduct, and the proper subsets of that set are not models of the reduct, is a stable model of the given set of formulas.

We have seen that is also a stable model of the same formula, written in logic programming notation, in the sense of the original definition. This is an instance of a general fact: in application to a set of (formulas corresponding to) traditional rules, the definition of a stable model according to Ferraris is equivalent to the original definition. The same is true, more generally, for programs with constraints and for disjunctive programs.

Properties of the general stable model semantics

The theorem asserting that all elements of any stable model of a program are head atoms of can be extended to sets of propositional formulas, if we define head atoms as follows. An atom is a head atom of a set of propositional formulas if at least one occurrence of in a formula from is neither in the scope of a negation nor in the antecedent of an implication. (We assume here that equivalence is treated as an abbreviation, not a primitive connective.)

The minimality and the antichain property of stable models of a traditional program do not hold in the general case. For instance, (the singleton set consisting of) the formula

has two stable models, and . The latter is not minimal, and it is a proper superset of the former.

Testing whether a finite set of propositional formulas has a stable model is -complete, as in the case of disjunctive programs.

See also

Notes

  1. ^ This approach to the semantics of logic programs without negation is due to Maarten van Emden and Robert Kowalskivan Emden & Kowalski 1976.
  2. ^ Gelfond & Lifschitz 1991 call the second negation classical and denote it by .

References