Side effect (computer science): Difference between revisions
reading data per se is normally not observable, needs clarification if people feel necessary to reintroduce |
m Add link to Effect system that keeps track of effects and Frame problem related to treatment of state in AI |
||
(155 intermediate revisions by 91 users not shown) | |||
Line 1: | Line 1: | ||
{{short description|Of a function, an additional effect besides returning a value}} |
|||
In [[computer science]], a [[subroutine|function]] or [[expression (programming)|expression]] is said to have a '''side effect''' if, in addition to producing a value, it also modifies some [[state (computer science)|state]] or has an ''observable'' interaction with calling functions or the outside world. For example, a function might modify a global or a [[static variable]], modify one of its arguments, raise an exception, write data to a display or file, or call other side-effecting functions. In the presence of side effects, a program's behavior depends on history; that is, the order of evaluation matters. Because understanding an effectful program requires thinking about all possible histories, side effects often make a program harder to understand. |
|||
{{use dmy dates|date=December 2021|cs1-dates=y}} |
|||
{{use list-defined references|date=August 2022}} |
|||
{{redir|Hidden side effect|similar uses|hidden variable (disambiguation)}} |
|||
In [[computer science]], an operation, [[subroutine|function]] or [[expression (programming)|expression]] is said to have a '''side effect''' if it has any observable effect other than its primary effect of reading the value of its arguments and returning a value to the invoker of the operation. Example side effects include modifying a [[non-local variable]], a [[static local variable]] or a mutable argument [[Evaluation_strategy#Call by reference|passed by reference]]; raising errors or exceptions; performing [[I/O]]; or calling other functions with side-effects.<ref name="Spuler-Sajeev_1994"/> In the presence of side effects, a program's behaviour may depend on history; that is, the order of evaluation matters. Understanding and debugging a function with side effects requires knowledge about the context and its possible histories.<ref name="Hughes_1990"/><ref name="Collberg_2005"/> |
|||
Side effects play an important role in the design and analysis of [[programming language]]s. The degree to which side effects are used depends on the programming paradigm. For example, [[imperative programming]] is commonly used to produce side effects, to update a system's state. By contrast, [[declarative programming]] is commonly used to report on the state of system, without side effects. |
|||
[[Functional programming]] aims to minimize or eliminate side effects. The lack of side effects makes it easier to do [[formal verification]] of a program. The functional language [[Haskell (programming language)|Haskell]] eliminates side effects such as [[Input/output|I/O]] and other stateful computations by replacing them with [[Monad (functional programming)|monadic]] actions.<ref name="Haskell_1998"/><ref name="Jones-Wadler_1993"/> Functional languages such as [[Standard ML]], [[Scheme (programming language)|Scheme]] and [[Scala (programming language)|Scala]] do not restrict side effects, but it is customary for programmers to avoid them.<ref name="Felleisen_2014"/> |
|||
Side effects are the most common way to enable a program to interact with the outside world (people, filesystems, other computers on networks). But the degree to which side effects are used depends on the programming paradigm. [[Imperative programming]] is known for uncontrolled, promiscuous use of side effects. In [[functional programming]], side effects are rarely used. Functional languages such as [[Standard ML]] and [[Scheme (programming language)|Scheme]] do not restrict side effects, but it is customary for programmers to avoid them.<ref>[http://www.htdp.org/ Matthias Felleisen et al., ''How To Design Programs'', MIT Press]</ref> The functional language [[Haskell (programming language)|Haskell]] restricts side effects with a static [[type system]]; it uses the concept of [[Monad_(functional_programming)|monads]] to do stateful and IO computations. <ref>Haskell 98 report, http://www.haskell.org.</ref><ref>''Imperative Functional Programming'', Simon Peyton Jones and Phil Wadler, <em>Conference Record of the 20th Annual ACM Symposium on Principles of Programming Languages</em>, pages 71–84, 1993</ref> |
|||
[[Effect system]]s extend types to keep track of effects, permitting concise notation for functions with effects, while maintaining information about the extent and nature of side effects. In particular, functions without effects correspond to pure functions. |
|||
Assembly-language programmers must be aware of ''hidden'' side effects — instructions that modify parts of the processor state which are not mentioned in the instruction's mnemonic. A classic example of a hidden side effect is an arithmetic instruction which explicitly modifies a register (an overt effect) and implicitly modifies condition codes (a hidden side effect). One defect of an instruction set with many hidden side effects is that if many instructions all have side effects on a single piece of state, like condition codes, then the logic required to update that state sequentially may become a performance bottleneck. The problem is particularly acute on processors designed with [[instruction pipeline]] (since 1990) or with out-of-order execution. Such a processor may require additional control circuitry to detect hidden side effects and stall the pipeline if the next instruction depends on the results of those effects. |
|||
[[Assembly language]] programmers must be aware of ''hidden'' side effects—instructions that modify parts of the processor state which are not mentioned in the instruction's mnemonic. A classic example of a hidden side effect is an arithmetic instruction that implicitly modifies [[status register|condition codes]] (a hidden side effect) while it explicitly modifies a [[processor register|register]] (the intended effect). One potential drawback of an [[instruction set]] with hidden side effects is that, if many instructions have side effects on a single piece of state, like condition codes, then the logic required to update that state sequentially may become a performance bottleneck. The problem is particularly acute on some processors designed with [[instruction pipeline|pipelining]] (since 1990) or with [[out-of-order execution]]. Such a processor may require additional control circuitry to detect hidden side effects and stall the pipeline if the next instruction depends on the results of those effects. |
|||
== Referential transparency == |
== Referential transparency == |
||
{{ |
{{Main|Referential transparency}} |
||
Absence of side effects is necessary but not sufficient for referential transparency. Referential transparency means that an expression (such as a function call) can be replaced with its value |
Absence of side effects is a necessary, but not sufficient, condition for referential transparency. Referential transparency means that an expression (such as a function call) can be replaced with its value. This requires that the expression is [[pure function|pure]], that is to say the expression must be [[deterministic algorithm|deterministic]] (always give the same [[value (computer science)|value]] for the same input) and side-effect free. |
||
== Temporal side effects == |
== Temporal side effects == |
||
Side effects |
Side effects caused by the time taken for an operation to execute are usually ignored when discussing side effects and referential transparency. There are some cases, such as with hardware timing or testing, where operations are inserted specifically for their temporal side effects e.g. <code>sleep(5000)</code> or <code>for (int i = 0; i < 10000; ++i) {}</code>. These instructions do not change state other than taking an amount of time to complete. |
||
== Idempotence == |
== Idempotence == |
||
{{Main|Idempotence#Computer science meaning}} |
|||
A side effect free function is always [[Idempotence|idempotent]] (by the computing definition, not the mathematical definition). |
|||
A [[subroutine]] with side effects is idempotent if multiple applications of the subroutine have the same effect on the system state as a single application, in other words if the function from the system state space to itself associated with the subroutine is idempotent in the [[Idempotence#Definition|mathematical sense]]. For instance, consider the following [[Python (programming language)|Python]] program: |
|||
<syntaxhighlight lang="python"> |
|||
x = 0 |
|||
def setx(n): |
|||
global x |
|||
x = n |
|||
setx(3) |
|||
assert x == 3 |
|||
setx(3) |
|||
assert x == 3 |
|||
</syntaxhighlight> |
|||
<code>setx</code> is idempotent because the second application of <code>setx</code> to 3 has the same effect on the system state as the first application: <code>x</code> was already set to 3 after the first application, and it is still set to 3 after the second application. |
|||
A [[pure function]] is idempotent if it is idempotent in the [[idempotence#Definition|mathematical sense]]. For instance, consider the following Python program: |
|||
<syntaxhighlight lang="python"> |
|||
def abs(n): |
|||
return -n if n < 0 else n |
|||
assert abs(abs(-3)) == abs(-3) |
|||
</syntaxhighlight> |
|||
<code>abs</code> is idempotent because the second application of <code>abs</code> to the return value of the first application to -3 returns the same value as the first application to -3. |
|||
== Example == |
|||
One common demonstration of side effect behavior is that of the [[assignment operator]] in [[C (programming language)|C]]. The assignment <code>a = b</code> is an expression that evaluates to the same value as the expression <code>b</code>, with the side effect of storing the [[value (computer science)#lrvalue|R-value]] of <code>b</code> into the [[value (computer science)#lrvalue|L-value]] of <code>a</code>. This allows multiple assignment: |
|||
<syntaxhighlight lang="c"> |
|||
a = (b = 3); // b = 3 evaluates to 3, which then gets assigned to a |
|||
</syntaxhighlight> |
|||
Because the operator [[operator associativity#Right-associativity of assignment operators|right associates]], this is equivalent to |
|||
<syntaxhighlight lang="c"> |
|||
a = b = 3; |
|||
</syntaxhighlight> |
|||
This presents a potential hangup for novice programmers who may confuse |
|||
<syntaxhighlight lang="c"> |
|||
while (b == 3) {} // tests if b evaluates to 3 |
|||
</syntaxhighlight> |
|||
with |
|||
<syntaxhighlight lang="c"> |
|||
while (b = 3) {} // b = 3 evaluates to 3, which then casts to true so the loop is infinite |
|||
</syntaxhighlight> |
|||
== See also == |
== See also == |
||
* [[Action at a distance (computer programming)]] |
|||
*[[Sequence point]] |
|||
* [[Don't-care term]] |
|||
* [[Sequence point]] |
|||
* [[Side-channel attack]] |
|||
* [[Undefined behaviour]] |
|||
* [[Unspecified behaviour]] |
|||
* [[Frame problem]] |
|||
== References == |
== References == |
||
{{Reflist|refs= |
|||
<ref name="Spuler-Sajeev_1994">{{cite book |title=Compiler Detection of Function Call Side Effects |author-last1=Spuler |author-first1=David A. |author-last2=Sajeev |author-first2=A. Sayed Muhammed |date=January 1994 |publisher=[[James Cook University]] |citeseerx=10.1.1.70.2096 |quote=The term Side effect refers to the modification of the nonlocal environment. Generally this happens when a function (or a procedure) modifies a global variable or arguments passed by reference parameters. But here are other ways in which the nonlocal environment can be modified. We consider the following causes of side effects through a function call: 1. Performing I/O. 2. Modifying global variables. 3. Modifying local permanent variables (like static variables in C). 4. Modifying an argument passed by reference. 5. Modifying a local variable, either automatic or static, of a function higher up in the function call sequence (usually via a pointer).}}</ref> |
|||
<ref name="Hughes_1990">{{cite book |title=Research Topics in Functional Programming |editor-first=David A. |editor-last=Turner |editor-link=David Turner (computer scientist) |publisher=[[Addison-Wesley]] |date=1990 |pages=17–42}} Via {{cite web |author-last=Hughes |author-first=John |title=Why Functional Programming Matters |url=http://www.cs.utexas.edu/~shmat/courses/cs345/whyfp.pdf |access-date=2022-08-06 |url-status=live |archive-url=https://web.archive.org/web/20220614042648/https://www.cs.utexas.edu/~shmat/courses/cs345/whyfp.pdf |archive-date=2022-06-14}}</ref> |
|||
<ref name="Collberg_2005">{{cite web |author-first=Christian S. |author-last=Collberg |date=2005-04-22 |title=CSc 520 Principles of Programming Languages |publisher=Department of Computer Science, [[University of Arizona]] |url=http://www.cs.arizona.edu/~collberg/Teaching/520/2005/Html/Html-24/index.html |access-date=2022-08-06 |archive-url=https://web.archive.org/web/20220806155136/https://www2.cs.arizona.edu/~collberg/Teaching/520/2005/Html/Html-24/index.html |archive-date=2022-08-06}}</ref> |
|||
<ref name="Felleisen_2014">{{cite web |author-first1=Matthias |author-last1=Felleisen |author-link1=Matthias Felleisen |author-first2=Robert Bruce |author-last2=Findler |author-first3=Matthew |author-last3=Flatt |author-first4=Shriram |author-last4=Krishnamurthi |author-link4=Shriram Krishnamurthi |title=How To Design Programs |publisher=[[MIT Press]] |date=2014-08-01 |edition=2 |url=http://www.htdp.org/}}</ref> |
|||
<ref name="Haskell_1998">{{cite web |title=Haskell 98 report |date=1998 |url=http://www.haskell.org}}</ref> |
|||
<ref name="Jones-Wadler_1993">{{cite conference |title=Imperative Functional Programming |author-first1=Simon Peyton |author-last1=Jones |author-first2=Phil |author-last2=Wadler |conference=Conference Record of the 20th Annual ACM Symposium on Principles of Programming Languages |pages=71–84 |date=1993}}</ref> |
|||
}} |
|||
{{Program analysis}} |
|||
<references/> |
|||
{{DEFAULTSORT:Side Effect (Computer Science)}} |
{{DEFAULTSORT:Side Effect (Computer Science)}} |
||
[[Category:Computer programming]] |
[[Category:Computer programming]] |
||
[[Category:Programming language theory]] |
|||
[[Category:Functional programming]] |
|||
[[cs:Vedlejší účinek]] |
|||
[[Category:Articles with example C code]] |
|||
[[de:Wirkung (Informatik)]] |
|||
[[Category:Articles with example Python (programming language) code]] |
|||
[[el:Παρενέργεια (υπολογιστές)]] |
|||
[[es:Efecto secundario (informática)]] |
|||
[[fr:Effet de bord (informatique)]] |
|||
[[ko:부작용 (전산학)]] |
|||
[[it:Effetto collaterale (informatica)]] |
|||
[[nl:Neveneffect]] |
|||
[[ja:副作用 (プログラム)]] |
|||
[[pl:Skutek uboczny (informatyka)]] |
|||
[[ru:Побочный эффект (программирование)]] |
|||
[[sv:Sidoeffekt (datorprogrammering)]] |
Latest revision as of 16:22, 16 November 2024
In computer science, an operation, function or expression is said to have a side effect if it has any observable effect other than its primary effect of reading the value of its arguments and returning a value to the invoker of the operation. Example side effects include modifying a non-local variable, a static local variable or a mutable argument passed by reference; raising errors or exceptions; performing I/O; or calling other functions with side-effects.[1] In the presence of side effects, a program's behaviour may depend on history; that is, the order of evaluation matters. Understanding and debugging a function with side effects requires knowledge about the context and its possible histories.[2][3] Side effects play an important role in the design and analysis of programming languages. The degree to which side effects are used depends on the programming paradigm. For example, imperative programming is commonly used to produce side effects, to update a system's state. By contrast, declarative programming is commonly used to report on the state of system, without side effects.
Functional programming aims to minimize or eliminate side effects. The lack of side effects makes it easier to do formal verification of a program. The functional language Haskell eliminates side effects such as I/O and other stateful computations by replacing them with monadic actions.[4][5] Functional languages such as Standard ML, Scheme and Scala do not restrict side effects, but it is customary for programmers to avoid them.[6]
Effect systems extend types to keep track of effects, permitting concise notation for functions with effects, while maintaining information about the extent and nature of side effects. In particular, functions without effects correspond to pure functions.
Assembly language programmers must be aware of hidden side effects—instructions that modify parts of the processor state which are not mentioned in the instruction's mnemonic. A classic example of a hidden side effect is an arithmetic instruction that implicitly modifies condition codes (a hidden side effect) while it explicitly modifies a register (the intended effect). One potential drawback of an instruction set with hidden side effects is that, if many instructions have side effects on a single piece of state, like condition codes, then the logic required to update that state sequentially may become a performance bottleneck. The problem is particularly acute on some processors designed with pipelining (since 1990) or with out-of-order execution. Such a processor may require additional control circuitry to detect hidden side effects and stall the pipeline if the next instruction depends on the results of those effects.
Referential transparency
[edit]Absence of side effects is a necessary, but not sufficient, condition for referential transparency. Referential transparency means that an expression (such as a function call) can be replaced with its value. This requires that the expression is pure, that is to say the expression must be deterministic (always give the same value for the same input) and side-effect free.
Temporal side effects
[edit]Side effects caused by the time taken for an operation to execute are usually ignored when discussing side effects and referential transparency. There are some cases, such as with hardware timing or testing, where operations are inserted specifically for their temporal side effects e.g. sleep(5000)
or for (int i = 0; i < 10000; ++i) {}
. These instructions do not change state other than taking an amount of time to complete.
Idempotence
[edit]A subroutine with side effects is idempotent if multiple applications of the subroutine have the same effect on the system state as a single application, in other words if the function from the system state space to itself associated with the subroutine is idempotent in the mathematical sense. For instance, consider the following Python program:
x = 0
def setx(n):
global x
x = n
setx(3)
assert x == 3
setx(3)
assert x == 3
setx
is idempotent because the second application of setx
to 3 has the same effect on the system state as the first application: x
was already set to 3 after the first application, and it is still set to 3 after the second application.
A pure function is idempotent if it is idempotent in the mathematical sense. For instance, consider the following Python program:
def abs(n):
return -n if n < 0 else n
assert abs(abs(-3)) == abs(-3)
abs
is idempotent because the second application of abs
to the return value of the first application to -3 returns the same value as the first application to -3.
Example
[edit]One common demonstration of side effect behavior is that of the assignment operator in C. The assignment a = b
is an expression that evaluates to the same value as the expression b
, with the side effect of storing the R-value of b
into the L-value of a
. This allows multiple assignment:
a = (b = 3); // b = 3 evaluates to 3, which then gets assigned to a
Because the operator right associates, this is equivalent to
a = b = 3;
This presents a potential hangup for novice programmers who may confuse
while (b == 3) {} // tests if b evaluates to 3
with
while (b = 3) {} // b = 3 evaluates to 3, which then casts to true so the loop is infinite
See also
[edit]- Action at a distance (computer programming)
- Don't-care term
- Sequence point
- Side-channel attack
- Undefined behaviour
- Unspecified behaviour
- Frame problem
References
[edit]- ^ Spuler, David A.; Sajeev, A. Sayed Muhammed (January 1994). Compiler Detection of Function Call Side Effects. James Cook University. CiteSeerX 10.1.1.70.2096.
The term Side effect refers to the modification of the nonlocal environment. Generally this happens when a function (or a procedure) modifies a global variable or arguments passed by reference parameters. But here are other ways in which the nonlocal environment can be modified. We consider the following causes of side effects through a function call: 1. Performing I/O. 2. Modifying global variables. 3. Modifying local permanent variables (like static variables in C). 4. Modifying an argument passed by reference. 5. Modifying a local variable, either automatic or static, of a function higher up in the function call sequence (usually via a pointer).
- ^ Turner, David A., ed. (1990). Research Topics in Functional Programming. Addison-Wesley. pp. 17–42. Via Hughes, John. "Why Functional Programming Matters" (PDF). Archived (PDF) from the original on 2022-06-14. Retrieved 2022-08-06.
- ^ Collberg, Christian S. (2005-04-22). "CSc 520 Principles of Programming Languages". Department of Computer Science, University of Arizona. Archived from the original on 2022-08-06. Retrieved 2022-08-06.
- ^ "Haskell 98 report". 1998.
- ^ Jones, Simon Peyton; Wadler, Phil (1993). Imperative Functional Programming. Conference Record of the 20th Annual ACM Symposium on Principles of Programming Languages. pp. 71–84.
- ^ Felleisen, Matthias; Findler, Robert Bruce; Flatt, Matthew; Krishnamurthi, Shriram (2014-08-01). "How To Design Programs" (2 ed.). MIT Press.