Futures and promises
This article is written like a personal reflection, personal essay, or argumentative essay that states a Wikipedia editor's personal feelings or presents an original argument about a topic. (November 2010) |
In computer science, future, promise, and delay refer to constructs used for synchronization in some concurrent programming languages. They describe an object that acts as a proxy for a result that is initially not known, usually because the computation of its value has not yet completed.
The term "promise" was proposed in 1976 by Daniel P. Friedman and David Wise,[1] and Peter Hibbard called it "eventual".[2] A somewhat similar concept "future" was introduced in 1977 in a paper by Henry Baker and Carl Hewitt. [3]
The terms "future", "promise", and "delay" are often used interchangeably, although some differences in usage between "future" and "promise" are discussed below. Setting the value of a future is also called "resolving", "fulfilling", or "binding" it.
Implicit vs explicit
Use of futures may be implicit (any use of the future automatically obtains its value, as if it were an ordinary reference) or explicit (the user must call a function to obtain the value, such as the get method of java.util.concurrent.Future in Java). Obtaining the value of an explicit future can be called "stinging" or "forcing". Explicit futures may be implemented as a library, whereas implicit futures require language support.
The original Baker and Hewitt paper described implicit futures, which are naturally supported in the Actor model of computation and pure object-oriented programming languages like Smalltalk. The Friedman and Wise paper described only explicit futures, probably reflecting the difficulty of efficiently implementing implicit futures on stock hardware. The difficulty is that stock hardware does not deal with futures for primitive data types like integers. For example, an add instruction does not know how to deal with 3 + future factorial(100000). In pure object or Actor languages this problem can be solved by sending future factorial(100000) the message +[3], which asks the future to add 3 to itself and return the result. Note that the message passing approach works regardless of when factorial(100000) finishes computation and that no stinging/forcing is required.
Promise pipelining
The use of futures can dramatically reduce latency in distributed systems. For instance, futures enable promise pipelining,[4][5] as implemented in the E and Joule programming languages, which was also called call-stream in the Argus programming language.
Consider an expression involving conventional remote procedure calls, such as:
t3 := ( x.a() ).c( y.b() )
which could be expanded to
t1 := x.a();
t2 := y.b();
t3 := t1.c(t2);
Each statement requires a message to be sent and a reply received before the next statement can proceed. Suppose, for example, that x, y, t1, and t2 are all located on the same remote machine. In this case, two complete network round-trips to that machine must take place before the third statement can begin to execute. The third statement will then cause yet another round-trip to the same remote machine.
Using futures, the above expression could be written
t3 := (x <- a()) <- c(y <- b())
which could be expanded to
t1 := x <- a();
t2 := y <- b();
t3 := t1 <- c(t2);
The syntax used here is that of the E programming language, where x <- a() means to send the message a() asynchronously to x. All three variables are immediately assigned futures for their results, and execution proceeds to subsequent statements. Later attempts to resolve the value of t3 may cause a delay; however, pipelining can reduce the number of round-trips required. If, as in the previous example, x, y, t1, and t2 are all located on the same remote machine, a pipelined implementation can compute t3 with one round-trip instead of three. Because all three messages are destined for objects which are on the same remote machine, only one request need be sent and only one response need be received containing the result. Note also that the send t1 <- c(t2) would not block even if t1 and t2 were on different machines to each other, or to x or y.
Promise pipelining should be distinguished from parallel asynchronous message passing. In a system supporting parallel message passing but not pipelining, the message sends x <- a() and y <- b() in the above example could proceed in parallel, but the send of t1 <- c(t2) would have to wait until both t1 and t2 had been received, even when x, y, t1, and t2 are on the same remote machine. The relative latency advantage of pipelining becomes even greater in more complicated situations involving many messages.
Promise pipelining also should not be confused with pipelined message processing in Actor systems, where it is possible for an actor to specify and begin executing a behaviour for the next message before having completed processing of the current message.
Read-only views
In some programming languages such as Oz, E, and AmbientTalk, it is possible to obtain a "read-only view" of a future, which allows reading its value when resolved, but does not permit resolving it:
- In Oz, the !! operator is used to obtain a read-only view.
- In E and AmbientTalk, a future is represented by a pair of values called a "promise/resolver pair". The promise represents the read-only view, and the resolver is required in order to set the future's value.
- In C++11 a
std::future
provides a read-only view. The value is set directly by using astd::promise
, or set to the result of a function call usingstd::packaged_task
orstd::async
. - In the Dojo Toolkit's Deferred API as of version 1.5, a "consumer-only promise object" represents a read-only view.[6]
- In Alice ML, futures only provide a "read-only view", whereas a promise contains both a future and the ability to resolve the future[7][8]
Support for read-only views is consistent with the Principle of Least Authority, since it enables the ability to set the value to be restricted to subjects that need to set it. In a system that also supports pipelining, the sender of an asynchronous message (with result) receives the read-only promise for the result, and the target of the message receives the resolver.
Thread-specific futures
Some languages, such as Alice ML, define futures that are associated with a specific thread that computes the future's value.[8] This computation may be started either eagerly when the future is created, or lazily when its value is first needed. A lazy future is similar to a thunk (in the sense of a delayed computation).
Alice ML also supports futures that can be resolved by any thread, and calls these "promises".[7] Note that this usage of "promise" is different from its usage in E as described above: an Alice promise is not a read-only view, and Alice also does not support pipelining for promises themselves. Instead, pipelining naturally happens for futures (including ones associated with promises).
Blocking vs non-blocking semantics
If the value of a future is accessed asynchronously, for example by sending a message to it, or by explicitly waiting for it using a construct such as when
in E, then there is no difficulty in delaying until the future is resolved before the message can be received or the wait completes. This is the only case to be considered in purely asynchronous systems such as pure Actor languages.
However, in some systems it may also be possible to attempt to "immediately" or "synchronously" access a future's value. Then there is a design choice to be made:
- the access could block the current thread or process until the future is resolved (possibly with a timeout). This is the semantics of dataflow variables in the Oz programming language.
- the attempted synchronous access could always signal an error, for example throwing an exception. This is the semantics of remote promises in E.[9]
- potentially, the access could succeed if the future is already resolved, but signal an error if it is not. This would have the disadvantage of introducing nondeterminism and the potential for race conditions, and does not appear to be a common design choice.
As an example of the first possibility, in C++0x, a thread that needs the value of a future can block until it is available by calling the wait()
or get()
member functions. You can also specify a timeout on the wait using the wait_for()
or wait_until()
member functions to avoid indefinite blocking. If the future arose from a call to std::async
then a blocking wait (without a timeout) may cause synchronous invocation of the function to compute the result on the waiting thread.
Related constructs
An I-var (as in the Id programming language) is a future with blocking semantics as defined above. An I-structure is a data structure containing I-vars. A related synchronization construct that can be set multiple times with different values is called an M-var. M-vars support atomic operations to "take" or "put" the current value, where taking the value also sets the M-var back to its initial "empty" state.[10]
A concurrent logic variable is similar to a future, but is updated by unification, in the same way as logic variables in logic programming. Thus it can be bound more than once to unifiable values (but cannot be set back to an empty or unresolved state). The dataflow variables of Oz act as concurrent logic variables, and also have blocking semantics as mentioned above.
A concurrent constraint variable is a generalization of concurrent logic variables to support constraint logic programming: the constraint may be narrowed multiple times, indicating smaller sets of possible values. Typically there is a way to specify a thunk that should be run whenever the constraint is narrowed further; this is necessary to support constraint propagation.
Relations between the expressiveness of different forms of future
Eager thread-specific futures can be straightforwardly implemented in terms of non-thread-specific futures, by creating a thread to calculate the value at the same time as creating the future. In this case it is desirable to return a read-only view to the client, so that only the newly created thread is able to resolve this future.
To implement implicit lazy thread-specific futures (as provided by Alice ML, for example) in terms in non-thread-specific futures, requires a mechanism to determine when the future's value is first needed (for example, the WaitNeeded construct in Oz[11]). If all values are objects, then the ability to implement transparent forwarding objects is sufficient, since the first message sent to the forwarder indicates that the future's value is needed.
Non-thread-specific futures can be implemented in terms of thread-specific futures, assuming that the system supports message passing, by having the resolving thread send a message to the future's own thread. However, this could be argued to be unnecessary complexity: in programming languages based on threads, the most expressive approach appears to be to provide a combination of non-thread-specific futures, read-only views, and either a 'WaitNeeded' construct or support for transparent forwarding.
Relation to lazy evaluation
Lazy futures, where the computation of the future's value starts when the value is first needed, are closely related to lazy evaluation. However, the term lazy evaluation is most often used to refer to an evaluation strategy for all computation in a language, whereas lazy futures represent specific values that are computed lazily, even in a language where computation is normally strict or eager. In C++0x such lazy futures can be created by passing the std::launch::sync
launch policy to std::async
, along with the function to compute the value.
Semantics of futures in the Actor model
In the Actor model, an expression of the form future <Expression> is defined by how it responds to an Eval message with environment E and customer C as follows: The future expression responds to the Eval message by sending the customer C a newly created actor F (the proxy for the response of evaluating <Expression>) as a return value concurrently with sending <Expression> an Eval message with environment E and customer F. The default behavior of F is as follows:
- When F receives a request R, then it checks to see if it has already received a response (that can either be a return value or a thrown exception) from evaluating <Expression> proceeding as follows:
- 1) If it already has a response V, then
- If V is a return value, then it is sent the request R.
- If V is an exception, then it is thrown to the customer of the request R.
- 2) If it does not already have a response, then R is stored in the queue of requests inside the F.
- When F receives the response V from evaluating <Expression>, then V is stored in F and
- If V is a return value, then all of the queued requests are sent to V.
- If V is an exception, then it is thrown to the customer of the each queued request.
However, some futures can deal with requests in special ways to provide greater parallelism. For example, the expression 1 + future factorial(n) can create a new future that will behave like the number 1+factorial(n). This trick does not always work. For example the following conditional expression:
- if m>future factorial(n) then print("bigger") else print("smaller")
suspends until the future for factorial(n) has responded to the request asking if m is greater than itself.
History
The future and/or promise constructs were first implemented in programming languages such as MultiLisp and Act 1. The use of logic variables for communication in concurrent logic programming languages was quite similar to futures. These started with "Prolog with Freeze" and "IC Prolog", and became a true concurrency primitive with Relational Language, Concurrent Prolog, Guarded Horn Clauses (GHC), Parlog, Vulcan, Janus, Mozart/Oz, Flow Java, and Alice ML. The single-assignment "I-var" from dataflow programming languages, originating in Id and included in Reppy's "Concurrent ML", is much like the concurrent logic variable.
The promise pipelining technique (using futures to overcome latency) was invented by Barbara Liskov and Liuba Shrira in 1988,[12] and independently by Mark S. Miller, Dean Tribble and Rob Jellinghaus in the context of Project Xanadu circa 1989.[13]
The term "promise" was coined by Liskov and Shrira, although they referred to the pipelining mechanism by the name "call-stream", which is now rarely used.
Both the design described in Liskov and Shrira's paper, and the implementation of promise pipelining in Xanadu, had the limitation that promise values were not first-class: an argument to, or the value returned by a call or send could not directly be a promise (so the example of promise pipelining given earlier, which uses a promise for the result of one send as an argument to another, would not have been directly expressible in the call-stream design or in the Xanadu implementation). It appears that promises and call-streams were never implemented in any public release of Argus[14] (the programming language used in the Liskov and Shrira paper); Argus development stopped around 1988.[15] The Xanadu implementation of promise pipelining only became publicly available with the release of the source code for Udanax Gold[16] in 1999, and was never explained in any published document.[17] The later implementations in Joule and E support fully first-class promises and resolvers.
Several early Actor languages, including the Act series of languages,[18][19] supported both parallel message passing and pipelined message processing, but not promise pipelining. (Although it is technically possible to implement the last of these features in terms of the first two, there is no evidence that the Act languages did so.)
List of implementations
Languages supporting futures, promises, concurrent logic variables, dataflow variables, or I-vars include:
- Id (I-vars and M-vars only)
- Glasgow Haskell (I-vars and M-vars only)
- Alice ML
- Clojure[20]
- Io[21]
- Oz version 3[22]
- Oxygene
- MultiLisp
- ABCL/f [23]
- Lucid (dataflow only)
- Scheme
- Racket[24]
- AmbientTalk (including first-class resolvers and read-only promises)
- R (promises for lazy evaluation - still single threaded)
- Flow Java
- μC++
- Python 3.2, as proposed by the PEP 3148
- Proposed for C# 5 (through the await keyword)
- Dart
Languages also supporting promise pipelining include:
Library-based implementations of futures:
- C++11 (new revision of the C++ standard), the Boost library and the Dlib C++ Library
- The Dojo Toolkit supplies both promises and Twisted style Deferreds for JavaScript.
- The Parallel Extensions library for C#
- OCaml's Lazy module implements lazy explicit futures.
- pythonfutures and Twisted's Deferreds in Python
- The Promise gem in Ruby
- Sub::Future and Reflex for Perl
- Eager Future2 and PCall for Common Lisp
References
- ^ Friedman, Daniel (1976). "The Impact of Applicative Programming on Multiprocessing". International Conference on Parallel Processing, pp. 263-272.
{{cite conference}}
: Unknown parameter|booktitle=
ignored (|book-title=
suggested) (help); Unknown parameter|coauthor=
ignored (|author=
suggested) (help) - ^ Hibbard, Peter (1976). "Parallel Processing Facilities". New Directions in Algorithmic Languages, (ed.) Stephen A. Schuman, IRIA, 1976.
{{cite conference}}
: Unknown parameter|booktitle=
ignored (|book-title=
suggested) (help) - ^ Henry Baker and Carl Hewitt (1977). "The Incremental Garbage Collection of Processes". Proceedings of the Symposium on Artificial Intelligence Programming Languages, SIGPLAN Notices 12.
{{cite conference}}
: Unknown parameter|booktitle=
ignored (|book-title=
suggested) (help); Unknown parameter|month=
ignored (help) - ^ Promise Pipelining at erights.org
- ^ Promise Pipelining on the C2 wiki
- ^ Robust promises with Dojo deferred, Site Pen, 2010-05-03.
- ^ a b "Promise", Alice Manual, DE: Uni-SB.
- ^ a b "Future", Alice manual, DE: Uni-SB.
- ^ Promise, E rights.
- ^ Control Concurrent MVar, Haskell.
- ^ WaitNeeded, Mozart Oz.
- ^ Barbara Liskov and Liuba Shrira (1988). "Promises: Linguistic Support for Efficient Asynchronous Procedure Calls in Distributed Systems". Proceedings of the SIGPLAN '88 Conference on Programming Language Design and Implementation; Atlanta, Georgia, United States, pp. 260–267. ISBN 0-89791-269-1, published by ACM. Also published in ACM SIGPLAN Notices, Volume 23, Issue 7, July 1988. doi:10.1145/53990.54016.
{{cite journal}}
: Cite journal requires|journal=
(help) - ^ Promise, Sunless Sea, archived from the original on October 23, 2007.
- ^ Argus, MIT.
- ^ Liskov, Barbara, Dsitributed computing and Argus, Oral history, IEEE GHN.
- ^ Gold, Udanax.
- ^ Pipeline, E rights.
- ^ Henry Lieberman (1981). "A Preview of Act 1". MIT AI memo 625.
{{cite journal}}
: Cite journal requires|journal=
(help); Unknown parameter|month=
ignored (help) - ^ Henry Lieberman (1981). "Thinking About Lots of Things at Once without Getting Confused: Parallelism in Act 1". MIT AI memo 626.
{{cite journal}}
: Cite journal requires|journal=
(help); Unknown parameter|month=
ignored (help) - ^ Rich Hickey (2009). "changes.txt at 1.1.x from richhickey's clojure".
- ^ Steve Dekorte (2005). "Io, The Programming Language".
- ^ Seif Haridi. "Tutorial of Oz". MOzart Global User Library. Retrieved 12 April 2011.
{{cite web}}
: Unknown parameter|coauthors=
ignored (|author=
suggested) (help) - ^ Kenjiro Taura, Satoshi Matsuoka, and Akinori Yonezawa (1994). "ABCL/f: A Future-Based Polymorphic Typed Concurrent Object-Oriented Language -- Its Design and Implementation.". In Proceedings of the DIMACS workshop on Specification of Parallel Algorithms, number 18 in Dimacs Series in Discrete Mathematics and Theoretical Computer Science. American Mathematical Society. pp. 275–292. doi:10.1.1.23.1161.
{{cite conference}}
: Check|doi=
value (help); Unknown parameter|booktitle=
ignored (|book-title=
suggested) (help)CS1 maint: multiple names: authors list (link) - ^ "Parallelism with Futures". PLT. Retrieved 02 March 2012.
{{cite web}}
: Check date values in:|accessdate=
(help); Cite has empty unknown parameter:|coauthors=
(help)
External links
- "Composable Futures with Akka 2.0", featuring Java and Scala code examples
- Future Value and Promise Pipelining at the Portland Pattern Repository
- Easy Threading with Futures in Python