Jump to content

CAP theorem: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
leanxcale spammer
m add WP:TEMPLATECAT to remove from template; genfixes
 
(52 intermediate revisions by 35 users not shown)
Line 1: Line 1:
{{short description|Need to sacrifice consistency or availability in the presence of network partitions}}
{{short description|Need to sacrifice consistency or availability in the presence of network partitions}}
In [[theoretical computer science]], the '''CAP theorem''', also named '''Brewer's theorem''' after computer scientist [[Eric Brewer (scientist)|Eric Brewer]], states that any [[distributed data store]] can only provide [[Trilemma|two of the following three]] guarantees:<ref name="Gilbert Lynch">Seth Gilbert and Nancy Lynch, [http://dl.acm.org/citation.cfm?id=564601&CFID=609557487&CFTOKEN=15997970 "Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services"], ''ACM SIGACT News'', Volume 33 Issue 2 (2002), pg. 51–59. {{doi|10.1145/564585.564601}}.</ref><ref>[http://www.julianbrowne.com/article/viewer/brewers-cap-theorem "Brewer's CAP Theorem"], julianbrowne.com, Retrieved 02-Mar-2010</ref><ref>[https://www.royans.net/2010/02/brewers-cap-theorem-on-distributed.html "Brewers CAP theorem on distributed systems"], royans.net</ref>
In [[database theory]], the '''CAP theorem''', also named '''Brewer's theorem''' after computer scientist [[Eric Brewer (scientist)|Eric Brewer]], states that any [[distributed data store]] can provide only [[Trilemma|two of the following three]] guarantees:<ref name="Gilbert Lynch">{{cite journal | last1=Gilbert | first1=Seth | last2=Lynch | first2=Nancy | title=Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services | journal=ACM SIGACT News | publisher=Association for Computing Machinery (ACM) | volume=33 | issue=2 | year=2002 | issn=0163-5700 | doi=10.1145/564585.564601 | pages=51–59| s2cid=15892169 }}</ref><ref>{{cite web | title=Brewer's CAP Theorem | website=julianbrowne.com | date=2009-01-11 | url=https://www.julianbrowne.com/article/brewers-cap-theorem/}}</ref><ref>{{cite web | title=Brewers CAP Theorem on distributed systems | website=royans.net | date=2010-02-14 | url=https://www.royans.net/2010/02/brewers-cap-theorem-on-distributed.html}}</ref>
; [[Consistency model|Consistency]]: Every read receives the most recent write or an error. Note that consistency as defined in the CAP theorem is quite different from the consistency guaranteed in [[ACID]] [[database transaction]]s.<ref>{{cite web |last1=Liochon |first1=Nicolas |title=The confusing CAP and ACID wording |url=http://blog.thislongrun.com/2015/03/the-confusing-cap-and-acid-wording.html |access-date=1 February 2019 |website=This long run}}</ref>
; [[Consistency model|Consistency]]: Every read receives the most recent write or an error.
; [[Availability]]: Every request received by a non-failing node in the system must result in a response. This is the definition of availability in CAP theorem as defined by Gilbert and Lynch.<ref name="Gilbert Lynch"/> Note that availability as defined in CAP theorem is different from [[high availability]] in software architecture.<ref name="Fowler 2015">{{Cite book |last=Fowler |first=Adam |title=NoSQL For Dummies |publisher=For Dummies |year=2015 |isbn=978-8126554904}}</ref>
; [[Availability]]: Every request receives a (non-error) response, without the guarantee that it contains the most recent write.
; [[Network partitioning|Partition tolerance]]: The system continues to operate despite an arbitrary number of messages being dropped (or delayed) by the network between nodes.
; [[Network partitioning|Partition tolerance]]: The system continues to operate despite an arbitrary number of messages being dropped (or delayed) by the network between nodes.


When a network partition failure happens, it must be decided whether to
When a [[network partition]] failure happens, it must be decided whether to do one of the following:
* cancel the operation and thus decrease the availability but ensure consistency or to
* cancel the operation and thus decrease the availability but ensure consistency
* proceed with the operation and thus provide availability but risk inconsistency.
* proceed with the operation and thus provide availability but risk inconsistency. Note this doesn't necessarily mean that system is [[High availability|highly available]] to its users.<ref name="Fowler 2015"/>


[[File:CAP Theorem Venn Diagram.png|thumb|CAP theorem Euler diagram]]
Thus, if there is a network partition, one has to choose between consistency and availability. Note that consistency as defined in the CAP theorem is quite different from the consistency guaranteed in [[ACID]] [[database transaction]]s.<ref>{{cite web |last1=Liochon |first1=Nicolas |title=The confusing CAP and ACID wording |url=http://blog.thislongrun.com/2015/03/the-confusing-cap-and-acid-wording.html |website=This long run |access-date=1 February 2019}}</ref>


Thus, if there is a network partition, one has to choose between consistency or availability.
[[Eric Brewer (scientist)|Eric Brewer]] argues that the often-used "two out of three" concept can be somewhat misleading because system designers only need to sacrifice consistency or availability in the presence of partitions, but that in many systems partitions are rare.<ref name="Brewer2012" /><ref name=":0" />


== Explanation ==
== Explanation ==
No distributed system is safe from network failures, thus [[network partition]]ing generally has to be tolerated.<ref>{{cite journal |last1=Kleppmann |first1=Martin |title=A Critique of the CAP Theorem |date=2015-09-18 |doi=10.17863/CAM.13083 |url=https://www.repository.cam.ac.uk/handle/1810/267054 |access-date=24 November 2019|bibcode=2015arXiv150905393K |arxiv=1509.05393 |s2cid=1991487 }}</ref><ref>{{cite web |last1=Martin |first1=Kleppmann |title=Please stop calling databases CP or AP |url=https://martin.kleppmann.com/2015/05/11/please-stop-calling-databases-cp-or-ap.html |website=Martin Kleppmann's Blog |access-date=24 November 2019}}</ref> In the presence of a partition, one is then left with two options: consistency or [[availability]]. When choosing consistency over availability, the system will return an error or a time out if particular information cannot be guaranteed to be up to date due to network partitioning. When choosing availability over consistency, the system will always process the query and try to return the most recent available version of the information, even if it cannot guarantee it is up to date due to network partitioning.
No distributed system is safe from network failures, thus network partitioning generally has to be tolerated.<ref>{{cite report|last1=Kleppmann |first1=Martin |title=A Critique of the CAP Theorem |date=2015-09-18 |publisher=Apollo - University of Cambridge Repository |doi=10.17863/CAM.13083 |url=https://www.repository.cam.ac.uk/handle/1810/267054 |access-date=24 November 2019|bibcode=2015arXiv150905393K |arxiv=1509.05393 |s2cid=1991487 }}</ref><ref>{{cite web |last1=Martin |first1=Kleppmann |title=Please stop calling databases CP or AP |url=https://martin.kleppmann.com/2015/05/11/please-stop-calling-databases-cp-or-ap.html |website=Martin Kleppmann's Blog |access-date=24 November 2019}}</ref> In the presence of a partition, one is then left with two options: consistency or [[availability]]. When choosing consistency over availability, the system will return an error or a time out if particular information cannot be guaranteed to be up to date due to network partitioning. When choosing availability over consistency, the system will always process the query and try to return the most recent available version of the information, even if it cannot guarantee it is up to date due to network partitioning.


CAP is often misunderstood as a choice at all times of which one of the three guarantees to abandon. In fact, the choice is between consistency and availability only when a network partition or failure happens. When there is no network failure, both availability and consistency can be satisfied.<ref>{{Cite web|url=https://dzone.com/articles/better-explaining-cap-theorem|title=Better explaining the CAP Theorem|website=DZone Big Data|access-date=2016-09-02}}</ref><ref>{{Cite web|url=http://dbmsmusings.blogspot.com/2010/04/problems-with-cap-and-yahoos-little.html|title=DBMS Musings: Problems with CAP, and Yahoo's little known NoSQL system|last=Abadi|first=Daniel|date=2010-04-23|website=DBMS Musings|access-date=2018-01-23}}</ref>
In the absence of a partition, both availability and consistency can be satisfied.<ref name="paclec">{{Cite web|url=http://dbmsmusings.blogspot.com/2010/04/problems-with-cap-and-yahoos-little.html|title=DBMS Musings: Problems with CAP, and Yahoo's little known NoSQL system|last=Abadi|first=Daniel|date=2010-04-23|website=DBMS Musings|access-date=2018-01-23}}</ref>


Database systems designed with traditional [[ACID]] guarantees in mind such as [[Relational database management system|RDBMS]] choose [[Consistency (database systems)|consistency]] over availability, whereas systems designed around the [[Eventual consistency|BASE]] philosophy, common in the [[NoSQL]] movement for example, choose availability over consistency.<ref name="Brewer2012" />
Database systems designed with traditional [[ACID]] guarantees in mind such as [[Relational database management system|RDBMS]] choose [[Consistency (database systems)|consistency]] over availability, whereas systems designed around the [[Eventual consistency|BASE]] philosophy, common in the [[NoSQL]] movement for example, choose availability over consistency.<ref name="Brewer2012" />


Some cloud services choose strong consistency but use worldwide private fiber networks and GPS clock synchronization to minimize the frequency of network partitions{{citation needed|date=November 2024}}. Finally, consistent shared-nothing architectures may use techniques such as geographic sharding to maintain availability of data owned by the queried node, but without being available for arbitrary requests during a network partition{{citation needed|date=November 2024}}.
The [[PACELC theorem]] builds on CAP by stating that even in the absence of partitioning, there is another trade-off between latency and consistency.


== History ==
== History ==


According to [[University of California, Berkeley]] computer scientist [[Eric Brewer (scientist)|Eric Brewer]], the theorem first appeared in autumn 1998.<ref name="Brewer2012">Eric Brewer, [http://www.infoq.com/articles/cap-twelve-years-later-how-the-rules-have-changed "CAP twelve years later: How the 'rules' have changed"], ''Computer'', Volume 45, Issue 2 (2012), pg. 23–29. {{doi|10.1109/MC.2012.37}}.</ref> It was published as the CAP principle in 1999<ref name="Brewer1999">Armando Fox and Eric Brewer, "Harvest, Yield and Scalable Tolerant Systems", ''Proc. 7th Workshop Hot Topics in Operating Systems (HotOS 99)'', IEEE CS, 1999, pg. 174–178. {{DOI|10.1109/HOTOS.1999.798396}}</ref> and presented as a [[conjecture]] by Brewer at the 2000 [[Symposium on Principles of Distributed Computing]] (PODC).<ref name="Brewer2000">Eric Brewer, [http://www.cs.berkeley.edu/~brewer/cs262b-2004/PODC-keynote.pdf "Towards Robust Distributed Systems"]</ref> In 2002, [[Seth Gilbert]] and [[Nancy Lynch]] of [[MIT]] published a formal proof of Brewer's conjecture, rendering it a [[theorem]].<ref name="Gilbert Lynch"/>
According to computer scientist [[Eric Brewer (scientist)|Eric Brewer]] of the [[University of California, Berkeley]], the theorem first appeared in autumn 1998.<ref name="Brewer2012">{{cite journal | last=Brewer | first=Eric | title=CAP twelve years later: How the "rules" have changed | journal=Computer | publisher=Institute of Electrical and Electronics Engineers (IEEE) | volume=45 | issue=2 | year=2012 | issn=0018-9162 | doi=10.1109/mc.2012.37 | pages=23–29 | s2cid=890105 |url=http://www.infoq.com/articles/cap-twelve-years-later-how-the-rules-have-changed}}</ref> It was published as the CAP principle in 1999<ref name="Brewer1999">{{cite conference |author1=Armando Fox |author2=Eric Brewer |title=Harvest, Yield and Scalable Tolerant Systems |conference=Proc. 7th Workshop Hot Topics in Operating Systems (HotOS 99) |publisher=IEEE CS |year=1999 |pages=174–178 |doi=10.1109/HOTOS.1999.798396}}</ref> and presented as a [[conjecture]] by Brewer at the 2000 [[Symposium on Principles of Distributed Computing]] (PODC).<ref name="Brewer2000">{{cite web |author=Eric Brewer |url=http://www.cs.berkeley.edu/~brewer/cs262b-2004/PODC-keynote.pdf |title=Towards Robust Distributed Systems}}</ref> In 2002, [[Seth Gilbert]] and [[Nancy Lynch]] of [[MIT]] published a formal proof of Brewer's conjecture, rendering it a [[theorem]].<ref name="Gilbert Lynch"/>


In 2012, Brewer clarified some of his positions, including why the often-used "two out of three" concept can be somewhat misleading because system designers only need to sacrifice consistency or availability in the presence of partitions; partition management and recovery techniques exist. Brewer also noted the different definition of consistency used in the CAP theorem relative to the definition used in [[ACID]].<ref name="Brewer2012" /><ref name=":0">{{Cite web|last=Carpenter|first=Jeff|last2=Hewitt|first2=Eben|date=July 2016|title=Cassandra: The Definitive Guide, 2nd Edition [Book]|url=https://www.oreilly.com/library/view/cassandra-the-definitive/9781491933657/|url-status=live|archive-url=https://web.archive.org/web/20200807235405/https://www.oreilly.com/library/view/cassandra-the-definitive/9781491933657/ |archive-date=2020-08-07 |access-date=2020-12-21|website=www.oreilly.com|language=en|quote=In February 2012, Eric Brewer provided an updated perspective on his CAP theorem [..] Brewer now describes the “2 out of 3” axiom as somewhat misleading. He notes that designers only need sacrifice consistency or availability in the presence of partitions, and that advances in partition recovery techniques have made it possible for designers to achieve high levels of both consistency and availability.}}</ref>
In 2012, Brewer clarified some of his positions, including why the often-used "two out of three" concept can be somewhat misleading because system designers only need to sacrifice consistency or availability in the presence of partitions; partition management and recovery techniques exist. Brewer also noted the different definition of consistency used in the CAP theorem relative to the definition used in [[ACID]].<ref name="Brewer2012" /><ref name=":0">{{Cite book |last1=Carpenter |first1=Jeff |url=https://www.oreilly.com/library/view/cassandra-the-definitive/9781491933657/ |title=Cassandra: The Definitive Guide |last2=Hewitt |first2=Eben |date=July 2016 |publisher=O'Reilly Media |isbn=9781491933657 |edition=2nd |quote=In February 2012, Eric Brewer provided an updated perspective on his CAP theorem{{nbsp}}... Brewer now describes the “2 out of 3” axiom as somewhat misleading. He notes that designers only need sacrifice consistency or availability in the presence of partitions, and that advances in partition recovery techniques have made it possible for designers to achieve high levels of both consistency and availability.}}</ref>


A similar theorem stating the trade-off between consistency and availability in distributed systems was published by Birman and Friedman in 1996.<ref>Ken Birman and Roy Friedman, "[https://ecommons.cornell.edu/handle/1813/7235 Trading Consistency for Availability in Distributed Systems]", April 1996. {{hdl|1813/7235}}.</ref> Birman and Friedman's result restricted this lower bound to non-commuting operations.
A similar theorem stating the trade-off between consistency and availability in distributed systems was published by Birman and Friedman in 1996.<ref>{{cite web |author1=Ken Birman |author2=Roy Friedman |url=https://ecommons.cornell.edu/handle/1813/7235 |title=Trading Consistency for Availability in Distributed Systems |date=April 1996 |hdl=1813/7235}}</ref> Birman and Friedman's result restricted this lower bound to non-commuting operations.


The [[PACELC theorem]], introduced in 2010,<ref name="paclec" /> builds on CAP by stating that even in the absence of partitioning, there is another trade-off between latency and consistency. PACELC means, if partition (P) happens, the trade-off is between availability (A) and consistency (C); Else (E), the trade-off is between latency (L) and consistency (C). Some experts like Marc Brooker argue that the CAP theorem is particularly relevant in intermittently connected environments, such as those related to the [[Internet of things|Internet of Things]] (IoT) and [[mobile app]]lications. In these contexts, devices may become partitioned due to challenging physical conditions, such as [[power outage]]s or when entering confined spaces like elevators. For [[Distributed computing|distributed systems]], such as [[Cloud computing|cloud applications]], it is more appropriate to use the [[PACELC theorem]], which is more comprehensive and considers trade-offs such as [[Latency (engineering)|latency]] and [[Consistency (database systems)|consistency]] even in the absence of network partitions.<ref>{{Cite book |title=Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems |publisher=O'Reilly Media |isbn=978-1449373320}}</ref>
[[Blockchain|Blockchain technology]] sacrifices immediate consistency for availability and partition tolerance, by requiring a specific number of "confirmations", Blockchain consensus algorithms are basically reduced to eventual consistency. <ref>Bashir, Imran. (2018). ''Mastering blockchain''. Birmingham, England: Packt Publishing. p. 41. {{ISBN|978-1-78883-904-4}}.</ref>


== See also ==
== See also ==
* [[Fallacies of distributed computing]]
* [[Fallacies of distributed computing]]
* [[Lambda architecture]] (solution)
* [[PACELC theorem]]
* [[PACELC theorem]]
* [[Paxos (computer science)]]
* [[Paxos (computer science)]]
* [[Raft (computer science)]]
* [[Raft (computer science)]]
* [[Zooko's triangle]]
* [[Zooko's triangle]]
* [[Inconsistent triad]]
* [[Trilemma]]


== References ==
== References ==
{{reflist}}
{{reflist}}


== External links ==
* [http://www.infoq.com/articles/cap-twelve-years-later-how-the-rules-have-changed CAP Twelve Years Later: How the "Rules" Have Changed] Brewer's 2012 article on CRDTs (conflict free replicated data types)
* [https://research.google.com/pubs/pub45855.html Spanner, TrueTime and the CAP Theorem]
* [https://www.cl.cam.ac.uk/research/dtg/www/files/publications/public/mk428/cap-critique.pdf A Critique of the CAP Theorem]
* [https://martin.kleppmann.com/2015/05/11/please-stop-calling-databases-cp-or-ap.html Please stop calling databases CP or AP] Kleppmann's 2015 blog post corresponding with the publication of "A Critique of the CAP Theorem"
{{Databases}}
{{Databases}}

{{DEFAULTSORT:Cap Theorem}}
{{DEFAULTSORT:Cap Theorem}}
[[Category:Distributed computing]]
[[Category:Distributed computing]]
[[Category:Database management systems]]

Latest revision as of 20:33, 5 December 2024

In database theory, the CAP theorem, also named Brewer's theorem after computer scientist Eric Brewer, states that any distributed data store can provide only two of the following three guarantees:[1][2][3]

Consistency
Every read receives the most recent write or an error. Note that consistency as defined in the CAP theorem is quite different from the consistency guaranteed in ACID database transactions.[4]
Availability
Every request received by a non-failing node in the system must result in a response. This is the definition of availability in CAP theorem as defined by Gilbert and Lynch.[1] Note that availability as defined in CAP theorem is different from high availability in software architecture.[5]
Partition tolerance
The system continues to operate despite an arbitrary number of messages being dropped (or delayed) by the network between nodes.

When a network partition failure happens, it must be decided whether to do one of the following:

  • cancel the operation and thus decrease the availability but ensure consistency
  • proceed with the operation and thus provide availability but risk inconsistency. Note this doesn't necessarily mean that system is highly available to its users.[5]
CAP theorem Euler diagram

Thus, if there is a network partition, one has to choose between consistency or availability.

Explanation

[edit]

No distributed system is safe from network failures, thus network partitioning generally has to be tolerated.[6][7] In the presence of a partition, one is then left with two options: consistency or availability. When choosing consistency over availability, the system will return an error or a time out if particular information cannot be guaranteed to be up to date due to network partitioning. When choosing availability over consistency, the system will always process the query and try to return the most recent available version of the information, even if it cannot guarantee it is up to date due to network partitioning.

In the absence of a partition, both availability and consistency can be satisfied.[8]

Database systems designed with traditional ACID guarantees in mind such as RDBMS choose consistency over availability, whereas systems designed around the BASE philosophy, common in the NoSQL movement for example, choose availability over consistency.[9]

Some cloud services choose strong consistency but use worldwide private fiber networks and GPS clock synchronization to minimize the frequency of network partitions[citation needed]. Finally, consistent shared-nothing architectures may use techniques such as geographic sharding to maintain availability of data owned by the queried node, but without being available for arbitrary requests during a network partition[citation needed].

History

[edit]

According to computer scientist Eric Brewer of the University of California, Berkeley, the theorem first appeared in autumn 1998.[9] It was published as the CAP principle in 1999[10] and presented as a conjecture by Brewer at the 2000 Symposium on Principles of Distributed Computing (PODC).[11] In 2002, Seth Gilbert and Nancy Lynch of MIT published a formal proof of Brewer's conjecture, rendering it a theorem.[1]

In 2012, Brewer clarified some of his positions, including why the often-used "two out of three" concept can be somewhat misleading because system designers only need to sacrifice consistency or availability in the presence of partitions; partition management and recovery techniques exist. Brewer also noted the different definition of consistency used in the CAP theorem relative to the definition used in ACID.[9][12]

A similar theorem stating the trade-off between consistency and availability in distributed systems was published by Birman and Friedman in 1996.[13] Birman and Friedman's result restricted this lower bound to non-commuting operations.

The PACELC theorem, introduced in 2010,[8] builds on CAP by stating that even in the absence of partitioning, there is another trade-off between latency and consistency. PACELC means, if partition (P) happens, the trade-off is between availability (A) and consistency (C); Else (E), the trade-off is between latency (L) and consistency (C). Some experts like Marc Brooker argue that the CAP theorem is particularly relevant in intermittently connected environments, such as those related to the Internet of Things (IoT) and mobile applications. In these contexts, devices may become partitioned due to challenging physical conditions, such as power outages or when entering confined spaces like elevators. For distributed systems, such as cloud applications, it is more appropriate to use the PACELC theorem, which is more comprehensive and considers trade-offs such as latency and consistency even in the absence of network partitions.[14]

See also

[edit]

References

[edit]
  1. ^ a b c Gilbert, Seth; Lynch, Nancy (2002). "Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services". ACM SIGACT News. 33 (2). Association for Computing Machinery (ACM): 51–59. doi:10.1145/564585.564601. ISSN 0163-5700. S2CID 15892169.
  2. ^ "Brewer's CAP Theorem". julianbrowne.com. 2009-01-11.
  3. ^ "Brewers CAP Theorem on distributed systems". royans.net. 2010-02-14.
  4. ^ Liochon, Nicolas. "The confusing CAP and ACID wording". This long run. Retrieved 1 February 2019.
  5. ^ a b Fowler, Adam (2015). NoSQL For Dummies. For Dummies. ISBN 978-8126554904.
  6. ^ Kleppmann, Martin (2015-09-18). A Critique of the CAP Theorem (Report). Apollo - University of Cambridge Repository. arXiv:1509.05393. Bibcode:2015arXiv150905393K. doi:10.17863/CAM.13083. S2CID 1991487. Retrieved 24 November 2019.
  7. ^ Martin, Kleppmann. "Please stop calling databases CP or AP". Martin Kleppmann's Blog. Retrieved 24 November 2019.
  8. ^ a b Abadi, Daniel (2010-04-23). "DBMS Musings: Problems with CAP, and Yahoo's little known NoSQL system". DBMS Musings. Retrieved 2018-01-23.
  9. ^ a b c Brewer, Eric (2012). "CAP twelve years later: How the "rules" have changed". Computer. 45 (2). Institute of Electrical and Electronics Engineers (IEEE): 23–29. doi:10.1109/mc.2012.37. ISSN 0018-9162. S2CID 890105.
  10. ^ Armando Fox; Eric Brewer (1999). Harvest, Yield and Scalable Tolerant Systems. Proc. 7th Workshop Hot Topics in Operating Systems (HotOS 99). IEEE CS. pp. 174–178. doi:10.1109/HOTOS.1999.798396.
  11. ^ Eric Brewer. "Towards Robust Distributed Systems" (PDF).
  12. ^ Carpenter, Jeff; Hewitt, Eben (July 2016). Cassandra: The Definitive Guide (2nd ed.). O'Reilly Media. ISBN 9781491933657. In February 2012, Eric Brewer provided an updated perspective on his CAP theorem ... Brewer now describes the "2 out of 3" axiom as somewhat misleading. He notes that designers only need sacrifice consistency or availability in the presence of partitions, and that advances in partition recovery techniques have made it possible for designers to achieve high levels of both consistency and availability.
  13. ^ Ken Birman; Roy Friedman (April 1996). "Trading Consistency for Availability in Distributed Systems". hdl:1813/7235.
  14. ^ Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems. O'Reilly Media. ISBN 978-1449373320.