Jump to content

Kubernetes

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Rckrd (talk | contribs) at 19:56, 25 March 2017. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Kubernetes
Initial release6 June 2014 (2014-06-06)[1]
Stable release
1.5.5[2] / March 7, 2017 (2017-03-07)
Repository
Written inGo
Operating systemCross-platform
TypeCluster management software
LicenseApache License 2.0
Websitekubernetes.io

Kubernetes (commonly referred to as "K8s") is an open source container cluster manager originally designed by Google and donated to the Cloud Native Computing Foundation. It aims to provide a "platform for automating deployment, scaling, and operations of application containers across clusters of hosts".[3] It usually works with the Docker container tool and coordinates between a wide cluster of hosts running Docker.

History

Kubernetes (from κυβερνήτης: Greek for "helmsman" or "pilot") was founded by Joe Beda, Brendan Burns and Craig McLuckie,[4] was quickly joined by other Google engineers including Brian Grant and Tim Hockin, and was first announced by Google in mid-2014.[5] Its development and design are heavily influenced by Google's Borg system,[6][7] and many of the top contributors to the project previously worked on Borg. The original name for Kubernetes within Google was project Seven of Nine, a reference to a Star Trek character that is a 'friendlier' Borg.[8] After Google's lawyers rebuked McLuckie, Burns and Beda for the Project Seven Name, McLuckie came up with the Kubernetes name.[citation needed] The seven spokes on the wheel of the Kubernetes logo is a small acknowledgment of Kubernetes' original name.

Kubernetes v1.0 was released on July 21, 2015.[9] Along with the Kubernetes v1.0 release, Google partnered with the Linux Foundation to form the Cloud Native Computing Foundation (CNCF)[10] and offered Kubernetes as a seed technology.

It is also being used by Red Hat for its OpenShift product [11][12] and CoreOS for its Tectonic product.

Design

Kubernetes defines a set of building blocks ("primitives") which collectively provide mechanisms for deploying, maintaining, and scaling applications. The components which make up Kubernetes are designed to be loosely coupled and extensible so that it can meet a wide variety of different workloads. The extensibility is provided in large part by the Kubernetes API, which is used by internal components as well as extensions and containers running on Kubernetes.[13]

Pods

The basic scheduling unit in Kubernetes is called a "pod". It adds a higher level of abstraction to containerized components. A pod consists of one or more containers that are guaranteed to be co-located on the host machine and can share resources.[13] Each pod in Kubernetes is assigned a unique (within the cluster) IP address, which allows applications to use ports without the risk of conflict.[14] A pod can define a volume, such as a local disk directory or a network disk, and expose it to the containers in the pod.[15] Pods can be manually managed through the Kubernetes API, or their management can be delegated to a controller.[13]

Labels and selectors

Kubernetes enables clients (users or internal components) to attach key-value pairs called "labels" to any API object in the system, such as pods and nodes[definition needed]. Correspondingly, "label selectors" are queries against labels that resolve to matching objects.[13]

Labels and selectors are the primary grouping mechanism in Kubernetes, and are used to determine the components to which to apply an operation.[16]

For example, if the Pods of an application have labels for a system tier ("front-end", "back-end", for example) and a release_track ("canary", "production", for example), then an operation on all of the "back-end" and "canary" nodes could use a label selector such as the following:[17]

tier=back-end AND release_track=canary

Controllers

A controller is a reconciliation loop that drives actual cluster state toward the desired cluster state.[18] It does this by managing a set of pods. One kind of controller is a “Replication Controller”, which handles replication and scaling by running a specified number of copies of a pod across the cluster. It also handles creating replacement pods if the underlying node fails.[18] Other controllers that are part of the core Kubernetes system include a “DaemonSet Controller” for running exactly one pod on every machine (or some subset of machines), and a “Job Controller” for running pods that run to completion, e.g. as part of a batch job.[19] The set of pods that a controller manages is determined by label selectors that are part of the controller’s definition.[17]

Services

A Kubernetes service is a set of pods that work together, such as one tier of a multi-tier application. The set of pods that constitute a service are defined by a label selector.[13] Kubernetes provides service discovery and request routing by assigning a stable IP address and DNS name to the service, and load balances traffic in a round-robin manner to network connections of that IP address among the pods matching the selector (even as failures cause the pods to move from machine to machine).[14] By default a service is exposed inside a cluster (e.g. back end pods might be grouped into a service, with requests from the front-end pods load-balanced among them), but a service can also be exposed outside a cluster (e.g. for clients to reach frontend pods)[20]

Architecture

Kubernetes architecture diagram

Kubernetes follows the master-slave architecture.The components of Kubernetes can be divided into those that manage an individual node and those that are part of the control plane.[13][21]

Kubernetes control plane

The Kubernetes Master is the main controlling unit of the cluster that manages its workload and directs communication across the system. The Kubernetes control plane consists of various components, each its own process, that can run both on a single master node or on multiple masters supporting high-availability clusters.[21] The various components of Kubernetes control plane are as follows:

etcd

etcd is a persistent lightweight, distributed key-value data store developed by CoreOS that reliably stores the configuration data of the cluster, representing the overall state of the cluster at any given point of time. Other components watch for changes to this store to bring themselves into the desired state.[21]

API server

The API server is a key component and serves the Kubernetes API using JSON over HTTP, which provides both the internal and external interface to Kubernetes.[13][22] The API server processes and validates REST requests and updates state of the API objects in etcd, thereby allowing clients to configure workloads and containers across Worker nodes.

Scheduler

The scheduler is the pluggable component that selects which node an unscheduled pod should run on based on resource availability. Scheduler tracks resource utilization on each node to ensure that workload is not scheduled in excess of the available resources. For this purpose, the scheduler must know the resource availability and their existing workloads assigned across servers.

Controller manager

The controller manager is the process that the core Kubernetes controllers like DaemonSet Controller and Replication Controller run in. The controllers communicate with the API server to create, update and delete the resources they manage (pods, service endpoints, etc.)[22]

Kubernetes node

The Node also known as Worker or Minion is the single machine (or virtual machine) where containers(workloads) are deployed. Every node in the cluster must run the container runtime (such as Docker), as well as the below mentioned components, for communication with master for network configuration of these containers.

Kubelet

Kubelet is responsible for the running state of each node (that is, ensuring that all containers on the node are healthy). It takes care of starting, stopping, and maintaining application containers (organized into pods) as directed by the control plane.[13][23]

Kubelet monitors the state of a pod and if not in the desired state, the pod will be redeployed to the same node. The node status is relayed every few seconds via heartbeat messages to the master. Once the master detects a node failure, the Replication Controller observes this state change and launches pods on other healthy nodes.[citation needed]

Kube-proxy

The kube-proxy is an implementation of a network proxy and a load balancer, and it supports the service abstraction along with other networking operation.[13] It is responsible for routing traffic to the appropriate container based on IP and port number of the incoming request.

cAdvisor

cAdvisor is an agent that monitors and gathers resource usage and performance metrics such as CPU, memory, file and network usage of containers on each node.

References

  1. ^ "First GitHub commit for Kubernetes". github.com. 2014-06-07.
  2. ^ "GitHub Releases page". github.com. 2017-03-21.
  3. ^ "What is Kubernetes". Kubernetes. Retrieved 28 August 2015.
  4. ^ "Google Made Its Secret Blueprint Public to Boost Its Cloud". Retrieved 2016-06-27.
  5. ^ "Google Open Sources Its Secret Weapon in Cloud Computing". Wired. Retrieved 24 September 2015.
  6. ^ Abhishek Verma; Luis Pedrosa; Madhukar R. Korupolu; David Oppenheimer; Eric Tune; John Wilkes (April 21–24, 2015). "Large-scale cluster management at Google with Borg". Proceedings of the European Conference on Computer Systems (EuroSys).
  7. ^ "Borg, Omega, and Kubernetes - ACM Queue". queue.acm.org. Retrieved 2016-06-27.
  8. ^ "Early Stage Startup Heptio Aims to Make Kubernetes Friendly". Retrieved 2016-12-06.
  9. ^ "As Kubernetes Hits 1.0, Google Donates Technology To Newly Formed Cloud Native Computing Foundation". TechCrunch. Retrieved 24 September 2015.
  10. ^ "Cloud Native Computing Foundation".
  11. ^ "OpenShift v3 Platform Combines Docker, Kubernetes". blog.openshift.com. Retrieved 14 August 2014.
  12. ^ "Why Red Hat Chose Kubernetes for OpenShift". blog.openshift.com. Retrieved 7 November 2016.
  13. ^ a b c d e f g h i "An Introduction to Kubernetes". DigitalOcean. Retrieved 24 September 2015.
  14. ^ a b Langemak, Jon (2015-02-11). "Kubernetes 101 – Networking". Das Blinken Lichten. Retrieved 2015-11-02.
  15. ^ Strachan, James (2015-05-21). "Kubernetes for Developers". Medium (publishing platform). Retrieved 2015-11-02.
  16. ^ Surana, Ramit (2015-09-16). "Containerizing Docker on Kubernetes". LinkedIn. Retrieved 2015-11-02.
  17. ^ a b "Intro: Docker and Kubernetes training - Day 2". Red Hat. 2015-10-20. Retrieved 2015-11-02.
  18. ^ a b "Overview of a Replication Controller". Documentation. CoreOS. Retrieved 2015-11-02.
  19. ^ Sanders, Jake (2015-10-02). "Kubernetes: Exciting Experimental Features". Livewyer. Retrieved 2015-11-02.
  20. ^ Langemak, Jon (2015-02-15). "Kubernetes 101 – External Access Into The Cluster". Das Blinken Lichten. Retrieved 2015-11-02.
  21. ^ a b c "Kubernetes Infrastructure". OpenShift Community Documentation. OpenShift. Retrieved 24 September 2015.
  22. ^ a b Marhubi, Kamal (2015-09-26). "Kubernetes from the ground up: API server". kamalmarhubi.com. Retrieved 2015-11-02.
  23. ^ Marhubi, Kamal (2015-08-27). "What [..] is a Kubelet?". kamalmarhubi.com. Retrieved 2015-11-02.