User:Madiyarturar/sandbox
CamCube is a direct-connect network topology, in which servers are connected directly to each other forming a 3D torus. Direct connections make wiring efficient due to the possibility of using short cables. Moreover, topology is resilient because any source and destination have several paths making it multipath architecture. CamCube is used in IBM BlueGene/L and Cray XT3/Red Storm supercomputers.
Problem Description
[edit]Scalability is one of the main problems of data center networks. With increasing popularity of cloud paradigm the scale of data center also increases and it raises the necessity in high network bandwidth [1]. Even with high cost IP switches and routers it is possible to achieve only 50% of aggregate bandwidth available at the servers [1] [2].
DCN’s performance and throughput is significantly dependent on its architecture. For example: the most frequent architecture for DCNs, three-tier architecture [2] has poor cross section bandwidth and high over-subscription ratio near the root [1] [2] [3]. Fat-tree architecture solves oversubscription ratio making it 1:1, nevertheless, it raises another problem of low scalability (limitation is the number of ports of a switch) [2]. Newer hybrid architecture DCell tackles scalability problem, but cannot perform well under heavy network load [2].
CamCube Topology
[edit]CamCube networking topology is based on a direct-connect topology in which servers are connected directly to each other via 1 Gbps Ethernet cross-over cables, forming 3D torus shape[4]. It is resilient to failure of servers and links, since 3D interconnected torus provides different kind of paths between any source and destination. In this topology, switches are used to connect CamCube servers only with external network but are not responsible for routing internal traffic between the servers. Therefore, not all CamCube servers should be connected to switches. To achieve better application-level performance, each application can implement their own routing protocols with benefit of 3D torus topology and flexibility of CamCube API [5]. However, low routing efficiency compared to other designs could be observed due to comparatively long routing paths in this design[6].
CamCube Services
[edit]CamKey
[edit]CamCube supports a key-based routing service where packets are routed based on the 3D coordinate (x, y, z) keys rather than server address (IP/MAC addresses)[4][5]. This addressing system is constructed in a way where physical topology is same as addressed virtual topology (CamKey) which makes it easier to find location of servers. Keys in CamCube are expressed with 160-bit identifiers. For example, in case CamCube topology has k-nary 3D-cube, only most significant k bits will be used to generate keys (x, y, z) (figure 1). If the server can be found then the assigned coordinate will be mapped to the server, otherwise the coordinate will be mapped to its neighbors. Hence, the not used 160-k bit will be used as weight to identify neighbor server. In case neighbor of the server is also unreachable then this weight will be assigned to another server[4][5]. By continuing this kind of cascading failure algorithm CamKey will construct deterministic map of CamCube topology[4].
Camdoop
[edit]Camdoop is a MapReduce-like model in CamCube. It is an efficient in-network aggregation model designed for parallel processing of big sets of data [4]. The difference from MapReduce is that Camdoop benefits from CamCube’s ability of custom forwarding and processing of packets on path. It is enabled due to the topology’s direct-connect nature, meaning that servers are interconnected directly between each other. This ability enables to perform multiple steps of aggregation, consequently decreasing traffic transportation in network without losing performance speed of an algorithm as opposed to a reference of Hadoop and Dryad/DryadLINQ [4].
CamGraph
[edit]A special supporting infrastructure as CamGraph can be used to increase computational performance of CamCube graph algorithms. Firstly, CamGraph divides the factor graph to the partitions using equal factor method and performs message transmission through the partitions. Main benefits of CamGraph is in its accuracy and simplicity of usage in factor graphs. On the other hand, CamGraph’s performance is slower than Map-Reduce as it gets high overheads while running partitioned graph algorithm[7].
TCP/IP service
[edit]In CamCube architecture we can use unmodified TCP/IP application without any struggle using them in applications. In this system packets will be generated by TCP/IP, encapsulated and sent by tunneling through CamCube architecture. Consequently, destination server injects packets in TCP/IP stack. The main goal of using TCP/IP service is as always requested to get maximum throughput[3]. There can be used different ways of using routing services of TCP/IP. One of them is using multiple shortest paths where routing algorithm calculates all these paths and uses all of them to achieve the maximum throughput. The main drawbacks of this routing algorithm are in its congestion and packet loss problem as the paths can cross each other to make collisions in the network topology[3]. Other routing algorithm is to get paths which are link-disjoint. In this case we use CamCube 3D structure to set links for three axes, in case they do not have any neighbor on the specific axis, routing algorithm adds next neighboring servers to the link list as it implements in CamKey identifiers[4]. As a result, the routing algorithm creates links meanwhile dividing CamCube into mini-cubes. At the end in this routing process there are will be at least three shortest paths and higher throughput while cost of increase of packet delivery jitter[3].
VM Service
[edit]To send large files to various servers (multicasting) the virtual machine (VM) distribution service can be implemented to CamCube structure. VM service works with link-disjoint paths, where union of shortest paths looks like tree structure with small number of interior serves[3]. As routing algorithm creates mini-cube structure in coordinate space of CamCube, VM structure will be hierarchical were smaller cube will be inside of bigger cubes in coordinate system. Consequently, there will be VM distribution tree in CamCube[3].
CamCube API
[edit]There are several core services provided by CamCube API, functioning on all servers. Some of core services are defining 3-dimensional coordinates of each server, exposing coordinate of one-hop neighbors and determining the size of the coordinate space[5]. Furthermore, there is a service, which monitors liveness of one-hop neighbors, in case when some server or link will fail. In order to route packets to servers, there are also multi-hop routing service, which provides link state-based protocol. Services that runs the routing service can modify, intercept or drop a packet. For example, if a server at destination is not reachable, packets are dropped. The routing service uses key-based routing as well as server-based routing. Except these services, all other services run on top of CamCube API[3] .
References
[edit]- ^ a b c Al-Fares, Mohammad; Loukissas, Alexander; Vahdat, Amin (2008). "A scalable, commodity data center network architecture". Proceedings of the ACM SIGCOMM 2008 Conference on Data Communication - SIGCOMM '08. New York, New York, USA: ACM Press: 63. doi:10.1145/1402958.1402967. ISBN 9781605581750. S2CID 65842.
- ^ a b c d e Bilal, Kashif; Khan, Samee U.; Zhang, Limin; Li, Hongxiang; Hayat, Khizar; Madani, Sajjad A.; Min-Allah, Nasro; Wang, Lizhe; Chen, Dan (2012-12-20). "Quantitative comparisons of the state-of-the-art data center architectures". Concurrency and Computation: Practice and Experience. 25 (12): 1771–1783. doi:10.1002/cpe.2963. ISSN 1532-0626. S2CID 17712213.
- ^ a b c d e f g Abu-Libdeh, Hussam; Costa, Paolo; Rowstron, Antony; O'Shea, Greg; Donnelly, Austin (2010). "Symbiotic routing in future data centers". Proceedings of the ACM SIGCOMM 2010 Conference on SIGCOMM - SIGCOMM '10. New York, New York, USA: ACM Press: 51. doi:10.1145/1851182.1851191. ISBN 9781450302012. S2CID 14956298.
- ^ a b c d e f g Costa Paolo; Donnelly Austin; O’Shea Greg; Rowstron Antony. "Camdoop: Exploiting In-network Aggregation for Big Data Applications" (PDF). Yale University. Retrieved 08/06/2018.
{{cite web}}
: Check date values in:|access-date=
(help)CS1 maint: multiple names: authors list (link) - ^ a b c d Costa Paolo; Donnelly Austin; O’Shea Greg; Rowstron Antony. "CamCubeOS: A Key-based Network Stack for 3D Torus Cluster Topologies" (PDF). Microsoft. Retrieved 08/06/2018.
{{cite web}}
: Check date values in:|access-date=
(help)CS1 maint: multiple names: authors list (link) - ^ Ting Wang; Zhiyang Su; Yu Xia; Hamdi, Mounir (2014). "Rethinking the Data Center Networking: Architecture, Network Protocols, and Resource Sharing". IEEE Access. 2: 1481–1496. doi:10.1109/access.2014.2383439. ISSN 2169-3536. S2CID 12071997.
- ^ Rowstron, Antony. "Nobody ever got fired for using Hadoop on a cluster" (PDF). Proceedings of the 1st International Workshop on Hot Topics in Cloud Data Processing: 2.