Geo-replication: Difference between revisions
Leolaursen (talk | contribs) m Categorizing article - You can help! |
m grammar |
||
Line 20: | Line 20: | ||
3) The differencing engine identifies that almost all of the content of the new document Dog.doc, which is only on server C, exists on servers A, B and D inside another document called Cat.doc. |
3) The differencing engine identifies that almost all of the content of the new document Dog.doc, which is only on server C, exists on servers A, B and D inside another document called Cat.doc. |
||
4) It then identifies the elements of the content of Dog.doc that are different |
4) It then identifies the elements of the content of Dog.doc that are different from the content of Cat.doc. These are the words "dog" and "rug". |
||
4) The software then sends instructions to servers A, B and D on how to build a replica of Dog.doc from the content of Cat.doc by replacing the word "cat" with "dog" and "mat" with "rug". |
4) The software then sends instructions to servers A, B and D on how to build a replica of Dog.doc from the content of Cat.doc by replacing the word "cat" with "dog" and "mat" with "rug". |
Revision as of 20:04, 28 March 2008
This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these messages)
No issues specified. Please specify issues, or remove this template. |
The article's lead section may need to be rewritten. (January 2008) |
Geo Replication Software
Geo-replication software is a network performance enhancing technology that is designed to provide improved access to portal or intranet content for uses at the most remote parts of large organizations. It is based on the principle of storing complete replicas of portal content on local servers, and then keeping the content on those servers up-to-date using heavily compressed data updates.
Geo replication technologies are used to provide replication of the content of portals such as Microsoft SharePoint, intranets, web applications, content and data between servers, across wide area networks (WAN) to allow users at remote sites to access central content at LAN speeds.
In the majority of situations geo replication technology can dramatically increase the performance of data networks that suffer limited bandwidth, latency and periodic disconnection. Terabytes of data can be efficiently replicated over a WAN giving remote sites rapid access to web applications.
Geo-replication solutions typically use a combination of compression and web virtualization technologies. Differencing technologies can also be employed to dramatically reduce the volume of data that has to be transmitted to keep portal content accurate across all servers. Some geo-replication technologies are able to reduce the data transmitted to keep portal content current across a global deployment by over 90%. This update compression dramatically reduces the load that portal traffic inflicts on networks, and significantly improves the end user experience of a portal by accelerating its performance.
To deliver this dramatic reduction in the size of the required data updates across a portal, geo-replication systems often use differencing engine technologies, such as the Epsilon compression system used by Infonic. This system is able to difference the content of each portal server right down to the byte level. This knowledge of the content that is already on each server enables the system to rebuild any changes to the content on one server, across each of the other servers in the deployment from content already hosted on those other servers. This type of differencing system ensures that no content, at the byte level, is ever sent to a server twice.
What does this mean in practice? Here is a very simplified example:
1) Lets say a document called "Cat.doc" exists on all the servers in a global portal deployment (called servers A, B, C and D.) This document's content is the sentence "The big fat cat sat on the nice brown mat".
2) Now lets say a new document is created by a local user of server C. He calls it "Dog.doc" and its content is the sentence "The big fat dog sat on the nice brown rug".
3) The differencing engine identifies that almost all of the content of the new document Dog.doc, which is only on server C, exists on servers A, B and D inside another document called Cat.doc.
4) It then identifies the elements of the content of Dog.doc that are different from the content of Cat.doc. These are the words "dog" and "rug".
4) The software then sends instructions to servers A, B and D on how to build a replica of Dog.doc from the content of Cat.doc by replacing the word "cat" with "dog" and "mat" with "rug".
Compression systems like Epsilon perform the task above, but at the byte level, meaning that no byte pattern is ever sent to a server on the network twice.
Geo-replication systems can often also employ a further technology called Web Virtualization to enable them to create replicas of server based portal content on devices such as laptops which do not have the storage capacity to create a genuine cache of the server content. Web Virtualisation enables mobile users to have access to a full replica of their business portal on a standard laptop.