Jump to content

XMODEM: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
No edit summary
 
(91 intermediate revisions by 45 users not shown)
Line 1: Line 1:
{{Short description|File transfer protocol}}
{{multiple issues|
{{primary sources|date=April 2013}}
{{More footnotes|date=January 2009}}
{{More footnotes|date=January 2009}}
}}

{{Infobox networking protocol
{{Infobox networking protocol
| title = XMODEM
| title = XMODEM
Line 13: Line 10:
| is stack =
| is stack =
| purpose = file transfer protocol
| purpose = file transfer protocol
| developer = [[Ward Christensen]]<ref>[https://books.google.com/books?id=9eJxx_ZGKngC&dq=%22Ward+Christensen%22&pg=PA451 Telecommunications: XMODEM: A Standard Is Born], By Alfred Glossbrenner, PC Mag, 17 April 1984, Page 451-452, ''... but the protocol itself was long ago placed in the public domain by its creator, Chicagoan Ward Christensen. Since its introduction in 1978, XMODEM ...''</ref><ref>[https://books.google.com/books?id=EzAEAAAAMBAJ&dq=%22Ward+Christensen%22&pg=PA26 In Focus: History lesson: Ward Christensen's free free-exchange software], By Michael Swaine, InfoWorld, 1 Nov 1982, Page 26</ref>
| developer = [[Ward Christensen]]
| date = {{Start date and age| 1977 | | }}<!--Fill in: Year (4 digits), month and day (2 digits)-->
| date = {{Start date and age| 1977 | | }}<!--Fill in: Year (4 digits), month and day (2 digits)-->
| based on =
| based on =
Line 22: Line 19:
| hardware = [[modem]]s
| hardware = [[modem]]s
}}
}}
'''XMODEM''' is a simple [[file transfer]] protocol developed as a quick [[Hacker (hobbyist)|hack]] by [[Ward Christensen]] for use in his 1977 '''MODEM.ASM''' [[terminal program]]. It allowed users to transmit files between their computers when both sides used MODEM. Keith Petersen made a minor update to always turn on "quiet mode", and called the result XMODEM.<ref>Ward Christensen, [http://www.bbsdocumentary.com/software/AAA/AAA/CBBS/memories.txt "Memories"], 25 November 1992</ref>
'''XMODEM''' is a simple [[file transfer]] protocol developed as a quick [[Hacker (hobbyist)|hack]] by [[Ward Christensen]] for use in his 1977 '''MODEM.ASM''' [[terminal program]]. It allowed users to transmit files between their computers when both sides used MODEM. Keith Petersen made a minor update to always turn on "quiet mode", and called the result XMODEM.<ref>Ward Christensen, [http://www.bbsdocumentary.com/software/AAA/AAA/CBBS/memories.txt "Memories"], 25 November 1992</ref><ref name="meeks198902">{{Cite magazine |last=Meeks |first=Brock |date=February 1989 |title=The ABCs of X-, Y-, and ZMODEM |url=https://archive.org/details/eu_BYTE-1989-02_OCR/page/n217/mode/2up?view=theater |access-date=2024-10-08 |magazine=BYTE |pages=163-166}}</ref>


XMODEM, like most file transfer protocols, breaks up the original data into a series of "[[Packet (information technology)|packets]]" that are sent to the receiver, along with additional information allowing the receiver to determine whether that packet was correctly received. If an error is detected, the receiver requests that the packet be re-sent. A string of bad packets causes the transfer to abort.
XMODEM became extremely popular in the early [[bulletin board system]] (BBS) market, largely because it was so simple to implement. It was also fairly inefficient, and as modem speeds increased this problem led to the development of a number of modified versions of XMODEM to improve performance or address other problems with the protocol. Christensen believed his original XMODEM to be "the single most modified program in computing history".<ref>{{cite web|url=http://www.well.com/user/hlr/vcbook/vcbook4.html|title=The Virtual Community}}</ref> [[Chuck Forsberg]] collected a number of modifications into his [[YMODEM]] protocol, but poor implementation led to a further fracturing before they were re-unified by his later [[ZMODEM]] protocol.


XMODEM became extremely popular in the early [[bulletin board system]] (BBS) market, largely because it was simple to implement. It was also fairly inefficient, and as modem speeds increased, this problem led to the development of a number of modified versions of XMODEM to improve performance or address other problems with the protocol.{{r|meeks198902}} Christensen believed his original XMODEM to be "the single most modified program in computing history".<ref>{{cite web|url=http://www.well.com/user/hlr/vcbook/vcbook4.html|title=The Virtual Community}}</ref>
XMODEM, like most file transfer protocols, breaks up the original data into a series of "[[Packet (information technology)|packets]]" that are sent to the receiver, along with additional information allowing the receiver to determine whether that packet was correctly received.

{{TOC right}}
[[Chuck Forsberg]] collected a number of common modifications into his [[YMODEM]] protocol, but poor implementation led to a further fracturing before they were re-unified by his later [[ZMODEM]] protocol. ZMODEM became very popular, but never completely replaced XMODEM in the BBS market.


==Packet structure==
==Packet structure==


The original XMODEM used a 128-byte data packet, the basic block size used on [[CP/M]] [[floppy disk]]s. The packet was prefixed by a simple 3-byte header containing a <tt><[[C0 and C1 control codes|SOH]]></tt> character, a "block number" from 0-255, and the "inverse" block number—255 minus the block number. Block numbering starts with 1 for the first block sent, not 0.
The original XMODEM used a 128-byte data packet, the [[Block (data storage)|block size]] used on [[CP/M]] [[floppy disk]]s. The packet was prefixed by a simple 3-byte header containing a <kbd><[[C0 and C1 control codes|SOH]]></kbd> character, a "block number" from 1-255, and the "inverse" block number—255 minus the block number. Block numbering starts with 1 for the first block sent, not 0. The header was followed by the 128 bytes of data, and then a single-byte [[checksum]]. The checksum was the sum of all 128 data bytes in the packet [[modulo operation|modulo]] 256. The complete packet was thus 132 bytes long, containing 128 bytes of [[Payload (computing)|payload data]], for a total [[channel efficiency]] of about 97%.

The packet was also suffixed with a single-byte [[checksum]] of the data bytes. The checksum was the sum of all bytes in the packet [[modulo operation|modulo]] 256. The modulo operation was easily computed by discarding all but the eight [[least significant bit]]s of the result, or alternatively on an eight bit machine, ignoring [[arithmetic overflow]] which would produce the same effect automatically. In this way the checksum was restricted to an eight bit quantity which was able to be expressed using a single byte. For example, if this checksum method was used on a tiny data packet containing only two bytes carrying the values 130 and 130, the total of these codes is 260 and the resulting checksum is 4.

The complete packet was thus 132 bytes long, containing 128 bytes of data, for a total [[channel efficiency]] of about 97%.{{clarify|reason="2/132 would be 1.5%, not 97%"|date=July 2015}}


The file was marked "complete" with a <tt><[[End-of-transmission character|EOT]]></tt> character sent after the last block. This character was not in a packet, but sent alone as a single byte. Since the file length was not sent as part of the protocol, the last packet was padded out with a "known character" that could be dropped. In the original specification this defaulted to <tt><nowiki><SUB></nowiki></tt> or 26 decimal, which CP/M used as the end-of-file marker inside its own disk format. The standard suggested any character could be used for padding, but there was no way for it to be changed ''within the protocol'' itself – if an implementation changed the padding character, only clients using the same implementation would correctly interpret the new padding character.
The file was marked "complete" with a <kbd><[[End-of-transmission character|EOT]]></kbd> character sent after the last block. This character was not in a packet, but sent alone as a single byte. Since the file length was not sent as part of the protocol, the last packet was padded out with a "known character" that could be dropped. In the original specification, this defaulted to <kbd><nowiki><SUB></nowiki></kbd> or 26 decimal, which CP/M used as the end-of-file marker inside its own disk format. The standard suggested any character could be used for padding, but there was no way for it to be changed ''within the protocol'' itself – if an implementation changed the padding character, only clients using the same implementation would correctly interpret the new padding character.


==Transfer details==
==Transfer details==


Files were transferred one packet at a time. When received, the packet's checksum was calculated by the receiver and compared to the one received from the sender at the end of the packet. If the two matched, the receiver sent an <tt><[[Acknowledge character|ACK]]></tt> message back to the sender, which then sent the next packet in sequence. If there was a problem with the checksum, the receiver instead sent a <tt><[[NAK]]></tt>. If a <tt><[[NAK]]></tt> was received, the sender would re-send the packet, and continued to try several times, normally ten, before aborting the transfer.
Files were transferred one packet at a time. When received, the packet's checksum was calculated by the receiver and compared to the one received from the sender at the end of the packet. If the two matched, the receiver sent an <kbd><[[Acknowledge character|ACK]]></kbd> message back to the sender, which then sent the next packet in sequence. If there was a problem with the checksum, the receiver instead sent a <kbd><[[Negative acknowledge character|NAK]]></kbd>. If a <kbd><NAK></kbd> was received, the sender would re-send the packet,{{r|meeks198902}} and continued to try several times, normally ten, before aborting the transfer.


A <tt><NAK></tt> was also sent if the receiver did not receive a valid packet within ten seconds while still expecting data due to the lack of a <tt><EOT></tt> character. A seven-second timeout was also used ''within'' a packet, guarding against dropped connections in mid-packet.
A <kbd><NAK></kbd> was also sent if the receiver did not receive a valid packet within ten seconds while still expecting data due to the lack of a <kbd><EOT></kbd> character. A seven-second timeout was also used ''within'' a packet, guarding against dropped connections in mid-packet.


The block numbers were also examined in a simple way to check for errors. After receiving a packet successfully, the next packet should have a one-higher number. If it instead received the same block number this was not considered serious, it was implied that the <tt><ACK></tt> had not been received by the sender, which had then re-sent the packet.
The block numbers were also examined in a simple way to check for errors. After receiving a packet successfully, the next packet should have a one-higher number. If it instead received the same block number this was not considered serious, it was implied that the <kbd><ACK></kbd> had not been received by the sender, which had then re-sent the packet. Any other packet number signalled that packets had been lost.


Transfers were receiver-driven; the transmitter would not send any data until an initial <tt><[[Acknowledge character|NAK]]></tt> was sent by the receiver. This was a logical outcome of the way the user interacted with the sending machine, which would be remotely located. The user would navigate to the requested file on the sending machine, and then ask that machine to transfer it. Once this command was issued, the user would then execute a command in their local software to start receiving. Since the delay between asking the remote system for the file and issuing a local command to receive was unknown, XMODEM allowed up to 90 seconds for the receiver to begin issuing requests for data packets.
Transfers were receiver-driven; the transmitter would not send any data until an initial <kbd><NAK></kbd> was sent by the receiver. This was a logical outcome of the way the user interacted with the sending machine, which would be remotely located. The user would navigate to the requested file on the sending machine, and then ask that machine to transfer it. Once this command was issued, the user would then execute a command in their local software to start receiving. Since the delay between asking the remote system for the file and issuing a local command to receive was unknown, XMODEM allowed up to 90 seconds for the receiver to begin issuing requests for data packets.


==Problems==
==Problems==
Although XMODEM was robust enough for a journalist in 1982 to transmit stories from Pakistan to the United States with an [[Osborne 1]] and [[acoustic coupler]] over poor-quality telephone lines,<ref name="kline198207">{{cite news | url=https://archive.org/stream/kilobaudmagazine-1982-07/Microcomputing_1982_July#page/n43/mode/2up | title=Osborne—Behind Guerrilla Lines | work=Microcomputing | date=July 1982 | access-date=15 February 2016 | author=Kline, David | pages=42–50}}</ref> the protocol had several flaws.



===Minor problems===
===Minor problems===


XMODEM was written for [[CP/M]] machines, and bears several marks of that [[operating system]]. Notably, files on CP/M were always multiples of 128 bytes, and their end was marked within a block with the <tt><EOT></tt> character. These characteristics were transplanted directly into XMODEM. However, other operating systems did not feature either of these peculiarities, and the widespread introduction of [[MS-DOS]] in the early 1980s led to XMODEM having to be updated to notice either a <tt><EOT></tt> ''or'' <tt><EOF></tt> as the end-of-file marker.
XMODEM was written for [[CP/M]] machines, and bears several marks of that [[operating system]]. Notably, files on CP/M were always multiples of 128 bytes, and their end was marked within a block with the <kbd><EOT></kbd> character. These characteristics were transplanted directly into XMODEM. However, other operating systems did not feature either of these peculiarities, and the widespread introduction of [[MS-DOS]] in the early 1980s led to XMODEM having to be updated to notice either a <kbd><EOT></kbd> ''or'' <kbd><EOF></kbd> as the end-of-file marker.


For some time it was suggested that sending a <tt><CAN></tt> character instead of an <tt><ACK></tt> or <tt><NAK></tt> should be supported in order to easily abort the transfer from the receiving end. Likewise, a <tt><CAN></tt> received in place of the <tt><SOH></tt> indicated the sender wished to cancel the transfer. However, this character could be easily "created" via simple noise-related errors of what was meant to be an <tt><ACK></tt> or <tt><NAK></tt>. A double-<tt><CAN></tt> was proposed to avoid this problem, but it is not clear if this was widely implemented.
For some time it was suggested that sending a <kbd><CAN></kbd> character instead of an <kbd><ACK></kbd> or <kbd><NAK></kbd> should be supported in order to easily abort the transfer from the receiving end. Likewise, a <kbd><CAN></kbd> received in place of the <kbd><SOH></kbd> indicated the sender wished to cancel the transfer. However, this character could be easily "created" via simple noise-related errors of what was meant to be an <kbd><ACK></kbd> or <kbd><NAK></kbd>. A double-<kbd><CAN></kbd> was proposed to avoid this problem, but it is not clear if this was widely implemented.


===Major problems===
===Major problems===


XMODEM was designed for simplicity, without much knowledge of other file transfer protocols – which were fairly rare anyway. Due to its simplicity, there were a number of very basic errors that could cause a transfer to fail, or worse, result in an incorrect file which went unnoticed by the protocol. Most of this was due to the use of a simple checksum for error correction, which is susceptible to missing errors in the data if ''two'' bits are reversed, which can happen with a suitably short burst of noise. Additionally, similar damage to the header or checksum could lead to a failed transfer in cases where the data itself was undamaged.
XMODEM was designed for simplicity, without much knowledge of other file transfer protocols – which were fairly rare anyway. Due to its simplicity, there were a number of very basic errors that could cause a transfer to fail, or worse, result in an incorrect file which went unnoticed by the protocol. Most of this was due to the use of a simple checksum for error correction,{{r|meeks198902}} which is susceptible to missing errors in the data if ''two'' bits are reversed, which can happen with a suitably short burst of noise. Additionally, similar damage to the header or checksum could lead to a failed transfer in cases where the data itself was undamaged.


Many authors introduced extensions to XMODEM to address these and other problems. Many asked for these extensions to be included as part of a new XMODEM standard. However, Ward Christensen refused to do this, as it was precisely the ''lack'' of these features, and the associated coding needed to support them, that led to XMODEM's widespread use. As he explained:
Many authors introduced extensions to XMODEM to address these and other problems. Many asked for these extensions to be included as part of a new XMODEM standard. However, Ward Christensen refused to do this, as it was precisely the ''lack'' of these features, and the associated coding needed to support them, which led to XMODEM's widespread use. As he explained:


:It was a quick hack I threw together, very unplanned (like everything I do), to satisfy a personal need to communicate with some other people. ONLY the fact that it was done in 8/77, and that I put it in the public domain immediately, made it become the standard that it is...
:It was a quick hack I threw together, very unplanned (like everything I do), to satisfy a personal need to communicate with some other people. ONLY the fact that it was done in 8/77, and that I put it in the public domain immediately, made it become the standard that it is...
:...People who suggest I make SIGNIFICANT changes to the protocol, such as 'full duplex', 'multiple outstanding blocks', 'multiple destinations', etc etc don't understand that the incredible simplicity of the protocol is one of the reasons it survived.
:...People who suggest I make SIGNIFICANT changes to the protocol, such as 'full duplex', 'multiple outstanding blocks', 'multiple destinations', etc etc don't understand that the incredible simplicity of the protocol is one of the reasons it survived.


==Batch Transfers==
==Batch transfers==
Another problem with XMODEM was that it required the transfer to be user-driven. Typically this meant the user would navigate on the sender's system to select the file they wanted, and then invoke the transfer from their end using a command in their terminal emulator. If the user wanted to transfer another file, they would have to repeat this process again.
Another problem with XMODEM was that it required the transfer to be user-driven rather than automated.{{r|meeks198902}} Typically this meant the user would navigate on the sender's system to select the file they wanted, and then use a command to put that system into the "ready to send" mode. They would then trigger the transfer from their end using a command in their terminal emulator. If the user wanted to transfer another file, they would have to repeat this process again.


For automated transfers between two sites, a number of add-ons to the XMODEM protocol were implemented over time. These generally assumed the sender would continue sending file after file, with the receiver attempting to trigger the next file by sending a <tt><NAK></tt> as normal at the start of a transfer. When the <tt><NAK></tt>'s timed out, it could be assumed that either there were no more files, or the link was broken anyway.
For automated transfers between two sites, a number of add-ons to the XMODEM protocol were implemented over time. These generally assumed the sender would continue sending file after file, with the receiver attempting to trigger the next file by sending a <code>NAK</code> as normal at the start of a transfer. When the <code>NAK</code>s timed out, it could be assumed that either there were no more files, or the link was broken anyway.


===MODEM7===
===MODEM7===
'''MODEM7''', also known as '''MODEM7 batch''' or '''Batch XMODEM''', was the first known extension of the XMODEM protocol. A normal XMODEM file transfer starts with the receiver sending a single <tt><NAK></tt> character to the sender, which then starts sending packets of 128-bytes of data prefixed with a <tt><SOH></tt>. MODEM7 changed this behaviour only slightly, by sending the filename, in [[8.3 filename]] format, before the first data packet. Each character was sent individually and had to be echoed by the receiver as a form of error correction. For a non-aware XMODEM implementation this data would simply be ignored while it waited for the <tt><SOH></tt> to arrive, so the characters would not be echoed and the implementation could fall back to conventional XMODEM. With "aware" software, the file name could be used to save the file locally. Transfers could continue with another <tt><NAK></tt>, each file being saved under the name being sent to the receiver.
'''MODEM7''', also known as '''MODEM7 batch''' or '''Batch XMODEM''', was the first known extension of the XMODEM protocol. A normal XMODEM file transfer starts with the receiver sending a single <code>NAK</code> character to the sender, which then starts sending a single <code>SOH</code> to indicate the start of the data, and then packets of data.
MODEM7 changed this behavior only slightly, by sending the filename, in [[8.3 filename]] format, before the <kbd><SOH></kbd>. Each character was sent individually and had to be echoed by the receiver as a form of error correction. For a non-aware XMODEM implementation, this data would simply be ignored while it waited for the <code>SOH</code> to arrive, so the characters would not be echoed and the implementation could fall back to conventional XMODEM. With "aware" software, the file name could be used to save the file locally. Transfers could continue with another <code><NAK></code>, each file is saved under the name being sent to the receiver.

[[Jerry Pournelle]] in 1983 described MODEM7 as "probably the most popular microcomputer communications program in existence".<ref name="pournelle198307">{{cite news | url=https://archive.org/stream/byte-magazine-1983-07-rescan/1983_07_BYTE_08-07_Videotex#page/n334/mode/2up | title=Interstellar Drives, Osborne Accessories, DEDICATE/32, and Death Valley | work=BYTE | date=July 1983 | access-date=28 August 2016 | author=Pournelle, Jerry | page=334}}</ref>


===TeLink===
===TeLink===
MODEM7 sent the filename as normal text, which meant it could be corrupted by the same problems that XMODEM was attempting to avoid. This led to the introduction of '''TeLink''' by [[Tom Jennings]], author of the original [[FidoNet]] mailers.
MODEM7 sent the filename as normal text, which meant it could be corrupted by the same problems that XMODEM was attempting to avoid. This led to the introduction of '''TeLink''' by [[Tom Jennings]], author of the original [[FidoNet]] mailers.


TeLink avoided MODEM7's problems by standardizing a new "zero packet" containing information about the original file. This included the file's name, size, and [[timestamp]], which were placed in a regular 128 byte XMODEM block. Whereas a normal XMODEM transfer would start with the sender sending "block 1", the TeLink header packet was labeled "block 0".
TeLink avoided MODEM7's problems by standardizing a new "zero packet" containing information about the original file. This included the file's name, size, and [[timestamp]], which were placed in a regular 128 byte XMODEM block. Whereas a normal XMODEM transfer would start with the sender sending "block 1" with a <code><SOH></code> header, the TeLink header packet was labeled "block 0" and began with a <code><SYN></code>. The packet contained the file creation date and time, filename up to 16 characters, the file size as a 4-byte value, and the name of the program sending the file.{{sfn|Bush|1995|p=G.1}}


Again, a normal XMODEM implementation would simply discard the packet, the assumption being that the packet number had been corrupted. But this led to a potential time delay if the packet were discarded, as the sender could not be sure it was being <tt><NAK></tt>'ed because it did not understand the "block 0", or because there was a transmission error. However, TeLink was generally limited to [[FidoNet]] software, which demanded it as part of the FidoNet standards. During early stages of FidoNet's development, the "mailer" programs called each other at known times early in the morning, when it was safe to assume the receiver was another mailer that also implemented TeLink.
A normal XMODEM implementation would simply discard the packet, the assumption being that the packet number had been corrupted. But this led to a potential time delay if the packet were discarded, as the sender could not tell whether the receiver had responded with a <code><NAK></code> because it did not understand the zero packet or because there was a transmission error. As TeLink was normally used only by [[FidoNet]] software, which demanded it as part of the FidoNet standards, this did not present a real-world problem as both ends would always support this standard.{{sfn|Bush|1995|p=G.1}}


The basic "block 0" system became a standard in the FidoNet community, and was re-used by a number of future protocols like [[SEAlink]] and [[YMODEM]].
The basic "block 0" system became a standard in the FidoNet community, and was re-used by a number of future protocols like [[SEAlink]] and [[YMODEM]].


==XMODEM-CRC==
==XMODEM-CRC==
The checksum used in the original protocol was extremely simple, and errors within the packet could go unnoticed. This led to the introduction of '''XMODEM-CRC''' by John Byrns,<ref>{{cite web
The checksum used in the original protocol was extremely simple, and errors within the packet could go unnoticed. This led to the introduction of '''XMODEM-CRC''' by John Byrns,{{sfn|Christensen|1982}}{{sfn|Forsberg|1986}} which used a 16-bit [[Cyclic redundancy check|CRC]] in place of the 8-bit checksum.{{r|meeks198902}} CRCs encode not only the data in the packet, but its location as well, allowing it to notice the bit-replacement errors that a checksum would miss. Statistically, this made the chance of detecting an error less than 16 bits long 99.9969%, and even higher for longer error bit strings.{{sfn|Boswell|1986}}
| url = http://www.techheap.com/communication/modems/xmodem.html
| title = XMODEM Protocol Overview
| first = Ward
| last = Christensen
| authorlink = Ward Christensen
| date = 1 January 1982
}}</ref><ref>{{cite web
| url = http://www.techheap.com/communication/modems/xmodem-ymodem_reference.html
| title = XMODEM/YMODEM PROTOCOL REFERENCE
| first = Chuck
| last = Forsberg
| authorlink = Chuck Forsberg
| date = 11 September 1986
}}</ref> which used a 16-bit [[Cyclic redundancy check|CRC]] in place of the 8-bit checksum. CRC's encode not only the data in the packet, but its location as well, allowing it to notice the bit-replacement errors that a checksum would miss. Statistically, this made the chance of detecting an error less than 16 bits long 99.9969%, and even higher for longer error bit strings.


XMODEM-CRC was designed to be backwardly compatible with XMODEM. To do this, the receiver simply sent a <tt>C</tt> (capital C) character instead of a <tt><NAK></tt> to start the transfer. If the sender responded by sending a packet, it was assumed the sender "knew" XMODEM-CRC, and the receiver continued sending <tt>C</tt>'s. If no packet was forthcoming, the receiver assumed the sender did not know the protocol, and sent an <tt><NAK></tt> to start a "traditional" XMODEM transfer.
XMODEM-CRC was designed to be backwardly compatible with XMODEM. To do this, the receiver sent a <kbd>C</kbd> (capital C) character instead of a <kbd><NAK></kbd> to start the transfer. If the sender responded by sending a packet, it was assumed the sender "knew" XMODEM-CRC, and the receiver continued sending <kbd>C</kbd>'s. If no packet was forthcoming, the receiver assumed the sender did not know the protocol, and sent an <kbd><NAK></kbd> to start a "traditional" XMODEM transfer.{{sfn|Boswell|1986}}


Unfortunately this attempt at backward compatibility had a downside. Since it was possible that the initial <tt>C</tt> character would be lost or corrupted, it could not be assumed that the receiver did not support XMODEM-CRC if the first attempt to trigger the transfer failed. The receiver thus tried to start the transfer three times with <tt>C</tt>, waiting three seconds between each attempt. This meant that if the user selected XMODEM-CRC while attempting to talk to ''any'' XMODEM, as it was intended, there was a potential 10 second delay before the transfer started.
Unfortunately, this attempt at backward compatibility had a downside. Since it was possible that the initial <kbd>C</kbd> character would be lost or corrupted, it could not be assumed that the receiver did not support XMODEM-CRC if the first attempt to trigger the transfer failed. The receiver thus tried to start the transfer three times with <kbd>C</kbd>, waiting three seconds between each attempt. This meant that if the user selected XMODEM-CRC while attempting to talk to ''any'' XMODEM, as it was intended, there was a potential 10 second delay before the transfer started.{{sfn|Boswell|1986}}


To avoid the delay, the sender and receiver would generally list XMODEM-CRC separately from XMODEM, allowing the user to select "basic" XMODEM if the sender didn't explicitly list it. Ironically, any software that ''did'' support -CRC in their basic XMODEM transfer, as it was intended, surreptitiously suggested the user should not attempt to use -CRC. To the average user, XMODEM-CRC was essentially a "second protocol", and treated as such.
To avoid the delay, the sender and receiver would generally list XMODEM-CRC separately from XMODEM, allowing the user to select "basic" XMODEM if the sender didn't explicitly list it. To the average user, XMODEM-CRC was essentially a "second protocol", and treated as such. This was not true of FidoNet mailers, however, where CRC was defined as the standard for all TeLink transfers.{{sfn|Bush|1995|p=G.1}}


==Higher throughput==
==Higher throughput==
Since the XMODEM protocol required the sender to stop and wait for an <tt><ACK></tt> or <tt><NAK></tt> message from the receiver, it tended to be quite slow. In the era of 300 bit/s modems, the entire 132-byte packet required just over 3.5 seconds to send (132 bytes * 8 bits per byte / 300 bits per second). If it then took 0.2 seconds for the receiver's <tt><ACK></tt> to make it back to the sender and the next packet to start hitting the receiver (0.1 seconds in both directions), the overall time for one packet would be 3.7&nbsp;seconds, just over 92% throughput.
Since the XMODEM protocol required the sender to stop and wait for an <kbd><ACK></kbd> or <kbd><NAK></kbd> message from the receiver, it tended to be quite slow. In the era of 300&nbsp;bit/s modems, the entire 132-byte packet required 4.4&nbsp;seconds to send (132 bytes * (8 bits per byte + 1 start bit + 1 stop bit) / 300 bits per second). Assuming it takes 0.2&nbsp;seconds for the receiver's <kbd><ACK></kbd> to make it back to the sender and the next packet to start hitting the receiver (0.1&nbsp;seconds in both directions), the overall time for one packet would be 4.6&nbsp;seconds, just over 92% channel efficiency.


As modem speeds increased, the fixed delay needed to send the <tt><ACK></tt>/<tt><NAK></tt> grows in proportion to time needed to send the packet. For instance, at 2400&nbsp;bit/s the packets took only 0.44&nbsp;seconds to send, so if the <tt><ACK></tt>/<tt><NAK></tt> still took 0.2&nbsp;seconds to make it back (this is ''latency'' in the network, not throughput), the throughput has fallen to under 60%. At 9600&nbsp;bit/s it is under 30% – more time is spent waiting for the reply than is needed to send the packet.
The time for the <kbd><ACK></kbd>/<kbd><NAK></kbd> process was a fixed function of the underlying communications network, not of the performance of the modems. As modem speeds increased, the fixed delay grew in proportion to time needed to send the packet. For instance, at 2400&nbsp;bit/s the packets took only 0.55&nbsp;seconds to send, so if the <kbd><ACK></kbd>/<kbd><NAK></kbd> still took 0.2&nbsp;seconds to make it back to the user's machine, the efficiency has fallen to 71%. At 9600&nbsp;bit/s it is just under 40% – more time is spent waiting for the reply than is needed to send the packet.


A number of new versions of XMODEM were introduced in order to address these problems. Like earlier extensions, these versions tended to be backward-compatible with the original XMODEM, and like those extensions, this led to a further fracturing of the XMODEM landscape in the user's terminal emulator. In the end, dozens of versions of XMODEM would emerge.
A number of new versions of XMODEM were introduced in order to address these problems. Like earlier extensions, these versions tended to be backward-compatible with the original XMODEM, and like those extensions, this led to further fracturing of the XMODEM landscape in the user's terminal emulator. In the end, dozens of versions of XMODEM emerged.

===WXModem===
'''WXmodem''', short for "Windowed Xmodem", is a variant of XMODEM developed by Peter Boswell in 1986 for use on high-latency lines, specifically public [[X.25]] systems and [[PC Pursuit]]. These have latencies that are far higher than the [[plain-old telephone service]], which leads to very poor efficiency in XMODEM. Additionally, these networks often use [[control character]]s for [[Flow control (data)|flow control]] and other tasks, notably [[XON/XOFF]] will stop the data flow. Finally, in the case of an error that required a resend, it was sometimes difficult to know whether a <code>SOH</code> was a packet indicator or more noise. WXmodem adapted XMODEM-CRC to address these problems.{{sfn|Boswell|1986}}

One change was to escape a small set of control characters: <code>DLE</code>, <code>XON</code>, <code>XOFF</code> and <code>SYN</code>. These were escaped by inserting a <code>DLE</code> in front of them, and then modifying the character by XORing it with 64. In theory, this meant the packet might be as long as 264 bytes if it originally consisted entirely of characters that required escaping. These inserted and modified characters are not part of the CRC calculation, they are removed and converted at the receiving end before calculating the CRC.{{sfn|Boswell|1986}}

Additionally, all packets were prefixed with a <code>SYN</code> character, which meant the packet lead-in was <code>SYN</code><code>SOH</code>, reducing the chance that a stray <code>SOH</code> would be confused for a packet header in various error cases. An unescaped <code>SYN</code> found in the body of a packet was an error.{{sfn|Boswell|1986}}

The major change in WXMODEM is the use of a [[sliding window]] to improve throughput on high-latency links. To do so, the <code>ACK</code> messages were followed by the packet number they were <code>ACK</code>ing or <code>NAK</code>ing. The receiver does not have to <code>ACK</code> every packet; it is allowed to <code>ACK</code> any number between one and four packets. An <code>ACK</code> with the fourth packet sequence number is assumed to <code>ACK</code> all four packets. An error causes a <code>NAK</code> to be sent immediately, with all packets from that number and after being re-sent.{{sfn|Boswell|1986}}

Requiring an <code>ACK</code> every four packets makes the system work like it has a packet size of 512&nbsp;bytes, but in the case of an error, typically only requires 128&nbsp;bytes to be re-sent. Moreover, it reduces the amount of data flowing in the reverse direction by four times. This is of little interest in the typical modem's [[full duplex]] operation, but is important in [[half duplex]] systems like [[Telebit]] models which have 19&nbsp;kB speed in one direction and 75&nbsp;bits/s in the return channel.


===SEAlink===
===SEAlink===
One of the first third party mailers for the [[FidoNet]] system was '''SEAdog''', written by the same author as the then-popular [[.arc]] [[data compression]] format. SEAdog included a wide variety of improvements, including [[SEAlink]], an improved transfer protocol.
One of the first third-party mailers for the [[FidoNet]] system was '''SEAdog''', written by the same author as the then-popular [[.arc]] [[data compression]] format. SEAdog included a wide variety of improvements, including [[SEAlink]], an improved transfer protocol based on the same sliding window concept as WXmodem.{{sfn|SEAlink|1987}} It differed from WXmodem mostly in details.


One difference is that SEAlink supported the "zero packet" introduced by TeLink, which is needed in order to operate as a drop-in replacement for TeLink in FidoNet systems where the header was expected. <code>ACK</code>s and <code>NAK</code>s were extended to three-byte "packets", starting with the <code>ACK</code> or <code>NAK</code>, then the packet number, then the complement of the packet number, in the same fashion as the original XMODEM packet header. The window size was normally set to six packets.{{sfn|SEAlink|1987}}
SEAlink used a method known as ''[[sliding window]]s'' to avoid the inter-packet delay. To do this, the protocol did not wait for the <tt><ACK></tt>/<tt><NAK></tt> to arrive, and immediately moved onto the next packet. It was only after a defined number of packets had been sent, the ''window'', that the protocol would stop and wait. If the <tt><ACK></tt> arrived before the window ended, the protocol would remove that packet from the window and add another. In this fashion the system, under ideal conditions, never reached the end of the window, and continued sending packets continually. In order for this to work, SEAlink needed to know which packet the receiver was <tt><ACK></tt>/<tt><NAK></tt>ing, which it did by appending the packet number to the <tt><ACK></tt> or <tt><NAK></tt> character.


SEAlink was not expected to operate over X.25 or similar links, and thus did not perform escaping. This was also needed so the zero packet would work properly, as this standard used the <code>SYN</code> character that WXmodem had re-purposed.{{sfn|SEAlink|1987}} On top of these changes, it added an "Overdrive" mode for half duplex links. This suppressed ACKs for packets that were successfully transferred, in effect making the window of infinite size. This mode was indicated by a flag in the zero block.{{sfn|SEAlink|1987}}
SEAlink later added a number of other improvements, and was a useful general-purpose protocol. However it remained rare outside the FidoNet world, and was rarely seen in user-facing software.

SEAlink later added a number of other improvements and was a useful general-purpose protocol. However, it remained rare outside the FidoNet world, and was rarely seen in user-facing software.


===XMODEM-1K===
===XMODEM-1K===
Another way to solve the throughput problem is to increase the packet size. Although the fundamental problem of latency remains, the speed at which it becomes a problem is higher. XMODEM-1K with 1024-byte packets was the most popular such solution. In this case, the throughput at 9600 bit/s is 81%, given the same assumptions as above.
Another way to solve the throughput problem is to increase the packet size. Although the fundamental problem of latency remains, the speed at which it becomes a problem is higher. XMODEM-1K with 1024-byte packets{{r|meeks198902}} was the most popular such solution. In this case, the throughput at 9600&nbsp;bit/s is 81%, given the same assumptions as above.


XMODEM-1K was an expanded version of XMODEM-CRC, which indicated the longer block size in the ''sender'' by starting a packet with the <tt><STX></tt> character instead of <tt><SOH></tt>. Like other backward-compatible XMODEM extensions, it was intended that a -1K transfer could be started with any implementation of XMODEM on the other end, backing off features as required.
XMODEM-1K was an expanded version of XMODEM-CRC, which indicated the longer block size in the ''sender'' by starting a packet with the <kbd><STX></kbd> character instead of <kbd><SOH></kbd>. Like other backward-compatible XMODEM extensions, it was intended that a -1K transfer could be started with any implementation of XMODEM on the other end, backing off features as required.


XMODEM-1K was originally one of the many improvements to XMODEM introduced by [[Chuck Forsberg]] in his [[YMODEM]] protocol. Forsberg suggested that the various improvements were optional, expecting software authors to implement as many of them as possible. Instead they generally implemented the bare minimum, leading to a profusion of semi-compatible implementations, and eventually, the splitting out of the name "YMODEM" into "XMODEM-1K" and a variety of YMODEMs. Thus XMODEM-1K actually post-dates YMODEM, but remained fairly common anyway.
XMODEM-1K was originally one of the many improvements to XMODEM introduced by [[Chuck Forsberg]] in his [[YMODEM]] protocol. Forsberg suggested that the various improvements were optional, expecting software authors to implement as many of them as possible. Instead, they generally implemented the bare minimum, leading to a profusion of semi-compatible implementations, and eventually, the splitting out of the name "YMODEM" into "XMODEM-1K" and a variety of YMODEMs. Thus XMODEM-1K actually post-dates YMODEM, but remained fairly common anyway.


===NMODEM===
A backwards compatible extensions of XMODEM with 32k and 64k block lengths was created by Adontec for better performance on high-speed error free connections like ISDN or TCP/IP networks.
NMODEM is a [[file transfer]] protocol developed by L. B. Neal, who released it in 1990. NMODEM is essentially a version of XMODEM-CRC using larger 2048 byte blocks, as opposed to XMODEM's 128 byte blocks.
NMODEM was implemented as a separate program, written in Turbo Pascal 5.0 for the [[IBM PC compatible]] family of computers. The block size was chosen to match the common cluster size of the [[MS-DOS]] [[File Allocation Table|FAT]] file system on contemporary [[hard drive]]s, making buffering data for writing simpler.<ref>{{cite web |url=http://www.cpeterso.com/protocols/NMODM112.ARJ |title=NMODEM 1.12 program and source code |archive-url=https://web.archive.org/web/20110807213052/http://www.cpeterso.com/code/protocols/NMODM112.ARJ |archive-date=2011-08-07 |url-status=dead |access-date=2020-02-13 }}</ref><ref>{{cite web |url=http://www.cpeterso.com/protocols/NMODEM.TXT |title=NMODEM documentation |archive-url=https://web.archive.org/web/20160409012002/https://www.cpeterso.com/code/protocols/NMODEM.TXT |archive-date=2016-04-09 |url-status=dead |access-date=2020-02-13 }}</ref>


===Pre-acknowledge===
===Protocol spoofing===
Over reliable (error-free) connections, the receiver could eliminate the latency issue by "pre-acknowledging" the packets. The receiver would already send ACK while the packet was still being transmitted. This effectively breaks error-correction since a packet is always acknowledged regardless of its integrity (which can only be checked after it has been completely received). Since this feature is only an alteration of the receiver-side behaviour, it does not require any changes in the protocol or on the sender's side.
Over reliable (error-free) connections, it is possible to eliminate latency by "pre-acknowledging" the packets, a technique known more generally as "[[protocol spoofing]]". This is normally accomplished in the link hardware, notably Telebit modems. The modems, when the option was turned on, would notice the XMODEM header and immediately sent an <code>ACK</code>. This would cause the sending XMODEM program to immediately send the next packet, making the transfer continuous, like an infinite-sized window. The modems also suppress the <code>ACK</code> being sent by the XMODEM software at the far end, thereby freeing up the low-speed return channel.


The system can also be implemented in the protocol itself, and variations of XMODEM offered this feature. In these cases, the receiver would send the <code>ACK</code> as soon as the packet started, in the same fashion as the Telebit modems. Since this feature is only an alteration of the receiver-side behavior, it does not require any changes in the protocol on the sender's side. [[YMODEM]] formalized this system.
Pre-acknowledge was also possible for [[YMODEM]]. It was made obsolete by variants such as YMODEM-g or [[ZMODEM]].

This concept should be contrasted with the one used in SEAlink, which changes the behavior on both sides of the link. In SEAlink, the receiver stops sending the <code>ACK</code> entirely, and the sender changes its behavior to not expect them.

==See also==
* [[BLAST (protocol)]]
* [[Kermit (protocol)]]


==References==
==References==
===Citations===
{{Reflist}}
{{Reflist}}

===Bibliography===
* {{cite tech report
|url=http://ftsc.org/docs/fts-0001.016
|title=FidoNet Technical Standard FTS-0001
|first=Randy |last=Bush
|date=30 September 1995
}}
* {{cite web
|title = XMODEM Protocol Overview
|first = Ward
|last = Christensen
|author-link = Ward Christensen
|date = 1 January 1982
|url = http://techheap.packetizer.com/communication/modems/xmodem.html
}}
* {{cite web
|title = XMODEM/YMODEM PROTOCOL REFERENCE
|first = Chuck
|last = Forsberg
|author-link = Chuck Forsberg
|date = 11 September 1986
|url = http://techheap.packetizer.com/communication/modems/xmodem-ymodem_reference.html
}}
* {{cite web
|url=http://wiki.synchro.net/ref:xmodem
|title= XMODEM, CRC XMODEM, WXMODEM File Transfer Protocols
|at=CRC XMODEM
|first=Peter |last=Boswell
|date=20 June 1986
}}
* {{cite tech report
|url=https://github.com/cpeterso/sealink/blob/master/sealink.txt
|title= SEALINK File Transfer Protocol
|date=24 August 1987
|ref=CITEREFSEAlink1987
}}


==External links==
==External links==
* [http://www.vintagecomputer.net/fjkraan/comp/mirror/z80cpu.eu/archive/rlee/L/LOOSECPM/224/MODEM.ASM MODEM.ASM], original source code, Ward Christensen, October 10, 1977.
* [http://textfiles.com/programming/ymodem.txt XMODEM / YMODEM Protocol Reference by Chuck Forsberg], October 10, 1985
* [http://textfiles.com/programming/ymodem.txt XMODEM / YMODEM Protocol Reference by Chuck Forsberg], October 10, 1985
* [http://pauillac.inria.fr/~doligez/zmodem/ymodem.txt XMODEM / YMODEM Protocol Reference by Chuck Forsberg], June 18, 1988 (document reformatted October 14, 1988) [http://www.techfest.com/hardware/modem/xymodem.htm (HTML version with text issues)]
* [http://pauillac.inria.fr/~doligez/zmodem/ymodem.txt XMODEM / YMODEM Protocol Reference by Chuck Forsberg], June 18, 1988 (document reformatted October 14, 1988) [https://web.archive.org/web/20121110151951/http://www.techfest.com/hardware/modem/xymodem.htm (HTML version with text issues)]
* [http://wiki.synchro.net/ref:xmodem XMODEM / XMODEM-CRC / WXMODEM File Transfer Protocols], synchro.net
* [http://wiki.synchro.net/ref:xmodem XMODEM / XMODEM-CRC / WXMODEM File Transfer Protocols], synchro.net
* [http://www.adontec.com/xmodem_e.htm Adontec XMODEM/32k and XMODEM/64k extensions], adontec.com
* [http://www.adontec.com/xmodem_e.htm Adontec XMODEM/32k and XMODEM/64k extensions], adontec.com
Line 147: Line 191:


[[Category:BBS file transfer protocols]]
[[Category:BBS file transfer protocols]]
[[Category:1977 introductions]]
[[Category:Computer-related introductions in 1977]]

Latest revision as of 18:54, 9 December 2024

XMODEM
Communication protocol
Purposefile transfer protocol
Developer(s)Ward Christensen[1][2]
Introduction1977; 48 years ago (1977)
InfluencedYMODEM, many others
Hardwaremodems

XMODEM is a simple file transfer protocol developed as a quick hack by Ward Christensen for use in his 1977 MODEM.ASM terminal program. It allowed users to transmit files between their computers when both sides used MODEM. Keith Petersen made a minor update to always turn on "quiet mode", and called the result XMODEM.[3][4]

XMODEM, like most file transfer protocols, breaks up the original data into a series of "packets" that are sent to the receiver, along with additional information allowing the receiver to determine whether that packet was correctly received. If an error is detected, the receiver requests that the packet be re-sent. A string of bad packets causes the transfer to abort.

XMODEM became extremely popular in the early bulletin board system (BBS) market, largely because it was simple to implement. It was also fairly inefficient, and as modem speeds increased, this problem led to the development of a number of modified versions of XMODEM to improve performance or address other problems with the protocol.[4] Christensen believed his original XMODEM to be "the single most modified program in computing history".[5]

Chuck Forsberg collected a number of common modifications into his YMODEM protocol, but poor implementation led to a further fracturing before they were re-unified by his later ZMODEM protocol. ZMODEM became very popular, but never completely replaced XMODEM in the BBS market.

Packet structure

[edit]

The original XMODEM used a 128-byte data packet, the block size used on CP/M floppy disks. The packet was prefixed by a simple 3-byte header containing a <SOH> character, a "block number" from 1-255, and the "inverse" block number—255 minus the block number. Block numbering starts with 1 for the first block sent, not 0. The header was followed by the 128 bytes of data, and then a single-byte checksum. The checksum was the sum of all 128 data bytes in the packet modulo 256. The complete packet was thus 132 bytes long, containing 128 bytes of payload data, for a total channel efficiency of about 97%.

The file was marked "complete" with a <EOT> character sent after the last block. This character was not in a packet, but sent alone as a single byte. Since the file length was not sent as part of the protocol, the last packet was padded out with a "known character" that could be dropped. In the original specification, this defaulted to <SUB> or 26 decimal, which CP/M used as the end-of-file marker inside its own disk format. The standard suggested any character could be used for padding, but there was no way for it to be changed within the protocol itself – if an implementation changed the padding character, only clients using the same implementation would correctly interpret the new padding character.

Transfer details

[edit]

Files were transferred one packet at a time. When received, the packet's checksum was calculated by the receiver and compared to the one received from the sender at the end of the packet. If the two matched, the receiver sent an <ACK> message back to the sender, which then sent the next packet in sequence. If there was a problem with the checksum, the receiver instead sent a <NAK>. If a <NAK> was received, the sender would re-send the packet,[4] and continued to try several times, normally ten, before aborting the transfer.

A <NAK> was also sent if the receiver did not receive a valid packet within ten seconds while still expecting data due to the lack of a <EOT> character. A seven-second timeout was also used within a packet, guarding against dropped connections in mid-packet.

The block numbers were also examined in a simple way to check for errors. After receiving a packet successfully, the next packet should have a one-higher number. If it instead received the same block number this was not considered serious, it was implied that the <ACK> had not been received by the sender, which had then re-sent the packet. Any other packet number signalled that packets had been lost.

Transfers were receiver-driven; the transmitter would not send any data until an initial <NAK> was sent by the receiver. This was a logical outcome of the way the user interacted with the sending machine, which would be remotely located. The user would navigate to the requested file on the sending machine, and then ask that machine to transfer it. Once this command was issued, the user would then execute a command in their local software to start receiving. Since the delay between asking the remote system for the file and issuing a local command to receive was unknown, XMODEM allowed up to 90 seconds for the receiver to begin issuing requests for data packets.

Problems

[edit]

Although XMODEM was robust enough for a journalist in 1982 to transmit stories from Pakistan to the United States with an Osborne 1 and acoustic coupler over poor-quality telephone lines,[6] the protocol had several flaws.

Minor problems

[edit]

XMODEM was written for CP/M machines, and bears several marks of that operating system. Notably, files on CP/M were always multiples of 128 bytes, and their end was marked within a block with the <EOT> character. These characteristics were transplanted directly into XMODEM. However, other operating systems did not feature either of these peculiarities, and the widespread introduction of MS-DOS in the early 1980s led to XMODEM having to be updated to notice either a <EOT> or <EOF> as the end-of-file marker.

For some time it was suggested that sending a <CAN> character instead of an <ACK> or <NAK> should be supported in order to easily abort the transfer from the receiving end. Likewise, a <CAN> received in place of the <SOH> indicated the sender wished to cancel the transfer. However, this character could be easily "created" via simple noise-related errors of what was meant to be an <ACK> or <NAK>. A double-<CAN> was proposed to avoid this problem, but it is not clear if this was widely implemented.

Major problems

[edit]

XMODEM was designed for simplicity, without much knowledge of other file transfer protocols – which were fairly rare anyway. Due to its simplicity, there were a number of very basic errors that could cause a transfer to fail, or worse, result in an incorrect file which went unnoticed by the protocol. Most of this was due to the use of a simple checksum for error correction,[4] which is susceptible to missing errors in the data if two bits are reversed, which can happen with a suitably short burst of noise. Additionally, similar damage to the header or checksum could lead to a failed transfer in cases where the data itself was undamaged.

Many authors introduced extensions to XMODEM to address these and other problems. Many asked for these extensions to be included as part of a new XMODEM standard. However, Ward Christensen refused to do this, as it was precisely the lack of these features, and the associated coding needed to support them, which led to XMODEM's widespread use. As he explained:

It was a quick hack I threw together, very unplanned (like everything I do), to satisfy a personal need to communicate with some other people. ONLY the fact that it was done in 8/77, and that I put it in the public domain immediately, made it become the standard that it is...
...People who suggest I make SIGNIFICANT changes to the protocol, such as 'full duplex', 'multiple outstanding blocks', 'multiple destinations', etc etc don't understand that the incredible simplicity of the protocol is one of the reasons it survived.

Batch transfers

[edit]

Another problem with XMODEM was that it required the transfer to be user-driven rather than automated.[4] Typically this meant the user would navigate on the sender's system to select the file they wanted, and then use a command to put that system into the "ready to send" mode. They would then trigger the transfer from their end using a command in their terminal emulator. If the user wanted to transfer another file, they would have to repeat this process again.

For automated transfers between two sites, a number of add-ons to the XMODEM protocol were implemented over time. These generally assumed the sender would continue sending file after file, with the receiver attempting to trigger the next file by sending a NAK as normal at the start of a transfer. When the NAKs timed out, it could be assumed that either there were no more files, or the link was broken anyway.

MODEM7

[edit]

MODEM7, also known as MODEM7 batch or Batch XMODEM, was the first known extension of the XMODEM protocol. A normal XMODEM file transfer starts with the receiver sending a single NAK character to the sender, which then starts sending a single SOH to indicate the start of the data, and then packets of data.

MODEM7 changed this behavior only slightly, by sending the filename, in 8.3 filename format, before the <SOH>. Each character was sent individually and had to be echoed by the receiver as a form of error correction. For a non-aware XMODEM implementation, this data would simply be ignored while it waited for the SOH to arrive, so the characters would not be echoed and the implementation could fall back to conventional XMODEM. With "aware" software, the file name could be used to save the file locally. Transfers could continue with another <NAK>, each file is saved under the name being sent to the receiver.

Jerry Pournelle in 1983 described MODEM7 as "probably the most popular microcomputer communications program in existence".[7]

[edit]

MODEM7 sent the filename as normal text, which meant it could be corrupted by the same problems that XMODEM was attempting to avoid. This led to the introduction of TeLink by Tom Jennings, author of the original FidoNet mailers.

TeLink avoided MODEM7's problems by standardizing a new "zero packet" containing information about the original file. This included the file's name, size, and timestamp, which were placed in a regular 128 byte XMODEM block. Whereas a normal XMODEM transfer would start with the sender sending "block 1" with a <SOH> header, the TeLink header packet was labeled "block 0" and began with a <SYN>. The packet contained the file creation date and time, filename up to 16 characters, the file size as a 4-byte value, and the name of the program sending the file.[8]

A normal XMODEM implementation would simply discard the packet, the assumption being that the packet number had been corrupted. But this led to a potential time delay if the packet were discarded, as the sender could not tell whether the receiver had responded with a <NAK> because it did not understand the zero packet or because there was a transmission error. As TeLink was normally used only by FidoNet software, which demanded it as part of the FidoNet standards, this did not present a real-world problem as both ends would always support this standard.[8]

The basic "block 0" system became a standard in the FidoNet community, and was re-used by a number of future protocols like SEAlink and YMODEM.

XMODEM-CRC

[edit]

The checksum used in the original protocol was extremely simple, and errors within the packet could go unnoticed. This led to the introduction of XMODEM-CRC by John Byrns,[9][10] which used a 16-bit CRC in place of the 8-bit checksum.[4] CRCs encode not only the data in the packet, but its location as well, allowing it to notice the bit-replacement errors that a checksum would miss. Statistically, this made the chance of detecting an error less than 16 bits long 99.9969%, and even higher for longer error bit strings.[11]

XMODEM-CRC was designed to be backwardly compatible with XMODEM. To do this, the receiver sent a C (capital C) character instead of a <NAK> to start the transfer. If the sender responded by sending a packet, it was assumed the sender "knew" XMODEM-CRC, and the receiver continued sending C's. If no packet was forthcoming, the receiver assumed the sender did not know the protocol, and sent an <NAK> to start a "traditional" XMODEM transfer.[11]

Unfortunately, this attempt at backward compatibility had a downside. Since it was possible that the initial C character would be lost or corrupted, it could not be assumed that the receiver did not support XMODEM-CRC if the first attempt to trigger the transfer failed. The receiver thus tried to start the transfer three times with C, waiting three seconds between each attempt. This meant that if the user selected XMODEM-CRC while attempting to talk to any XMODEM, as it was intended, there was a potential 10 second delay before the transfer started.[11]

To avoid the delay, the sender and receiver would generally list XMODEM-CRC separately from XMODEM, allowing the user to select "basic" XMODEM if the sender didn't explicitly list it. To the average user, XMODEM-CRC was essentially a "second protocol", and treated as such. This was not true of FidoNet mailers, however, where CRC was defined as the standard for all TeLink transfers.[8]

Higher throughput

[edit]

Since the XMODEM protocol required the sender to stop and wait for an <ACK> or <NAK> message from the receiver, it tended to be quite slow. In the era of 300 bit/s modems, the entire 132-byte packet required 4.4 seconds to send (132 bytes * (8 bits per byte + 1 start bit + 1 stop bit) / 300 bits per second). Assuming it takes 0.2 seconds for the receiver's <ACK> to make it back to the sender and the next packet to start hitting the receiver (0.1 seconds in both directions), the overall time for one packet would be 4.6 seconds, just over 92% channel efficiency.

The time for the <ACK>/<NAK> process was a fixed function of the underlying communications network, not of the performance of the modems. As modem speeds increased, the fixed delay grew in proportion to time needed to send the packet. For instance, at 2400 bit/s the packets took only 0.55 seconds to send, so if the <ACK>/<NAK> still took 0.2 seconds to make it back to the user's machine, the efficiency has fallen to 71%. At 9600 bit/s it is just under 40% – more time is spent waiting for the reply than is needed to send the packet.

A number of new versions of XMODEM were introduced in order to address these problems. Like earlier extensions, these versions tended to be backward-compatible with the original XMODEM, and like those extensions, this led to further fracturing of the XMODEM landscape in the user's terminal emulator. In the end, dozens of versions of XMODEM emerged.

WXModem

[edit]

WXmodem, short for "Windowed Xmodem", is a variant of XMODEM developed by Peter Boswell in 1986 for use on high-latency lines, specifically public X.25 systems and PC Pursuit. These have latencies that are far higher than the plain-old telephone service, which leads to very poor efficiency in XMODEM. Additionally, these networks often use control characters for flow control and other tasks, notably XON/XOFF will stop the data flow. Finally, in the case of an error that required a resend, it was sometimes difficult to know whether a SOH was a packet indicator or more noise. WXmodem adapted XMODEM-CRC to address these problems.[11]

One change was to escape a small set of control characters: DLE, XON, XOFF and SYN. These were escaped by inserting a DLE in front of them, and then modifying the character by XORing it with 64. In theory, this meant the packet might be as long as 264 bytes if it originally consisted entirely of characters that required escaping. These inserted and modified characters are not part of the CRC calculation, they are removed and converted at the receiving end before calculating the CRC.[11]

Additionally, all packets were prefixed with a SYN character, which meant the packet lead-in was SYNSOH, reducing the chance that a stray SOH would be confused for a packet header in various error cases. An unescaped SYN found in the body of a packet was an error.[11]

The major change in WXMODEM is the use of a sliding window to improve throughput on high-latency links. To do so, the ACK messages were followed by the packet number they were ACKing or NAKing. The receiver does not have to ACK every packet; it is allowed to ACK any number between one and four packets. An ACK with the fourth packet sequence number is assumed to ACK all four packets. An error causes a NAK to be sent immediately, with all packets from that number and after being re-sent.[11]

Requiring an ACK every four packets makes the system work like it has a packet size of 512 bytes, but in the case of an error, typically only requires 128 bytes to be re-sent. Moreover, it reduces the amount of data flowing in the reverse direction by four times. This is of little interest in the typical modem's full duplex operation, but is important in half duplex systems like Telebit models which have 19 kB speed in one direction and 75 bits/s in the return channel.

[edit]

One of the first third-party mailers for the FidoNet system was SEAdog, written by the same author as the then-popular .arc data compression format. SEAdog included a wide variety of improvements, including SEAlink, an improved transfer protocol based on the same sliding window concept as WXmodem.[12] It differed from WXmodem mostly in details.

One difference is that SEAlink supported the "zero packet" introduced by TeLink, which is needed in order to operate as a drop-in replacement for TeLink in FidoNet systems where the header was expected. ACKs and NAKs were extended to three-byte "packets", starting with the ACK or NAK, then the packet number, then the complement of the packet number, in the same fashion as the original XMODEM packet header. The window size was normally set to six packets.[12]

SEAlink was not expected to operate over X.25 or similar links, and thus did not perform escaping. This was also needed so the zero packet would work properly, as this standard used the SYN character that WXmodem had re-purposed.[12] On top of these changes, it added an "Overdrive" mode for half duplex links. This suppressed ACKs for packets that were successfully transferred, in effect making the window of infinite size. This mode was indicated by a flag in the zero block.[12]

SEAlink later added a number of other improvements and was a useful general-purpose protocol. However, it remained rare outside the FidoNet world, and was rarely seen in user-facing software.

XMODEM-1K

[edit]

Another way to solve the throughput problem is to increase the packet size. Although the fundamental problem of latency remains, the speed at which it becomes a problem is higher. XMODEM-1K with 1024-byte packets[4] was the most popular such solution. In this case, the throughput at 9600 bit/s is 81%, given the same assumptions as above.

XMODEM-1K was an expanded version of XMODEM-CRC, which indicated the longer block size in the sender by starting a packet with the <STX> character instead of <SOH>. Like other backward-compatible XMODEM extensions, it was intended that a -1K transfer could be started with any implementation of XMODEM on the other end, backing off features as required.

XMODEM-1K was originally one of the many improvements to XMODEM introduced by Chuck Forsberg in his YMODEM protocol. Forsberg suggested that the various improvements were optional, expecting software authors to implement as many of them as possible. Instead, they generally implemented the bare minimum, leading to a profusion of semi-compatible implementations, and eventually, the splitting out of the name "YMODEM" into "XMODEM-1K" and a variety of YMODEMs. Thus XMODEM-1K actually post-dates YMODEM, but remained fairly common anyway.

NMODEM

[edit]

NMODEM is a file transfer protocol developed by L. B. Neal, who released it in 1990. NMODEM is essentially a version of XMODEM-CRC using larger 2048 byte blocks, as opposed to XMODEM's 128 byte blocks. NMODEM was implemented as a separate program, written in Turbo Pascal 5.0 for the IBM PC compatible family of computers. The block size was chosen to match the common cluster size of the MS-DOS FAT file system on contemporary hard drives, making buffering data for writing simpler.[13][14]

Protocol spoofing

[edit]

Over reliable (error-free) connections, it is possible to eliminate latency by "pre-acknowledging" the packets, a technique known more generally as "protocol spoofing". This is normally accomplished in the link hardware, notably Telebit modems. The modems, when the option was turned on, would notice the XMODEM header and immediately sent an ACK. This would cause the sending XMODEM program to immediately send the next packet, making the transfer continuous, like an infinite-sized window. The modems also suppress the ACK being sent by the XMODEM software at the far end, thereby freeing up the low-speed return channel.

The system can also be implemented in the protocol itself, and variations of XMODEM offered this feature. In these cases, the receiver would send the ACK as soon as the packet started, in the same fashion as the Telebit modems. Since this feature is only an alteration of the receiver-side behavior, it does not require any changes in the protocol on the sender's side. YMODEM formalized this system.

This concept should be contrasted with the one used in SEAlink, which changes the behavior on both sides of the link. In SEAlink, the receiver stops sending the ACK entirely, and the sender changes its behavior to not expect them.

See also

[edit]

References

[edit]

Citations

[edit]
  1. ^ Telecommunications: XMODEM: A Standard Is Born, By Alfred Glossbrenner, PC Mag, 17 April 1984, Page 451-452, ... but the protocol itself was long ago placed in the public domain by its creator, Chicagoan Ward Christensen. Since its introduction in 1978, XMODEM ...
  2. ^ In Focus: History lesson: Ward Christensen's free free-exchange software, By Michael Swaine, InfoWorld, 1 Nov 1982, Page 26
  3. ^ Ward Christensen, "Memories", 25 November 1992
  4. ^ a b c d e f g Meeks, Brock (February 1989). "The ABCs of X-, Y-, and ZMODEM". BYTE. pp. 163–166. Retrieved 2024-10-08.
  5. ^ "The Virtual Community".
  6. ^ Kline, David (July 1982). "Osborne—Behind Guerrilla Lines". Microcomputing. pp. 42–50. Retrieved 15 February 2016.
  7. ^ Pournelle, Jerry (July 1983). "Interstellar Drives, Osborne Accessories, DEDICATE/32, and Death Valley". BYTE. p. 334. Retrieved 28 August 2016.
  8. ^ a b c Bush 1995, p. G.1.
  9. ^ Christensen 1982.
  10. ^ Forsberg 1986.
  11. ^ a b c d e f g Boswell 1986.
  12. ^ a b c d SEAlink 1987.
  13. ^ "NMODEM 1.12 program and source code". Archived from the original on 2011-08-07. Retrieved 2020-02-13.
  14. ^ "NMODEM documentation". Archived from the original on 2016-04-09. Retrieved 2020-02-13.

Bibliography

[edit]
[edit]