IRC
- This article discusses the Internet Chat Protocol IRC. For other meanings of IRC, see IRC (disambiguation).
Internet protocol suite |
---|
Application layer |
Transport layer |
Internet layer |
Link layer |
Internet Relay Chat (IRC) is a form of instant communication over the Internet. It is mainly designed for group (many-to-many) communication in discussion forums called channels, but also allows one-to-one communication.
IRC was created by Jarkko Oikarinen (nickname "WiZ") in late August 1988 to replace a program called MUT (MultiUser Talk) on a BBS called OuluBox in Finland. Oikarinen found inspiration in Bitnet Relay Chat which operated on the Bitnet network.
IRC gained prominence when it was used to report on the Soviet coup attempt of 1991 throughout a media blackout. It was later used in a similar fashion by Kuwaitis during the Iraqi invasion.
Technical information
IRC is an open protocol that uses TCP and optionally SSL. An IRC server can connect to other IRC servers to expand the IRC network. Users access IRC networks by connecting a client to a server. There are many client and server implementations. Most IRC servers do not require users to log in, but a user will have to set a nickname before being connected.
IRC is a plaintext protocol, which means that it is fully possible (though quite inconvenient) to use IRC via a basic byte-stream client such as netcat or telnet. However, the protocol only uses a slightly modified version of ASCII, and does not originally provide any support for non-ASCII characters in text, with the result that many different, incompatible character encodings (such as ISO 8859-1 and UTF-8) are used.
Because most IRC implementations use an acyclic graph as their connection model, there is no redundancy, and outage of a server or a link can cause a netsplit.
Evolution
All client-to-server IRC protocols in use today are descended from the protocol implemented in the irc2.8 version of the IRC2server, and documented in RFC 1459. Since RFC 1459 was published, the new features in the irc2.10 implementation led to the publication of several revised protocol documents; RFC 2810, RFC 2811, RFC 2812 and RFC 2813, however these protocol changes have not been widely adopted among other implementations. IRC 2.10 is most widely used on the IRCnet network. The IRC protocol was extended by Microsoft in 1998 via its IRCX protocol that solves many of the traditional problems that legacy IRC networks faced, along with some features that most users felt were 'ahead of its time'. Although many specifications on the IRC protocol have been published, there is no official specification, as the protocol remains dynamic. Virtually no clients and very few servers rely strictly on the above RFCs as a reference.
While the client-to-server protocols are at least functionally similar, server-to-server protocols differ widely (TS5, P10, and ND/CD are several widely-used and incompatible server protocols), making it very difficult to "link" two separate implementations of the IRC server. Some "bridge" servers do exist, to allow linking of, for example, 2.10 servers to TS5 servers, but these are often accompanied with restrictions of which parts of each protocol may be used, and are not widely deployed.
In its first incarnations, IRC did not have many features that are taken for granted today, such as named channels and channel operators. Channels were numbered -- channel 4 and channel 57, for example -- and the channel topic described the kind of conversation that took place in the channel. One holdover of this is that joining channel 0 causes a client to leave all the channels it is presently on: "CHANNEL 0" being the original command to leave the current channel.
The first major change to IRC, in version 2.5, was to add named channels -- "+channels". "+channels" were later replaced with "#channels" in version 2.7, numeric channels were removed entirely and channel bans (mode +b) were implemented. irc2.8 added "&channels" (those that exist only on the current server, rather than the entire network) and "!channels" (those that are theoretically safe from suffering from the many ways that a user could exploit a channel by "riding a netsplit"), and is the baseline release from which nearly all current implementations are derived.
Significant releases based on 2.8 include:
- 2.8.21+CS, developed by Comstud
- 2.8+th, Taner's patchset, which later became
- 2.8/hybrid, originally developed by Jon Lusky (Rodder) and Diane Bruce (Dianora), later joined by a large development team.
- 2.9, 2.10, 2.11, ... continue the development of the original codebase, mainly for use on the IRCnet network. This development line produced the 4 IRC RFCs released after RFC 1459, which document this server protocol exclusively.
2.8.21+CS and 2.8/hybrid continue to be used on EFnet, with ircd-ratbox (an offshoot of 2.8/hybrid) as of 2004 being the most popular.
Undernet's IRC server, ircu, is one of the few servers not descended from irc2.8 that are based on the original ircd; it was forked from the irc2.7 codebase.
Many modern IRC servers have been coded from scratch, such as csircd (also from Comstud), ConferenceRoom, Microsoft Exchange Chat Service, and IRCPlus/IRCXPro.
Channels and Modes
The basic means of communication in an established IRC session is a channel which users can join and then send messages to, which are then relayed to all other users in the same channel. Channels which are available across an entire IRC network are prepended with a '#', while those local to a server use '&'. Other (non-standard) channel types include '+' channels—'modeless' channels without operators, and '!' channels, a form of timestamped channel on normally non-timestamped networks.
Both users and channels may have modes, which are some kind of attributes or switches. Modes are abbreviated by single letters so you can string them together concisely. An example for an user mode is 'i', which stands for invisible. (You cannot tell whether or not an invisible user is on a channel unless you join that channel or use the whois command on its nick.) A simple channel-mode example is 'm' (moderated), specifying that only 'voiced' users and channel operators are allowed to speak on the channel. This, along with 'k' (keyed - requires a password to join the channel) and 'i' (invite-only - requires an invitation from a channel operator) modes can be used to keep abuse out of the channel. There are five types of channelmodes, four of which will accept an argument, type A accepting an argument to add/remove values from a list (such as 'b'), type B accepting an argument that is used when turning the mode 'on' and 'off' (such as 'k'), type C accepting an argument only when the mode is turned 'on' (such as 'l'), type D which accepts no arguments and is simply a boolean flag (such as 'm', 'n', and 't'), and type E (usually called 'class' or 'prefix' modes) that give/take a privilege from a user on a channel (such as 'o').
Type E modes (channel classes) specify which users on a channel have privileges, and what level of those privileges they have. Originally only 'channel operator' (mode 'o') and 'voice' (mode 'v') existed. Channel operator (usually abbreviated chanop or simply 'op') privileges allow a user to kick users, set modes, and change the topic if the channel is '+t'. Voice privileges allow a user to speak on a channel if it is moderated (mode 'm'). Additions to these classes are 'channel owner' (mode 'q') created by Microsoft in its IRCX implementation (and later used by UnrealIrcd); 'half-operator' (mode 'h') which is similar to a chanop, except they cannot set certain modes and can only kick normal users; 'protected' (mode 'a'); 'administrator' (mode 'a' or 'u'); and many more.
Each channel class has an associated prefix that is shown beside a user's nickname whenever associated with that channel. The most common prefixes are '@' for channel operator, '+' for voice, '%' for half-op, '.' or '~' for channel owner, '&' for protected user, '!' or '*' for administrator.
Unless the channel is moderated, the only effect of +v (voice) is the plus sign appearing beside the nick name. On many channels this is used to indicate seniority or regularity of use, or a kind of "trusted user" flag in case the channel does have to be moderated.
Most IRC networks feature a lot of extra modes not specified in any RFC document. This is a very simple feat for clients to adapt to since a list of all the valid user and channelmodes are sent to clients in the RPL_MYINFO reply upon logon. In addition, the list of channelmodes (and what type of arguments they accept), and the prefixes for class modes are specified in the protocol control reply (RPL_PROTOCTL or 005) sent from most IRC servers when a client connects. This message is used to tell clients what features the server supports, and what its limits are (for example, the maximum number of users you can have on your notify list, or the maximum length of your nickname).
There are also users whose privileges extend to whole servers or networks of servers; these are called IRC Operators. On some IRC implementations, IRC operators are also given channel operator status in every channel, although many people believe that administration of channels and administration of the network should be kept separate, and that IRC operator status does not confer the right to interfere with a particular channel's operation.
Because IRC connections are unencrypted and typically span long time periods, they are an attractive target for malicious hackers. Because of this, careful security policy is necessary to ensure that an IRC network is not susceptible to an attack such as an IRC takeover war. IRC networks also k-line or g-line users or networks that tend to have a harming effect.
IRC served as an early laboratory for many kinds of Internet attacks, such as using fake ICMP unreachable messages to break TCP-based IRC connections ("nuking") to annoy users or facilitate takeovers.
Abuse prevention: timestamping vs. nick/channel delay protocol
One of the most contentious technical issues surrounding IRC implementations, which survives to this day, is the merit of "Nick/Channel Delay" vs. "TimeStamp" protocols. Both methods exist to solve the problem of denial-of-service attacks, but take very different approaches.
The problem with the original IRC protocol as implemented was that when two servers split and rejoined, the two sides of the network would simply merge their channels. If a user could join on a "split" server, where a channel which existed on the other side of the network was empty, and gain operator status, they would become a channel operator of the "combined" channel after the netsplit ended; if a user took a nickname which existed on the other side of the network, the server would kill both users when rejoining.
This was often abused to "mass-kill" all users on a channel, thus creating "opless" channels: where no operators were present to deal with abuse. Apart from causing problems within IRC, this encouraged people to conduct denial of service attacks against IRC servers in order to cause netsplits, which they would then abuse.
Nick/channel delay
The nick/channel delay (abbreviated ND/CD) solution to this problem was very simple. After a user signed off and the nickname became available, or a channel ceased to exist because all its users left (as often happens during a netsplit), the server would not allow any user to use that nickname or join that channel, respectively, until a certain period of time (the delay) had passed. The idea behind this was that even if a netsplit occurred, it was useless to an abuser because they could not take the nickname or gain operator status on a channel, and thus no collision of a nickname or 'merging' of a channel could occur. To some extent, this inconvenienced legitimate users, who might be forced to briefly use a different name (appending an underscore was popular) after rejoining.
Timestamping
The alternative, the timestamp or TS protocol, took a different approach. Every nickname and channel on the network was assigned a timestamp -- the date and time when it was created. When a netsplit occurred, two users on each side were free to use the same nickname or channel, but when the two sides were joined, only one could survive. In the case of nicknames, the newer user, according to their TS, was killed; when a channel collided, the members (users on the channel) were merged, but the channel operators on the "losing" side of the split were de-opped.
TS is a much more complicated protocol than ND/CD, both in design and implementation, and despite having gone through several revisions, some implementations still have problems with "desyncs" (where two servers on the same network disagree about the current state of the network), and allowing too much leniency in what was allowed by the 'losing' side. Under the original TS protocols, for example, there was no protection against users setting bans or other modes in the losing channel which would then be merged when the split rejoined, even though the users who had set those modes were no longer opped. Some modern TS-based IRC servers have also incorporated some form of ND and/or CD in addition to timestamping in an attempt to further curb abuse.
There is not, and likely never will be, a consensus on timestamping vs. delay; however most networks today use the timestamping approach. It was part of the issues and disagreements which caused several servers to split away from EFnet and form the newer IRCnet (EFnet after the split moving to a TS protocol, and IRCnet using ND/CD), and supporters on both sides were known for heated arguments regarding the merits of their solution.
Networks
Today there are several thousand running IRC networks in the world. They run various implementations of IRC servers, and are administered by various groups of IRC Operators, but the protocol exposed to IRC users is very similar, and all IRC networks can be accessed by the same client software.
The largest IRC networks have traditionally been grouped in The Big Four — a designation for networks that top the statistics. Currently this includes four networks regularly frequented by around or over one hundred thousand clients, namely:
Other large networks include:
- Aitvaras
- DALnet (once held QuakeNet's seat, before a series of DDoS attacks)
- Enter The Game
- Freenode
- IRCHighway
- GameSurge
- LinkNet
- rizon
For network statistics, rankings, and a list of smaller networks, see netsplit.de and Search IRC
Clients
- See list of IRC clients for more detail.
mIRC is widely believed to be the most popular IRC client on Windows based systems. ircII is the canonical Unix IRC client, but its userbase has declined with the appearance of competing clients such as ircII-EPIC, BitchX, irssi, X-Chat etc.
A framework designed to incorporate IRC into various other applications, such as games, is LibIRC, although it is still heavily under development.
Mac OS X can also run most Unix-like command line and X11 IRC clients.
Bots
There are also many automated clients, called bots. The first bot was written by Greg Lindahl and provided moderation for the game of Wumpus, but most modern bots are usually used to exercise operator privileges (controlling channels and acting quickly in case of abuse), to annoy other users (perhaps by spamming them with lots of messages), to answer repetitive user questions and provide help when channels are not attended, or serve as permanent points of contact for information exchange (an answering machine, file transfer, etc.). The most popular IRC bots today are Eggdrop and EnergyMech.
More recently, bots have been written using the mIRC client's built-in scripting language.
The modern IRC services are implemented via bots. They're often used in channel maintenance: banning users, keeping a list of operators, keeping the channel topic, etc.
Bouncer
A program that runs as a daemon on a server and functions as a persistent proxy is known as a bouncer. A bouncer's purpose is to maintain a connection to an IRC server, acting as a relay between it and the connecting client. Should the client lose network connectivity, the bouncer will archive all traffic for later delivery, allowing the user to resume his IRC session without externally perceptible disruption. Two of the most popular bouncers are muh and psyBNC. Muh is exclusively for single user connections, while psyBNC supports multiple users. Another feature-rich bouncer is ZNC.
Modern IRC
IRC has changed much over its life on the Internet. New server software has added a multitude of new features.
- Services: Network-Operated bots to facilitate registration of nicknames and channels, sending messages for offline users and network operator functions.
- Extra Modes: While the original IRC system used a set of standard user and channel modes, new servers add many new modes for such features as removing color codes from text, or obscuring a user's hostmask ("cloaking") to protect from denial of service attacks.
- Proxy Detection: Most modern servers support detection of users attempting to connect through an insecure (misconfigured or exploited) proxy, which can then be denied a connection. An example is the Blitzed Open Proxy Monitor or BOPM, used by several networks.
- Additional Commands: New commands can be such things as shorthand commands to issue commands to Services, to network operator only commands to manipulate a user's hostmask.
- Encryption: For the client-to-server leg of the connection SSL might be used (messages cease to be secure once they are relayed to other users on standard connections, but it makes eavesdropping on or wiretapping an individual's IRC sessions difficult). For client-to-client communication, SDCC (secure DCC) can be used.
- Ident: Provides identification to the IRC server.
- Connection Protocol: IRC can be connected to via IPv4, the current standard version of the Internet Protocol, or by IPv6, the next-generation version of the Protocol.
Abuse
In addition to the denial of service attacks and netsplit abuses mentioned above, there are varied groups with malicious intent on the servers. One group consists mostly of Leet users knowledgeable about computers. They tell new users that ask questions about how to use IRC to type in specific harmful commands. Examples are:
- format C: /Y (reformats hard drive in Windows)
- rm -rf / (wipes a Unix/Linux system)
- Ctrl+Alt+Delete twice (forces a reboot in earlier versions of Windows)
- Alt+F4 (closes current program in Windows)
- Alt+Z (closes the current channel window in mIRC, a popular IRC client)
Other methods of abuse include typing in questions the answers to which cause the user to quit the server (like "Two friends are sitting in a garden: /exit and /quit. /exit walks away, who is left?") and trying to get another user to quit by flooding him with packets. Advertising of channels or websites (sometimes through the use of bots) in public and private channels is a problem, too.
File sharing
Using scripts like Sysreset and Polaris users can create file servers that allow them to share files with others. In addition to the normal pros and cons of file-sharing (see Copyright infringement of software), there are also groups that set up anime fansubbing networks, allowing American audiences to see anime that would normally be unavailable in English and outside of Japan.
Due to the large amount of people who use IRC solely for file sharing, some think that IRC is a form of P2P file sharing (along with the client mIRC). Paradoxically, many users try to defeat this view by persistently discouraging it or refusing to help with it. Technically, IRC is not for file sharing, although it does posess some advanced file transfer mechanisms which, most importantly, support resuming.
However, IRC-based "DCC" transfers should not be considered to be P2P, as the download source is from one, single, individual; typically an "XDCC" bot or another user. While there is much legitimate material on IRC for download, there is also a strong background of "warez", which is namely the presence of illegal or copyrighted material.
See also
- Bash.org
- Depot channel
- Direct Client-to-Client
- Idle RPG - A role playing game for IRC
- Instant messaging
- IRC Services
- IRCX
- Multicast - IRC is one of the few technologies equipped with a real one-to-many strategy.
- Online chat
- PalTalk
- Peer-to-peer
- XDCC
- IRC floods
- SILC (protocol) - a promising alternative to IRC
- Shell Account
- BBS
- Chat
- Chat room
- Internet forum
- List of smiley codes
- List of IRC commands
- List of IRC clients
External links
- IRC.org - Technical and Historical IRC6 information
- History of IRC summarized by Daniel Stenberg
- mIRC server list, most of the larger IRC networks
- Andreas Gelhausen's extensive IRC statistics
- List of channel modes that various IRC Daemons use
- Large archive of IRC-related documents, somewhat EFNet biased
- cross network whois search at Search IRC, an IRC search engine
- Raw images spidered from IRC channels
- IRC Junkie - IRC news
- IRC Help - Learning the basics of IRC, including netiquette
- An extensive list of different numerics and modes used by various IRC networks
- Technical Information about the IRC Protocol