Jump to content

HTTP 404

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Cjjm (talk | contribs) at 05:39, 28 September 2010 (Reinstatement of lost wiki code at end of '404 Page Widgets' section. What does: SONU KUMAR SONI mean?). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The 404 or Not Found error message is a HTTP standard response code indicating that the client was able to communicate with the server, but the server could not find what was requested. 404 errors should not be confused with "server not found" or similar errors, in which a connection to the destination server could not be made at all. A 404 error indicates that the requested resource may be available again in the future.

Overview

When communicating via HTTP, a server is required to respond to a request, such as a web browser's request for an HTML document (web page), with a numeric response code and an optional, mandatory, or disallowed (based upon the status code) message. In the code 404, the first "4" indicates a client error, such as a mistyped URL. The following two digits indicate the specific error encountered. HTTP's use of three-digit codes is similar to the use of such codes in earlier protocols such as FTP and NNTP.

At the HTTP level, a 404 response code is followed by a human-readable "reason phrase". The HTTP specification suggests the phrase "Not Found"[1] and many web servers by default issue an HTML page that includes both the 404 code and the "Not Found" phrase.

A 404 error is often returned when pages have been moved or deleted. In the first case, a better response is to return a 301 Moved Permanently response, which can be configured in most server configuration files, or through URL rewriting; in the second case, a 410 Gone should be returned. Because these two options require special server configuration, most websites do not make use of them.

404 errors should not be confused with DNS errors, which appear when the given URL refers to a server name that does not exist. A 404 error indicates that the server itself was found, but that the server was not able to retrieve the requested page.

Custom error pages

A screenshot of a 404 error page on Wikipedia in the Camino browser

Webservers can typically be configured to display a customised error page, including a more natural description, the parent site's branding or sometimes a search form. The protocol level phrase, which is hidden from the user, is rarely customized.

Internet Explorer (before Internet Explorer 7), however, will not display custom pages unless they are larger than 512 bytes, opting to instead display a "friendly" error page. Google Chrome includes similar functionality, where the 404 is replaced with alternative suggestions generated by Google algorithms, if the page is under 512 bytes in size.

False 404 errors

Some websites report a "not found" error by returning a standard web page with a "200 OK" response code; this is known as a soft 404. Soft 404s are problematic for automated methods of discovering whether a link is broken. Some search engines, like Yahoo, use automated processes to detect soft 404s.[2] Soft 404s can occur as a result of configuration errors when using certain HTTP server software, for example with the Apache software, when an Error Document 404 (specified in a .htaccess file) is specified as an absolute path (e.g. http://example.com/error.html) rather than a relative path (/error.html).[3]

Some proxy servers generate a 404 error when the remote host is not present, rather than returning the correct 500-range code when errors such as hostname resolution failures or refused TCP connections prevent the proxy server from satisfying the request. This can confuse programs that expect and act on specific responses, as they can no longer easily distinguish between an absent web server and a missing web page on a web server that is present.

In July 2004, the UK telecom provider BT Group deployed the Cleanfeed content blocking system, which returns a 404 error to any request for content identified as potentially illegal by the Internet Watch Foundation.[4] Other ISPs return a HTTP 403 "forbidden" error in the same circumstances.[5] The practice of employing fake 404 errors as a means to conceal censorship has also been reported in Thailand[6] and Tunisia.[7] In Tunisia, where censorship is reportedly severe, people have become aware of the nature of the fake 404 errors and have created an imaginary character named "Ammar 404" who represents "the invisible censor".[8]

Slang usage

In 2008, a study found that "404" had become a slang synonym for "clueless" in the UK/Ireland areas. Slang lexicographer Jonathon Green said that "404" as a slang term had been driven by the "influence of technology" and young people, but at the current time, such usage was relatively confined to London and other urban areas. The study, carried out by the telecommunications arm of the Post Office, concluded that writing abbreviations such as "404" was expedient to the sender, while an Australian study found that such phrases impaired the intelligibility of the message.[9]

404 page widgets

While many websites send additional information in a 404 error message—such as a link to the homepage of a website or a search box—there are also much more advanced extensions available as widgets that endeavor to find the correct web page the user wanted.[10]

See also

References

  1. ^ "6.1.1 Status Code and Reason Phrase". Retrieved 22 June 2008.
  2. ^ ""Why is your crawler asking for strange URLs that have never existed on my site?", Yahoo Web Crawler Help page". Retrieved 4 October 2009.
  3. ^ ""Farewell to soft 404s", Google Official Blog". Retrieved 20 September 2008.
  4. ^ LINX Public Affairs » Cleanfeed: the facts
  5. ^ "DEMON - Error 403"
  6. ^ "The old fake '404 Not Found' routine". Bangkok Post. 2009-02-18. Retrieved 2010-09-12.
  7. ^ http://opennet.net/node/950
  8. ^ ""Anti-censorship movement in Tunisia: creativity, courage and hope!"". Advocacy.globalvoicesonline.org. 2010-05-27. Retrieved 2010-08-28.
  9. ^ "Don't be 404, know the tech slang". BBC News. 2008-12-10. Retrieved 2008-12-13.
  10. ^ Swenson, Sahala (August 19, 2008). "Make your 404 pages more useful". Official Google Webmaster Central Blog. Google, Inc. Retrieved August 28, 2009.