English in computing: Difference between revisions
Jack Frost (talk | contribs) m Reverted edits by 93.183.169.165 (talk) (HG) (3.4.10) |
IDN Date start is June 2003 acording to https://en.wikipedia.org/wiki/Internationalized_domain_name, and it mentions a source, so I put that source |
||
(30 intermediate revisions by 23 users not shown) | |||
Line 1: | Line 1: | ||
⚫ | |||
{{Short description|English as lingua franca of programming and computer science}} |
{{Short description|English as lingua franca of programming and computer science}} |
||
{{Multiple issues| |
|||
⚫ | |||
{{original research|date=June 2021}} |
{{original research|date=June 2021}} |
||
{{ |
{{more footnotes|date=June 2021}} |
||
}} |
|||
⚫ | |||
⚫ | |||
⚫ | The [[English language]] is sometimes described as the ''[[lingua franca]]'' of computing. In comparison to other sciences, where [[Latin]] and [[Greek language|Greek]] are often the principal sources of vocabulary, [[computer science]] borrows more extensively from English. In the past, due to the technical limitations of early computers, and the lack of international standardization on the [[Internet]], computer users were limited to using English and the Latin alphabet. However, this historical limitation is less present today, due to innovations in internet infrastructure and increases in computer speed. Most software products are [[Internationalization and localization|localized]] in numerous languages and the invention of the [[Unicode]] character encoding has resolved problems with non-Latin alphabets. Some limitations have changed since June 2003<ref>Mohan, Ram, [https://www.iana.org/domains/idn-tables/tables/info_de_1.0.html German IDN], [http://www.afilias.info/register/idn/DE-INFO_IDN_Language_Table-v1-1.pdf German Language Table] {{Webarchive|url=https://web.archive.org/web/20061218142725/http://www.afilias.info/register/idn/DE-INFO_IDN_Language_Table-v1-1.pdf |date=2006-12-18 }}, March 2003</ref> such as with [[domain name]]s, which previously allowed only [[ASCII]] characters. |
||
⚫ | |||
==History== |
|||
⚫ | The [[English language]] is sometimes described as the ''[[lingua franca]]'' of computing. In comparison to other sciences, where [[Latin]] and [[Greek language|Greek]] are the principal sources of vocabulary, [[computer science]] borrows more extensively from English. |
||
Computer science has an ultimately mathematical foundation which was laid by non-English speaking cultures. The first mathematically literate societies in the Ancient Near East recorded methods for solving mathematical problems in steps.<ref>{{cite book |last=Chabert |first=Jean-Luc |date=1994 |title=A History of Algorithms |location=Paris |publisher=Springer |page=7}}</ref> The word 'algorithm' comes from the name of a famous medieval Arabic mathematician who contributed to the spread of [[Hindu–Arabic numeral system|Hindu-Arabic numerals]], [[al-Khwārizmī]].<ref>{{cite book |last=O'Regan |first=Gerard |date=2021 |title=A Brief History of Computing |location=Cham, Switzerland |publisher=Springer |page=29}}</ref> The first systematic treatment of binary numbers was completed by [[Leibniz]],<ref>{{cite book |last=O'Regan |first=Gerard |date=2021 |title=A Brief History of Computing |location=Cham, Switzerland |publisher=Springer |page=38}}</ref> a German mathematician. Leibniz wrote his treatise on the topic in French, the lingua franca of science at the time,<ref>{{cite web |url=http://www.andaman.org/BOOK/reprints/weber/rep-weber.htm |title=Top Languages |last=Weber |first=George |date=2003 |website=www.andaman.org |archive-url=https://web.archive.org/web/20080312042140/http://www.andaman.org/BOOK/reprints/weber/rep-weber.htm |archive-date=12 March 2008 |url-status=dead}}</ref> and innovations in what is now called [[computer hardware]] occurred outside of an English tradition, with Pascal inventing the first mechanical calculator, and Leibniz improving it.<ref>{{cite book |last=O'Regan |first=Gerard |date=2021 |title=A Brief History of Computing |location=Cham, Switzerland |publisher=Springer |page=36}}</ref> |
|||
Interest in building computing machines first emerged in the 19th century, with the coming of the [[Second Industrial Revolution]]. The origins of computing in an English tradition began in this era with [[Charles Babbage]]'s conceptualization of the [[Difference Engine|Difference]] and [[Analytical Engine]], [[George Boole]]'s work on logic, and [[Herman Hollerith]]'s invention of the tabulating machine for specific use in the [[1890 United States census]].<ref>{{cite book |last=O'Regan |first=Gerard |date=2021 |title=A Brief History of Computing |location=Cham, Switzerland |publisher=Springer |pages=35–88}}</ref> At the time, Britain enjoyed near complete hegemonic power in the West at the height of the ''[[Pax Britannica]]'', and America was experiencing an economic and demographic boom. By the time of the interwar period in the early 20th century, the most important mathematics related to the development of computing was being done in English, which was also beginning to become the new lingua franca of science.<ref>{{cite book |last=Kaplan |first=Robert |date=2001 |title=The Dominance of English as a Language of Science |location=Berlin, New York |publisher=Mouton De Gruyter |page=9}}</ref> |
|||
⚫ | |||
==Influence on other languages== |
==Influence on other languages== |
||
The computing terminology of many languages borrows from English. Some language communities |
The computing terminology of many languages borrows from English. Some language communities actively resist this trend, and in other cases English is used extensively and more directly. This section gives some examples of the use of English loans in other languages and mentions any notable differences. |
||
===Bulgarian=== |
===Bulgarian=== |
||
Both English and Russian have influence over [[Bulgarian language|Bulgarian]] computing vocabulary. |
Both English and Russian have had influence over [[Bulgarian language|Bulgarian]] computing vocabulary. In many cases, however, the borrowed word is translated into Bulgarian rather than transcribed phonetically from English. Combined with the use of [[Cyrillic script|Cyrillic]] this can make it difficult to recognize loanwords. For example, the Bulgarian term for [[motherboard]] is {{lang|bg|дънна платка}} ({{IPA|bg|ˈdɤnna ˈplatka|ipa}}), literally "bottom board". |
||
* {{lang|bg|компютър}} {{IPA| |
* {{lang|bg|компютър}} ({{IPA|bg|komˈpjutɤr|ipa}}) – computer |
||
* {{lang|bg|твърд диск}} {{IPA| |
* {{lang|bg|твърд диск}} ({{IPA|bg|tvɤrt disk|ipa}}) – hard disk (i.e., [[Hard disk drive|hard drive]]) |
||
* {{lang|bg|дискета}} {{IPA| |
* {{lang|bg|дискета}} ({{IPA|bg|diˈskɛtɐ|ipa}}) – [[floppy disk]]; like the French {{lang|fr|disquette}} |
||
* {{lang|bg| |
* {{lang|bg|уебсайт}} ({{IPA|bg|ˈwɛbsɐjt|ipa}}) – web site; but also {{lang|bg|интернет страница}} ({{IPA|bg|ˈintɛrnɛt ˈstranit͡sɐ|ipa}}), literally "internet page" |
||
===Faroese=== |
===Faroese=== |
||
The [[Faroese language]] has a sparse scientific vocabulary ''based on the language itself''. Many Faroese scientific words are borrowed and/or modified versions of especially Nordic and English equivalents. The vocabulary is constantly evolving and thus new words often die out, and only a few survive and become widely used. Examples of successful words include e.g. "telda" (computer), "kurla" (at sign) and "ambætari" (server).<ref>{{Cite web |url=http://www.fmn.fo/listar/teldord/teldord-f-e-d3.htm |title=List of Faroese-English-Danish IT words |access-date=June 29, 2010 |archive-url=https://web.archive.org/web/20130531144706/http://www.fmn.fo/listar/teldord/teldord-f-e-d3.htm |archive-date=May 31, 2013 |url-status=dead }}</ref> |
The [[Faroese language]] has a sparse scientific vocabulary ''based on the language itself''. Many Faroese scientific words are borrowed and/or modified versions of especially Nordic and English equivalents. The vocabulary is constantly evolving and thus new words often die out, and only a few survive and become widely used. Examples of successful words include e.g. "{{lang|fo|telda}}" (computer), "{{lang|fo|kurla}}" (at sign) and "{{lang|fo|ambætari}}" (server).<ref>{{Cite web |url=http://www.fmn.fo/listar/teldord/teldord-f-e-d3.htm |title=List of Faroese-English-Danish IT words |access-date=June 29, 2010 |archive-url=https://web.archive.org/web/20130531144706/http://www.fmn.fo/listar/teldord/teldord-f-e-d3.htm |archive-date=May 31, 2013 |url-status=dead }}</ref> |
||
===French=== |
===French=== |
||
{{see also|Franglais}} |
{{see also|Franglais}} |
||
In [[French language|French]], there are some generally accepted English |
In [[French language|French]], there are some generally accepted English loanwords, but there is also a distinct effort to avoid them. In France, the {{lang|fr|[[Académie française]]|italic=no}} is responsible for the standardisation of the language and often coins new technological terms. Some of them are accepted in practice, but oftentimes the English loans remain predominant. In Quebec, the {{lang|fr|[[Office québécois de la langue française]]|italic=no}} has a similar function. |
||
* |
* {{lang|fr|email}}/{{lang|fr|mail}} (in Europe); {{lang|fr|courriel}} (mainly in [[French-speaking Canada]], but increasingly used in French-speaking Europe); {{lang|fr|mél.}} (only used as an abbreviation, similar to "{{lang|fr|tél.}}"<ref>[http://www.academie-francaise.fr/questions-de-langue#26_strong-em-courriel-ml-em-strong "Questions de langue" on the Académie Française's website]</ref>); more formally {{lang|fr|courrier électronique}} |
||
* |
* {{lang|fr|pourriel}} – spam |
||
* |
* {{lang|fr|hameçonnage}}, phishing – phishing |
||
* |
* {{lang|fr|télécharger}} – to download |
||
* |
* {{lang|fr|site web}} – website |
||
* |
* {{lang|fr|lien}}, {{lang|fr|hyperlien}} – website [[hyper-link]] |
||
* |
* {{lang|fr|base de données}} – database |
||
* |
* {{lang|fr|caméra web}}, {{lang|fr|webcaméra}}, short {{lang|fr|webcam}} – webcam |
||
* |
* {{lang|fr|amorcer}}, {{lang|fr|démarrer}}, {{lang|fr|booter}} – to boot |
||
* |
* {{lang|fr|redémarrer}}, {{lang|fr|rebooter}} – to reboot |
||
* |
* {{lang|fr|arrêter}}, {{lang|fr|éteindre}} – to shut down |
||
* |
* {{lang|fr|amorçable}}, {{lang|fr|bootable}} – bootable |
||
* |
* {{lang|fr|surfréquençage}}, {{lang|fr|surcadençage}}, overclocking – overclocking |
||
* |
* {{lang|fr|refroidissement à l'eau}} – watercooling |
||
* |
* {{lang|fr|tuning PC}} – [[case modding]] |
||
=== German === |
=== German === |
||
{{see also|Denglisch}} |
{{see also|Denglisch}} |
||
In [[German language|German]], English words are very often used as well: |
In [[German language|German]], English words are very often used as well: |
||
* nouns: Computer, Website, Software, E-Mail, Blog |
* nouns: {{lang|de|Computer}}, {{lang|de|Website}}, {{lang|de|Software}}, {{lang|de|E-Mail}}, {{lang|de|Blog}} |
||
* verbs: downloaden, booten, crashen |
* verbs: {{lang|de|downloaden}}, {{lang|de|booten}}, {{lang|de|crashen}} |
||
=== Japanese === |
=== Japanese === |
||
{{See also|Japanese input methods}} |
{{See also|Japanese input methods}} |
||
[[Japanese language|Japanese]] uses the [[katakana]] alphabet for foreign |
[[Japanese language|Japanese]] uses the [[katakana]] alphabet for foreign loanwords, a wide variety of which are in use today. English computing terms remain prevalent in modern Japanese vocabulary. |
||
* |
* コンピューター ({{lang|ja-latn|konpyūtā}}) - [[computer]] |
||
* コーダー ( |
* コーダー ({{lang|ja-latn|kōdā}}) - [[Programmer|coder]] |
||
* コーデック |
* コーデック ({{lang|ja-latn|kōdekku}}) - [[codec]] |
||
* |
* ダウンロード ({{lang|ja-latn|daunrōdo}}) - [[download]] |
||
* リンク ({{lang|ja-latn|rinku}}) - [[Hyperlink|link]] |
|||
Utilizing a [[Computer keyboard|keyboard]] layout suitable for [[romanization of Japanese]], a user may type in the [[Latin script]] in order to display Japanese, inclusive of [[hiragana]], [[katakana]], and [[Japanese kanji]].<blockquote>Usually when writing in Japanese on a computer keyboard, the text is input in roman transcription, optionally according to [[Hepburn romanization|Hepburn]], [[Kunrei-shiki romanization|Kunrei]], or [[Nihon-shiki romanization|Nippon romanization]]; the common Japanese word processing programs allow for all three. Long vowels are input according to how they are written in kana; for example, a long ''o'' is input as ''ou'', instead of an ''o'' with a circumflex or macron (''ô'' or ''ō''). As letters are keyed in, they are automatically converted, as specified, into either hiragana or katakana. And these kana phrases are in turn converted, as desired, into kanji.<ref>{{Cite web|url=https://www.hadamitzky.de/english/lp_romanization_sys.htm|title=Romanization systems|website=www.hadamitzky.de|access-date=2019-05-15}}</ref></blockquote> |
Utilizing a [[Computer keyboard|keyboard]] layout suitable for [[romanization of Japanese]], a user may type in the [[Latin script]] in order to display Japanese, inclusive of [[hiragana]], [[katakana]], and [[Japanese kanji]].<blockquote>Usually when writing in Japanese on a computer keyboard, the text is input in roman transcription, optionally according to [[Hepburn romanization|Hepburn]], [[Kunrei-shiki romanization|Kunrei]], or [[Nihon-shiki romanization|Nippon romanization]]; the common Japanese word processing programs allow for all three. Long vowels are input according to how they are written in kana; for example, a long ''o'' is input as ''ou'', instead of an ''o'' with a circumflex or macron (''ô'' or ''ō''). As letters are keyed in, they are automatically converted, as specified, into either hiragana or katakana. And these kana phrases are in turn converted, as desired, into kanji.<ref>{{Cite web|url=https://www.hadamitzky.de/english/lp_romanization_sys.htm|title=Romanization systems|website=www.hadamitzky.de|access-date=2019-05-15}}</ref></blockquote> |
||
=== Icelandic === |
=== Icelandic === |
||
The [[Icelandic language]] has [[Icelandic vocabulary#Modern neologisms|its own vocabulary of scientific terms]] |
The [[Icelandic language]] has [[Icelandic vocabulary#Modern neologisms|its own vocabulary of scientific terms]]. Still, English loans exist, and are mostly used in casual conversation, whereas the Icelandic words might be longer or not as widespread. |
||
=== Norwegian === |
=== Norwegian === |
||
It's quite common to use English words |
It's quite common to use English words with regards to computing in all Scandinavian languages. |
||
nouns: {{lang|no| |
nouns: {{lang|no|mail}} (referring to e-mail), {{lang|no|software}}, {{lang|no|blogg}} (from "blog"), {{lang|no|spam}} |
||
verbs: å boote, å spamme, å blogge |
verbs: {{lang|no|å boote}}, {{lang|no|å spamme}}, {{lang|no|å blogge}} |
||
=== Polish === |
=== Polish === |
||
[[Polish language]] |
[[Polish language|Polish]] terminology derived from English: |
||
* ''[[wikt:dżojstik|dżojstik]]'': [[joystick]]<ref>{{cite web|url=http://sjp.pwn.pl/slownik/2562347/joystick|title=dżojstik|publisher=[[Polish Scientific Publishers PWN]]|work=[[Słownik języka polskiego]]|accessdate=26 September 2012}}</ref> |
* ''[[wikt:dżojstik|dżojstik]]'': [[joystick]]<ref>{{cite web|url=http://sjp.pwn.pl/slownik/2562347/joystick|title=dżojstik|publisher=[[Polish Scientific Publishers PWN]]|work=[[Słownik języka polskiego]]|accessdate=26 September 2012}}</ref> |
||
* ''[[:pl:wikt:kartridż|kartrydż, kartridż]]'': [[wikt:cartridge|cartridge]] |
* ''[[:pl:wikt:kartridż|kartrydż, kartridż]]'': [[wikt:cartridge|cartridge]] |
||
Line 79: | Line 87: | ||
=== Russian === |
=== Russian === |
||
* [[History of computer hardware in |
* [[History of computer hardware in Eastern Bloc countries]] |
||
* [[Computer Russification]] |
* [[Computer Russification]] |
||
Line 86: | Line 94: | ||
;Frequently untranslated, and their Spanish equivalent |
;Frequently untranslated, and their Spanish equivalent |
||
*email: |
*email: {{lang|es|correo electrónico}} |
||
*mouse (only in Latin America): |
*mouse (only in Latin America): {{lang|es|ratón}} (mainly in Spain) |
||
*messenger: |
*messenger: {{lang|es|mensajero}} (only in Spain) |
||
*webcam: |
*webcam: {{lang|es|cámara web}}, {{lang|es|webcam}} |
||
*website: |
*website: {{lang|es|página web}}, {{lang|es|sitio web}} |
||
*blog: |
*blog: {{lang|es|bitácora}}, {{lang|es|blog}} |
||
*ban/banned: |
*ban/banned: {{lang|es|baneado}} (Latin America), {{lang|es|vetar}}, {{lang|es|vetado}} |
||
*web: |
*web: {{lang|es|red}}, {{lang|es|web}} |
||
;Not translated |
;Not translated |
||
*flog |
*flog |
||
*descargar |
|||
;Undecided |
;Undecided |
||
Many computing terms in Spanish share a common root with their English counterpart. In these cases, both terms are understood, but the Spanish is preferred for formal use: |
Many computing terms in Spanish share a common root with their English counterpart. In these cases, both terms are understood, but the Spanish is preferred for formal use: |
||
*link vs |
*link vs {{lang|es|enlace}} or {{lang|es|vínculo}} |
||
*net vs |
*net vs {{lang|es|red}} |
||
== Character encoding == |
== Character encoding == |
||
Early computer software and hardware had very little support for character sets other than the Latin alphabet. As a result, it was difficult or impossible to represent languages based on other scripts. The [[ASCII]] character encoding, created in the 1960s, usually only supported 128 different characters in a 7 bit format. With the use of additional software it was possible to provide support for some languages, for instance those based on the Cyrillic alphabet. However, complex-script and [[Logogram|logographic]] languages like Chinese or Japanese need more characters than the 256 limit imposed by 8-bit character encodings. Some computers created in the former [[Soviet Union|USSR]] had native support for the Cyrillic alphabet. |
|||
The |
The widespread adoption of [[Unicode]], and [[UTF-8]] on the web, resolved most of these historical limitations. ASCII remains the de facto standard for command interpreters, [[programming language]]s and text-based communication protocols, but it is slowly dying out. |
||
* [[Mojibake]] – Text presented as "unreadable" when software fails due to character encoding issues. |
* [[Mojibake]] – Text presented as "unreadable" when software fails due to character encoding issues. |
||
Line 112: | Line 121: | ||
== Programming language == |
== Programming language == |
||
{{see also|Non-English-based programming languages}} |
{{see also|Non-English-based programming languages}} |
||
The syntax of most programming languages uses English keywords, and therefore it could be argued some knowledge of English is required in order to use them. However, it is important to recognize all [[programming languages]] are in the class of [[formal languages]]. They are very different from any natural language, including English. |
The syntax of most programming languages uses English keywords, and therefore it could be argued some knowledge of English is required in order to use them. Some studies have shown that programmers nonnative to English self-report that English is their biggest obstacle to programming proficiency.<ref>{{cite journal |last1=Ben Idris |first1=Mrwan |last2=Ammar |first2=Hany |date=March 2018 |title=The Correlation between Arabic Student's English Proficiency and Their Computer Programming Ability at the University Level |url=https://www.researchgate.net/publication/324254171 |journal=International Journal of Managing Public Sector Information and Communication Technologies |volume=9 |pages=01–10 |doi=10.5121/ijmpict.2018.9101|doi-access=free }}</ref> However, it is important to recognize all [[programming languages]] are in the class of [[formal languages]]. They are very different from any natural language, including English. |
||
Some examples of non-English programming languages: |
Some examples of non-English programming languages: |
||
Line 125: | Line 134: | ||
* [[Languages of India|Indian Languages]]: [[Hindawi Programming System]] |
* [[Languages of India|Indian Languages]]: [[Hindawi Programming System]] |
||
* [[Russian language|Russian]]: [[Glagol]] |
* [[Russian language|Russian]]: [[Glagol]] |
||
* [[Spanish language|Spanish]]: [[Lexico (programming language)|Lexico]] |
|||
* [[Portuguese language|Portuguese]]: [[Portugol]] |
* [[Portuguese language|Portuguese]]: [[Portugol]] |
||
== Communication protocols == |
== Communication protocols == |
||
Many application protocols use text strings for requests and parameters, rather than the binary values commonly used in lower layer protocols. The request strings are generally based on English words, although in some cases the strings are contractions or acronyms of English expressions, which |
Many application protocols use text strings for requests and parameters, rather than the binary values commonly used in lower layer protocols. The request strings are generally based on English words, although in some cases the strings are contractions or acronyms of English expressions, which can render them somewhat cryptic to anyone not familiar with the protocol, whatever their proficiency in English. Nevertheless, the use of word-like strings is a convenient mnemonic device that allows a person skilled in the art (and with sufficient knowledge of English) to execute the protocol manually from a keyboard, usually for the purpose of finding a problem with the service. |
||
Examples: |
Examples: |
||
Line 160: | Line 168: | ||
==== Native speakers ==== |
==== Native speakers ==== |
||
English-language users appear to be a plurality of web users, consistently cited as around one-third of the overall (near one billion). This reflects the relative affluence of English-speaking countries and high Internet penetration rates in them. This lead may be eroding due mainly to a rapid increase of Chinese users.<ref> |
English-language users appear to be a plurality of web users, consistently cited as around one-third of the overall (near one billion). This reflects the relative affluence of English-speaking countries and high Internet penetration rates in them. This lead may be eroding due mainly to a rapid increase of Chinese users.<ref>{{Cite web |last=Johnson |first=Bobbie |date=August 16, 2005 |title=English grip on internet being eroded |url=http://blogs.guardian.co.uk/online/archives/2005/08/16/english_grip_on_internet_being_eroded.html |url-status=dead |archive-url=https://web.archive.org/web/20050910205352/http://blogs.guardian.co.uk/online/archives/2005/08/16/english_grip_on_internet_being_eroded.html |archive-date=Sep 10, 2005 |website=Onlineblog |publisher=Guardian Unlimited}}</ref> |
||
First-language users among other relatively affluent countries appear generally stable, the two largest being German and Japanese, which each have between 5% and 10% of the overall share. |
First-language users among other relatively affluent countries appear generally stable, the two largest being German and Japanese, which each have between 5% and 10% of the overall share. |
||
=== World Wide Web content === |
=== World Wide Web content === |
||
One widely quoted figure for the amount of [[web content]] in English is 80%.<ref> |
One widely quoted figure for the amount of [[web content]] in English is 80%.<ref>{{Cite web |title=What percentage of the internet is in English? |url=http://www.englishenglish.com/english_facts_8.htm |url-status=dead |archive-url=https://web.archive.org/web/20200128201340/http://www.englishenglish.com/english_facts_8.htm |archive-date=Jan 28, 2020 |website=English English}}</ref> Other sources show figures five to fifteen points lower, though still well over 50%.<ref>{{cite web|title=Usage of content languages for websites|url=http://w3techs.com/technologies/overview/content_language/all|work=W3Techs |accessdate=30 December 2011 |url-status=live |archive-url=https://archive.today/20170530202731/https://w3techs.com/technologies/overview/content_language/all |archive-date= 30 May 2017 }}</ref><ref>{{cite web |url=http://www.verisign.com/press_releases/pr/page_029135.html |title=VeriSign Announces Plan to Further Enhance .com and .net Global Internet Constellation Sites with Regional Resolution Servers |website=VeriSign |date=April 6, 2005 |accessdate=31 January 2014 |url-status=dead |archiveurl=https://web.archive.org/web/20051224063055/http://www.verisign.com/press_releases/pr/page_029135.html |archivedate=24 December 2005 }}</ref><ref>{{Cite web |last=Bowen |first=Ted Smalley |date=November 21, 2001 |title=English could snowball on Net |url=http://www.trnmag.com/Stories/2001/112101/English_could_snowball_on_Net_112101.html |url-status=live |archive-url=https://web.archive.org/web/20230924160226/http://trnmag.com/Stories/2001/112101/English_could_snowball_on_Net_112101.html |archive-date=Sep 24, 2023 |website=Technology Research News}}</ref> There are two notable facts about these percentages: |
||
The English web content is greater than the number of first-language English users by as much as 2 to 1.{{Citation needed|date=May 2009}} |
The English web content is greater than the number of first-language English users by as much as 2 to 1.{{Citation needed|date=May 2009}} |
||
Line 171: | Line 179: | ||
Given the enormous lead it already enjoys and its increasing use as a ''lingua franca'' in other spheres, English web content may continue to dominate even as English first-language Internet users decline. This is a classic [[positive feedback loop]]: new Internet users find it helpful to learn English and employ it online, thus reinforcing the language's prestige and forcing subsequent new users to learn English as well. |
Given the enormous lead it already enjoys and its increasing use as a ''lingua franca'' in other spheres, English web content may continue to dominate even as English first-language Internet users decline. This is a classic [[positive feedback loop]]: new Internet users find it helpful to learn English and employ it online, thus reinforcing the language's prestige and forcing subsequent new users to learn English as well. |
||
Certain other factors (some predating the medium's appearance) have propelled English into a majority web-content position. Most notable in this regard is the tendency for researchers and professionals to publish in English to ensure maximum exposure. The largest database of medical bibliographical information, for example, shows English was the majority language choice for the past forty years and its share has continually increased over the same period.<ref> |
Certain other factors (some predating the medium's appearance) have propelled English into a majority web-content position. Most notable in this regard is the tendency for researchers and professionals to publish in English to ensure maximum exposure. The largest database of medical bibliographical information, for example, shows English was the majority language choice for the past forty years and its share has continually increased over the same period.<ref>{{Cite journal |last1=Loria |first1=Alvar |last2=Arroyo |first2=Pedro |date=July 2005 |title=Language and country preponderance trends in MEDLINE and its causes |journal=J Med Libr Assoc |volume=93 |issue=3 |pages=381–385 |pmid=16059428 |pmc=1175804 }}</ref> |
||
The fact that non-Anglophones regularly publish in English only reinforces the language's dominance. English has a rich technical vocabulary (largely because native and non-native speakers alike use it to communicate technical ideas) and many IT and technical professionals use English regardless of country of origin ([[Linus Torvalds]], for instance, comments his code in English, despite being from Finland and having Swedish as his first language). |
The fact that non-Anglophones regularly publish in English only reinforces the language's dominance. English has a rich technical vocabulary (largely because native and non-native speakers alike use it to communicate technical ideas) and many IT and technical professionals use English regardless of country of origin ([[Linus Torvalds]], for instance, comments his code in English, despite being from Finland and having Swedish as his first language). |
||
Line 178: | Line 186: | ||
{{reflist}} |
{{reflist}} |
||
[[Category:English language|computing in]] |
|||
[[Category:English language]] |
|||
[[Category:Computing and society]] |
[[Category:Computing and society]] |
||
[[Category:Internet culture]] |
[[Category:Internet culture]] |
||
[[Category:Natural language and computing]] |
[[Category:Natural language and computing]] |
||
[[Category:English as a global language]] |
[[Category:English as a global language|computing]] |
Latest revision as of 20:04, 17 September 2024
This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these messages)
|
The English language is sometimes described as the lingua franca of computing. In comparison to other sciences, where Latin and Greek are often the principal sources of vocabulary, computer science borrows more extensively from English. In the past, due to the technical limitations of early computers, and the lack of international standardization on the Internet, computer users were limited to using English and the Latin alphabet. However, this historical limitation is less present today, due to innovations in internet infrastructure and increases in computer speed. Most software products are localized in numerous languages and the invention of the Unicode character encoding has resolved problems with non-Latin alphabets. Some limitations have changed since June 2003[1] such as with domain names, which previously allowed only ASCII characters.
English is seen as having this role due to the prominence of the United States and the United Kingdom, both English-speaking countries, in the development and popularization of computer systems, computer networks, software and information technology.
History
[edit]Computer science has an ultimately mathematical foundation which was laid by non-English speaking cultures. The first mathematically literate societies in the Ancient Near East recorded methods for solving mathematical problems in steps.[2] The word 'algorithm' comes from the name of a famous medieval Arabic mathematician who contributed to the spread of Hindu-Arabic numerals, al-Khwārizmī.[3] The first systematic treatment of binary numbers was completed by Leibniz,[4] a German mathematician. Leibniz wrote his treatise on the topic in French, the lingua franca of science at the time,[5] and innovations in what is now called computer hardware occurred outside of an English tradition, with Pascal inventing the first mechanical calculator, and Leibniz improving it.[6]
Interest in building computing machines first emerged in the 19th century, with the coming of the Second Industrial Revolution. The origins of computing in an English tradition began in this era with Charles Babbage's conceptualization of the Difference and Analytical Engine, George Boole's work on logic, and Herman Hollerith's invention of the tabulating machine for specific use in the 1890 United States census.[7] At the time, Britain enjoyed near complete hegemonic power in the West at the height of the Pax Britannica, and America was experiencing an economic and demographic boom. By the time of the interwar period in the early 20th century, the most important mathematics related to the development of computing was being done in English, which was also beginning to become the new lingua franca of science.[8]
Influence on other languages
[edit]The computing terminology of many languages borrows from English. Some language communities actively resist this trend, and in other cases English is used extensively and more directly. This section gives some examples of the use of English loans in other languages and mentions any notable differences.
Bulgarian
[edit]Both English and Russian have had influence over Bulgarian computing vocabulary. In many cases, however, the borrowed word is translated into Bulgarian rather than transcribed phonetically from English. Combined with the use of Cyrillic this can make it difficult to recognize loanwords. For example, the Bulgarian term for motherboard is дънна платка (IPA: [ˈdɤnna ˈplatka]), literally "bottom board".
- компютър (IPA: [komˈpjutɤr]) – computer
- твърд диск (IPA: [tvɤrt disk]) – hard disk (i.e., hard drive)
- дискета (IPA: [diˈskɛtɐ]) – floppy disk; like the French disquette
- уебсайт (IPA: [ˈwɛbsɐjt]) – web site; but also интернет страница (IPA: [ˈintɛrnɛt ˈstranit͡sɐ]), literally "internet page"
Faroese
[edit]The Faroese language has a sparse scientific vocabulary based on the language itself. Many Faroese scientific words are borrowed and/or modified versions of especially Nordic and English equivalents. The vocabulary is constantly evolving and thus new words often die out, and only a few survive and become widely used. Examples of successful words include e.g. "telda" (computer), "kurla" (at sign) and "ambætari" (server).[9]
French
[edit]In French, there are some generally accepted English loanwords, but there is also a distinct effort to avoid them. In France, the Académie française is responsible for the standardisation of the language and often coins new technological terms. Some of them are accepted in practice, but oftentimes the English loans remain predominant. In Quebec, the Office québécois de la langue française has a similar function.
- email/mail (in Europe); courriel (mainly in French-speaking Canada, but increasingly used in French-speaking Europe); mél. (only used as an abbreviation, similar to "tél."[10]); more formally courrier électronique
- pourriel – spam
- hameçonnage, phishing – phishing
- télécharger – to download
- site web – website
- lien, hyperlien – website hyper-link
- base de données – database
- caméra web, webcaméra, short webcam – webcam
- amorcer, démarrer, booter – to boot
- redémarrer, rebooter – to reboot
- arrêter, éteindre – to shut down
- amorçable, bootable – bootable
- surfréquençage, surcadençage, overclocking – overclocking
- refroidissement à l'eau – watercooling
- tuning PC – case modding
German
[edit]In German, English words are very often used as well:
- nouns: Computer, Website, Software, E-Mail, Blog
- verbs: downloaden, booten, crashen
Japanese
[edit]Japanese uses the katakana alphabet for foreign loanwords, a wide variety of which are in use today. English computing terms remain prevalent in modern Japanese vocabulary.
- コンピューター (konpyūtā) - computer
- コーダー (kōdā) - coder
- コーデック (kōdekku) - codec
- ダウンロード (daunrōdo) - download
- リンク (rinku) - link
Utilizing a keyboard layout suitable for romanization of Japanese, a user may type in the Latin script in order to display Japanese, inclusive of hiragana, katakana, and Japanese kanji.
Usually when writing in Japanese on a computer keyboard, the text is input in roman transcription, optionally according to Hepburn, Kunrei, or Nippon romanization; the common Japanese word processing programs allow for all three. Long vowels are input according to how they are written in kana; for example, a long o is input as ou, instead of an o with a circumflex or macron (ô or ō). As letters are keyed in, they are automatically converted, as specified, into either hiragana or katakana. And these kana phrases are in turn converted, as desired, into kanji.[11]
Icelandic
[edit]The Icelandic language has its own vocabulary of scientific terms. Still, English loans exist, and are mostly used in casual conversation, whereas the Icelandic words might be longer or not as widespread.
Norwegian
[edit]It's quite common to use English words with regards to computing in all Scandinavian languages.
nouns: mail (referring to e-mail), software, blogg (from "blog"), spam
verbs: å boote, å spamme, å blogge
Polish
[edit]Polish terminology derived from English:
Russian
[edit]Spanish
[edit]The English influence on the software industry and the internet in Latin America has borrowed significantly from the Castilian lexicon.
- Frequently untranslated, and their Spanish equivalent
- email: correo electrónico
- mouse (only in Latin America): ratón (mainly in Spain)
- messenger: mensajero (only in Spain)
- webcam: cámara web, webcam
- website: página web, sitio web
- blog: bitácora, blog
- ban/banned: baneado (Latin America), vetar, vetado
- web: red, web
- Not translated
- flog
- descargar
- Undecided
Many computing terms in Spanish share a common root with their English counterpart. In these cases, both terms are understood, but the Spanish is preferred for formal use:
- link vs enlace or vínculo
- net vs red
Character encoding
[edit]Early computer software and hardware had very little support for character sets other than the Latin alphabet. As a result, it was difficult or impossible to represent languages based on other scripts. The ASCII character encoding, created in the 1960s, usually only supported 128 different characters in a 7 bit format. With the use of additional software it was possible to provide support for some languages, for instance those based on the Cyrillic alphabet. However, complex-script and logographic languages like Chinese or Japanese need more characters than the 256 limit imposed by 8-bit character encodings. Some computers created in the former USSR had native support for the Cyrillic alphabet.
The widespread adoption of Unicode, and UTF-8 on the web, resolved most of these historical limitations. ASCII remains the de facto standard for command interpreters, programming languages and text-based communication protocols, but it is slowly dying out.
- Mojibake – Text presented as "unreadable" when software fails due to character encoding issues.
Programming language
[edit]The syntax of most programming languages uses English keywords, and therefore it could be argued some knowledge of English is required in order to use them. Some studies have shown that programmers nonnative to English self-report that English is their biggest obstacle to programming proficiency.[13] However, it is important to recognize all programming languages are in the class of formal languages. They are very different from any natural language, including English.
Some examples of non-English programming languages:
- Arabic: ARLOGO, قلب
- Bengali: BangaBhasha
- Chinese: Chinese BASIC
- Dutch: Superlogo
- French: LSE, WinDev, Pascal (although the English version is more widespread)
- Hebrew: Hebrew Programming Language
- Icelandic: Fjölnir
- Indian Languages: Hindawi Programming System
- Russian: Glagol
- Portuguese: Portugol
Communication protocols
[edit]Many application protocols use text strings for requests and parameters, rather than the binary values commonly used in lower layer protocols. The request strings are generally based on English words, although in some cases the strings are contractions or acronyms of English expressions, which can render them somewhat cryptic to anyone not familiar with the protocol, whatever their proficiency in English. Nevertheless, the use of word-like strings is a convenient mnemonic device that allows a person skilled in the art (and with sufficient knowledge of English) to execute the protocol manually from a keyboard, usually for the purpose of finding a problem with the service.
Examples:
- FTP: USER, PASS (password), PASV (passive), PORT, RETR (retrieve), STOR (store), QUIT
- SMTP: HELO (hello), MAIL, RCPT (recipient), DATA, QUIT
- HTTP: GET, PUT, POST, HEAD (headers), DELETE, TRACE, OPTIONS
It is notable that response codes, that is, the strings sent back by the recipient of a request, are typically numeric: for instance, in HTTP (and some borrowed by other protocols)
- 200 OK request succeeded
- 301 Moved Permanently to redirect the request to a new address
- 404 Not Found the requested page does not exist
This is because response codes also need to convey unambiguous information, but can have various nuances that the requester may optionally use to vary its subsequent actions. To convey all such "sub-codes" with alphabetic words would be unwieldy, and negate the advantage of using pseudo-English words. Since responses are usually generated by software they do not need to be mnemonic. Numeric codes are also more easily analyzed and categorized when they are processed by software, instead of a human testing the protocol by manual input.
Localization
[edit]BIOS
[edit]Many personal computers have a BIOS chip, displaying text in English during boot time.
Keyboard shortcut
[edit]Keyboard shortcuts are usually defined in terms of English keywords such as CTRL+F for find.
English on the World Wide Web
[edit]English is the largest language on the World Wide Web, with 27% of internet users.
English speakers
[edit]Web user percentages usually focus on raw comparisons of the first language of those who access the web. Just as important is a consideration of second- and foreign-language users; i.e., the first language of a user does not necessarily reflect which language he or she regularly employs when using the web.
Native speakers
[edit]English-language users appear to be a plurality of web users, consistently cited as around one-third of the overall (near one billion). This reflects the relative affluence of English-speaking countries and high Internet penetration rates in them. This lead may be eroding due mainly to a rapid increase of Chinese users.[14]
First-language users among other relatively affluent countries appear generally stable, the two largest being German and Japanese, which each have between 5% and 10% of the overall share.
World Wide Web content
[edit]One widely quoted figure for the amount of web content in English is 80%.[15] Other sources show figures five to fifteen points lower, though still well over 50%.[16][17][18] There are two notable facts about these percentages:
The English web content is greater than the number of first-language English users by as much as 2 to 1.[citation needed]
Given the enormous lead it already enjoys and its increasing use as a lingua franca in other spheres, English web content may continue to dominate even as English first-language Internet users decline. This is a classic positive feedback loop: new Internet users find it helpful to learn English and employ it online, thus reinforcing the language's prestige and forcing subsequent new users to learn English as well.
Certain other factors (some predating the medium's appearance) have propelled English into a majority web-content position. Most notable in this regard is the tendency for researchers and professionals to publish in English to ensure maximum exposure. The largest database of medical bibliographical information, for example, shows English was the majority language choice for the past forty years and its share has continually increased over the same period.[19]
The fact that non-Anglophones regularly publish in English only reinforces the language's dominance. English has a rich technical vocabulary (largely because native and non-native speakers alike use it to communicate technical ideas) and many IT and technical professionals use English regardless of country of origin (Linus Torvalds, for instance, comments his code in English, despite being from Finland and having Swedish as his first language).
Notes
[edit]- ^ Mohan, Ram, German IDN, German Language Table Archived 2006-12-18 at the Wayback Machine, March 2003
- ^ Chabert, Jean-Luc (1994). A History of Algorithms. Paris: Springer. p. 7.
- ^ O'Regan, Gerard (2021). A Brief History of Computing. Cham, Switzerland: Springer. p. 29.
- ^ O'Regan, Gerard (2021). A Brief History of Computing. Cham, Switzerland: Springer. p. 38.
- ^ Weber, George (2003). "Top Languages". www.andaman.org. Archived from the original on March 12, 2008.
- ^ O'Regan, Gerard (2021). A Brief History of Computing. Cham, Switzerland: Springer. p. 36.
- ^ O'Regan, Gerard (2021). A Brief History of Computing. Cham, Switzerland: Springer. pp. 35–88.
- ^ Kaplan, Robert (2001). The Dominance of English as a Language of Science. Berlin, New York: Mouton De Gruyter. p. 9.
- ^ "List of Faroese-English-Danish IT words". Archived from the original on May 31, 2013. Retrieved June 29, 2010.
- ^ "Questions de langue" on the Académie Française's website
- ^ "Romanization systems". www.hadamitzky.de. Retrieved May 15, 2019.
- ^ "dżojstik". Słownik języka polskiego. Polish Scientific Publishers PWN. Retrieved September 26, 2012.
- ^ Ben Idris, Mrwan; Ammar, Hany (March 2018). "The Correlation between Arabic Student's English Proficiency and Their Computer Programming Ability at the University Level". International Journal of Managing Public Sector Information and Communication Technologies. 9: 01–10. doi:10.5121/ijmpict.2018.9101.
- ^ Johnson, Bobbie (August 16, 2005). "English grip on internet being eroded". Onlineblog. Guardian Unlimited. Archived from the original on September 10, 2005.
- ^ "What percentage of the internet is in English?". English English. Archived from the original on January 28, 2020.
- ^ "Usage of content languages for websites". W3Techs. Archived from the original on May 30, 2017. Retrieved December 30, 2011.
- ^ "VeriSign Announces Plan to Further Enhance .com and .net Global Internet Constellation Sites with Regional Resolution Servers". VeriSign. April 6, 2005. Archived from the original on December 24, 2005. Retrieved January 31, 2014.
- ^ Bowen, Ted Smalley (November 21, 2001). "English could snowball on Net". Technology Research News. Archived from the original on September 24, 2023.
- ^ Loria, Alvar; Arroyo, Pedro (July 2005). "Language and country preponderance trends in MEDLINE and its causes". J Med Libr Assoc. 93 (3): 381–385. PMC 1175804. PMID 16059428.