Wikipedia:Naming conventions (Indic)
The following is a proposed Wikipedia policy, guideline, or process. The proposal may still be in development, under discussion, or in the process of gathering consensus for adoption. |
These conventions are used to write and edit Indic-related articles, as well as to provide consistent naming of people, places, and historical terms of the Indian subcontinent.
Policy
In naming Indic-related topics and article titles in the English language Wikipedia, please follow these conventions. The term Indic is used here to refer to names and words that originated in one of the languages of the Indian subcontinent, and that are written in Brahmi derived scripts.
Scope
This applies to any articles related to Indic subjects, or words derived from languages written in an Indic script. It also applies to most of the religious works of the Indian religions.
Languages of origin
This convention should be applied to any language spoken in the Indian subcontinent that is written in an Indic script. The major languages are: Assamese, Bengali, Gujarati, Hindi, Kannada, Konkani (when written in Kannada or Devanagari scripts), Malayalam, Marathi, Nepali, Oriya, Pali, Punjabi (when written in Gurmukhi script), Sanskrit, Sinhala, Tamil, Telugu and Tulu (written in Kannada script).
The following languages are of Indic origin, but will usually be written in non-Indic scripts, usually derived from Arabic (see Wikipedia:Naming conventions (Arabic). This convention will normally apply to them only when transliterating from writings in an Indic script: Urdu, Kashmiri, Punjabi (western), Sindhi.
Several languages may be written in Indic scripts, but are not themselves Indic languages. Some aspects of this convention may apply to them, but they may have their own conventions. They include Burmese, Javanese, Khmer, Lao, Thai, Tibetan.
Subject matter covered
This standard is recommended for use in articles in the following fields;
- Religions originating in South Asia, including Hinduism, Sikhism, Jainism. Articles on Buddhism may follow this convention, or that of other languages, depending on the language of origin.
- Mythology of South Asia.
- Historical articles and historical place names of South Asia, including especially history prior to 1800.
Modern names and terms
Personal, organisation, and company names in current and recent usage should generally be romanized according to the nameholder's preference, if that can be established. However, this convention may be appropriately applied to them in certain contexts. These include;
- when it is necessary to accurately or unambiguously transliterate from original text, or to indicate original pronunciation,
- when it is necessary to maintain consistency in the article
Titles and honorifics
In keeping with the neutral nature of Wikipedia and wider policies, honorifics should generally not be included with names, unless they are part of an external quotation.
It is common in devotional texts to precede the name of deities (gods), and sometimes revered individuals with the English term 'lord', the Sanskrit 'sri' or other appellations. These should be omitted in Wikipedia articles and the simple name should be used; e.g. 'Krishna', not 'Sri Krishna', 'Lord Krishna' or other forms.
Sometimes postfixes are found, most commonly the appending of the term 'ji'. This should also be omitted.
Exceptions may be made in cases where the subject is not known except with titles or other honorifics (i.e. when the honorific becomes part of the name), or where they become the best means of disambiguation. See Wikipedia:Manual of Style/Biographies.
Below is a list of commonly used honorifics which should be questioned;
- Lord (English)
- Sri
- Swami
- Sankaracharya
- Raja, Rani (Maharaja, Maharani)
- Guru, Jagadguru
- Deva, Devi (where used as a postfix for deities)
- Ji (postfix)
Redirects should be used for other forms of an individual's name.
Naming and transliteration
Each Indic article should be named according to its primary transliteration, if this can be clearly established. If a primary transliteration cannot be clearly established, then the article should be labelled with a simplified transliteration.
Informal transliterations should not be used for article titles, unless they can be justified as being the primary transliteration used in English. If a transliteration's status as primary cannot be justified or verified, then it should be corrected and replaced by a simplified transliteration. A redirect should then be left in its place. Redirects should also be created for the more common alternative spelling forms.
Where alternative spellings or informal transliterations are in widespread use, a limited number of the most common ones should be listed at the top of the main article.
If the formal transliteration differs from the article title, it should be included once in the introductory section of the article for reference. The name in the original script may also be included, for further reference. See below for advice on formatting these.
Primary transliteration
A word has a primary transliteration if at least 75% of all references in wider English usage have the same transliteration. Primary transliterations may sometimes be less accurate than other transliterations.
Formal transliteration
A formal transliteration preserves all phonetic information in the original script, and should be used where unambiguous transliteration is needed. The preferred formal transliteration is the standardised ISO 15919 transliteration scheme for Indic scripts. This system uses the Latin alphabet (the Roman script) with additional diacritic symbols to represent Indic scripts accurately. For Sanskrit and Pali, IAST is preferred, which differs from ISO 15919 only in a very few points.
See this concise table for aid in transliteration.
Simplified transliteration
A simplified transliteration scheme is used to produce an easy-to-read version of the formal transliteration. The simplified transliteration converts the formal transliteration into a Latin form that complies with Latin phonemic rules. This means that Indic transliteration symbols that are not part of the standard Latin alphabet are merged with the closest Latin equivalent. The simplified transliteration is also included with the formal transliteration at the transliteration transliteration table .
The formal transliteration can be used for all Wikis written in Latin based scripts. The simplified transliteration however may be suitable only for the English Wikipedia.
Formatting
Use the {{lang}} tag to mark non-English strings. When giving a term in its native script, provide the ISO 639-2 code (if unavailable, ISO 639-3) to identify the language. Example:
- {{lang|ta|தமிழ்}}, {{lang|hi|हिन्दी}}
- தமிழ், हिन्दी
Giving a term in ISO 15919 transliteration, simply add "-Latn" (the ISO 15924 code for the Latin alphabet) to the language code:
- ''{{lang|ta-Latn|tamiḻ}}, {{lang|hi-Latn|hindī}}''
- tamiḻ, hindī
Use {{IAST}} only for Sanskrit terms:
- {{lang|sa|संस्कृतम्}} ''{{IAST|saṃskṛtam}}''
- संस्कृतम् saṃskṛtam
Preferred format for introducing the article subject
Ideally, when introducing an article that is covered by this policy, IPA transcriptions (with audio file if possible) and transliterations of the Indic script should be included. The format is:
- Simplified Transliteration (English IPA text; Language: [Indic Text] Error: {{Lang}}: unrecognized language tag: lang (help), [ISO Transliteration] Error: {{Transliteration}}: unrecognized language / script code: lang (help), Template:IPA-lang) ...
A special {{indic}} template has been made to take care of formatting issues.
Use this format when you have the original script text, transliteration, IPA and audio pronunciation file.
'''Simplified Transliteration''' ({{indic | lang=Language code | defaultipa=English IPA text | defaultaudio=Audio file.ogg | indic=Indic Text | trans=ISO Transliteration | indicipa=Indic IPA text | indicaudio=Audio file.ogg }}) ...
- Example
'''Sikhism''' ({{indic|lang=pa|defaultipa='siːkɪz(ə)m|defaultaudio=Seekism.ogg|indic=ਸਿੱਖੀ|trans=sikkhī|indicipa='sɪk.kʰiː| indicaudio=Sikkhi.ogg}}) is a...
- Sikhism ('siːkɪz(ə)m; Punjabi: ਸਿੱਖੀ, sikkhī, IPA: ['sɪk.kʰiː] ) is a...
Without audio
Use this when you have the original script text, transliteration and IPA but do not have the audio pronunciation. This is likely to be the most used format.
'''Simplified Transliteration''' ({{indic | lang=Language code | defaultipa=English IPA text | indic=Indic Text | trans=ISO Transliteration | indicipa=Indic IPA text }}) ...
- Example
'''Mumbai''' ({{indic|lang=mr|defaultipa=mumbəi|indic=मुंबई|trans=mumbaī}}) is a...
- Mumbai (mumbəi; Marathi: मुंबई, mumbaī) is a...
Other versions
If both the English and Indic pronunciation are the same (likely if the Indic word isn't used in English) then ignore the indicipa
parameter. If you don't have audio files, you can simply leave those parameters out. For full details of what the template can do, see Template:Indic.