Jump to content

Sino-Tibetan languages

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 212.95.8.241 (talk) at 14:04, 28 July 2018 (External classification: included theory from japanese article (translated)). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Sino-Tibetan
Trans-Himalayan
Geographic
distribution
East Asia, Southeast Asia, South Asia, Northeast India
Linguistic classificationOne of the world's primary language families
SubdivisionsSome 40 well-established subgroups, of which those with the most speakers are:
Language codes
ISO 639-2 / 5sit
Linguasphere79- (phylozone)
Glottologsino1245
Major branches of Sino-Tibetan:

The Sino-Tibetan languages, in a few sources also known as Trans-Himalayan, are a family of more than 400 languages spoken in East Asia, Southeast Asia and South Asia. The family is second only to Indo-European in terms of the number of native speakers. The Sino-Tibetan languages with the most native speakers are the varieties of Chinese (1.3 billion), Burmese (33 million), and the Tibetic languages (6 million), but many Sino-Tibetan languages are spoken by small communities in remote mountain areas and as such are poorly documented. Unlike Western linguists, Chinese linguists generally include Kra–Dai and Hmong-Mien languages within Sino-Tibetan.

Several low-level subgroups have been securely reconstructed, but reconstruction of a proto-language for the family as a whole is still at an early stage, so the higher-level structure of Sino-Tibetan remains unclear. Although the family is traditionally presented as divided into Sinitic (i.e. Chinese) and Tibeto-Burman branches, a common origin of the non-Sinitic languages has never been demonstrated. Several links to other language families have been proposed, but none has broad acceptance.

History

A genetic relationship between Chinese, Tibetan, Burmese and other languages was first proposed in the early 19th century and is now broadly accepted. The initial focus on languages of civilizations with long literary traditions has been broadened to include less widely spoken languages, some of which have only recently, or never, been written. However, the reconstruction of the family is much less developed than for families such as Indo-European or Austroasiatic. Difficulties have included the great diversity of the languages, the lack of inflection in many of them, and the effects of language contact. In addition, many of the smaller languages are spoken in mountainous areas that are difficult to access, and are often also sensitive border zones.[1]

Early work

During the 18th century, several scholars had noticed parallels between Tibetan and Burmese, both languages with extensive literary traditions. Early in the following century, Brian Houghton Hodgson and others noted that many non-literary languages of the highlands of northeast India and Southeast Asia were also related to these. The name "Tibeto-Burman" was first applied to this group in 1856 by James Richardson Logan, who added Karen in 1858.[2][3] The third volume of the Linguistic Survey of India, edited by Sten Konow, was devoted to the Tibeto-Burman languages of British India.[4]

Studies of the "Indo-Chinese" languages of Southeast Asia from the mid-19th century by Logan and others revealed that they comprised four families: Tibeto-Burman, Tai, Mon–Khmer and Malayo-Polynesian. Julius Klaproth had noted in 1823 that Burmese, Tibetan and Chinese all shared common basic vocabulary but that Thai, Mon, and Vietnamese were quite different.[5][6] Ernst Kuhn envisaged a group with two branches, Chinese-Siamese and Tibeto-Burman.[a] August Conrady called this group Indo-Chinese in his influential 1896 classification, though he had doubts about Karen. Conrady's terminology was widely used, but there was uncertainty regarding his exclusion of Vietnamese. Franz Nikolaus Finck in 1909 placed Karen as a third branch of Chinese-Siamese.[7][8]

Jean Przyluski introduced the French term sino-tibétain as the title of his chapter on the group in Meillet and Cohen's Les langues du monde in 1924.[9][10] He divided them into three groups: Tibeto-Burman, Chinese and Tai,[9] and was uncertain about the affinity of Karen and Hmong–Mien.[11] The English translation "Sino-Tibetan" first appeared in a short note by Przyluski and Luce in 1931.[12]

Shafer and Benedict

In 1935, the anthropologist Alfred Kroeber started the Sino-Tibetan Philology Project, funded by the Works Project Administration and based at the University of California, Berkeley. The project was supervised by Robert Shafer until late 1938, and then by Paul K. Benedict. Under their direction, the staff of 30 non-linguists collated all the available documentation of Sino-Tibetan languages. The result was eight copies of a 15-volume typescript entitled Sino-Tibetan Linguistics.[4][b] This work was never published, but furnished the data for a series of papers by Shafer, as well as Shafer's five-volume Introduction to Sino-Tibetan and Benedict's Sino-Tibetan, a Conspectus.[14]

Benedict completed the manuscript of his work in 1941, but it was not published until 1972.[15] Instead of building the entire family tree, he set out to reconstruct a Proto-Tibeto-Burman language by comparing five major languages, with occasional comparisons with other languages.[16] He reconstructed a two-way distinction on initial consonants based on voicing, with aspiration conditioned by pre-initial consonants that had been retained in Tibetic but lost in many other languages.[17] Thus, Benedict reconstructed the following initials:[18]

TB Tibetan Jingpho Burmese Garo Mizo S'gaw Karen Old Chinese[c]
*k k(h) k(h) ~ g k(h) k(h) ~ g k(h) k(h) *k(h)
*g g g ~ k(h) k g ~ k(h) k k(h) *gh
ŋ ŋ ŋ ŋ ŋ y
*t t(h) t(h) ~ d t(h) t(h) ~ d t(h) t(h) *t(h)
*d d d ~ t(h) t d ~ t(h) d d *dh
*n n n n n n n *n ~ *ń
*p p(h) p(h) ~ b p(h) p(h) ~ b p(h) p(h) *p(h)
*b b b ~ p(h) p b ~ p(h) b b *bh
*m m m m m m m *m
*ts ts(h) ts ~ dz ts(h) s ~ tś(h) s s(h) *ts(h)
*dz dz dz ~ ts ~ ś ts tś(h) f s(h) ?
*s s s s th th θ *s
*z z z ~ ś s s f θ ?
*r r r r r r γ *l
*l l l l l l l *l
*h h h h h *x
*w w w w w w *gjw
*y y y y tś ~ dź z y *dj ~ *zj

Although the initial consonants of cognates tend to have the same place and manner of articulation, voicing and aspiration is often unpredictable.[19] This irregularity was attacked by Roy Andrew Miller,[20] though Benedict's supporters attribute it to the effects of prefixes that have been lost and are often unrecoverable.[21] The issue remains unsolved today.[19] It was cited together with the lack of reconstructable shared morphology, and evidence that much shared lexical material has been borrowed from Chinese into Tibeto-Burman, by Christopher Beckwith, one of the few scholars still arguing that Chinese is not related to Tibeto-Burman.[22][23]

Study of literary languages

Ancient Chinese text on bamboo strips

Old Chinese is by far the oldest recorded Sino-Tibetan language, with inscriptions dating from 1200 BC and a huge body of literature from the first millennium BC, but the Chinese script is not alphabetic. Scholars have sought to reconstruct the phonology of Old Chinese by comparing the obscure descriptions of the sounds of Middle Chinese in medieval dictionaries with phonetic elements in Chinese characters and the rhyming patterns of early poetry. The first complete reconstruction, the Grammata Serica Recensa of Bernard Karlgren, was used by Benedict and Shafer.[24]

Karlgren's reconstruction was somewhat unwieldy, with many sounds having a highly non-uniform distribution. Later scholars have revised it by drawing on a range of other sources.[25] Some proposals were based on cognates in other Sino-Tibetan languages, though workers have also found solely Chinese evidence for them.[26] For example, recent reconstructions of Old Chinese have reduced Karlgren's 15 vowels to a six-vowel system originally suggested by Nicholas Bodman on the basis of comparisons with Tibetic.[27] Similarly, Karlgren's *l has been recast as *r, with a different initial interpreted as *l, matching Tibeto-Burman cognates, but also supported by Chinese transcriptions of foreign names.[28] A growing number of scholars believe that Old Chinese did not use tones, and that the tones of Middle Chinese developed from final consonants. One of these, *-s, is believed to be a suffix, with cognates in other Sino-Tibetan languages.[29]

Old Tibetan text found at Turfan

Tibetic has extensive written records from the adoption of writing by the Tibetan Empire in the mid-7th century. The earliest records of Burmese (such as the 12th-century Myazedi inscription) are more limited, but later an extensive literature developed. Both languages are recorded in alphabetic scripts ultimately derived from the Brahmi script of Ancient India. Most comparative work has used the conservative written forms of these languages, following the dictionaries of Jäschke (Tibetan) and Judson (Burmese), though both contain entries from a wide range of periods.[30]

There are also extensive records in Tangut, the language of the Western Xia (1038–1227). Tangut is recorded in a Chinese-inspired logographic script, whose interpretation presents many difficulties, even though multilingual dictionaries have been found.[31]

Gong Hwang-cherng has compared Old Chinese, Tibetic, Burmese and Tangut in an effort to establish sound correspondences between those languages.[16][32] He found that Tibetic and Burmese /a/ correspond to two Old Chinese vowels, *a and *ə.[33] While this has been considered evidence for a separate Tibeto-Burman subgroup, Hill (2014) finds that Burmese has distinct correspondences for Old Chinese rhymes -ay : *-aj and -i : *-əj, and hence argues that the development *ə > *a occurred independently in Tibetan and Burmese.[34]

Fieldwork

The descriptions of non-literary languages used by Shafer and Benedict were often produced by missionaries and colonial administrators of varying linguistic skill.[35][36] Most of the smaller Sino-Tibetan languages are spoken in inaccessible mountainous areas, many of which are politically or militarily sensitive and thus closed to investigators. Until the 1980s, the best-studied areas were Nepal and northern Thailand.[37] In the 1980s and 1990s, new surveys were published from the Himalayas and southwestern China. Of particular interest was the discovery of a new branch of the family, the Qiangic languages of western Sichuan and adjacent areas.[38][39]

Distribution

Most of the current spread of Sino-Tibetan languages is the result of historical expansions of the three groups with the most speakers – Chinese, Burmese and Tibetic – replacing an unknown number of earlier languages. These groups also have the longest literary traditions of the family. The remaining languages are spoken in mountainous areas, along the southern slopes of the Himalayas, the Southeast Asian Massif and the eastern edge of the Tibetan Plateau.

Contemporary languages

Proportion of first-language speakers of larger branches of Sino-Tibetan[40]

  Chinese (94.28%)
  Lolo-Burmese (3.39%)
  Tibetic (0.44%)
  Karen (0.30%)
  others (1.59%)

By far the largest group are the 1.3 billion speakers of varieties of Chinese, most of whom live in the eastern half of China. The first records of Chinese are oracle bone inscriptions from c. 1200 BC, when Old Chinese was spoken around the middle reaches of the Yellow River.[41] Chinese has since expanded throughout China, forming a family whose diversity has been compared with the Romance languages. Diversity is greater in the rugged terrain of southeast China than in the North China Plain.[42]

Burmese is the national language of Myanmar, and the first language of some 33 million people. Burmese speakers first entered the northern Irrawaddy basin from what is now western Yunnan in the early 9th century, when the Pyu city-states had been weakened by an invasion by Nanzhao.[43] Other Burmish languages are still spoken in Dehong Prefecture in the far west of Yunnan.[44] By the 11th century their Pagan Kingdom had expanded over the whole basin.[43] The oldest texts, such as the Myazedi inscription, date from the early 12th century.[44]

The Tibetic languages are spoken by some 6 million people on the Tibetan Plateau and neighbouring areas in the Himalayas and western Sichuan.[45] They are descended from Old Tibetan, which was originally spoken in the Yarlung Valley before it was spread by the expansion of the Tibetan Empire in the 7th century.[46] Although the empire collapsed in the 9th century, Classical Tibetan remained influential as the liturgical language of Tibetan Buddhism.[47]

The remaining languages are spoken in upland areas. Southernmost are the Karen languages, spoken by 4 million people in the hill country along the Myanmar–Thailand border, with the greatest diversity in the Karen Hills, which are believed to be the homeland of the group.[48] The highlands stretching from northeast India to northern Myanmar contain over 100 high-diverse Sino-Tibetan languages. Other Sino-Tibetan languages are found along the southern slopes of the Himalayas, southwest China and northern Thailand.[49]

Homeland

There have been a range of proposals for the Sino-Tibetan urheimat, reflecting the uncertainty about the classification of the family and its time depth.[50] James Matisoff (1991) places it in the eastern part of the Tibetan plateau around 4000 BC, with the various groups migrating out down the Yellow, Yangtze, Mekong, Salween and Brahmaputra rivers.[51] George van Driem (2005) proposes that Sino-Tibetan originated the Sichuan Basin before 7000 BC, with an early migration into northeast India, and a later migration north of the predecessors of Chinese and Tibetic.[52] Roger Blench and Mark Post (2014) have proposed that the Sino-Tibetan homeland is northeast India, the area of greatest diversity, around 7000 BC.[53]

Roger Blench (2009)[54] proposes that the earliest speakers of Sino-Tibetan were not agriculturalists, since agriculture cannot be reconstructed for Proto-Sino-Tibetan. Rather, early Sino-Tibetan speakers were highly diverse foragers who were living in northeast India up to 10,000 years ago. Tibetic, Sinitic, and Burmic were all the results of later expansions.

Classification

Several low-level branches of the family, particularly Lolo-Burmese, have been securely reconstructed, but in the absence of a secure reconstruction of a Sino-Tibetan proto-language, the higher-level structure of the family remains unclear.[55][56] Thus, a conservative classification of Sino-Tibetan/Tibeto-Burman would posit several dozen small coordinate families and isolates; attempts at subgrouping are either geographic conveniences or hypotheses for further research.

Li (1937)

In a survey in the 1937 Chinese Yearbook, Li Fang-Kuei described the family as consisting of four branches:[57][58]

Indo-Chinese (Sino-Tibetan)

Tai and Miao–Yao were included because they shared isolating typology, tone systems and some vocabulary with Chinese. At the time, tone was considered so fundamental to language that tonal typology could be used as the basis for classification. In the Western scholarly community, these languages are no longer included in Sino-Tibetan, with the similarities attributed to diffusion across the Mainland Southeast Asia linguistic area, especially since Benedict (1942).[58] The exclusions of Vietnamese by Kuhn and of Tai and Miao–Yao by Benedict were vindicated in 1954 when André-Georges Haudricourt demonstrated that the tones of Vietnamese were reflexes of final consonants from Proto-Mon–Khmer.[59]

Many Chinese linguists continue to follow Li's classification.[d][58] However, this arrangement remains problematic. For example, there is disagreement over whether to include the entire Kra–Dai family or just Kam–Tai (Zhuang–Dong excludes the Kra languages), because the Chinese cognates that form the basis of the putative relationship are not found in all branches of the family and have not been reconstructed for the family as a whole. In addition, Kam–Tai itself no longer appears to be a valid node within Kra–Dai.

Benedict (1942)

Benedict overtly excluded Vietnamese (placing it in Mon–Khmer) as well as Hmong–Mien and Kra–Dai (placing them in Austro-Tai). He otherwise retained the outlines of Conrady's Indo-Chinese classification, though putting Karen in an intermediate position:[60][61]

Sino-Tibetan
  • Chinese
  • Tibeto-Karen
    • Karen
    • Tibeto-Burman

Shafer (1955)

Shafer criticized the division of the family into Tibeto-Burman and Sino-Daic branches, which he attributed to the different groups of languages studied by Konow and other scholars in British India on the one hand and by Henri Maspero and other French linguists on the other.[62] He proposed a detailed classification, with six top-level divisions:[63][64][e]

Sino-Tibetan
  • Sinitic
  • Daic
  • Bodic
  • Burmic
  • Baric
  • Karenic

Shafer was sceptical of the inclusion of Daic, but after meeting Maspero in Paris decided to retain it pending a definitive resolution of the question.[65][66]

Matisoff (1978, 2015)

James Matisoff abandoned Benedict's Tibeto-Karen hypothesis:

Sino-Tibetan
  • Chinese
  • Tibeto-Burman

Some more-recent Western scholars, such as Bradley (1997) and La Polla (2003), have retained Matisoff's two primary branches, though differing in the details of Tibeto-Burman. However, Jacques (2006) notes, "comparative work has never been able to put forth evidence for common innovations to all the Tibeto-Burman languages (the Sino-Tibetan languages to the exclusion of Chinese)"[f] and that "it no longer seems justified to treat Chinese as the first branching of the Sino-Tibetan family,"[g] because the morphological divide between Chinese and Tibeto-Burman has been bridged by recent reconstructions of Old Chinese.

The internal structure of Sino-Tibetan has been tentatively revised as the following Stammbaum by Matisoff (2015: xxxii, 1123-1127) in the final print release of the Sino-Tibetan Etymological Dictionary and Thesaurus (STEDT).[67][68] Matisoff (2015: xxxi) acknowledges that the position of Chinese as either a sister branch of Tibeto-Burman or a branch within Tibeto-Burman remains an open question.

Sino-Tibetan

Starostin (1996)

Sergei Starostin proposed that both the Kiranti languages and Chinese are divergent from a "core" Tibeto-Burman of at least Bodish, Lolo-Burmese, Tamangic, Jinghpaw, Kukish, and Karen (other families were not analysed) in a hypothesis called Sino-Kiranti. The proposal takes two forms: that Sinitic and Kiranti are themselves a valid node or that the two are not demonstrably close, so that Sino-Tibetan has three primary branches:

Sino-Tibetan (version 1)
  • Sino-Kiranti
  • Tibeto-Burman
Sino-Tibetan (version 2)
  • Chinese
  • Kiranti
  • Tibeto-Burman

van Driem (1997, 2001)

van Driem, like Shafer, rejects a primary split between Chinese and the rest, suggesting that Chinese owes its traditional privileged place in Sino-Tibetan to historical, typological, and cultural, rather than linguistic, criteria. He calls the entire family "Tibeto-Burman", a name he says has historical primacy,[69] but other linguists who reject a privileged position for Chinese nevertheless continue to call the resulting family "Sino-Tibetan", including Roger Blench.

Like Matisoff, van Driem acknowledges that the relationships of the "Kuki–Naga" languages (Kuki, Mizo, Meitei, etc.), both amongst each other and to the other languages of the family, remain unclear. However, rather than placing them in a geographic grouping, as Matisoff does, van Driem leaves them unclassified. He has proposed several hypotheses, including the reclassification of Chinese to a Sino-Bodic subgroup:

Tibeto-Burman

Van Driem points to two main pieces of evidence establishing a special relationship between Sinitic and Bodic and thus placing Chinese within the Tibeto-Burman family. First, there are a number of parallels between the morphology of Old Chinese and the modern Bodic languages. Second, there is an impressive body of lexical cognates between the Chinese and Bodic languages, represented by the Kirantic language Limbu.[70]

In response, Matisoff notes that the existence of shared lexical material only serves to establish an absolute relationship between two language families, not their relative relationship to one another. Although some cognate sets presented by van Driem are confined to Chinese and Bodic, many others are found in Sino-Tibetan languages generally and thus do not serve as evidence for a special relationship between Chinese and Bodic.[71]

van Driem (2001, 2014)

George van Driem (2001) has also proposed a "fallen leaves" model that lists dozens of well-established low-level groups while remaining agnostic about intermediate groupings of these.[72] In the most recent version (van Driem 2014), 42 groups are identified (with individual languages highlighted in italics):[73]

van Driem (2007) also suggested that the Sino-Tibetan language family be renamed "Trans-Himalayan", which he considers to be more neutral.[74]

Blench and Post (2014)

Roger Blench and Mark W. Post have criticized the applicability of conventional Sino-Tibetan classification schemes to minor languages lacking an extensive written history (unlike Chinese, Tibetic, and Burmese). They find that the evidence for the subclassification or even ST affiliation at all of several minor languages of northeastern India, in particular, is either poor or absent altogether.

While relatively little has been known about the languages of this region up to and including the present time, this has not stopped scholars from proposing that these languages either constitute or fall within some other Tibeto-Burman subgroup. However, in absence of any sort of systematic comparison – whether the data are thought reliable or not – such "subgroupings" are essentially vacuous. The use of pseudo-genetic labels such as "Himalayish" and "Kamarupan" inevitably give an impression of coherence which is at best misleading.

In their view, many such languages would for now be best considered unclassified, or "internal isolates" within the family. They propose a provisional classification of the remaining languages:

Sino-Tibetan

Following that, because they propose that the three best-known branches may actually be much closer related to each other than they are to "minor" Sino-Tibetan languages, Blench and Post argue that "Sino-Tibetan" or "Tibeto-Burman" are inappropriate names for a family whose earliest divergences led to different languages altogether. They support the proposed name "Trans-Himalayan".

Development of dialects and languages

Change in word structure

The phenomenon of drift, proposed by American linguist Edward Sapir, occurred in many languages and dialects in the Sino-Tibetan family. Proto-Chinese and Proto-Tibeto-Burman are both agglutinative languages. The change in Proto-Chinese to Old Chinese around the Shang Dynasty could be found in the Book of songs when the classifications of the noun, verbs, and modifier were all dependent on prefixes such as *s-, *p-, *-k.[75] After the Warring State Period in China, Old Chinese developed and started to use tones as the classification of words.[76] The suffix *-s also presented in the new classification system. The characteristics of Old Chinese were maintained in most dialects of southern China.[77]

The Chinese dialects of Min and Wu which were mainly spoken in southern parts of China, had similarities in pronunciation with reptiles and birds as seen in the Old Tai-Kadai language according to You Rujie's research. The prefixes used for differentiating reptiles and birds in Chinese dialects showed similar features with the Old Tai-Kadai language.[78] The old Tai-Kadai language was mainly used in the Xiangxi and Guizhou areas of China. You believed that these unique prefixes maintained by both the local dialects and the old Tai-Kadai language could be a product of local environmental influence.[79]

Dialects in the Tibeto-Burman language area developed more conservatively; they keep the rules for pronunciation and word structure the same compared to Proto-Tibeto-Burman. The Tibetic languages are classified between fusional and analytic language; the Lolo-Burmese languages are mostly analytic languages, and the Jingpho languages are a mix of an agglutinative and fusional language.[80] The Bodo–Garo and the Kuki-Chin–Naga languages possibly kept some particular characteristics of the putative Proto-Tibeto-Burman language such as agglutination and vowel prefixes. This phenomenon could be that the two language groups were separated early from the Proto-Tibeto-Burman language therefore did not undergo much development. The same happened to Sinitic, where its agglutinative property was kept even when it developed into an analytic language. Old Tibetan and the Qiangic both exhibit consonant clusters caused by the dropping of vowel prefixes, which is believed to be the same structure Proto-Sino-Tibetan had.[81]

Old Burmese and Old Tibetan dropped the vowel prefixes during the dialect acquisition, leaving only Tibeto-Burmese, Jingpho, the Bodo–Garo and Kuki-Chin–Naga languages that kept the vowel form of prefixes. The Lolo-Burmese languages and other languages from the Bodish-Himalayish language group preferred a suffix structure which they inherited from the Tibetan-Qiangic-Lolo-Burmese group. Their similarities could be proven by example like the phonetics of the Tibetic language for "sun": ŋi ma; Achang for "sun": ni3131; the Hakun language for "sun": nɔ55 ma33; and Naxi for "sun": ŋi33 me33. These inherited suffixes were later retained in these languages and became widespread in dialects of Old Tibetan, which caused the usage of the prefix in the modern language to decrease.[82]

According to Dai Qingxia, half of the vocabulary in the Jingpho language are disyllabic as well as most of the nouns of Jingpho. This significant amount of disyllabic words came from the consonant cluster in monosyllabic words and compound words mainly found in the Proto-Tibeto-Burman language.[83] The development of the Sino-Tibetan language had been focused on solving the problem of phoneme rhyme, as well as coordinate the crucial point between monosyllabic morpheme and disyllabic word. Because the Sino-Tibetan language consists of a monosyllabic root, a prefix and suffix are needed for classifying word meaning and point of view (aspect?). The prefix *a- appeared in many Sino-Tibetan dialects to coordinate different morpheme structures. The repetition of a syllable has the same coordination effect.[84]

Change in tone

Chinese and the Hmong-Mien and Kra–Dai languages are analytic languages that have similar grammar, pronunciation, and syllable structure. They all started with four tones, soon afterward developed into different phonological tones such as checked tone because of the voiced and voiceless properties of the initial. The aspiration of the initial and the length of the vowel in checked tone led to further tone development of dialects in these languages. Cantonese in Jiangyang area for Chinese developed eight different tones because of the length of the vowel. The aspiration property also determined the tone development of Kra-Dai, of which the tone eventually developed into sixteen types of tone.[85] Zongdi dialect of Hmong-Mien had also experienced the change in tone because of the aspiration property.[86]

Typology

Word order

Except for the Chinese, Bai, Karenic, and Mruic languages, the usual word order in Sino-Tibetan languages is object–verb. Most scholars believe this to be the original order, with Chinese, Karen and Bai having acquired subject–verb–object order due to the influence of neighbouring languages in the Mainland Southeast Asia linguistic area.[87] However, Chinese and Bai differ from almost all other VO languages in the world in placing relative clauses before the nouns they modify.[88]

Morphology

Hodgson had in 1849 noted a dichotomy between "pronominalized" (inflecting) languages, stretching across the Himalayas from Himachal Pradesh to eastern Nepal, and "non-pronominalized" (isolating) languages. Konow (1909) explained the pronominalized languages as due to a Munda substratum, with the idea that Indo-Chinese languages were essentially isolating as well as tonal. Maspero later attributed the putative substratum to Indo-Aryan. It was not until Benedict that the inflectional systems of these languages were recognized as (partially) native to the family. Scholars disagree over the extent to which the agreement system in the various languages can be reconstructed for the proto-language.[89][90]

In morphosyntactic alignment, many Tibeto-Burman languages have ergative and/or anti-ergative (an argument that is not an actor) case marking. However, the anti-ergative case markings can not be reconstructed at higher levels in the family and are thought to be innovations.[91]

Classifiers and definite marking

There is no language originally in the Sino-Tibetan family that had classifiers, but some subgroups did develop some properties of classifier,[92][93] such as the Lolo-Burmese languages which had cognate nouns as classifiers.[94] Tibeto-Burman and Sinitic languages also developed classifiers that are used more commonly in South East Asia and are mainly use without numerals, such as in Rawang lègā tiq bok [book one classifier] meaning "one book", lègā bok meaning "the book"; in Cantonese yat55 ga33 che55 [one classifier vehicle] meaning "one car", ga33 che55 meaning "the car" (verbally).[95] Some other classifiers in Tibeto-Burman and Sinitic languages developed the same use as definite or specific marking. Definite marking did not appear in the Proto-Sino-Tibetan language either, but there is some use of it in Qiangic of the Tibeto-Burmese languages,[96] where the markings seem to evolve from demonstratives.[97]

Vocabulary

Sino-Tibetan numerals
gloss Old Chinese[98] Old Tibetan[99] Old Burmese[99] Jingpho[100] Garo[100] Limbu[101] Kanauri[102]
"one" *ʔjit ac id
*tjek "single" gcig tac thik
"two" *njijs gnyis nhac gin-i nɛtchi niš
"three" *sum gsum sumḥ mə̀sūm git-tam sumsi sum
"four" *sjijs bzhi liy mə̀lī bri lisi pə:
"five" *ŋaʔ lnga ṅāḥ mə̀ŋā boŋ-a nasi ṅa
"six" *C-rjuk drug khrok krúʔ dok tuksi țuk
"seven" *tsʰjit khu-nac sə̀nìt sin-i nusi štiš
"eight" *pret brgyad rhac mə̀tshát cet yɛtchi rəy
"nine" *kjuʔ dgu kuiḥ cə̀khù sku sgui
"ten" *gjəp kip[103] gip
bcu chay shī ci-kuŋ səy

External classification

Beyond the traditionally recognized families of Southeast Asia, a number of possible broader relationships have been suggested.

One of these is the "Sino-Caucasian" hypothesis of Sergei Starostin, which posits that the Yeniseian languages and North Caucasian languages form a clade with Sino-Tibetan. The Sino-Caucasian hypothesis has been expanded by others to "Dené–Caucasian" to include the Na-Dené languages of North America, Burushaski, Basque and, occasionally, Etruscan. Edward Sapir had commented on a connection between Na-Dené and Sino-Tibetan.[104] A narrower binary Dené–Yeniseian family has recently been well-received, though not conclusively demonstrated.

In contrast, Laurent Sagart proposes a Sino-Austronesian family with Sino-Tibetan and Austronesian (including Kra–Dai as a subbranch) as primary brances.[105] Stanley Starosta has extended this proposal with a further branch called "Yangzian" joining Hmong–Mien and Austroasiatic.[106]

Another theory was raised by the Japanese linguist Īno Mutsumi. He suggested after his analysis of proto-Sino-Tibetan that Japanese is related to the proto-form of Sino-Tibetan, especially to the Burmese language. Because of similar grammer rules (SOV, syntax), similar non-loan basic-vocabulary and the fact that early Sino-Tibetan was non-tonal like still today some small languages, he proposed the Sinitic origin theory.[107][108]

The linguist Juha Janhunen found during his analysis of asian languages strong similarities between proto-Japanese and Sino-Tibetan languages. He says, similar as Vovin, that proto-Japanese originated somewhere in southeast China but was a Sino-Tibetan(possibly burmanese) language and got later under Austronesian influence. After political or economical problems the proto-Japanese people started to migrate into Japan over the Korean Peninsula. There they possibly got under the influence of Altaic peoples.[109]

This table shows some similarities between proto-Japanese and proto-Chinese:

Similarities between ancient Japanese and ancient Chinese languages
Kanji Heian-Period Kun-reading/Romaji Proto-Chinese Modern Chinese (Pinyin) Cantonese Heian-Period/Romaji reconstruction of the Heian-Period/Romaji
Sen zeni tsean qian2 chin3 sen, zen sen
Mutsumi mutu, mu muiu mu4 muk6 moku boku
Keikoku kafi ɣeap xia2 haap6 gefu kafu
Chu tu-gu tugio zhu4 jyu3 su shyu
Maki maki miək mu4 muk6 moku boku
Tono tono, domu tyən dian4 din6 den ten
Kuni(guni) kuni, (kofori) giuən jun4 gwan6 gun kun
To to, tomu tɕiə zhi3 ji2 shi shi
Uma muma, nma xan ma3 ma5 me ba, ma
Ume mume, nme mei2 mui4 mai, me bai
Mugi mugi meək mai4 mak6 myaku baku
Ga a, aga, ware, waga ŋai wo3 ngo2 ga ga
ware ware ŋea wu2 ng4 gu go

Notes

  1. ^ Kuhn (1889), p. 189: "wir das Tibetisch-Barmanische einerseits, das Chinesisch-Siamesische anderseits als deutlich geschiedene und doch wieder verwandte Gruppen einer einheitlichen Sprachfamilie anzuerkennen haben." (also quoted in van Driem (2001), p. 264.)
  2. ^ The volumes were: 1. Introduction and bibliography, 2. Bhotish, 3. West Himalayish, 4. West Central Himalayish, 5. East Himalayish, 6. Digarish, 7. Nungish, 8. Dzorgaish, 9. Hruso, 10. Dhimalish, 11. Baric, 12. Burmish–Lolish, 13. Kachinish, 14. Kukish, 15. Mruish.[13]
  3. ^ Karlgren's reconstruction, with aspiration as 'h' and 'i̯' as 'j' to aid comparison.
  4. ^ See, for example, the "Sino-Tibetan" (汉藏语系 Hàn-Zàng yǔxì) entry in the "languages" (語言文字, Yǔyán-Wénzì) volume of the Encyclopedia of China (1988).
  5. ^ For Shafer, the suffix "-ic" denoted a primary division of the family, whereas the suffix "-ish" denoted a sub-division of one of those.
  6. ^ les travaux de comparatisme n’ont jamais pu mettre en évidence l’existence d’innovations communes à toutes les langues « tibéto-birmanes » (les langues sino-tibétaines à l’exclusion du chinois)
  7. ^ il ne semble plus justifié de traiter le chinois comme le premier embranchement primaire de la famille sino-tibétaine

References

  1. ^ Handel (2008), pp. 422, 434–436.
  2. ^ Logan (1856), p. 31.
  3. ^ Logan (1858).
  4. ^ a b Hale (1982), p. 4.
  5. ^ van Driem (2001), p. 334.
  6. ^ Klaproth (1823), pp. 346, 363–365.
  7. ^ van Driem (2001), p. 344.
  8. ^ Finck (1909), p. 57.
  9. ^ a b Przyluski (1924), p. 361.
  10. ^ Sapir (1925), p. 373.
  11. ^ Przyluski (1924), p. 380.
  12. ^ Przyluski & Luce (1931).
  13. ^ Miller (1974), p. 195.
  14. ^ Miller (1974), pp. 195–196.
  15. ^ Matisoff (1991), p. 473.
  16. ^ a b Handel (2008), p. 434.
  17. ^ Benedict (1972), pp. 20–21.
  18. ^ Benedict (1972), pp. 17–18, 133–139, 164–171.
  19. ^ a b Handel (2008), pp. 425–426.
  20. ^ Miller (1974), p. 197.
  21. ^ Matisoff (2003), p. 16.
  22. ^ Beckwith (1996).
  23. ^ Beckwith (2002b).
  24. ^ Matisoff (1991), pp. 471–472.
  25. ^ Norman (1988), p. 45.
  26. ^ Baxter (1992), pp. 25–26.
  27. ^ Bodman (1980), p. 47.
  28. ^ Baxter (1992), pp. 197, 199–202.
  29. ^ Baxter (1992), pp. 315–317.
  30. ^ Beckwith (2002a), pp. xiii–xiv.
  31. ^ Thurgood (2003), p. 17.
  32. ^ Gong (1980).
  33. ^ Handel (2008), p. 431.
  34. ^ Hill (2014), pp. 97–104.
  35. ^ Matisoff (1991), pp. 472–473.
  36. ^ Hale (1982), pp. 4–5.
  37. ^ Matisoff (1991), pp. 470, 476–478.
  38. ^ Handel (2008), p. 435.
  39. ^ Matisoff (1991), p. 482.
  40. ^ Lewis, Simons & Fennig (2015).
  41. ^ Norman (1988), p. 4.
  42. ^ Norman (1988), pp. 187–188.
  43. ^ a b Taylor (1992), p. 165.
  44. ^ a b Wheatley (2003), p. 195.
  45. ^ Tournadre (2014), p. 117.
  46. ^ Tournadre (2014), p. 107.
  47. ^ Tournadre (2014), p. 120.
  48. ^ Thurgood (2003), p. 18.
  49. ^ Handel (2008), pp. 424–425.
  50. ^ Handel (2008), p. 423.
  51. ^ Matisoff (1991), pp. 470–471.
  52. ^ van Driem (2005), pp. 91–95.
  53. ^ Blench & Post (2014), p. 89.
  54. ^ Blench, Roger. 2009. If agriculture cannot be reconstructed for Proto-Sino-Tibetan, what are the consequences?. Paper presented at the 42nd International Conference on Sino-Tibetan Language and Linguistics (ICSTLL 42), Chiang Mai, November 2-4, 2009. (slides)
  55. ^ Handel (2008), p. 426.
  56. ^ DeLancey (2009), p. 695.
  57. ^ Li (1937), pp. 60–63.
  58. ^ a b c Handel (2008), p. 424.
  59. ^ Matisoff (1991), p. 487.
  60. ^ Benedict (1942), p. 600.
  61. ^ Benedict (1972), pp. 2–4.
  62. ^ Shafer (1955), pp. 94–96.
  63. ^ Shafer (1955), pp. 99–108.
  64. ^ Shafer (1966), p. 1.
  65. ^ Shafer (1955), pp. 97–99.
  66. ^ van Driem (2001), pp. 343–344.
  67. ^ Matisoff, James A. 2015. The Sino-Tibetan Etymological Dictionary and Thesaurus. Berkeley: University of California. (PDF)
  68. ^ Bruhn, Daniel; Lowe, John; Mortensen, David; Yu, Dominic (2015). Sino-Tibetan Etymological Dictionary and Thesaurus Database Software. Software, UC Berkeley Dash. doi:10.6078/D1159Q
  69. ^ van Driem (2001), p. 383.
  70. ^ van Driem (1997).
  71. ^ Matisoff (2000).
  72. ^ van Driem (2001), p. 403.
  73. ^ van Driem (2014), p. 19.
  74. ^ van Driem (2007), p. 226.
  75. ^ Wu (1987).
  76. ^ Wang (1980), p. 221.
  77. ^ Wu (2002), pp. 9–10.
  78. ^ You (1982).
  79. ^ Wu (2002), p. 12.
  80. ^ Sun (1996).
  81. ^ Wu (2002), pp. 10, 12.
  82. ^ Wu (2002), p. 10.
  83. ^ Dai (1997).
  84. ^ Wu (2002), p. 11.
  85. ^ Shi (1991).
  86. ^ Wu (2002).
  87. ^ Dryer (2003), pp. 43–45.
  88. ^ Dryer (2003), pp. 50.
  89. ^ Handel (2008), p. 430.
  90. ^ LaPolla (2003), pp. 29–32.
  91. ^ LaPolla (2003), pp. 34–35.
  92. ^ Xu (1989), pp. 15–23.
  93. ^ Dai (1994), pp. 166–181.
  94. ^ Bradley (2012), pp. 171–192.
  95. ^ Baron (1973).
  96. ^ LaPolla & Huang (2003).
  97. ^ Thurgood & LaPolla (2017), p. 46.
  98. ^ Baxter (1992).
  99. ^ a b Hill (2012).
  100. ^ a b Burling (1983), p. 28.
  101. ^ van Driem (1987), pp. 32–33.
  102. ^ Sharma (1988), p. 116.
  103. ^ Yanson (2006), p. 106.
  104. ^ Shafer (1952).
  105. ^ Sagart (2005).
  106. ^ Starosta (2005).
  107. ^ 飯野睦毅 (1994)『奈良時代の日本語を解読する』東陽出版
  108. ^ Taw Sein Ko 1924, p. viii.
  109. ^ ユハ・ヤンフネン 「A Framework for the Study of Japanese Language Origins」『日本語系統論の現在』(pdf) 国際日本文化センター、京都、2003年、477-490頁。

Works cited

General