Victor A. Friedman
The Balkan languages were the first group of languages whose similarities were explained in modern linguistic terms as a result of language contact rather than as a result of descent from a common ancestor. Nikolai Trubetzkoy coined the term Sprachbund ‘linguistic league’ (as opposed to Sprachfamilie ‘language family’) to describe this relationship. Balkan linguistics, as both a subset of and precursor to contact linguistics, is, at its base, an historical linguistic discipline. It seeks to explain similarities among the relevant languages as the result of diffusion rather than of either transmission or of putative universal, typological properties of human language (which latter assumes parallel developments whose causation is ahistorical, i.e., unconnected with either contact or ancestry). The relevant languages are, with the exception of Turkic, all part of the Indo-European language family, but they belong to five distinct groups that are known to have been separated for a significant length of time (presumably millennia). Moreover, for four out of five Indo-European groups as well as for Turkic, there exists documentation that goes back more than a millennium, and in some cases several millennia. The Balkan languages are thus the oldest example of a well-documented and still living Sprachbund.
The primary questions that Balkan linguistics seeks to answer are these: What are the results of language contact in the Balkan languages, and how did they come about? The Balkan languages are traditionally defined as Albanian, Modern Greek, Balkan Romance (Romanian, Aromanian, and Meglenoromanian), and Balkan Slavic (Bulgarian, Macedonian, and the southernmost dialects of the former Serbo-Croatian). In recent decades, it has been recognized that the relevant dialects of Romani, Judezmo, and Turkish and Gagauz also participate in at least some of the convergent processes that are taken as definitive of the Balkan linguistic league. While the language family is defined by regular sound correspondences, which in turn help define shared morphology and a core lexicon, the Balkan linguistic league is defined principally by shared morphosyntactic developments and a shared lexicon of borrowings often called “cultural.” In the Balkan linguistic league, phonological developments are sometimes shared among different languages at the dialectal level, but there are no such features that characterize the Balkan languages as a group. Just as in the language family not every diagnostic item is represented in every branch, so, too, in the Balkan linguistic league not every feature is equally represented in all languages and dialects.
Among the most characteristic morphosyntactic features are the following: (1) replacement of infinitives by analytic subjunctives, (2) the use of a particle derived from etymological ‘want’ to mark the future, (3) replacement of synthetic gradation of adjectives with analytic constructions, (4) replacement of conditionals by anterior futures, (5) resumptive clitic pronouns for certain direct and indirect objects, (6) various simplifications in the declensional system, (7) postposed definite articles (for Balkan Slavic, Balkan Romance, and Albanian), (8) grammaticalized evidentials (Balkan Slavic, Albanian, Turkic, and to some extent Balkan Romance and Romani). While some of these convergences began in the ancient or medieval periods, the Balkan linguistic league took its definitive modern shape during the centuries of the Ottoman Empire (14th to early 20th centuries).
Cynthia L. Allen
Middle English is the name given to the English of the period from approximately 1100 to approximately 1450. This period is marked by substantial developments in all areas of English grammar. It is also the period of English when different dialects are the most fully attested in the texts. At the beginning of the Middle English period, the sociolinguistic status of English was low due to the Norman Invasion, and although religious texts of Old English composition continued to be copied and updated, few original compositions are extant. By the end of the period, English had regained its status as the language of government, law, and literature generally.
Although some notable changes to the phonemic inventory of consonants date from the Middle English period, the most dramatic phonological developments of the period involve vowels. The reduction of the vowels of unstressed syllables, one of the changes that marks the beginning of the Middle English period, is a phonological change with substantial morphological effects, as it substantially reduced the number of distinctive inflectional forms. Constituent order replaced case marking as the primary means of signaling grammatical relations. By the end of the Middle English period, subject-verb-object order had become established as the norm.
The lexicon of English was transformed in this period by an enormous influx of French words. The role of derivational morphology declined as its functions were to some extent replaced by the adoption of French words. Most Scandinavian loans in English first appear in the texts of this period. The Scandinavian loans are typically everyday words, while the words adopted from French are more often in areas of government, law, and higher culture, reflecting the nature of the contact between English speakers and the speakers of these languages.
The density of the Scandinavian population in the northern part of England is generally held to be responsible for the earlier appearance of changes in the north than in the south. The replacement of the third person plural personal pronoun hie by the Scandinavian they is an example of a development which is apparent only in the north early in Middle English but became general in English by the end of this period.
An important phonological development of later Middle English is the beginning of the Great Vowel Shift, which affected long vowels and involved successive changes and was implemented differently in different dialects, the north-south divide being the most evident.
Early Middle English is a language that cannot be understood by Modern English readers without special study, while the language of the late Middle English period, especially that coming from the London area, can be understood with the heavy use of explanatory notes.
George van Driem
Several language families and a few language isolates are represented in the Himalayas, the world’s greatest massif, running a length of over 3,600 km. The most well-represented language family in this region happens to be the Trans-Himalayan language family, whose very centre of gravity and phylogenetic diversity is situated within the Eastern Himalaya. This most populous language family on our planet in terms of numbers of speakers used to be known as Tibeto-Burman but, in some circles, the family formerly also went by the names “Indo-Chinese” or “Sino-Tibetan”, the latter two labels actually designating empirically unsupported and now obsolete models of language relationship. The study of Trans-Himalayan historical grammar began with Brian Houghton Hodgson in the 1830s, who during this time served at Kathmandu as the British Resident to the Kingdom of Nepal. Periodically, minor studies devoted attention to several of the more salient morphosyntactic phenomena of Trans-Himalayan historical grammar, but Stuart Wolfenden contributed the first major monograph to the subject in the 1920s. Finally, the historical morphosyntax of the Trans-Himalayan language family came to be the focus of numerous linguistic studies from the 1970s onward, and since that time our understanding of the historical grammar of the language family has changed drastically.
As ever more languages out of the hundreds of previously undocumented Trans-Himalayan tongues came to be described and analysed in great detail, it came to be understood that the flamboyant verbal agreement morphology observed in languages such as the Kiranti languages of eastern Nepal and the rGyalrongic languages of southwestern China were neither grammatically innovative nor represented typological flukes, but instead represented the most grammatically conservative languages within the entire language family. Subsequently, cognate inflectional systems or vestiges of cognate conjugational morphology were discovered in most other branches of the language family as well. The geographical centre, as well as the centre of phylogenetic diversity of the Trans-Himalayan language family, was identified as the highland arc of the Eastern Himalaya. Sinitic languages, although representing by far the most populous single branch of the Trans-Himalayan family, were now understood as constituting just one out of many subgroups, not more divergent from other branches than any one of the four dozen other subgroups making up the language family. The various types of epistemic marking systems observed sporadically throughout the region were shown to be secondary innovations, reflecting a great variety of semantically distinct language-specific grammatical categories. Particularly, languages showing the typology of the Loloish or Sinitic type were shown to be innovative in their grammar, having lost much of the original Trans-Himalayan morphosyntax.
Jack B. Martin
Old and Middle Japanese are the pre-modern periods of the attested history of the Japanese language. Old Japanese (OJ) is largely the language of the 8th century, with a modest, but still significant number of written sources, most of which is poetry. Middle Japanese is divided into two distinct periods, Early Middle Japanese (EMJ, 800–1200) and Late Middle Japanese (LMJ, 1200–1600). EMJ saw most of the significant sound changes that took place in the language, as well as profound influence from Chinese, whereas most grammatical changes took place between the end of EMJ and the end of LMJ. By the end of LMJ, the Japanese language had reached a form that is not significantly different from present-day Japanese.
OJ phonology was simple, both in terms of phoneme inventory and syllable structure, with a total of only 88 different syllables. In EMJ, the language became quantity sensitive, with the introduction of a long versus short syllables. OJ and EMJ had obligatory verb inflection for a number of modal and syntactic categories (including an important distinction between a conclusive and an (ad)nominalizing form), whereas the expression of aspect and tense was optional. Through late EMJ and LMJ this system changed completely to one without nominalizing inflection, but obligatory inflection for tense.
The morphological pronominal system of OJ was lost in EMJ, which developed a range of lexical and lexically based terms of speaker and hearer reference. OJ had a two-way (speaker–nonspeaker) demonstrative system, which in EMJ was replaced by a three-way (proximal–mesial–distal) system.
OJ had a system of differential object marking, based on specificity, as well as a word order rule that placed accusative marked objects before most subjects; both of these features were lost in EMJ. OJ and EMJ had genitive subject marking in subordinate clauses and in focused, interrogative and exclamative main clauses, but no case marking of subjects in declarative, optative, or imperative main clauses and no nominative marker. Through LMJ genitive subject marking was gradually circumscribed and a nominative case particle was acquired which could mark subjects in all types of clauses.
OJ had a well-developed system of complex predicates, in which two verbs jointly formed the predicate of a single clause, which is the source of the LMJ and NJ (Modern Japanese) verb–verb compound complex predicates. OJ and EMJ also had mono-clausal focus constructions that functionally were similar to clefts in English; these constructions were lost in LMJ.
The Northeast Asia is one of the unique points on the globe where there are many language isolates and portmanteau families. From a conservative point of view, the Japanese language is a member of such a portmanteau family that has recently and increasingly been called Japonic in the Western literature. While Japanese is unquestionably a member of this Japonic language family, which consists of two Japanese languages (Japanese itself and the moribund Hachijō language) and four or five relatively closely related Ryūkyūan languages (Amami, Okinawan, Miyako, Yaeyama, and possibly Yonaguni), attempts have also been made to establish a genetic relationship between Japanese and various other language families. Most of these attempts have been amateurish, a major exception being the Koreo-Japonic hypothesis, which still remains unproven as well. It is also quite likely that the Japonic language family (or, more precisely, Insular Japonic) is the only linguistic grouping whose genetic relationship can be established beyond any doubt. A genetic relationship is also likely to exist between Japonic and a number of fragmentarily attested languages that once flourished in the south and center of the Korean Peninsula, but that died out no later than 9th century A.D. The paucity of material available does not allow one to establish solid predictive-productive regular correspondences in many cases, but intuitively the genetic relationship seems to be a matter of fact. Anything beyond intuition, however, lies in the realm of conjecture and speculation. The alleged Koreo-Japonic relationship is best explained by a centuries-long contact relationship rather than by common origin, given such factors as the virtual absence of any kind of shared paradigmatic morphology, as well as by multiple problems in establishing the real (and not imaginable or made-to-fit) regular correspondences. The Japanese-“Altaic” hypothesis is even more speculative and far-fetched. Consequently, the conclusion is that the Japanese language or the Japonic language family has no demonstrable relationship with any other language family or language isolate on the planet.
Pidgin languages sometimes form in contact situations where a means of communication is urgently needed between groups lacking a common code. They are typically less elaborate than any of the languages involved in their formation, and in comparison to those, reduction characterizes all linguistic levels.
The process is relatively uncommon, and the life span of pidgins is usually short – most disappear when the contact situation changes, or when another medium of intergroup communication becomes available. In some rare cases, however, they expand (both socially and structurally), and may even nativize, i. e. become mother tongues to their speakers (when they may be re-labelled “creoles”).
Pidgins are severely understudied, and while they are often mentioned as precursors to creoles, few linguists have shown a serious interest in them. As a result, many generalizations have been based on extremely limited amounts of data or even on intuition. Some frequently occurring ones is that pidginization is a case of second language acquisition, that power and prestige are important factors, and that most structures are derived from the input languages. My work with pidgins has led me to believe the opposite to be true in these cases: pidgins form through a trial-and-error process, where anything that is understood by the other party is sanctioned, this process is one of collaborative language creation (rather than one involving one group of teachers and one group of learners), and much of what finds its way in the resultant contact language do so independently of what the creators spoke prior to their encounter.
As for theoretical implications, pidgins may shed light on which features in traditional languages are necessary for communication, and which are superfluous from the point of view of pure information transmission.
Chiyuki Ito and Michael J. Kenstowicz
Typologically, pitch-accent languages stand between stress languages like Spanish and tone languages like Shona, and share properties of both. In a stress language, typically just one syllable per word is accented and bears the major stress (cf. Spanish sábana ‘sheet,’ sabána ‘plain,’ panamá ‘Panama’). In a tone language, the number of distinctions grows geometrically with the size of the word. So in Shona, which contrasts high versus low tone, trisyllabic words have eight possible pitch patterns. In a canonical pitch-accent language such as Japanese, just one syllable (or mora) per word is singled out as distinctive, as in Spanish. Each syllable in the word is assigned a high or low tone (as in Shona); however, this assignment is predictable based on the location of the accented syllable.
The Korean dialects spoken in the southeast Kyengsang and northeast Hamkyeng regions retain the pitch-accent distinctions that developed by the period of Middle Korean (15th–16th centuries). For example, in Hamkyeng a three-syllable word can have one of four possible pitch patterns, which are assigned by rules that refer to the accented syllable. The accented syllable has a high tone, and following syllables have low tones. Then the high tone of the accented syllable spreads up to the initial syllable, which is low. Thus, /MUcike/ ‘rainbow’ is realized as high-low-low, /aCImi/ ‘aunt’ is realized as low-high-low, and /menaRI/ ‘parsley’ is realized as low-high-high. An atonic word such as /cintallɛ/ ‘azalea’ has the same low-high-high pitch pattern as ‘parsley’ when realized alone. But the two types are distinguished when combined with a particle such as /MAN/ ‘only’ that bears an underlying accent: /menaRI+MAN/ ‘only parsely’ is realized as low-high-high-low while /cintallɛ+MAN/ ‘only azelea’ is realized as low-high-high-high. This difference can be explained by saying that the underlying accent on the particle is deleted if the stem bears an accent. The result is that only one syllable per word may bear an accent (similar to Spanish). On the other hand, since the accent is realized with pitch distinctions, tonal assimilation rules are prevalent in pitch-accent languages.
This article begins with a description of the Middle Korean pitch-accent system and its evolution into the modern dialects, with a focus on Kyengsang. Alternative synchronic analyses of the accentual alternations that arise when a stem is combined with inflectional particles are then considered. The discussion proceeds to the phonetic realization of the contrasting accents, their realizations in compounds and phrases, and the adaptation of loanwords. The final sections treat the lexical restructuring and variable distribution of the pitch accents and their emergence from predictable word-final accent in an earlier stage of Proto-Korean.
Polysynthesis is informally understood as the packing of a large number of morphemes into single words, as in (1) from Bininj Gun-wok (Evans, in press).
'I cooked the wrong meat for them again.'
Its status as a distinct typological category into which some of the world’s languages fall, on a par with isolating, agglutinating, or fusional languages, has been controversial from the start. Nevertheless, researchers working with these languages are seldom in doubt as to their status as distinct from these other morphological types. This has been complicated by the fact that the speakers of such languages are largely limited to hunter-gatherers—or were so in the not too distant past—so the temptation is to link the phenomenon directly to way of life. This proves to be oversimplified, although it is certainly true that languages qualifying as polysynthetic are almost everywhere spoken in peripheral regions and are on the decline in the modern world—few children are learning them today.
Perhaps the most pervasive of the traits that give these languages the impression of a “special” status is that of holophrasis, which can be defined as the (possible) expression of what in less synthetic languages would be whole sentences in single complex (usually verbal) words. It turns out, however, that there is much greater variety among polysynthetic languages than is generally thought: there are few other traits that they all share, although distinct subtypes can in fact be distinguished, notably the affixing as opposed to the incorporating type.
These languages have considerable importance for the investigation of the diachronic complexification of languages in general and of language acquisition by children, as well as for theories of language universals. The sociolinguistic factors behind their development have only recently begun to be studied in depth. All polysynthetic languages today are to some degree endangered (they are dying off at an alarming rate), and many have been poorly studied if at all, which makes their investigation before it is too late a prime goal for linguistics.