The word accent system of Tokyo Japanese might look quite complex with a number of accent patterns and rules. However, recent research has shown that it is not as complex as has been assumed if one incorporates the notion of markedness into the analysis: nouns have only two productive accent patterns, the antepenultimate and the unaccented pattern, and different accent rules can be generalized if one focuses on these two productive accent patterns.
The word accent system raises some new interesting issues. One of them concerns the fact that a majority of nouns are ‘unaccented,’ that is, they are pronounced with a rather flat pitch pattern, apparently violating the principle of obligatoriness. A careful analysis of noun accentuation reveals that this strange accent pattern occurs in some linguistically predictable structures. In morphologically simplex nouns, it typically tends to emerge in four-mora nouns ending in a sequence of light syllables. In compound nouns, on the other hand, it emerges due to multiple factors, such as compound-final deaccenting morphemes, deaccenting pseudo-morphemes, and some types of prosodic configurations.
Japanese pitch accent exhibits an interesting aspect in its interactions with other phonological and linguistic structures. For example, the accent of compound nouns is closely related with rendaku, or sequential voicing; the choice between the accented and unaccented patterns in certain types of compound nouns correlates with the presence or absence of the sequential voicing. Moreover, whether the compound accent rule applies to a certain compound depends on its internal morphosyntactic configuration as well as its meaning; alternatively, the compound accent rule is blocked in certain types of morphosyntactic and semantic structures.
Finally, careful analysis of word accent sheds new light on the syllable structure of the language, notably on two interrelated questions about diphthong-hood and super-heavy syllables. It provides crucial insight into ‘diphthongs,’ or the question of which vowel sequence constitutes a diphthong, against a vowel sequence across a syllable boundary. It also presents new evidence against trimoraic syllables in the language.
“Altaic” is a common term applied by linguists to a number of language families, spread across Central Asia and the Far East and sharing a large, most likely non-coincidental, number of structural and morphemic similarities. At the onset of Altaic studies, these similarities were ascribed to the one-time existence of an ancestral language—“Proto-Altaic,” from which all these families are descended; circumstantial evidence and glottochronological calculations tentatively date this language to some time around the 6th–7th millennium
The debate over the nature of the relationship between the various units that constitute “Altaic,” sometimes referred to as “the Altaic controversy,” has been one of the most hotly debated topics in 20th-century historical linguistics and a major focal point of studies dealing with the prehistory of Central and East Eurasia. Supporters of “Proto-Altaic,” commonly known as “(pro-)Altaicists,” claim that only divergence from an original common ancestor can account for the observed regular phonetic correspondences and other structural similarities, whereas “anti-Altaicists,” without denying the existence of such similarities, insist that they do not belong to the “core” layers of the respective languages and are therefore better explained as results of lexical borrowing and other forms of areal linguistic contact.
As a rule, “pro-Altaicists” claim that “Proto-Altaic” is as reconstructible by means of the classic comparative method as any uncontroversial linguistic family; in support of this view, they have produced several attempts to assemble large bodies of etymological evidence for the hypothesis, backed by systems of regular phonetic correspondences between compared languages. All of these, however, have been heavily criticized by “anti-Altaicists” for lack of methodological rigor, implausibility of proposed phonetic and/or semantic changes, and confusion of recent borrowings with items allegedly inherited from a common ancestor. Despite the validity of many of these objections, it remains unclear whether they are sufficient to completely discredit the hypothesis of a genetic connection between the various branches of “Altaic,” which continues to be actively supported by a small, but stable scholarly minority.
K. A. Jayaseelan
The Dravidian languages have a long-distance reflexive anaphor taan. (It is taan in Tamil and Malayalam, taanu in Kannada and tanu in Telugu.) As is the case with other long-distance anaphors, it is subject-oriented; it is also [+human] and third person. Interestingly, it is infelicitous if bound within the minimal clause when it is an argument of the verb. (That is, it seems to obey Principle B of the binding theory.) Although it is subject-oriented in the normal case, it can be bound by a non-subject if the verb is a “psych predicate,” that is, a predicate that denotes a feeling; in this case, it can be bound by the experiencer of the feeling. Again, in a discourse that depicts the thoughts, feelings, or point of view of a protagonist—the so-called “logophoric contexts”—it can be coreferential with the protagonist even if the latter is mentioned only in the preceding discourse (not within the sentence). These latter facts suggest that the anaphor is in fact coindexed with the perspective of the clause (rather than with the subject per se). In cases where this anaphor needs to be coindexed with the minimal subject (to express a meaning like ‘John loves himself’), the Dravidian languages exhibit two strategies to circumvent the Principle B effect. Malayalam adds an emphasis marker tanne to the anaphor; taan tanne can corefer with the minimal subject. This strategy parallels the strategy of European languages and East Asian languages (cf. Scandinavian seg selv). The three other major Dravidian languages—Tamil, Telugu, and Kannada—use a verbal reflexive: they add a light verb koL- (lit. ‘take’) to the verbal complex, which has the effect of reflexivizing the transitive predicate. (It either makes the verb intransitive or gives it a self-benefactive meaning.)
The Dravidian languages also have reciprocal and distributive anaphors. These have bipartite structures. An example of a Malayalam reciprocal anaphor is oral … matte aaL (‘one person … other person’). The distributive anaphor in Malayalam has the form awar-awar (‘they-they’); it is a reduplicated pronoun. The reciprocals and distributives are strict anaphors in the sense that they apparently obey Principle A; they must be bound in the domain of the minimal subject. They are not subject-oriented.
A noteworthy fact about the pronominal system of Dravidian is that the third person pronouns come in proximal-distal pairs, the proximal pronoun being used to refer to something nearby and the distal pronoun being used elsewhere.
Japanese is a language where the grammatical status of arguments and adjuncts is marked exclusively by postnominal case markers, and various argument realization patterns can be assessed by their case marking. Since Japanese is categorized as a language of the nominative-accusative type typologically, the unmarked case-marking frame obtained for transitive predicates of the non-stative (or eventive) type is ‘nominative-accusative’. Nevertheless, transitive predicates falling into the stative class often have other case-marking alignments, such as ‘nominative-nominative’ and ‘dative-nominative’. Consequently, Japanese provides much more varying argument realization patterns than those expected from its typological character as a nominative-accusative language.
In point of fact, argument marking can actually be much more elastic and variable, the variations being motivated by several linguistic factors. Arguments often have the option of receiving either syntactic or semantic case, with no difference in the logical or cognitive meaning (as in plural agent and source agent alternations) or depending on the meanings their predicate carry (as in locative alternation). The type of case marking that is not normally available in main clauses can sometimes be obtained in embedded contexts (i.e., in exceptional case marking and small-clause constructions). In complex predicates, including causative and indirect passive predicates, arguments are case-marked differently from their base clauses by virtue of suffixation, and their case patterns follow the mono-clausal case array, despite the fact that they have multi-clausal structures.
Various case marking options are also made available for arguments by grammatical operations. Some processes instantiate a change on the grammatical relations and case marking of arguments with no affixation or embedding. Japanese has the grammatical process of subjectivization, creating extra (non-thematic) major subjects, many of which are identified as instances of ‘possessor raising’ (or argument ascension). There is another type of grammatical process, which reduces the number of arguments by virtue of incorporating a noun into the predicate, as found in the light verb constructions with suru ‘do’ and the complex adjective constructions formed on the negative adjective nai ‘non-existent.’
Languages from at least five genetically unrelated families are spoken in the Caucasus, but there are only three endemic linguistic families belonging to the region: Kartvelian, West Caucasian, and Northeast Caucasian. These families are rather heterogeneous in terms of the number of languages and the distribution of the speakers across them. The Caucasus represents a situation where languages with millions of speakers have coexisted with one-village languages for hundreds of years, and where multilingualism has always been the norm. The richness of Caucasian languages on every linguistic stratum is dazzling: here we find some of the largest consonant inventories, inflectional systems where the mere number of word forms strains credibility (one of the Caucasian languages, Archi, is claimed to have over a million and a half word forms), and challenging syntactic structures. The typological interest of the Caucasian languages and the challenges they present to linguistic theory lie in different areas. Thus, for Kartvelian languages, the number of factors at play in the verbal system make the task of the production of a correct verbal form far from trivial. West Caucasian languages represent an instance of polysynthetic polypersonal verb inflection, which is unusual not only for Caucasus but for Eurasia in general. East Caucasian languages have large systems of non-finite forms which, unusually, retain the ability to realize agreement in gender and number while their non-finite nature is determined by the inability to head an independent clause and to express certain morpho-syntactic categories such as illocutionary force and evidentiality. Finally, all Caucasian languages are ergative to some extent.
Haihua Pan and Yuli Feng
Cross-linguistic data can add new insights to the development of semantic theories or even induce the shift of the research paradigm. The major topics in semantic studies such as bare noun denotation, quantification, degree semantics, polarity items, donkey anaphora and binding principles, long-distance reflexives, negation, tense and aspects, eventuality are all discussed by semanticists working on the Chinese language. The issues which are of particular interest include and are not limited to: (i) the denotation of Chinese bare nouns; (ii) categorization and quantificational mapping strategies of Chinese quantifier expressions (i.e., whether the behaviors of Chinese quantifier expressions fit into the dichotomy of A-Quantification and D-quantification); (iii) multiple uses of quantifier expressions (e.g., dou) and their implication on the inter-relation of semantic concepts like distributivity, scalarity, exclusiveness, exhaustivity, maximality, etc.; (iv) the interaction among universal adverbials and that between universal adverbials and various types of noun phrases, which may pose a challenge to the Principle of Compositionality; (v) the semantics of degree expressions in Chinese; (vi) the non-interrogative uses of wh-phrases in Chinese and their influence on the theories of polarity items, free choice items, and epistemic indefinites; (vii) how the concepts of E-type pronouns and D-type pronouns are manifested in the Chinese language and whether such pronoun interpretations correspond to specific sentence types; (viii) what devices Chinese adopts to locate time (i.e., does tense interpretation correspond to certain syntactic projections or it is solely determined by semantic information and pragmatic reasoning); (ix) how the interpretation of Chinese aspect markers can be captured by event structures, possible world semantics, and quantification; (x) how the long-distance binding of Chinese ziji ‘self’ and the blocking effect by first and second person pronouns can be accounted for by the existing theories of beliefs, attitude reports, and logophoricity; (xi) the distribution of various negation markers and their correspondence to the semantic properties of predicates with which they are combined; and (xii) whether Chinese topic-comment structures are constrained by both semantic and pragmatic factors or syntactic factors only.
Dene-Yeniseian is a proposed genealogical link between the widespread North American language family Na-Dene (Athabaskan, Eyak, Tlingit) and Yeniseian in central Siberia, represented today by the critically endangered Ket and several documented extinct relatives. The Dene-Yeniseian hypothesis is an old idea, but since 2006 new evidence supporting it has been published in the form of shared morphological systems and a modest number of lexical cognates showing interlocking sound correspondences. Recent data from human genetics and folklore studies also increasingly indicate the plausibility of a prehistoric (probably Late Pleistocene) connection between populations in northwestern North America and the traditionally Yeniseian-speaking areas of south-central Siberia. At present, Dene-Yeniseian cannot be accepted as a proven language family until the purported evidence supporting the lexical and morphological correspondences between Yeniseian and Na-Dene is expanded and tested by further critical analysis and their relationship to Old World families such as Sino-Tibetan and Caucasian, as well as the isolate Burushaski (all earlier proposed as relatives of Yeniseian, and sometimes also of Na-Dene), becomes clearer.
The Eskimo-Aleut language family consists of two quite different branches, Aleut and Eskimo. The latter consists of Yupik and Inuit languages. It is spoken from the eastern coast of Russia to Greenland. The family is thought to have developed and diverged in Alaska between 4,000 and 6,000 years ago, although recent findings in a variety of fields suggest a more complex prehistory than previously assumed. The language family shares certain characteristics, including polysynthetic word formation, an originally ergative-absolutive case system (now substantially modified in Aleut), SOV word order, and more or less similar phonological systems across the language family, involving voiceless stop and voiced fricative consonant series often in alternation, and an originally four-vowel system frequently reduced to three. The languages in the family have undergone substantial postcolonial contact effects, especially evident in (although not restricted to) loanwords from the respective colonial languages. There is extensive language documentation for all languages, although not necessarily all dialects. Most languages and dialects are severely endangered today, with the exception of Eastern Canadian Inuit and Greenlandic (Kalaallisut). There are also theoretical studies of the languages in many linguistic fields, although the languages are unevenly covered, and there are still many more studies of the phonologies and syntaxes of the respective languages than other aspects of grammar.
Eva Buchi and Steven N. Dworkin
This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
Within the field of linguistics, etymology is the only subdiscipline that is uniquely historical in its study of the relevant linguistic data. It is one of the oldest fields in Romance linguistics. The scholar credited with establishing Romance linguistics as a scholarly discipline, Friedrich Diez (1794–1876) authored both the first comparative Romance historical grammar (his three-volume Grammatik der Romanischen Sprachen [1836–1844]) and the first pan-Romance etymological dictionary (his Etymologisches Wörterbuch der Romanischen Sprachen ). A similar combination, illustrating the indissoluble link between etymology and historical grammar (especially the study of sound change), can be seen in the work of Wilhelm Meyer-Lübke (1861–1936), author of a four-volume Grammatik der Romanischen Sprachen (1890–1902) and of the last complete pan-Romance etymological dictionary, the Romanisches Etymologisches Wörterbuch (3d definitive edition, 1935).
The concept of etymology as practiced by Romanists has changed over the last 100 years. At the outset, Romance etymologists took as their brief the search for and identification of individual word origins. Starting in the early 20th century, various specialists began to view etymology as the preparation of the complete history of all facets of the evolution over time and space of the words or lexical families under study. Identification of the underlying base was only the first step in the process. From this perspective, etymology constitutes an essential element of diachronic lexicology, which covers all formal, semantic, and syntactic facets of a word’s evolution, including, if appropriate, the circumstances leading to its demise and replacement.
Practitioners of Romance etymology tend to study the history of individual words or word families in specific Romance languages rather than across the entire family. Almost every Romance language and many of their regional varieties have at least one etymological dictionary devoted to the history of its vocabulary (or at least to the identification of relevant word origins), the most notable being such multi-volumed works as the Französisches Etymologisches Wörterbuch (1922–2002), the Lessico Etimilogico Italiano (1979–), the Diccionario crítico etimológico castellano e hispánico (1980–1991), and the Diccionari etimològic i complimenari de la llengua catalana (1980–2001). The last complete pan-Romance dictionary remains the afore-cited third edition of Meyer-Lübke’s Romanisches etymologisches Wörterbuch.
Although originally coined as a riposte to the Neogrammarian view of sound change, Jules Gilliéron’s (1854–1926) dictum, “each word has its own history,” applies equally well to etymology. Yakov Malkiel (1914–1998), one of the leading writers on questions of method and practice in Romance etymology, has discussed the unique and complex nature of etymological solutions. As a result of the emphasis on individual problems and solutions, Romance etymology has not lent itself to the formulation of theories on the nature of lexical change, although there was in the past no shortage of literature on questions of methodology.
Although specialists continue to work on language-specific etymological questions, etymology is not currently at the forefront of work in Romance historical linguistics, a situation that may result, in part, from its lack of engagement with broad theoretical issues. Most studies still appear in the form of journal articles or Festschrift contributions. There is currently underway a new pan-Romance project, the Dictionnaire étymologique Roman (DéRom), with a new (and controversial) methodological underpinning, namely the rigorous application to the Romance data of comparative reconstruction to capture more accurately the phonological and morphological reality of proto-Romance (in essence a register of spoken Latin) and the semantic scope of the etymological base. This project has reawakened an interest in Romance etymology among a new generation of Romanists. Indeed, to remain vital and relevant within the framework of Romance linguistics, etymology must go beyond the details of individual lexical histories and make an effort to link its findings to our understanding of the nature and processes of language change.
D. Gary Miller
Apart from runic inscriptions, Gothic is the earliest attested language of the Germanic family, dating to the 4th century. Along with Crimean Gothic, it belongs to the branch known as East Germanic. The bulk of the extant Gothic corpus is a translation of the Bible, of which only a portion remains. The translation is traditionally ascribed to Wulfila, who is credited with inventing the Gothic alphabet. The many Greek conventions both help and hinder interpretation of the Gothic phonological system. As in Greek, letters of the alphabet functioned as numerals, but the late letter names were from runic.
Gothic inflectional categories include nouns, adjectives, and verbs. Nouns are inflected for three genders, two numbers, and four cases. Various stem types inherited from Indo-European constitute different form classes in Gothic. Adjectives have the same properties and are also inflected according to so-called weak and strong forms, as are Gothic verbs. Verbs are inflected for three persons and numbers, an indicative and a nonindicative mood (here called “optative”), past and nonpast tense, and voice. The mediopassive survives in Gothic morphologically as a synthetic passive and syntactically in innovated periphrastic formations; middle and anticausative functions were taken over by reflexive-type structures. Nonfinite forms are the infinitive, the imperative, and two participles.
In syntax, Gothic had null subjects as an option, mostly in the third person singular. Aspect was effected primarily by prefixes, which have many other functions, and aspect is not consistently indicated. Absolute constructions with a participle occurred in various cases with functional differences. Relativization was effected primarily by relative pronouns built on demonstratives plus a complementizer. Complementizers could be used with subordinate clause verbs in the indicative or optative. The switch to the optative was triggered by irrealis, matrix verbs that do not permit a full range of subordinate tenses, expression of a hope or wish, potentiality, and several other conditions. Many of these are also relevant to matrix clauses (independent optatives).
Essentials of linearization include prepositional phrases, default postposed genitives and possessive adjectives, and preposed demonstratives. Verb-object order predominates, but there is much Greek influence. Verb-auxiliary order is native Gothic.