This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
As far as personal nouns are concerned, an agent noun is a derived noun whose general meaning is “person who usually/typically does . . .”. It is thus characterized by the feature [+ Human], irrespective of the fact that the person involved actually performs an action (e.g., French nageur “swimmer,” i.e., “a person who swims”), carries on a profession (e.g., Spanish carpintero “carpenter,” i.e., “a person who builds or repairs wooden structures”), adheres to a certain ideology or group (e.g., Italian femminista “feminist,” i.e., “a person who supports or follows the feminist movement”), etc.
Agent nouns can be both denominal and deverbal. Latin denominal agent nouns were mainly formed with –ARIUS, though the Latin agentive suffix par excellence was –TOR, which derived nouns from verbs. The reflexes of both –ARIUS and –TOR are widespread and highly productive in Romance: For the former, see Portuguese –eiro / –ario, Spanish –ero / –ario, Catalan –er, French –ier / –aire, Italian –aio / –aro / –ario, Romanian –ar, etc.; for the latter, Portuguese, Spanish, Catalan –dor, French –eur, Italian –tore, Romanian –tor, etc. At any rate, the distinction between denominal and deverbal agent nouns is not always straightforward, as is shown by the Romance forms connected with Latin present particle –NTE: Whereas most of them display a verbal base (see, e.g., Italian cantante “singer” ← cantare “to sing”), some others do not (see, e.g., Italian bracciante “hired hand” ← braccio “arm”), allowing them to be regarded as denominal derivations.
The distribution of agentive suffixes in Romance is sometimes conditioned by borrowing phenomena. For example, Italian –iere is not a direct outcome of Latin –ARIUS, but rather a borrowing from French –ier; Sardinian –eri is of Spanish/Catalan origin, whereas Piedmontese –aire and –eur are loans from Occitan and French, respectively.
A theoretical issue that is related to agent nouns in Romance is the so-called agent-instrument-locative polysemy. While traditionally attributed to the result of metaphorical and/or semantic extensions, this polysemy is better explained by such factors as homonymization, ellipsis, analogy, borrowing/calque, though, at least in some cases, we cannot exclude that a combination of both explanation types has taken place.
Pidgin languages sometimes form in contact situations where a means of communication is urgently needed between groups lacking a common code. They are typically less elaborate than any of the languages involved in their formation, and in comparison to those, reduction characterizes all linguistic levels.
The process is relatively uncommon, and the life span of pidgins is usually short – most disappear when the contact situation changes, or when another medium of intergroup communication becomes available. In some rare cases, however, they expand (both socially and structurally), and may even nativize, i. e. become mother tongues to their speakers (when they may be re-labelled “creoles”).
Pidgins are severely understudied, and while they are often mentioned as precursors to creoles, few linguists have shown a serious interest in them. As a result, many generalizations have been based on extremely limited amounts of data or even on intuition. Some frequently occurring ones is that pidginization is a case of second language acquisition, that power and prestige are important factors, and that most structures are derived from the input languages. My work with pidgins has led me to believe the opposite to be true in these cases: pidgins form through a trial-and-error process, where anything that is understood by the other party is sanctioned, this process is one of collaborative language creation (rather than one involving one group of teachers and one group of learners), and much of what finds its way in the resultant contact language do so independently of what the creators spoke prior to their encounter.
As for theoretical implications, pidgins may shed light on which features in traditional languages are necessary for communication, and which are superfluous from the point of view of pure information transmission.
Polysynthesis is informally understood as the packing of a large number of morphemes into single words, as in (1) from Bininj Gun-wok (Evans, in press).
'I cooked the wrong meat for them again.'
Its status as a distinct typological category into which some of the world’s languages fall, on a par with isolating, agglutinating, or fusional languages, has been controversial from the start. Nevertheless, researchers working with these languages are seldom in doubt as to their status as distinct from these other morphological types. This has been complicated by the fact that the speakers of such languages are largely limited to hunter-gatherers—or were so in the not too distant past—so the temptation is to link the phenomenon directly to way of life. This proves to be oversimplified, although it is certainly true that languages qualifying as polysynthetic are almost everywhere spoken in peripheral regions and are on the decline in the modern world—few children are learning them today.
Perhaps the most pervasive of the traits that give these languages the impression of a “special” status is that of holophrasis, which can be defined as the (possible) expression of what in less synthetic languages would be whole sentences in single complex (usually verbal) words. It turns out, however, that there is much greater variety among polysynthetic languages than is generally thought: there are few other traits that they all share, although distinct subtypes can in fact be distinguished, notably the affixing as opposed to the incorporating type.
These languages have considerable importance for the investigation of the diachronic complexification of languages in general and of language acquisition by children, as well as for theories of language universals. The sociolinguistic factors behind their development have only recently begun to be studied in depth. All polysynthetic languages today are to some degree endangered (they are dying off at an alarming rate), and many have been poorly studied if at all, which makes their investigation before it is too late a prime goal for linguistics.
Timothy J. Vance
The term rendaku, sometimes translated as sequential voicing, denotes a morphophonemic phenomenon in Japanese. In a prototypical case, an alternating morpheme appears with an initial voiceless obstruent as a word on its own or as the initial element (E1) in a compound but with an initial voiced obstruent as the second element (E2) in a two-element compound. For example, the simplex word /take/ ‘bamboo’ and the compound /take+yabu/ ‘bamboo grove’ (cf. /yabu/ ‘grove’) begin with voiceless /t/, but this morpheme meaning ‘bamboo’ begins with voiced /d/ in /sao+dake/ ‘bamboo (made into a) pole’ (cf. /sao/ ‘pole’). Rendaku was already firmly established in 8th-century Old Japanese (OJ), the earliest variety for which extensive written records exist, and subsequent sound changes have made the alternations phonetically heterogeneous. Many OJ compounds with eligible E2s did not undergo rendaku, and the phenomenon remains pervasively irregular in modern Japanese. There are, however, many factors that promote or inhibit rendaku, and some of these appear to influence native-speaker behavior on experimental tasks. The best known phonological factor is Lyman’s Law, according to which rendaku does not apply to E2s that contain a non-initial voiced obstruent. Many theoretical phonologists endorse the idea that Lyman’s Law is a sub-case of the Obligatory Contour Principle, which rules out identical or similar units if they would be adjacent in some domain. Other well-known factors involve vocabulary stratum (e.g., the resistance to rendaku of recently borrowed E2s) or the morphological/semantic relationship between E2 and E1 (e.g., the resistance to rendaku of coordinate compounds). Some morphemes are idiosyncratically immune to rendaku. Other morphemes alternate but undergo rendaku in some compounds while failing to undergo it in others, even though no known factor is relevant. In addition, many individual compounds vary between a form with rendaku and a form without, and this variability is often not reflected in dictionary entries. Despite its irregularity, rendaku is productive in the sense that it often applies to newly created compounds. Many compounds, of course, are stored (with or without rendaku) in a speaker’s lexicon, but fact that native speakers can apply rendaku not just to existing E2s in novel compounds but even to made-up E2s shows that rendaku as an active process is somehow incorporated into the grammar.
Empirical and theoretical research on language has recently experienced a period of extensive growth. Unfortunately, however, in the case of the Japanese language, far fewer studies—particularly those written in English—have been presented on adult second language (L2) learners and bilingual children. As the field develops, it is increasingly important to integrate theoretical concepts and empirical research findings in second language acquisition (SLA) of Japanese, so that the concepts and research can be eventually applied to educational practice. This article attempts to: (a) address at least some of the gaps currently existing in the literature, (b) deal with important topics to the extent possible, and (c) discuss various problems with regard to adult learners of Japanese as an L2 and English–Japanese bilingual children. Specifically, the article first examines the characteristics of the Japanese language. Tracing the history of SLA studies, this article then deliberately touches on a wide spectrum of domains of linguistic knowledge (e.g., phonology and phonetics, morphology, lexicon, semantics, syntax, discourse), context of language use (e.g., interactive conversation, narrative), research orientations (e.g., formal linguistics, psycholinguistics, social psychology, sociolinguistics), and age groups (e.g., children, adults). Finally, by connecting past SLA research findings in English and recent/present concerns in Japanese as SLA with a focus on the past 10 years including corpus linguistics, this article provides the reader with an overview of the field of Japanese linguistics and its critical issues.
Erich R. Round
The non–Pama-Nyugan, Tangkic languages were spoken until recently in the southern Gulf of Carpentaria, Australia. The most extensively documented are Lardil, Kayardild, and Yukulta. Their phonology is notable for its opaque, word-final deletion rules and extensive word-internal sandhi processes. The morphology contains complex relationships between sets of forms and sets of functions, due in part to major historical refunctionalizations, which have converted case markers into markers of tense and complementization and verbal suffixes into case markers. Syntactic constituency is often marked by inflectional concord, resulting frequently in affix stacking. Yukulta in particular possesses a rich set of inflection-marking possibilities for core arguments, including detransitivized configurations and an inverse system. These relate in interesting ways historically to argument marking in Lardil and Kayardild. Subordinate clauses are marked for tense across most constituents other than the subject, and such tense marking is also found in main clauses in Lardil and Kayardild, which have lost the agreement and tense-marking second-position clitic of Yukulta. Under specific conditions of co-reference between matrix and subordinate arguments, and under certain discourse conditions, clauses may be marked, on all or almost all words, by complementization markers, in addition to inflection for case and tense.
Rosemary G. Beam de Azcona
Zapotecan languages belong to the Otomanguean stock and consist of two major subgroups: Zapotec and Chatino. They are primarily spoken in Oaxaca, Mexico, and elsewhere in diaspora, particularly in California but also in other parts of the United States and Mexico. Zapotecan languages are spoken in a contiguous area and although all are related genetically, many languages exist in regional diffusion zones such that new changes spread areally. Similarly, individual Zapotecan “languages” often consist of dialect continua.
Zapotecan languages are tonal and also have contrastive phonation types, such as a contrast between modal (V), checked (VɁ), and rearticulated (VɁV) vowels. Some Valley Zapotec languages also have breathy voice, partially due to contact with Mixe. Vowel nasalization is a prominent feature of Chatino and a marginal feature of some Zapotec languages. Consonants usually fall into two contrastive series in Zapotec, commonly termed “fortis” and “lenis,” though the phonetic realizations of these vary from language to language. The historical loss of unstressed vowels is common in many Zapotecan languages, though there are individual Zapotec and Chatino languages that retain them. A stress shift from the final syllable (retained in Chatino) to the first (usually penultimate) syllable of the root (in Zapotec) makes the languages with vowel loss more dissimilar from each other, since a different syllable survives in Chatino versus Zapotec.
Zapotecan languages are head-initial languages with VSO order and are typically head-marking. Common morphology includes pre-posed TAM markers and post-posed person markers on the verb, and derivational prefixes on nouns. An emergent class of prepositions is developing out of what were historically relational nouns. Stative forms of verbs are more common than true adjectives, while numerals have many verbal properties.
Like other Meso-American languages, Zapotecan languages are currently experiencing both a golden age and a moment of unprecedented peril. There is an ever-increasing number of linguists who are native speakers of these languages, and the community of language activists, including students and educators, is growing stronger in Oaxaca and Mexico at large, and indeed worldwide, including where Zapotecan languages are spoken by immigrants. At the same time, the intense political and socioeconomic pressure on communities to shift to Spanish is greater than ever before, and the number of communities where children speak Zapotecan languages is ever shrinking. Children in communities where children speak Zapotecan languages at home are often chastised for doing so in school, which poses a continual threat. Zapotecan languages historically have been in contact with other Meso-American languages such as Nahuatl, Mixtec, Chontal, Mixe, Huave and Chinantec, among others. Today the vast majority of speakers of Zapotecan languages are at least bilingual in their language and Spanish, and many also speak English and/or other Meso-American languages. Zapotecan languages mostly show lexical borrowings from these other languages, and occasional grammatical borrowings. Regional varieties of Spanish show a Zapotecan substrate with numerous calques and interference on every level of the language from phonetics to pragmatics.