How Language Evolved

2 downloads 0 Views 716KB Size Report
weaving of fibers into clothing, nets, bags, and ropes, dating from some 29,000 years ago ..... Hunt, G. R., Corballis, M. C., & Gray, R.D. (2001). Laterality in tool.
心 理 学 报 2007,39(3):415~430 Acta Psychologica Sinica

How Language Evolved Michael C. Corballis University of Auckland

Language, with its complex recursive structure, is almost certainly a uniquely human capacity. I argue that it evolved over the past 2 million years during the Pleistocene epoch, as part of a cognitive adaptation to deforestation and predation from dangerous killer animals on the African savanna. Rather than postulate any specific genes for language, I suggest that there was systematic selection for increase in brain size, allowing for more complex social cognition, including the “grammaticalization” of communication through learning and cultural pressures. Parallel to this development, the medium of communication changed gradually from a manual mode to a facial and then vocal mode, culmination in a mutation of the FOXP2 gene that gave our own species, Homo sapiens, the capacity for autonomous speech. This final switch may explain the so-called “human revolution,” leading to the dominance of humans on the planet, and the demise of other species of the genus Homo. Keywords: language, grammaticalization, the medium of communication.

语言如何进化 人类语言具有复杂多变的递归结构,漫长的物种进化过程中唯独人类精通语言。语言的进化始于大约两 百万年前的“更新世时期”,语言在当时作为一种认知适应对于人类应对自然界带给人类的挑战(如动 物掠食与森林毁坏)有很大帮助。人类进化过程中学习与文化因素形成一种选择压力促使人际交流语法 化,人际交流语法化引发大脑容量增加,然而,最初的语言进化与基因无关。学习与文化压力也使交流 的媒介依次变为手语模式、表情模式与语言模式。交流媒介的逐渐变化最终导致了 FOXP2 基因突变, FOXP2 基因突变让智人具有了自主的言语能力。与地球上其它的人科动物相比,人类的语言能力使人类 在进化中具有明显的优势。 关键词:语言,语法化,沟通媒介。 分类号:B84-069 quickly made clear by Huxley (1863/2001): Humans had evolved through natural selection from the African apes. The Oxford philologist Friedrich Max Muller immediately took up the Cartesian challenge, declaring that language was critical proof of the gap between humans and “brutes” (Muller, 1861/1880). Darwin replied by suggesting that language emerged from the inarticulate cries of animals, which Muller in turn scornfully derided. In view of such vituperative exchanges, it is perhaps not surprising that in 1866 the Linguistic Society of Paris banned all discussion of the evolution of language. The London Philological Society followed suit in 1872. Curiously, that ban seems to have persisted until late in the 20th century, and the approach to language has been for the most part Cartesian rather than Darwinian. This was reinforced by the dominant linguist of the second half of the 20th century, Noam Chomsky. An avowed Cartesian, Chomsky has argued that human language is unrelated to any form of animal communication: “Modern studies of animal communication,” he once wrote, so far offer no counterevidence to the Cartesian assumption that human language is based on an entirely different

Introduction The Cartesian background The evolution of language has long been contentious. The 17th-century philosopher Réné Descartes (1647/1985) set the stage for much of the controversy when he contrasted humans with animals, suggesting that language was one attribute unique to humans. The distinctive characteristic of language was its open-ended quality, since there seemed to be no limits to what humans, even human “imbeciles,” could express. This freedom of expression, apparently denied to all other species, seemed to be inexplicable in terms of any mechanical principles, leading Descartes to declare that it must have been bestowed by God. Any such notions, though, were challenged by Darwin’s momentous book, On the Origin of Species, published in 1859. Although Darwin did not at first deal with the question of human evolution, or even with the evolution of language, his message was Received 2006-06-30 Correspondence should be addressed to Michael C. Corballis, Department of Psychology, University of Auckland, Private Bag 92019, Auckland, New Zealand; e-mail: [email protected]. 415

416





principle” (Chomsky, 1966, p. 78). If this is so, one may question whether an evolutionary account is even possible. Not surprisingly, then, Chomsky (1975) suggested that language could not be explained in terms of natural selection, but may have arisen simply as a consequence of possessing an enlarged brain, without the assistance of natural selection: We know very little about what happens when 1010 neurons are crammed into something the size of a basketball, with further conditions imposed by the specific manner in which this system developed over time. It would be a serious error to suppose that all properties, or the interesting structures that evolved, can be ‘explained’ in terms of natural selection. Yet it is surely vacuous to appeal simply to a large brain, especially when the structures of language seem to imply intricate and dedicated programming. Some authors have proposed instead that language must have emerged as a result of some mutation, an idea sometimes referred to as the “big bang” theory of language evolution. Pinker (1994, p. 297) wrote of “the grammar gene,” with the implication that it was uniquely human. Bickerton (1995) asserted that “… true language, via the emergence of syntax, was a catastrophic event, occurring within the first few generations of Homo sapiens sapiens (p. 69).” Even more radically, Crow (2002) has proposed that a genetic mutation gave rise to the speciation of Homo sapiens, along with such uniquely human attributes as language, cerebral asymmetry, theory of mind, and a vulnerability to psychosis. Again, though, appeal to a single fortuitous event, such as a genetic mutation, smacks of the miraculous, and is little more enlightening than Descartes’ appeal to God. One of the arguments against an evolutionary account of language is that it is too complex to have evolved through natural selection in the period of some 6 million years since the hominid lineage split from that leading to the present-day chimpanzee and bonobo, and that human language is in any case too powerful to have been the product of natural selection. Premack (1985) put it like this: Human language is an embarrassment for evolutionary theory because it is vastly more powerful than one can account for in terms of selective fitness. A semantic language with simple mapping rules, of a kind one might suppose that the chimpanzee would have, appears to confer all the advantages one normally associates with discussion of mastodon hunting or the like. For discussions of that kind, syntactic classes, structure-dependent rules, recursion and the rest, are overly powerful devices, absurdly so (p. 282). This is the so-called “argument from incredulity.” It was also used in the 19th century to argue that the





39 卷

eye could not have evolved from natural selection, but was soon dismissed (see, e.g., Chapter 5 of Dawkins, 1995). Indeed such arguments have never been persuasive to evolutionary biologists, since the gradual accumulation of small changes can lead to considerable complexity. Evolutionary revival With respect to language, the argument from incredulity was challenged by Pinker and Bloom (1990). Contrary to Chomsky, they argue that, like the eye, human language evolved incrementally, in what they call a “cognitive arms race.” In this view, cognition is conceived primarily as a “social tool,” shaped by the complexities of social relationships, and involving such capacities as language and theory of mind (TOM) (e.g., Alexander, 1979; Flinn, Geary & Ward, 2005; Geary, 2005). Much of this arms race has to do with the competing principles of cooperation and the necessity to detect and remove freeloaders, who capitalize on the sacrifices made by others. This leads to ever more sophisticated means of cheating, and of detecting those who cheat. Language clearly plays a critical role, as anyone buying (or selling) a used car knows full well. Pinker and Bloom write: The ability to frame an offer so that it appears to present maximum benefit and minimum cost to the buyer, and the ability to see through such attempts and to formulate persuasive counterproposals, would have been a skill of inestimable value in primitive negotiations, as it is today (p. 725). Not all subsequent theorists have accepted Pinker and Bloom’s analysis, but their seminal article led to a revival, over the past dozen years or so, of speculation as to how and when language might have evolved. The first of a biennial series of conferences on the evolution of language was established in Edinburgh in 1996, and it was perhaps fitting that the third such conference, in 2000, was held in Paris, site of the original ban. One of the critical questions is whether language is indeed a uniquely human capacity, as held by Chomsky, Bickerton, and others in the Cartesian tradition. What is unique about language? Although arguing that language evolved incrementally through natural selection, Pinker and Bloom were otherwise still fundamentally Chomskyan in their approach, accepting that syntax, perhaps the most distinctive quality of human language, is indeed uniquely human. Nevertheless there has been an erosion of the Chomskyan belief that human language is based on “an entirely different principle” from animal communication. Even Chomsky seems to have given ground; he was

3期

Michael C. Corballis. How Language Evolved

recently co-author of an influential paper in which it was argued that human language and animal communication shared many features in common (Hauser, Chomsky, & Fitch, 2002). Indeed the one characteristic that seemed to stand out as uniquely human was the use of recursion to extend sentences, at least in principle, to any desired length or complexity. This is exemplified by the well-known children’s story The House that Jack Built. As the first few lines show, one can add ever-increasingly complexity: This is the house that Jack built. This is the malt that lay in the house that Jack built. This is the rat that ate the malt that lay in the house that Jack built. This is the cat that worried the rat that ate the malt that lay in the house that Jack built. … and so on. As Hauser et al. (2002) recognize, recursion may not be restricted even to language. For a start, recursive sentences presumably refer to recursive ideas, and it is unlikely that recursive language came first. Hauser et al. suggest that recursion might have evolved in the context of navigation or social relationships. Bickerton, who once advocated a “big bang” theory of language evolution, has more recently suggested that the structure of language might derive from reciprocal altruism, evident in primate behavior (Bickerton, 2003; Calvin & Bickerton, 2000). Similarly, Tomasello (2003) has proposed that language is part of a broader capacity to understand others as intentional agent. Our social lives are governed by the attribution of mental states, and these attributions are used recursively, as in I know that she thinks I’m crazy, or I know that she thinks he thinks she’s crazy. Recursion is not even restricted to social settings. We have developed highly complex, recursive ways of manufacturing things, using the same elements in different environments or at different levels. The classic example is the invention of the wheel, which has come to feature in a vast array of mechanical contrivances, including wheels within wheels. This does not prove the existence of a manufacturing gene, but is generally seen as evidence of human inventiveness. I have myself argued that humans are blessed with a generative assembling device (GAD) that underlies not only language, but also other recursive activities such as manufacture and music (Corballis, 1991). The question of whether nonhuman species are capable of recursion, whether in thought, behavior, or language, is also a matter of contention. The work of Tomasello and colleagues suggests that primates (e.g., Hare, Call, Agnetta, & Tomasello, 2000), and perhaps other mammals such as dogs (e.g., Hare and Tomasello, 1999), may be capable of taking the visual perspective of others, which implies at least first-

417

order recursion (“I know what you can see”). Other research suggests that the great apes, but perhaps not other primates, can use their knowledge of what others can see to accomplish what has been called tactical deception (Whiten & Byrne, 1988). A more negative view, though, has been expressed by Povinelli and his colleagues, who suggest that much of the evidence on perspective-taking in chimpanzees can be explained more simply. For example, although chimpanzees can follow the gaze of another (Hare et al., 2000), Povinelli and Bering (2002) suggest that this is really an instinctive behaviour with little understanding of what the observed animal can see. Gaze following may simply be an adaptive response that alerts other animals to danger or reward, but we humans have intellectualized it, often after the fact. In a more general vein, Povinelli and Bering (2002) invoke the spirit of Descartes, decrying the overzealous attempts “to dismantle arguments of human uniqueness,” while nevertheless reaffirming the importance of comparative psychology. “A true comparative science of animal minds,” they go on to write, “ … will recognize the complex diversity of the animal kingdom, and will thus view Homo sapiens as one more species with a unique set of adaptive skills crying out to be identified and understood” (p. 115). There is little evidence that any nonhuman species is capable of understanding recursion in language-like tasks. Fitch and Hauser (2004) found that tamarind monkeys could easily learn sequences of syllables that followed the rules of finite-state grammar, but could not learn a grammar that involved phrase structure, with the embedding of phrases (syllable pairs) within pairs. This implies an inability to understand recursion. * Nevertheless nonhuman primates may be capable of combining symbolic elements, although at a level that Bickerton (1995) characterized as “protolanguage,” which is essentially language without grammar. Captive great apes taught artificial systems do make use of combinations of two or three symbols to create novel meanings (e.g., Gardner & Gardner, 1969; Savage-Rumbaugh, Shanker, & Taylor, 1998), and there is now evidence that Taï chimpanzees in the wild combine different calls, usually in pairs, but in rare cases in triplets and quartets, which in some cases seem to signal novel meanings (Crockford & Boesch, 2005). At a simpler level still, free-ranging puttynosed monkeys apparently use call paired combinations of calls to signal urgency (Arnold & Zuberbühler, 2006). Although it now seems clear that nonhuman species, including great apes and some birds, can communicate by stringing meaningful elements * Gentner, Fenn, Margoliash, and Nusbaum (2006) claim to have taught starlings to recognize recursive patterns of sounds, but a much simpler explanation is likely.

418





together, this capacity still falls far short of human syntax. There are no function words, no tenses, no recursion, and nothing resembling human conversation. The emergence of language as we humans know it must therefore have evolved at some time since the split, some 6 or 7 million years ago, between the hominid lineage and the lineage that led to present-day chimpanzees and bonobos. Language, Cognition, and the Pleistocene One characteristic that distinguishes the hominids from the apes is that the apes are quadrupedal knuckle-walkers, whereas the hominids evolved a bipedal stance and method of locomotion. On the basis of overall brain size, there is little evidence that the early hominids differed substantially, at least in cognitive terms, from their ape predecessors, or indeed from present-day apes. The epoch that most clearly shows the transition from ape-like to humanlike behavior is the Pleistocene, some 4 million years after the split between the hominid line and that leading to present-day chimpanzees and bonobos. The Pleistocene is formally dated from 1.81 million years ago to 11,500 years ago, though some have argued that it should be dated from as early as 2.58 million years ago (e.g., Suc, Bertini, Leroy, & Suballyova, 1997). It was characterized by a series of ice ages that reduced the forested areas in which the earlier primates, and probably the earlier hominids, had found safety and adequate sources of food. The forested terrain was replaced by more open savanna, and the hominids were forced into a hunter-gatherer mode of existence, initially scavenging for food but also gradually developing hunting techniques for the slaughter of animals. The Pleistocene also corresponds at least roughly with the emergence of the genus Homo. Evolutionary psychologists treat the Pleistocene as the cradle of humankind, relating present-day cognition to selective pressures arising from hunter-gatherer modes of subsistence (e.g., Barkow, Cosmides, & Tooby, 1992; Pinker, 1997). These pressures were climatic, ecological and social, and it was perhaps the ecological pressures that were at first instrumental in selecting for social skills, and then competition among conspecific that honed them further in the “cognitive arms race” referred to earlier (Flinn et al., 2005). Baumeister (2005) has also usefully distinguished between the social and the cultural, suggesting that it was culture that shaped human evolution. Other primates are social, with interrelationships, dominance patterns, and the like, but culture creates communities of like minds, with networks for the maintenance, transmission, and accumulation of information. Such networks imply at least some level of language. Tomasello et al. (2005) argue somewhat similarly that the critical element in





39 卷

human evolution may have been not simply interpersonal understanding, demonstrable to a limited degree in other primates, but the further capacity to share intentions. The instinctive sharing of goals and ideals has no doubt been critical to the survival of our species in the face of hardship and predation, starting with the African savanna, but continuing in other regions as some members of early Homo migrated out of Africa into Europe and Asia. More generally, hominids adapted to their new conditions by inhabiting what has been called a “cognitive niche,” which Tooby and DeVore (1987) define as “conceptually abstracting from a situation a model of what manipulations are necessary to achieve proximate goals that correlate with fitness” (p. 209). For example, hunter-gatherers do not simply kill their prey, as other animals do, but precede each hunt with a plan based on earlier experiences and communication among band members, and follow each hunt with campfire debriefings (Lee, 1979). Tool manufacture One characteristic of the genus Homo is the manufacture and use of tools. The earliest stone tool industry, known as the Oldowan industry, has been dated from around 2.5 million years ago, and associated with the earliest member of our genus, Homo rudolfensis (Semaw et al., 1997). This industry, known as the Oldowan, was primitive compared with the later Acheulian tool industry associated with the larger-brained Homo erectus around 1.8 million years ago (Foley & Lahr, 1997). Although there was something of a rise in manufacturing sophistication from around 300,000 years ago (Ambrose, 2001), the Acheulian industry remained fairly static for over a million years, and even persisted into the culture of early Homo sapiens some 125,000 years ago (Walter et al., 2000). Indeed, the most significant advances did not occur until within the past 100,000 years in what is known as the “human revolution” (Mellars & Stringer, 1989), which is discussed in more detail in a later section. While the manufacture of stone tools clearly marks a cognitive advance, the static nature of tool development throughout most of the Pleistocene suggests that tools do not really tell us much about how the human mind evolved through that period. It is of course possible that there were more sophisticated tools made from perishable materials. Indeed other primates use sticks and stones as tools, and capuchin monkeys in particular are especially well known for tool use, and occasionally modify twigs for more efficient use as diggers (Moura & Lee, 2004), and it would be surprising if the early hominids who preceded the genus Homo did not also make use of sticks and stones as tools. Even crows

3期

Michael C. Corballis. How Language Evolved

have a capacity to fashion tools out of leaves and twigs to forage for grubs in holes (Hunt, 1996; Hunt, Corballis, & Gray, 2001). Consequently, recent accounts of how cognition and language might have evolved over the Pleistocene have placed less emphasis on tools than on other marks of cognitive development. The expanding brain Perhaps the clearest evidence that cognitive capacity grew during the Pleistocene is the dramatic increase in brain size associated with the genus Homo. The earlier hominids had brains that differed little in size from those of present day apes, at least if body size is taken into account. For example, according to estimates based on fossil skulls, the average brain size of Australopithecus afarensis was around 433 cc, compared with averages of 393 cc for the chimpanzee, 418 for the orangutan, and 465 for the gorilla (Martin, 1990). It rose to some 612 cc in Homo habilis, 854 cc in early Homo erectus (also known as Homo ergaster), 1016 cc in later Homo erectus. After a period of stasis, there appears to have been a secondary increase from about 500,000 years ago, reaching about 1552 cc in the Neanderthals (Homo neanderthalensis), and a slightly smaller 1355 cc in Homo sapiens (Wood & Collard, 1999). Brain size depends partly on body size, which probably explains why the Neanderthals, being slightly larger than modern humans, also had slightly larger brains. One way to take body size into account is to use an index called the encephalization quotient (EQ), which is based on the regression of brain size on body size (Martin, 1981), and it has been estimated that the EQ was slightly smaller in Neanderthals than in early humans (Ruff, Trinkaus, & Holliday, 1997). This mercifully restores humans to the top of the pile. The increase in brain size, rather than the emergence of one or more so-called grammar genes, may well have been the vehicle for the evolution of language, and perhaps of more general cognitive developments. Nevertheless the increase in brain size itself was surely dependent on genetic changes. One gene that is a specific regulator of brain size is the abnormal spindle-like microcephaly associated (ASPM) gene, and phylogenetic analysis suggests strong positive selection of this gene in the lineage leading to Homo sapiens (Evans, et al., 2004). Indeed, a selection sweep appears to have occurred as recently as 5,800 years ago, suggesting that the human brain is still undergoing rapid evolution (Mekel-Bobrov, et al., 2005). Interestingly, two other genes appear to have resulted in increased brain size through inactivation rather than positive selection. One of these encodes the enzyme CMP-Nacetylneuraminic acid (CMP-Neu5Ac) hydroxylase (CMAH). An inactivating mutation of this gene has resulted in a deficiency in humans of the mammalian

419

sialic acid N-glycolylneuraminic acid (Neu5Gc). This appears to have been the end result of a process of down-regulation throughout mammalian evolution, since the acid is absent in Neanderthal fossils as well as in humans, and is only weakly present in chimpanzees relative to other primates and mammals. Chou et al. (2002) speculate that inactivation of the CMAH gene may have removed a constraint on brain growth in human ancestry. Molecular-clock analysis indicated that the inactivating mutation probably occurred some 2.7 million years ago, leading up to the expansion in brain size from around 2.1 million years ago. The other inactivating mutation that may also have contributed to the increase in brain size occurred on a gene that encodes for the myosene heavy chain MYH16. This chain is responsible for the heavy masticatory muscles in most primates, including chimpanzees and gorillas, as well as the early hominids. Molecular-clock analysis suggests that the inactivation dates from around 2.4 million years ago, leading to speculation that the diminution of jaw muscles and their supporting bone structure removed a further constraint on brain growth (Stedman et al., 2004). It is a matter of further speculation as to why this seemingly deleterious mutation became fixed in the ancestral human population. It may have had to do with the change from a predominantly vegetable diet to a meat-eating one, or it may have had to do with the increasing use of the hands rather than the jaws to prepare food (Currie, 2004). As a spin-off, though, it may have allowed brain size to increase in the face of selective pressures that favored more complex cognition, such as theory of mind and recursive language. Curiously, though, it is not entirely clear that the increase in the size of particular areas of the brain was driven by selection for the particular functions subserved by those areas. Rather, the increases in subareas of the brain are tightly constrained by the fact that the brain grows as a covarying whole. Different regions do grow at different rates, but according to a fairly inflexible rule, with structures that emerge late in development increasing more than structures that emerge early. The disproportionate growth in the cortex relative to other parts of the brain might therefore have arisen, not because the cortex is intrinsically specialized for “higher-order” functions, but because the cortex is the last area to undergo neurogenesis. According to this view, structure preceded function, and cognitive traits like language found representation in the cortex because the cortex grew disproportionately relative to subcortical structures (Finlay, Darlington, & Nicastro, 2001). This notion is somewhat controversial, since there is also some evidence for selective pressures on the sizes of different brain regions, leading to the notion

420





of cerebrotypes (see Geary, 2005, p. 104, for further discussion), including evidence for a genetic influence over the size of the frontal cortex (Piao et al., 2004). Nevertheless, Christiansen and Dale (2004) have suggested that the structure of the brain did not adapt to accommodate language, as generally supposed, but rather that language adapted to biological constraints. In some respects, the emerging view is somewhat reminiscent of Chomsky’s notion that language was a consequence of increased brain size. The likely scenario, though, is that the increased brain size was driven by the cognitive demands of the Pleistocene, and that language was one of the complex functions that were able to emerge as a consequence of increased neural capacity. Although there are plausible scenarios as to how universal grammar might have evolved (e.g., Nowak, 2001; Nowak, Plotkin, & Jansen, 2000), there is growing recognition that learning may have played an important role, with the help of cultural transmission. This view is supported by the development of network simulations that mimic the acquisition of language, without the necessity to postulate any pre-existing, innately determined universal grammar (e.g., Chang, Dell, & Bock, 2006; Christiansen & Dale, 2004). Grammaticalization A related view is that language assumes its shape through a gradual process, known as “grammaticalization” (Hopper & Traugott, 2003) rather than through any abrupt genetic change or the expression of an innate universal grammar, although natural selection no doubt played a critical role. There is no direct evidence as to when this first occurred, although it seems likely that the opportunity for grammaticalization evolved during the Pleistocene, and was associated with the increase in brain size. According to the 18th-century English philologist John Horne Tooke (1857), the earliest “language” consisted only of nouns and verbs (“necessary words”), while other word classes, such as adjectives, adverbs, prepositions, and conjunctions arose from the abbreviation or “mutilation” of these necessary words. Adjectives, for example, may derive from nouns, as in heavenly or manly, where the suffix –ly is presumably a contraction of –like. This idea is endorsed by Hurford (2003), who gives an example from the emergence of Tok Pisin, a creole that derived from pidgin in Papua New Guinea. Pidgins are makeshift languages created as a means of communication between speakers of different languages and have little or no grammar. They often develop over the course of a few generations into creoles, which have more sophisticated grammars. The Papua New Guinean pidgin consisted only of nouns and verbs, but in Tok Pisin adjectives were signaled by the addition of the suffix -fela (or –pela), itself derived from the





39 卷

English noun fellow. It also seems reasonable to suppose that phrase structure preceded the ability to combine phrases into single utterances. Christiansen and Kirby (2004) give the example of the phrases My dad/He plays tennis/He plays tennis with his colleagues, which can be combined into the more compact form My dad plays tennis with his colleagues. The idea that grammar may have evolved in response to cultural and environmental pressures is reinforced by increasing evidence for extreme variability between present-day languages. Tomasello (2003), for example, writes as follows: As we investigate more and more of the world’s 6000+ languages, this hypothesis [of universal grammar] is proving more and more difficult to maintain. Of course we can take the grammar of Standard Average European and impose it on other languages. But when we look at, for example, Austronesian languages, on their own terms, we find that they work in quite unexpected ways; they simply do not have some of the categories and constructions that appear in European languages, and they of course have some of their own categories and constructions as well (p. 101). An example of a language that is grammatically impoverished can be found among the Pirahã, the tribe of some 200 people in Brazil (Everett, 2005). It has no numbers or system of counting, no color terms, no perfect tense and only a very primitive way of talking about relative time. The grammar itself is simple; for example, there is no embedding of phrases, and the system of pronouns is the simplest yet recorded. The Pirahã seem to live in the present, with no creation myths, no art or drawing, no individual or collective memory for more than two generations past. They have remained monolingual despite more than 200 years of trading with Portuguese-speaking Brazilians and speakers of other native languages. One might be tempted to believe that they suffer from some genetic defect, but this idea is rejected by Everett, who describes them anecdotally as “some of the brightest, pleasantest, and most fun-loving people that I know” (p. 621). It seems entirely likely, then, that the emergence of grammar was the outcome of pressures toward economy of expression in the face of increasing complexity. Up to a certain level of complexity, single calls are sufficient to communicate all that needs to be communicated. For example, vervet monkeys give different warning cries to distinguish between a number of different threats, such as snakes, hawks, eagles, or leopards. When a monkey makes one of these cries, the troop acts appropriately, clambering up trees in response to a leopard call or running into the bushes in response to an eagle call (Cheney & Seyfarth, 1990). Life may have grown

3期

Michael C. Corballis. How Language Evolved

more complex in the Pleistocene, with numerous activities to be labeled and perhaps communicated. Suppose, for example, that there were some ten meaningful objects (such as lion, tree, stone chopper, child, etc) and six actions (such as run, climb, throw, carry, etc). This gives rise to 60 possible combinations (some admittedly unlikely), and to attach a single label to each would therefore require 60 distinct utterances. There is clear benefit in attaching distinct symbols, which we can now call “words,” to each object and to each action, requiring only 16 words. Hence the first step toward grammar may have the distinction between different classes of words, and the discovery that meanings could be generated by combining them, resulting in a considerable gain in economy (Nowak et al., 2000) A real-life example of the “discovery” of a combinatorial principle is provided by Nicaraguan Sign Language (NSL), which first emerged some 25 years ago when a school was established for deaf children. In an experimental study, users of NSL were asked to describe the action of rolling down a slope. Those from the first cohort mimicked both the rolling and the down motions in a single gesture. The majority of those in the second and third cohorts indicated the motion in two gestures, one to indicate a rolling motion and the other to indicate downward motion (Senghas, Kita, & Özyürek, 2004). This is a living example of grammaticalization. Nowak (2001) has extended this approach theoretically to indicate how universal grammar might have evolved. In their computational modelling, Nowak and his collaborators assume an evolutionary perspective, but it is perhaps equally plausible to suppose that grammaticalization was shaped by culture and communicative demands rather than, or as well as, by genetic mutation and natural selection. A combinatorial structure is efficient in the sense that it cuts down the number of elements from which to build a message. Speech, for example, is built from a relatively small number of phonemes—44 in American English—that are combined hierarchically into morphemes, words, phrases, and sentences, and thence into stories, epistles, contracts, and the like. Even so, the generation of complex structures requires extensive working memory, and it may have been the demands on processing capacity rather than the emergence of combinatorial rules, or any kind of universal grammar, that drove the increase in brain size. Indeed, Baddeley, Gathercole, and Papagno (1998) have argued that the phonological loop—a component of working memory—evolved primarily as a device for learning new words. One of the final stages in the emergence of a fully syntactic language may have been the incorporation of recursive, embedded structures, which imposes heavy demands on working memory. This may have coincided with

421

the secondary spurt in brain size that occurred from about 500,000 years ago (Wood & Collard, 1999). Striedler (2006) points out that an increase in brain size of itself provides for better neural connectivity, which he calls the rule of “large equals wellconnected” (p. 6). This may have been the final change that provided fore the precision and intricate programming necessary for syntactic language, although again there is likely to have been some regional selectivity within the brain (Geary, 2005). In summary, the hierarchical, combinatorial structure of language probably evolved during the Pleistocene, as an adaptation toward increased efficiency of communication in the face of increased complexity of social life. Language probably did not evolve as an isolated skill, but was rather linked to other capacities involving recursive thought, such as enhanced theory of mind, and perhaps mental time travel—the ability to project one’s self mentally forward and backward in time (Suddendorf & Corballis, 1997). The emergence of speech Attempts to communicate with the great apes have taught us at least one thing—these animals are a long way from being able to speak. For example Viki, a chimpanzee raised from infancy in a human household, could never utter more than about four indistinct words (Hayes, 1952). The capacity to speak must itself have evolved relatively slowly, since considerable anatomical and neural modifications were necessary in order to make articulate speech possible. These modifications, which were probably largely independent of increases in brain size, probably also took place during the Pleistocene, but may not have reached the stage at which autonomous, articulate speech was possible until the emergence of our own species. Articulate speech required radical change in the neural control of vocalization. The species-specific and largely involuntary calls of primates depend on an evolutionarily ancient system that originates in the limbic system, but in humans this is augmented by a separate neocortical system operating through the pyramidal tract, and synapsing directly with the brainstem nuclei for the vocal cords and tongue (Ploog, 2002). The evidence suggests that voluntary control of vocalization in the chimpanzee is extremely limited, at best (e.g., Goodall, 1986). The development of cortical control must surely have occurred gradually, rather than in all-or-none fashion, and perhaps reached its final level of development only in anatomically modern humans. An adaptation unique to H. sapiens is neurocranial globularity, defined as the roundness of the cranial vault in the sagittal, coronal, and transverse planes, which is likely to have increased the relative size of the

422





temporal and/or frontal lobes relative to other parts of the brain (D. E. Lieberman, McBratney, & Krovitz, 2002). These changes may reflect more refined control of articulation and/or more accurate perceptual discrimination of articulated sounds. Speech also required anatomical changes to the vocal tract. While this too must have been gradual, P. Lieberman (1998; Lieberman, Crelin, & Klatt, 1972) has argued that the lowering of the larynx, an adaptation that increased the range of speech sounds, was incomplete even in the Neanderthals of 30,000 years ago. Perhaps, then, it was this, rather than the absence of language itself, that kept them separate from H. sapiens, leading to their eventual extinction. Lieberman’s work remains controversial (e.g., Gibson & Jessee, 1999), but there is other evidence that the cranial structure underwent critical changes subsequent to the split between anatomically modern and earlier “archaic” Homo, such as the Neanderthals, Homo heidelbergensis, and Homo rhodesiensis. One such change is the shortening of the sphenoid, the central bone of the cranial base from which the face grows forward, resulting in a flattened face (D. E. Lieberman, 1998). D. E. Lieberman speculates that this is an adaptation for speech, contributing to the unique proportions of the human vocal tract, in which the horizontal and vertical components are roughly equal in length—a configuration, he argues, that improves the ability to produce acoustically distinct speech sounds. Also critical to articulate speech was an increase in the innervation of the tongue. The hypoglossal nerve is much larger in humans than in great apes, probably because of the important role of the tongue in speech. Fossil evidence suggests that the size of the hypoglossal canal in early australopithecines, and perhaps in Homo habilis, was within the range of that in modern great apes, whereas that of the Neanderthal and early H. sapiens skulls contained was well within the modern human range (Kay, Cartmill, & Barlow, 1998), although this has been disputed (DeGusta, Gilbert, & Turner, 1999). Changes in the control of breathing were also important for speech, and this is at least partly reflected in the fact that the thoracic region of the spinal cord is larger in humans than in nonhuman primates, probably because breathing during speech involves extra muscles of the thorax and abdomen. Fossil evidence indicates that this enlargement was not present in the early hominids or even in Homo ergaster, dating from about 1.6 million years ago, but was present in several Neanderthal fossils (MacLarnon & Hewitt, 1999, 2004). The culmination of changes required for articulate speech may well have occurred very late in the evolution of Homo, perhaps even with the arrival of our own species. Some have taken this as evidence that language itself emerged only in Homo sapiens.





39 卷

Yet such radical changes must have taken place slowly, at least over the duration of the Pleistocene. This suggests that there must have been a prior form of communication that was shaped in two parallel ways, toward more sophisticated syntax, and both toward a vocal form. There are compelling reasons to suppose that this communication was initially based on manual gestures, but increasingly incorporated movements of the face, and finally articulate vocalization. The gestural origins of language The hypothesis that language evolved from manual gestures has a long pedigree, dating back at least to the 18th century philosopher Condillac (1971/1746). It was revived by Hewes (1973), and has more recently been revised and extended by several authors (e.g., Arbib, 2005; Armstrong, 1999; Armstrong et al., 1995; Corballis, 1992, 1999, 2002, 2003a; Givón, 1995; Place, 2000; Rizzolatti & Arbib, 1998; Skoyles, 2000). As we have seen, it has become abundantly clear that great apes, our closest relatives, cannot acquire speech, but they have achieved a moderate level of success using manual gestures (e.g., Gardner & Gardner, 1969; Savage-Rumbaugh et al., 1998). This suggests that the common ancestor of Homo sapiens and the chimpanzee and bonobo would not have been equipped for a vocal form of language, but might well have begun to develop a form of communication based on manual gestures. This development may well have been enhanced in the hominid lineage by the emergence of bipedalism, which would have freed the hands from any involvement in locomotion. The idea that language may have evolved from manual gestures is further supported by other lines of evidence, some of them recent. Signed languages First, there is no question that true language can be accomplished using manual and facial gesture, without voicing. It is now well established that the signed languages of the deaf display all of the essential linguistic properties of spoken language (Emmorey, 2002; Neidle, Kegl, MacLaughlin, Bahan & Lee, 2000; Stokoe, 1960). Signs are fundamentally different from gestures of the sort that occur in everyday life, independently of any linguistic function, and which tend to be iconic (i.e., pictorial or mimed) rather than symbolic. Signs, in contrast, tend to be symbolic, although there is also an analogue component, suggesting a link to a more pictorial form of communication. In the course of evolution, then, pantomimes of actions might have incorporated gestures that are analogue representations of objects or actions (Donald, 1991), but through time these gestures may have lost the analogue features and become abstract. In modern American Sign Language,

3期

Michael C. Corballis. How Language Evolved

for example, the sign for home was once a combination of the signs for eat, which is a bunched hand touching the mouth, and the sign for sleep, which is a flat hand on the cheek. Now it consists of two quick touches on the cheek, both with a bunched handshape, so the original iconic components are effectively lost. The shift over time from iconic gestures to arbitrary symbols is termed conventionalization, and appears to be common to both human and animal communication systems, and is probably driven by increased economy of reference (Burling, 1999). Although the nature of the differences between signed and spoken languages remains somewhat controversial, it now seems reasonably clear that they share the same underlying structure, even to the level of phonology. * Emmorey (2002) summarizes as follows: The research strategy of comparing signed and spoken languages makes it possible to tease apart which phonological entities arise from the modality of articulation and perception and which properties arise from the nature of the expression system of human language, regardless of modality. The results thus far suggest that basic phonological entities such as distinctive features, segments, and syllables do not arise because language is spoken; that is, they do not arise from the nature of speech. Although the detailed structure of these entities differs (e.g., distinctive features in signed language are based on manual, rather than oral, articulation) they appear to play the same organizational role for both signed and spoken languages (pp. 41-42). It is also increasingly recognized that signed languages involve movements of the face as well as of the hands (e.g., Emmorey, 2002). Facial expressions and head movements can turn an affirmative sentence into a negation, or a question. Mouth gestures are especially important, and have been linked to the equivalent of phonology, especially in European signed languages. Explicit schemes for the phonological composition of mouth movements have been proposed for a number of European Sign languages, including Swedish, English, and Italian (Sutton-Spence & Boyes-Braem, 2001). Mouth gestures can serve to disambiguate hand gestures, and as part of more general facial gestures provide the equivalent of prosody in speech (Emmorey, 2002). This work is still in its infancy, but suggests an evolutionary scenario in which mouth movements gradually assume dominance over hand movements, * At one time the sign-language equivalent of phonemes were referred to as cheremes, and the equivalent of phonology as cherology. However most linguists now use the terms phoneme and phonology to apply to signed languages, even though there is no acoustic component (Emmorey, 2002).

423

and were eventually accompanied by voicing and movements of the tongue and vocal tract. Thus, perhaps, speech was born. Of course the fact that signed languages have the same basic structure as spoken languages does not prove that spoken language evolved from a manuofacial language. Nevertheless, the fact that manual and facial control is superior to vocal control in our nearest primate relatives makes it at least plausible that vocal language grew out of a system that controlled movements of the hands and face. Speech itself as gesture The argument for continuity between manual and vocal language is further supported by arguments that speech itself is fundamentally gestural. This idea is captured by the motor theory of speech perception (Liberman, Cooper, Shankweiler, & StuddertKennedy, 1967), and by what has more recently become known as articulatory phonology (Browman & Goldstein, 1995). In this view speech is regarded, not as a system for producing sounds, but rather as a system for producing articulatory gestures, through the independent action of the six articulatory organs— namely, the lips, the velum, the larynx, and the blade, body, and root of the tongue. This approach is based largely on the fact that the basic units of speech, known as phonemes, do not exist as discrete units in the acoustic signal (Joos, 1948), nor are they discretely discernible in mechanical recordings of sound, as in a sound spectrograph (Liberman et al., 1967). One reason for this is that the acoustic signals corresponding to individual phonemes vary widely, depending on the contexts in which they are embedded. In particular, the sound patterns for a particular phoneme can be quite different, depending on the neighboring phonemes. For example, the acoustic signal corresponding to the phoneme /d/ is quite different depending on whether it is followed by /i/, as in “dig”, or /u/, as in “do” (see Liberman et al., 1967, Fig. 1), yet in both cases we hear the same sound. Despite the complexity of mapping speech sounds onto our perceptions, we can perceive speech at remarkably high rates, up to at least 10–15 phonemes per second, which seems at odds with the idea that some complex, context-dependent transformation is necessary. The conceptualization of speech as gesture overcomes these difficulties, at least to some extent, since the articulatory gestures that give rise to speech partially overlap in time (coarticulation), which makes possible the high rates of production and perception (Studdert-Kennedy, 2005). Mirror neurons More direct evidence for the links between manual, facial, and vocal gestures has been documented through the discovery in the primate brain of so-called

424





“mirror neurons,” which respond both when the animal makes a particular reaching and grasping movement and when it observes the same movement made by others (Rizzolatti, Fadiga, Gallese & Fogassi, 1996). This implies a motor theory of grasp perception. Mirror neurons are located in area F5 of the prefrontal cortex, which is the homolog of Broca’s area, an area critical to the production of speech in humans (Rizzolatti & Arbib, 1998). Primates have little if any cortical control over vocalization (Ploog, 2002), but it appears that Broca’s area, now considered part of a more general “mirror system” involving the understanding of biological action (Rizzolatti, Fogassi, & Gallese, 2001), eventually incorporated vocalization. As we shall see below, this may not have been fully accomplished until the emergence of Homo sapiens. Even in primates, there are neural links between hand and mouth. In monkeys, the mirror system responds to movements of the mouth as well as to movements of the hands (Ferrari, Gallese, Rizzolatti & Fogassi, 2003). In humans, the link between hand and mouth can be demonstrated behaviourally as well. Gentilucci, Benuzzi, Gangitano, and Grimaldi (2001) showed that when subjects were instructed to open their mouths while grasping objects, the size of the mouth opening increased with the size of the grasped object, and conversely, when they open their hands while grasping objects with their mouths, the size of the hand opening also increased with the size of the object. Grasping movements of the hand also affect the kinematics of speech itself. Grasping larger objects (Gentilucci et al., 2001) and bringing them to the mouth (Gentilucci, Santunione, Roy & Stefanini, 2004) induces selective increases in parameters of lip kinematics and voice spectra of syllables pronounced simultaneously with action execution. Even observing another individual grasping or bringing to the mouth larger objects affects the lip kinematics and the voice spectra of syllables simultaneously pronounced by the viewer (Gentilucci, 2003). In the course of evolution, this mechanism of double command to hand and mouth could have been instrumental in the transfer of a communication system, based on the mirror system, from movements of the hand to movements of the mouth (Gentilucci & Corballis,, 2006). This evolutionary sequence of events may be paralleled by those in the development of language in children, in which gestures are intimately tied to vocalizations (e.g., Bates & Dick, 2002). For example, canonical babbling in children aged from 6-8 months is accompanied by rhythmic hand movements (Masataka, 2001). Manual gestures predate early development of speech in children, and predict later success even up to the two-word level (Iverson & Goldin-Meadow, 2005). Word comprehension in children between 8 and 10 months and word





39 卷

productions between 11-13 months are accompanied by gestures of pointing and showing and gestures indicating recognition, respectively (Bates & Snyder, 1987). Even in adults, it is well known that manual gestures accompany speech, to form a single integrated system (McNeill, 1985, 1992). This means, of course, that the transition from manual gesture to speech was not complete, although it was nevertheless sufficient to enable effective communication through an acoustic signal alone, as when we communicate by telephone or radio. The visual component is nevertheless informative. Many people with impaired hearing develop lipreading as an effective alternative, and watching people’s lips while they talk can influence what they report hearing. This is illustrated by the McGurk effect, in which dubbing a syllable (e.g., “ba”) onto a mouth that is saying something different (e.g., “ga”) shifts the perception to some intermediate syllable (e.g., “da”) (McGurk & MacDonald, 1976). Preuss, Qi, and Kaas (1999) have also documented greater differentiation in the visual cortex in humans than in apes (including the chimpanzee), which may conceivably relate to the emergence of lipreading in hominid evolution. The connections between hand and mouth may have been established initially in the context ingestion, and the acts of grasping and bringing food to the mouth, but adapted later for communication. MacNeilage (1998) has suggested that speech itself originated from repetitive ingestive movements of the mouth. This may well be correct, but it is perhaps only half the story, since it neglects the important role, in primates at least, of hand and arm movements in eating. Clicks before vocal articulation? MacNeilage (1998) also drew attention to the similarity between human speech and primate soundproducing facial gestures such as lip smacks, tongue smacks, and teeth chatters. Ferrari et al. (2003) recorded discharge both from mirror neurons in monkeys during the lip smack, which is the most common facial gesture in monkeys, and from other mirror neurons in the same area during mouth movements related to eating. These observations raise the possibility that the earliest audible language was composed of nonvocalized sounds. This idea receives some support from click languages. Aside from a now extinct click language in Australia, click languages are confined to Africa. Two of the many groups that make extensive use of click sounds are the Hadzabe and San, who are separated geographically by some 2000 kilometers, and genetic evidence suggests that the most recent common ancestor of these groups goes back to the root of present-day mitochondrial DNA lineages, perhaps as early as 100,000 years ago (Knight et al., 2003).

3期

Michael C. Corballis. How Language Evolved

This is not to say that click languages are in any way primitive or linguistically deficient. Moreover modern click languages do incorporate vocalizations. It is also likely that the earliest click languages, and indeed gestural languages generally, included vocalization, perhaps initially in the form of grunts, but with gradual shaping toward articulate sounds. Nevertheless the critical event that resulted in fully articulate vocalization may have emerged after the earliest click languages were formed, and may have depended on a genetic mutation. The FOXP2 gene Evidence for a genetic component underlying articulate vocal speech came initially from studies of a speech disorder afflicting an extended family, known as the KE family, in England. Over three generations, half of the members of this family have been affected by the disorder, which persists from the affected child’s first attempts to speak through adulthood (Vargha-Khadem, Watkins, Alcock, Fletcher, & Passingham, 1995). Some have argued that the deficit is primarily linguistic, mainly (but not exclusively) affecting the ability to use inflectional morphosyntactic rules, such as changing the endings of words to mark tense or number (Gopnik, 1990); indeed Pinker (1994) explicitly identified the deficit as a loss of the “grammar gene.” Other, more recent work suggests, though, that the core deficit is one of articulation rather than syntax, with morphosyntax a secondary casualty (Alcock, Passingham, Watkins & Vargha-Khadem, 2000; Vargha-Khadem et al., 1998; Watkins, Dronkers, & Vargha-Khadem, 2002). The disorder is now known to be due to a point mutation on the FOXP2 gene (forkhead box P2) on chromosome 7, and for normal speech to be acquired, two functional copies of this gene seem to be necessary (Fisher, Vargha-Khadem, Watkins, Monaco, & Pembrey, 1998). FOXP2 has been sequenced in humans, chimpanzees, gorillas, orangutans, rhesus monkeys, and mice (Enard et al., 2002). The sequences reveal changes in amino-acid encoding and the pattern of nucleotide polymorphism that emerged after the split between human and chimpanzee lineages, and were therefore probably selected for their beneficial effect on vocal communication. The FOXP2 gene is involved in the development of several structures, including the lungs, intestinal system, and cardiovascular system (Shu, Yang, Zhang, Lu, & Morrisey (2001), as well as several brain areas. Nevertheless the mutation of the gene in the speechaffected members of the KE family may have specifically influenced the functioning of the mirror system. Liégeois et al. (2003) used fMRI to record brain activity in both affected and unaffected members of the KE family while they covertly generated verbs in response to nouns. Whereas

425

unaffected members showed the expected activity concentrated in Broca’s area in the left hemisphere, affected members showed relative underactivation in both Broca’s area and its right-hemisphere homologue, as well as in other cortical language areas, such as Wernicke’s area and the left supramarginal gyrus. They also showed overactivation bilaterally in regions not associated with language. However, there was bilateral activation in the posterior superior temporal gyrus; on the left, this area overlaps Wernicke’s area, important in the comprehension of language. This suggests that affected members may have generated words in terms of their sounds, rather than in terms of articulatory patterns. Their deficits were not attributable to any difficulty with verb generation itself, since affected and unaffected members did not differ in their ability to generate verbs overtly, and the patterns of brain activity were similar to those recorded during covert verb generation. Another study based on structural MRI showed morphological abnormalities in affected members in the same areas (Watkins, Vargha-Khadem et al., 2002). Enard et al. (2002) have estimated the date of the most recent mutation as occurring within the past 100,000 years, although the standard error was sufficiently large to make it conceivable that it was a defining event in the speciation of Homo sapiens as long as 200,000 years ago. Either way, it suggests that the mutation may have been the critical event that allowed language to become autonomously vocal. This may have had profound consequences for the subsequent development of our species. Homo sapiens and the “human revolution” Human society underwent a profound transformation some time since the emergence of Homo sapiens. In what has been terms a “human revolution” (Mellars & Stringer, 1989), a sudden flowering of art and technology took place in Europe around 30,000 to 40,000 years ago, and included a dramatic expansion of manufactured objects to include projectiles, harpoons, awls, buttons, needles, and ornaments (Ambrose, 2001). Cave drawings in France and Northern Italy, depicting a menagerie of horses, rhinos, bears, lions, and horses, date from the same period (Knecht, Pike-Tay, & White, 1993). The first unequivocal musical instruments are bird-bone flutes from the early Upper Paleolithic in Germany (Hahn & Münzel, 1995), and there is widespread evidence across Russia, France, and Germany for the weaving of fibers into clothing, nets, bags, and ropes, dating from some 29,000 years ago (Soffer, Adovasio, Illingworth, Amirkhanov, Praslov, & Street, 2000). Further, the Neanderthals, who had adapted to the glacial climate of northwestern Eurasian for at least 200,000 years, abruptly disappeared between 30,000

426





and 40,000 years ago, coincident with the arrival of Homo sapiens. Some have argued that the “human revolution,” usually described in terms of European artefacts, is fundamentally Eurocentric. McBrearty and Brooks (2000) refer to “the revolution that wasn’t” (p. 453), and suggest a continuity in the development of technology over the past 300,000 years or so. Mellars (2004) suggests that the revolution probably began in Africa rather than in Europe, and the dramatic events in Europe were due to the arrival of fully modern humans. He dates the dispersal from Africa to Europe via southwestern Asia as occurring within the past 50,000 years. This is consistent with evidence from analysis of mitochondrial DNA that non-Africans share a most recent common ancestor with Africans dated at an estimated 52,000 years ago, while the origins of Homo sapiens in Africa goes back some 170,000 years (Ingman, Kaessmann, Pääbo, & Gyllensten, 2000). An alternative model is developed by Oppenheimer (2003), who has re-evaluated the evidence from mitochondrial DNA and dated the dispersal from Africa at around 83,000 years ago. He traces a migration pattern around the coasts to India, with one stream heading further east and north to Asia and Australia, and another heading northwest to Europe. This model may make more sense of the evidence for the early arrival of Homo sapiens in Australia (see also Macauley et al., 2005), and the disappearance of Homo erectus in southeast Asia at around the same time that the Neanderthals disappeared from Europe. Whatever the migratory route out of Africa, it seems likely that those migrants possessed some quality that enabled them eventually to spread over the globe, displacing (to put it euphemistically) those hominids who had migrated earlier. Not surprisingly, many have speculated that the critical quality was language. Mellars (2004), for example, speculates that it was “the emergence of more complex language and other forms of symbolic communication that gave the adaptive advantage to fully modern humans that led to their subsequent dispersal across Asia and the demise of the European Neanderthals” (pp. 464-465). We have also seen that authors such as Bickerton (1995) and Crow (2002) have similarly associated the emergence of language with the arrival of our own species. Such a scenario, of course, is at odds with the idea that language evolved gradually over the past two million years, and was associated with the increase in brain size. Moreover, as noted earlier, the Neanderthals had brains that were as large as those of modern humans (although possibly lower EQ, as noted earlier), and yet they too seem to have succumbed to the invasion of modern Homo sapiens. If not language, then, what was the quality that led to human occupation of the planet?





39 卷

The power of speech My contention is that it was the emergence of autonomous speech, and not of language itself, that was critical to the human revolution, and the fateful dominance of humans over their now-extinct cousins (Corballis, 2004). In dating the mutation of the FOXP2 gene at somewhere in the past 100,000 years, Enard et al. (2002) write that the date “is compatible with a model in which the expansion of modern humans was driven by the appearance of a moreproficient spoken language” (p. 871). This is not to say that the FOXP2 gene was the only gene involved in the switch to an autonomously vocal system; rather, it was probably just the final step in a series of progressive changes. What, then, might have been the selective pressures that drove the communications system away from the hands and toward the face, and eventually toward a form in which messages could be conveyed by voice alone? One factor may have been greater energy requirements associated with manual gesture; anecdotal evidence from courses in sign language suggests that the instructors require regular massages in order to meet the sheer physical demands of signlanguage expression. In contrast, the physiological costs of speech, in particular, are so low as to be nearly unmeasurable (Russell, Cerny, & Stathopoulos, 1998); in terms of the expenditure of energy, speech adds little to the cost of breathing, which we must do anyway to sustain life. Increasing involvement of the face would have had the further advantage of freeing the hands for other activities, such as carrying things, grooming, or the use or manufacture of tools. The addition of vocalization, even if initially only in the form of grunts, would have served to attract attention. Vocal communication is more efficient not only in terms of the expenditure of energy, but also in terms of simply getting the message across. Visual communication requires that the sender of a message be visible to the receiver, whereas vocal communication can take place at night, or around objects, or when the receiver’s eyes are closed or engaged elsewhere. There is some evidence that in hunter-gatherer societies that the telling of stories, critical for the transmission of culture, may have been accomplished at night. In describing life among the San, a modern hunter-gatherer society, Konner (1982) notes that conflicts within the group were resolved by discussions that began at dusk, and often continued right through the night. The use of the face and eventually the voice would have enhanced pedagogy, so that people could explain skills while at the same time as demonstrating them with their hands, as in modern TV cooking shows. Konner notes that pedagogy, too, tends to be a night-time activity among the San:

3期

Michael C. Corballis. How Language Evolved

Not only stories, but great stores of knowledge are exchanged around the fire among the !Kung and the dramatizations—perhaps best of all— bear knowledge critical to survival. A way of life that is difficult enough would, without such knowledge, become simply impossible. (p. 171) Pinker and Bloom (1990) suggest that vocal oratory (especially at night, one is tempted to think) might have been subject to sexual selection, citing Symons’s (1979) observation that tribal chiefs are often both gifted orators and highly polygynous. That is, women may have found men who speak a lot to be sexually attractive, leading to the propagation of genes favoring oratory—although this scarcely accords with the alternative romantic ideal of the strong silent male. The possible role of sexual selection in shaping the human mind, including language, is discussed by Miller (2000). The switch from a manuo-visual to a vocal-auditory form of language was probably gradual, and language has probably always been a combination of the two modes. Even today, we gesture manually and facially as we speak. But was the final achievement of a vocal mode that could carry a verbal message more or less autonomously really sufficient to explain the human revolution, leading to the extinction of other extant hominids? Even small changes in the efficiency of communication can have momentous effects. This is illustrated by the later emergence of writing and literacy, and more recently by the emergence of computer technology, leading to the Internet. Communication systems also permit the accumulation of culture, so that advances feed on advances, in a ratchet-like way. It may therefore not be too farfetched to suppose that we did talk our way into the human revolution, and our hominid cousins out of existence. In other words, so to speak, language aided emerging humans to organize their groups and coordinate their behaviour (e.g., hunts, warfare) in ways that enable them to outcompete related species. Conclusion The evolution of language remains a controversial topic, no less so perhaps than it was when the Linguistic Society of Paris imposed the ban in 1866. Although few would now maintain that language was a gift from God, or is somehow not amenable to scientific study, fundamental issues remain. One issue has to do with the extent to which language is shaped by genes or by culture. I have argued that the cultural influence is more important than implied by Chomsky’s notion of a biologically determined universal grammar, although the fact that no other species has demonstrated anything resembling true human language indicates that natural selection must have played a role. A related issue is whether language evolved as a distinct system, as implied by

427

the notion of universal grammar, or whether it coevolved with other sociocultural capacities, such as theory of mind or mental time travel. Although these capacities depend in part on different brain regions, they may possess properties in common. I have argued that the most salient of these may be recursion—the ability to embed phrases in phases, ideas in ideas, scenarios in scenarios. Indeed, recursion may be the principle feature that distinguishes the human mind from that of other animals (Corballis, 2003b). Issues about the evolution of language can be clarified somewhat if a clear distinction is made between language and speech. Although language itself may be partly determined by culture, speech itself clearly depended on biological changes, including alterations to the vocal tract, mechanism of breathing, and neural control of vocalization. These changes were no doubt the result of natural selection. The distinction between language and speech is supported by the fact that signed languages have all of the essential properties of true language, and I have argued further that language itself evolved first as a visuomanual system, only gradually incorporating movements of the face and voicing. Indeed speech may not have become autonomous until the appearance of our own species, Homo sapiens, within the past 200,000 years, and possibly even later, as suggested earlier. The notion that language evolved from manual gestures is itself controversial, although increasingly supported by developmental and neurophysiological evidence. It is also beginning to make sense of some of the archaeological evidence. There are strong reasons to believe that the essentials of the human mind evolved during the Pleistocene, from around 1.8 million years ago, and it is difficult to imagine that these essential would not have included language. Other evidence, including the so-called “human revolution,” has led a number of authors to argue that language did not evolve until the emergence of our own species, perhaps even within the past 100,000 years. According to the gestural-origins theory, it was not language, but rather speech, that emerged as an autonomous system, and that explains the dramatic developments in culture and manufacture that have taken place over the past 100,000 years. This is not to say that the switch from manual gesture to speech was sudden; rather, facial gesture and vocalization were probably gradually introduced, with vocalization eventually assuming dominance. We should not forget that manual gestures are still a prominent accompaniment of speech. In summary, then, I have argued that the unique properties of human language probably evolved over the past 2 million years, along with other unique characteristics of the human mind. I have proposed

428





that two developments occurred in parallel. One was the grammaticalization of communication, made possible through an increase in brain size, and made necessary by the pressures toward more effective social cognition. This probably took place gradually throughout the Pleistocene. The other was the gradual shift from a form of communication dominated by manual gestures, toward a form of communication focused on the face, with the eventual incorporation of vocalization. By freeing the hands from communication, by allowing communication at night, and by removing much of the metabolic load, this final adjustment may have been at least partially responsible for the human revolution, leading to the dominance of Homo sapiens on the planet, and the extinction of all other hominid species. References Alcock, K. J., Passingham, R. E., Watkins, K. E. & Vargha-Khadem, F. (2000). Oral dyspraxia in inherited speech and language impairment and acquired dysphasia. Brain & Language, 75, 17-33. Alexander, R. D. (1979). Darwinism and human affairs. Seattle, WA: University of Washington Press. Ambrose, S. H. (2001). Paleolithic technology and human evolution. Science, 291, 1748-1752. Arbib, M.A. (2005). From monkey-like action recognition to human language: An evolutionary framework for neurolinguistics. Behavioral & Brain Sciences, 28, 105-168. Armstrong, D. F. (1999). Original signs: gesture, sign, and the source of language. Washington, DC: Gallaudet University Press. Armstrong, D. F., Stokoe, W. C., & Wilcox, S. E. (1995). Gesture and the nature of language. Cambridge, UK: Cambridge University Press. Arnold, K. & Zuberbühler, K. (2006). Semantic combinations in primate calls. Nature, 441, 303. Baddeley, A., Gathercole, S., & Papagno, C. (1998). The phonological loop as a language learning device. Psychological Review, 105, 158173. Barkow, J., Cosmides, L., & Tooby, J. (Eds.) (1992). The adapted mind: Evolutionary psychology and the generation of culture. New York: Oxford University Press. Bates, E. & Dick, F. (2002). Language, gesture, and the developing brain. Developmental Psychobiology, 40, 293-310. Bates, E. & Snyder, L.S. (1987). The cognitive hypothesis in language development. In E. Ina, C. Uzgiris, & E. J. McVicker Hunt (Eds.), Infant performance and experience: New findings with the ordinal scales (pp. 168-204). Urbana, IL: University of Illinois Press. Baumeister, R. F. (2005). The cultural animal: Human nature, meaning, and social life. New York: Oxford University Press. Bickerton, D. (1995). Language and human behavior. Seattle, WA: University of Washington Press. Bickerton, D. (2003). Symbol and structure: A comprehensive framework for language evolution. In M. H. Christiansen, & S. Browman, C.P., & Goldstein, L.F. (1995). Dynamics and articulatory phonology. In T. van Gelder & R. F. Port (Eds.), Mind as motion (pp. 175-193). Cambridge, MA: MIT Press. Burling, R. (1999). Motivation, conventionalization, and arbitrariness in the origin of language. In B.J. King (Ed.), The origins of language: What nonhuman primates can tell us (pp. 307-350). Santa Fe, NM: School of American Research Press, Calvin, W. H., & Bickerton, D. (2000). Lingua ex machina: Reconciling Darwin with the human brain. Cambridge, MA: MIT Press. Chang, F., Dell, G. S., & Bock, K. (2006). Becoming syntactic. Psychological Review, 113, 234-272. Cheney, D. L., & Seyfarth, R. M. (1990). How monkeys see the world. Chicago: University of Chicago Press. Chomsky, N. (1966). Cartesian linguistics: A chapter in the history of rationalist thought. New York: Harper & Row. Chomsky, N. (1975). Reflections on language. New York: Pantheon.





39 卷

Chou, H.-H., Hakayama, T., Diaz, S., Krings, M., Indriati, E., Leakey, M., et al. (2002). Inactivation of CMP-N-acetylneuraminic acid hydroxylase occurred prior to brain expansion during human evolution. Proceedings of the National Academy of Sciences, USA 99, 11736-11741. Christiansen, M. H., & Kirby, S. (2004). Language evolution: The hardest problem in science? In M. H. Christiansen, & S. Kirby (Eds.), Language evolution (pp. 1-15). Oxford: Oxford University Press.

Christiansen, M. H., & Dale, R. (2004). The role of learning and development in language evolution: A connectionist perspective. In D. K. Oller & U. Griebel (Eds.), Evolution of communication systems (pp. 91-110). Cambridge, MA: MIT Press. Condillac, E.B. de. (1971). An essay on the origin of human knowledge. T. Nugent (Tr.), Gainesville, FL: Scholars Facsimiles and Reprints. (Originally published 1746). Corballis, M. C. (1991). The lopsided ape: Evolution of the generative mind. New York: Oxford University Press. Corballis, M. C. (1992). On the evolution of language and generativity. Cognition, 44, 197-226. Corballis, M. C. (1999). The gestural origins of language. American Scientist, 87, 138- 45. Corballis, M. C. (2002). From hand to mouth: The gestural origins of language. Princeton, NJ: Princeton University Press. Corballis, M. C. (2003a). From hand to mouth: The gestural origins of language. In M. H. Christiansen & S. Kirby (Eds.), Language evolution (pp. 201-218). Oxford: Oxford University Press. Corballis, M. C. (2003b). Recursion as the key to the human mind. In Sterelny, K. & Fitness, J. (Eds.), From mating to mentality: Evaluating evolutionary psychology (pp. 155-171). New York: Psychology Press. Corballis, M. C. (2004). The origins of modernity: Was autonomous speech the critical factor? Psychological Review, 111, 543-552. Crockfort, C. & Boesch, C. (2005). Call combinations in wild chimpanzees. Behaviour, 142, 397-421. Crow, T. J. (2002). Sexual selection, timing, and an X-Y homologous gene: Did Homo sapiens speciate on the Y chromosome? In T. J. Crow (Ed.), The speciation of modern Homo sapiens (pp. 197-216). Oxford, UK: Oxford University Press. Currie, P. (2004). Muscling in on hominid evolution. Nature, 428, 373374. Darwin, C. (1859). The origin of species. London: John Murray. Dawkins, R. (1996). Climbing mount improbable. New York: W.W. Norton. DeGusta, D., Gilbert, W. H., & Turner, S. P. (1999). Hypoglossal canal size and hominid speech. Proceedings of the National Academy of Sciences, 96, 1800-1804. Descartes, R. (1985). Discourse on method. In Cottingham, J., Stootfoff, R., & Murdock, D. (ed. & tr.), The philosophical writings of Descartes. Cambridge: Cambridge University Press. (Originally published 1647). Donald, M. (1991). Origins of the modern mind. Cambridge, MA: Harvard University Press. Emmorey, K. (2002). Language, cognition, and brain: Insights from sign language research. Hillsdale, NJ: Erlbaum. Enard, W., Przeworski, M., Fisher, S. E., Lai, C. S., Wiebe, V., Kitano, T., et al. (2002). Molecular evolution of FOXP2, a gene involved in speech and language. Nature, 418, 869-872. Evans, P. D., Anderson, J. R., Vallender, E. J., Gilbert, S. L., Malcom, C. M., Dorus, S., et al. (2004). Adaptive evolution of ASPM, a major determinant of cerebral cortical size in humans. Human Molecular Genetics, 13, 489-494. Everett, D. L. (2005). Cultural constraints on grammar and cognition in Pirahã. Current Anthropology, 46, 621-646. Ferrari, P. F., Gallese, V., Rizzolatti, G., Fogassi, L. (2003). Mirror neurons responding to the observation of ingestive and communicative mouth actions in the monkey ventral premotor cortex. European Journal of Neuroscience, 17, 1703-1714. Finlay, B. L., Darlington, R. B., & Nicastro, N. (2001). Developmental structure in brain evolution. Behavioral & Brain Sciences, 24, 263308. Fisher, S. E., Vargha-Khadem, F., Watkins, K. E., Monaco, A. P., & Pembrey, M. E. (1998). Localization of a gene implicated in a severe speech and language disorder. Nature Genetics, 18, 168-170. Fitch, W. T., & Hauser, M. D. (2004). Computational constraints on syntactic processing in a nonhuman primate. Science, 303, 377-380.

3期

Michael C. Corballis. How Language Evolved

Flinn, M. V., Geary, D. C., & Ward, C. V. (2005). Ecological dominance, social competition, and the coalitionary arms race: Why humans evolved such extraordinary intelligence. Evolution & Human Behavior, 26, 10-46. Foley, R., & Lahr, M. M. (1997). Mode 3 technologies and the evolution of modern humans. Cambridge Archaeological Journal, 7, 3-36. Gardner, R. A., & Gardner, B. T. (1969). Teaching sign language to a chimpanzee. Science, 165, 664-672. Geary, D. C. (2005). The origin of mind: Evolution of brain, cognition, and general intelligence. Washington, DC: American Psychological Association. Gentilucci, M. (2003). Grasp observation influences speech production. European Journal of Neuroscience, 17, 179-184. Gentilucci, M. & Corballis, M. C. (in press). From manual gesture to speech: A gradual transition. Neuroscience & Biobehavioral Reviews. Gentilucci, M., Benuzzi, F., Gangitano, M. & Grimaldi, S. (2001). Grasp with hand and mouth: A kinematic study on healthy subjects. Journal of Neurophysiology, 86, 1685-1699. Gentilucci, M., Santunione, P., Roy, A. C. & Stefanini (2004). Execution and observation of bringing a fruit to the mouth affect syllable pronunciation. European Journal of Neuroscience, 19, 190202. Gentner, T. Q., Fenn, K. M., Margoliash, D., & Nusbaum, H. C. (2006). Recursive syntactic pattern learning by songbirds. Nature, 440, 1204-1207. Gibson, K. R. & Jessee, S. (1999). Language evolution and expansions of multiple neurological processing areas. In B. J. King (Ed.), The origins of language: What nonhuman primates can tell us. Santa Fe, NM: School of American Research Press. Givón, T. (1995). Functionalism and grammar. Philadeplphia, PA: Benjamins. Goodall, J. (1986). The chimpanzees of Gombe: Patterns of behavior. Harvard University Press. Gopnik, M. (1990). Feature-blind grammar and dysphasia. Nature, 344, 715. Hahn, J. & Münzel, S. (1995). Knochenflöten aus dem Aurignacien des Geissenklösterle bei Blaubeuren, Alb-Donau-Kreis. Fundberichte Aus Baden-Wurttemberg, 20, 1-12. Hare, B., Call, J., Agnetta, B., & Tomasello, M. (2000). Chimpanzees know what conspecifics do and do not see. Animal Behaviour, 59, 771-785. Hare, B., & Tomasello, M. (1999). Domestic dogs (Canis familiaris) use human and conspecific cues to locate hidden food. Journal of Comparative Psychology, 113, 173-177. Hauser, M. D., Chomsky, N., & Fitch, W. T. (2002). The faculty of language: What is it, who has it, and how did it evolve? Science, 298, 1569-1579. Hayes, C. (1952). The ape in our house. London: Gollancz. Hewes, G. W. (1973). Primate communication and the gestural origins of language. Current Anthropology, 14, 5-24. Hopper, P. J., & Traugott, E. C. (2003). Grammaticalization (2nd Edition). Cambridge: Cambridge University Press. Horne Tooke, J. (1857). Epea pteroenta or the diversions of Purley. London. Hunt, G. R. (1996). Manufacture and use of hook-tools by New Caledonian crows. Nature, 379, 249-251. Hunt, G. R., Corballis, M. C., & Gray, R.D. (2001). Laterality in tool manufacture by crows. Nature, 414, 707. Hurford, J. R. (2003). The language mosaic and its evolution. In M. H. Christiansen, & S.Huxley, T. H. (2001). Evidence as to man’s place in nature. New York: Modern Library (First published 1863) Ingman, M., Kaessmann, H., Pääbo, S., & Gyllensten, U. (2000). Mitochondrial genome variation and the origin of modern humans. Nature, 408, 708-713. Iverson, J. M. & Goldin-Meadow, S. (2005). Gesture paves the way for language development. Psychological Science, 16, 367-371. Joos, M. (1948). Acoustic phonetics. Language Monograph No. 23. Baltimore, MD: Linguistic Society of America. Kay, R. F., Cartmill, M., & Barlow, M. (1998). The hypoglossal canal and the origin of human vocal behavior. Proceedings of the National Academy of Sciences (USA), 95, 5417-5419.

429

Knecht, H., Pike-Tay, A., & White, R. (1993). Before Lascaux: The complex record of the early Upper Paleolithic. Boca Raton: CRC Press. Knight, A., Underhill, P.A., Mortensen, H.M., Zhivotovsky, L.A., Lin, A.A., Henn, B.M., et al. (2003). African Y chromosome and mtDNA divergence provides insight into the history of click languages. Current Biology, 13, 464-473. Konner, M. (1982). The tangled wing: Biological constraints on the human spirit. New York: Harper. Lee, R. B. (1979). The !Kung San: Men, women, and work in a foraging society. Cambridge, UK: Cambridge University Press. Liberman A. M., Cooper F. S., Shankweiler, D. P., & StuddertKennedy, M.. (1967). Perception of the speech code. Psychological Review, 74, 431-461. Lieberman, D. E. (1998). Sphenoid shortening and the evolution of modern cranial shape. Nature, 393, 158-162. Lieberman, D. E., McBratney, B. M., & Krovitz, G. (2002). The evolution and development of cranial form in Homo sapiens. Proceedings of the National Academy of Sciences, 99, 1134-1139. Lieberman, P. (1998). Eve spoke: Human language and human evolution. New York: W.W. Norton. Lieberman, P., Crelin, E. S., & Klatt, D. H. (1972). Phonetic ability and related anatomy of the new-born, adult human, Neanderthal man, and the chimpanzee. American Anthropologist, 74, 287-307. Liégeois, F., Baldeweg, T., Connelly, A., Gadian, D. G., Mishkin, M., & Vargha-Khadem, F. (2003). Language fMRI abnormalities associated with FOXP2 gene mutation. Nature Neuroscience, 6, 1230-1237. MacLarnon, A. & Hewitt, G. (1999). The evolution of human speech: The role of enhanced breathing control. American Journal of Physical Anthropology, 109, 341-363. MacLarnon, A., & Hewitt, G. (2004). Increased breathing control: Another factor in the evolution of human language. Evolutionary Anthropology, 13, 181-197. MacNeilage, P. F. (1998). The frame/content theory of evolution of speech. Behavioral & Brain Sciences, 21, 499-546. Macauley, V., Hill, C., Achilli, A., Rengo, C., Clarke, D., Meehan, W., et al. (2005). Single, rapid coastal settlement of Asia revealed by analysis of complete mitochondrial genomes. Science, 308, 10341036. Martin, R. D. (1981). Relative brain size and basal metabolic rate in terrestrial vertebrates. Nature, 293, 57-60. Martin, R. D. (1990). Primate origins and evolution: A phylogenetic reconstruction. Princeton, NJ: Princeton University Press. Masataka, N. (2001). Why early linguistic milestones are delayed in children with Williams syndrome: Late onset of hand banging as a possible rate-limiting constraint on the emergence of canonical babbling. Developmental Science, 4, 158-164. McBrearty, S. & Brooks, A. S. (2000). The revolution that wasn’t: A new interpretation of the origin of modern human behavior. Journal of Human Evolution, 39, 453-563. McGurk, H., & MacDonald, J. (1976). Hearing lips and seeing voices. Nature, 264, 746-748. McNeill, D. (1985). So you think gestures are nonverbal?’ Psychological Review, 92, 350-371. McNeill, D. (1992). Hand and mind: What gestures reveal about thought. Chicago: University of Chicago Press. Mekel-Bobrov, N., Gilbert, S. L., Evans, P. D., Vallender, E. J., Anderson, J. R., Hudson, R. R., et al. (2005). On-going adaptive evolution of ASPM, a brain size determinant in Homo sapiens. Science, 309, 1720-1722. Mellars, P. (2004). Neanderthals and the modern human colonization of Europe. Nature, 432, 461-465. Mellars, P. A. & Stringer, C. B. (Eds.) (1989). The human revolution: Behavioural and biological perspectives on the origins of modern humans. Edinburgh: Edinburgh University Press. Miller, G. F. (2000). The mating mind: How sexual choice shaped the evolution of human nature. London: Doubleday. Moura, A. C. de A. & Lee, P. C. (2004). Capuchin stone tool use in Caatinga dry forest. Nature, 305, 1909. Muller, F. M. (1880). Lectures on the science of language. Vol. 1. London: Longmans, Green. (Originally published in German, 1861) Neidle, C., Kegl, J., MacLaughlin, D., Bahan, B. & Lee, R. G. (2000). The syntax of American Sign Language. Cambridge, MA: MIT Press.

430





Nowak, M. A. (2001). Evolution of universal grammar. Science, 291, 114-118. Nowak, M. A., Plotkin, J. B., & Jansen, V. A. A. (2000). The evolution of syntactic communication. Nature, 404, 495-498. Oppenheimer, S. (2003). Out of Eden: The peopling of the world. London: Robinson. Piao, X. H., Hill, R.S., Bodell, A., Chang, B. S., Basel-Vanagaite, L., Straussberg, R., et al. (2004). G protein-coupled receptor-dependent development of human frontal cortex. Science, 303, 2033-2036. Pinker, S. (1994). The language instinct. New York: Morrow. Pinker, S. (1997). How the mind works. New York: W. W. Norton. Pinker, S. & Bloom, P. (1990). Natural language and natural selection. Behavioral & Brain Sciences, 13, 707-784. Place, U. T. (2000). The role of the hand in the evolution of language. Psycoloquy, 11, No. 7. Ploog, D. (2002). Is the neural basis of vocalization different in nonhuman primates and Homo sapiens? In T. J. Crow (Ed.), The speciation of Modern Homo Sapiens (pp. 121-135). Oxford: Oxford University Press. Povinelli, D. J. & Bering, J. M. (2002). The mentality of apes revisited. Current Directions in Psychological Science, 11, 115-119. Premack, D. (1985). Gavagai! or the future history of the animal language controversy. Cognition, 19, 207-296. Preuss, T. M., Qi, H., & Kaas, J. H. (1999). Distinctive compartmental organization of human primary visual cortex. Proceedings of the National Academy of Sciences (USA), 96, 11601-11606. Rizzolatti, G. & Arbib, M. A. (1998). Language within our grasp. Trends in Neuroscience, 21, 188-194. Rizzolatti, G., Fadiga, L., Gallese, V., & Fogassi, L. (1996). Premotor cortex and the recognition of motor actions. Cognitive Brain Research, 3, 131-141. Rizzolatti, G., Fogassi, L., & Gallese, V. (2001). Neurophysiological mechanisms underlying the understanding and imitation of action. Nature Reviews, 2, 661-670. Ruff, C. B, Trinkaus, E., & Holliday, T. W. (1997). Body mass and encephalization in Pleistocene Homo. Nature, 387, 173-176. Russell, B. A., Cerny, F. J., & Stathopoulos, E. T. (1998). Effects of varied vocal intensity on ventilation and energy expenditure in women and men. Journal of Speech, Language, & Hearing Research, 41, 239-248. Savage-Rumbaugh, S., Shanker, S. G. & Taylor, T. J. (1998). Apes, language, and the human mind. New York: Oxford University Press. Semaw, S. P., Renne, P., Harris, J. W. K., Feibel, C. S., Bernor, R. L., Fessweha, N., et al. (1997). 2.5-million-year-old stone tools from Gona, Ethiopia. Nature, 385, 333-336. Senghas, A., Kita, S., & Özyürek, A. (2004). Children creating core properties of language: Evidence from an emerging sign language in Nicaragua. Science, 305, 1779-1782. Shu, W., Yang, H., Zhang, L., Lu, M. M., & Morrisey, E. E. (2001). Characterization of a new subfamily of winged-helix/forkhead (Fox) genes that are expressed in the lung and act as transcriptional repressors. Journal of Biological Chemistry, 276, 27488-27497. Skoyles, J. R. (2000). Gesture, language origins, and right handedness. Psycoloquy, 11, No. 24. Soffer, O. J. M., Adovasio, J. S., Illingworth, H. A., Amirkhanov, N. D., Praslov, N. D., & Street, M. (2000). Palaeolithic perishables made permanent. Antiquity, 74, 812-821.





39 卷

Stedman, H. H., Kozyak, B. W., Nelson, A., Thesier, D. M., Su, L. T., Low, D. W., et al. (2004). Myosin gene mutation correlates with anatomical changes in the human lineage. Nature, 428, 415-418. Stokoe, W. C. (1960). Sign language structure: an outline of the communicative systems of the American deaf. Silver Spring, MD: Linstock Press. Striedler, G. F. (2006). Précis of Principles of Brain Evolution. Behavioral &Brain Sciences, 29, 1-36. Studdert-Kennedy, M. (2005). How did language go discrete? In M. Tallerman (Ed.), Language origins: Perspectives on evolution (pp. 48-67). Oxford: Oxford University Press. Suddendorf, T. & Corballis, M. C. (1997). Mental time travel and the evolution of the human mind. Genetic, Social, & General Psychology Monographs, 123, 133-167. Suc, J.-P., Bertini, A., Leroy, S. A. G., & Suballyova, D. (1997). Towards the lowering of the Pliocene/Pleistocene boundary to the Gauss-Matuyama reversal. Quaternary International, 40, 37-42. Sutton-Spence, R. & Boyes-Braem P. (Eds.) (2001). The hands are the head of the mouth: The mouth as articulator in sign language. Hamburg: Signum-Verlag. Symons, D. (1979). The evolution of human sexuality. Oxford: Oxford University Press. Tomasello, M. (2003). On the different origins of symbols and grammar. In M. H. Christiansen, & S. Tomasello, M., Carpenter, M., Call, J., Behne, T., & H. Moll (2005). Understanding and sharing intentions: The origins of cultural cognition. Behavioral and Brain Sciences, 28, 675-735. Tooby, J., & DeVore, I. (1987). The reconstruction of hominid evolution through strategic modeling. In W. G. Kinzey (Ed.), The evolution of human behavior: Primate models. Albany, NY: SUNY Press. Vargha-Khadem, F., Watkins, K. E., Alcock, K. J., Fletcher, P., & Passingham, R. (1995). Praxic and nonverbal cognitive deficits in a large family with a genetically transmitted speech and language disorder. Proceedings of the National Academy of Sciences (USA), 92, 930-933. Vargha-Khadem, F., Watkins, K. E., Price, C. J., Ashburner, J., Alcock, K. J., Connelly, A., et al. (1998). Neural basis of an inherited speech and language disorder. Proceedings of the National Academy of Sciences (USA), 95, 12695-12700. Walter, R. C., Buffler, R. T., Bruggemann, J. H., Guillaume, M. M. M., Berhe, S. M., Negassi, B., et al. (2001). Early human occupation of the Red Sea coast of Eritrea during the last interglacial. Nature, 405, 65-69. Watkins, K. E., Dronkers, N. F., & Vargha-Khadem, F. (2002). Behavioural analysis of an inherited speech and language disorder: comparison with acquired aphasia. Brain, 125, 452-464. Watkins, K.E., Vargha-Khadem, F., Ashburner, J., Passingham, R.E., Connelly, A., Friston, K.J., et al. (2002). MRI analysis of an inherited speech and language disorder: structural brain abnormalities. Brain, 125, 465-478. Whiten, A. & Byrne, R. W. (1988). Tactical deception in primates. Behavioral & Brain Sciences, 11, 233-244. Wood, B., & Collard, M. (1999). The human genus. Science, 284, 6571.