Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2016
Last year, my colleague Ian Howard and I published a paper in the Journal of Phonetics (Messum & Howard 2015) that discussed the mechanism by which young children learn to pronounce the speech sounds of their mother tongue (L1). The longstanding assumption has been that they do this by some form of imitation. We argued that on current evidence it is more likely that they do this through a mirroring process; with their caregivers as the 'mirror' in which infants and young children discover the linguistic significance of their vocal actions. This matters for the learning of second language (L2) pronunciation because many of our teaching practices are implicitly based on the idea that learning to produce sounds by listening first and then trying to copy what we have heard is 'natural' (or even that it is the only possible way for the production of new sounds to be learnt). If it is not natural, then we might want to reconsider our use of 'listen first' approaches for teaching speech sounds. These approaches are not notably successful and there is at least one well-developed and proven alternative. This article summarises the 2015 paper, concentrating on the parts of it that will be of most interest to Speak Out! readers. The paper was written for a special issue of the journal which was examining how speech is represented in the brain, hence the paper's title: Creating the cognitive form of phonological units: the speech sound correspondence problem in infancy could be solved by mirrored vocal interactions in infancy rather than by the imitation of speech sounds. In a second article, Roslyn Young and I will examine the nature of L2 speech sound learning and the different approaches taken to teaching sounds.
It is generally assumed that children learn to pronounce speech sounds by imitation from adult models. This requires that a child creates an underlying representation for a speech sound which he uses for both perception and production. This is usually imagined to be auditory/acoustic, but arguments can also be made for underlying representations to be motor/gestural and recent neuroimaging evidence has added strength to the alternative views. But whether auditory or motor, it is supposed that the representation is abstracted from examples of the speech sound that the child hears, and then used to guide his own production of the corresponding speech sound.
CODEN - PBLDF5 , Date revised - 2003-10-01 , DOI - LLBA-308591476; 200308045; 1080-692X; PBLDF5 , Last updated - 2011-01-11 , SubjectsTermNotLitGenreText - *English as a Second Language Learning (22130); *Vowels (95650); *Word Frequency (97450); *Familiarity (23800); *Adults (00600); *Elementary School Students (21520); Korean (40950) , The following values have no corresponding Zotero field: Author Address: U Illinois A3 - Trofimovich, Pavel CY - United States
British journal of psychology (London, England : 1953), 2016
Phonological development is sometimes seen as a process of learning sounds, or forming phonological categories, and then combining sounds to build words, with the evidence taken largely from studies demonstrating 'perceptual narrowing' in infant speech perception over the first year of life. In contrast, studies of early word production have long provided evidence that holistic word learning may precede the formation of phonological categories. In that account, children begin by matching their existing vocal patterns to adult words, with knowledge of the phonological system emerging from the network of related word forms. Here I review evidence from production and then consider how the implicit and explicit learning mechanisms assumed by the complementary memory systems model might be understood as reconciling the two approaches.
CamLing, 2005
How is it that an English-speaking 5-year-old comes to pronounce the vowel of seat to be shorter than that of seed but longer than that of sit; to say a multi-word phrase with ‘stress-timed’ rhythm; or to aspirate the /p/’s of pin, polite and spin to different degrees? These phenomena - pre-fortis clipping, tense and lax vowel classes, ‘stress-timing’ and long/short/no lag voice onset times - are systematic features of English pronunciation. Most people believe that children acquire them by imitation. There are reasons, however, to doubt this. A non-imitative account of the replication of these phenomena sees them appearing as the result of articulatory activity being conditioned by the breath stream dynamics of child speech.
2015
Theories about the cognitive nature of phonological units have been constrained by the assumption that young children solve the correspondence problem for speech sounds by imitation, whether by an auditory- or gesture-based matching to target process. Imitation on the part of the child implies that he makes a comparison within one of these domains, which is presumed to be the modality of the underlying representation of speech sounds. However, there is no evidence that the correspondence problem is solved in this way. Instead we argue that the child can solve it through the mirroring behaviour of his caregivers within imitative interactions and that this mechanism is more consistent with the developmental data. The underlying representation formed by mirroring is intrinsically perceptuo-motor. It is created by the association of a vocal action performed by the child and the reformulation of this into an L1 speech token that he hears in return. Our account of how production and perception develop incorporating this mechanism explains some longstanding problems in speech and reconciles data from psychology and neuroscience.
Early Childhood Research Quarterly, 2001
As a researcher in the area of early communication and language I found How Language Comes to Children a thorough and thought-provoking description of early language development and the research related to how infants learn to talk as well as how infants respond to speech. Or, to be more precise, responses to speech from before birth until the end of the second year. Particularly interesting to me was the cross-cultural focus. De Boysson-Bardies compares research from different parts of the world to determine similarities and differences in development as well as the variations in how parents view and respond to the early speech of their children. The author is the Director of Research in the Experimental Psychology Laboratory at the Center Nationale de la Recherche Scientifique in Paris, and the book is translated from French. However, the examples of the children's earlier word productions are in French. As a mono-lingual English speaker who does not know French pronunciation, the French examples required a bit more effort than they would have in English. However, the depth and detail of the information make the book well worth the effort. The book is organized into 8 chapters. Generally the information follows the development of the child with the first chapter focused on infants and early perceptions related to speech, and the last chapter focused on the language of 18-24 month olds. In addition to an in-depth look at speech perception and vocal development, deBoysson-Bardies discusses the communication environment of the young child, cultural variations in development, and individual variations among children. For example, one of the early discussions in the book was about the speech perception of young infants. For the past 30 years or so researchers have been devising creative ways to answer questions about infant perception of different dimensions of speech. Because infants are not yet able to answer questions or to directly indicate preferences researchers have identified something infants can do, suck, and built studies around that ability. The premise is that infants suck harder to indicate preference or novelty. Results of these studies indicate that infants even a few days old show a preference for their mother's speech when compared to that of another woman. They also prefer the prosody (the rhythms and tonal variation, sometimes referred to as the music of the language) of their mother's voice compared to another woman's voice when listening to voices with the phonetic information screened out. However, if the tape of their mother is played backwards, making the prosody "unnatural" the infant no longer prefers the voice of the mother. These early studies also show that infants
Zonas de contacto: culturas, lenguas y educación, 2016
Research in second language speech has seen an exponential growth during the past decades. A number of theoretical frameworks and models of acquisition have been posited to account for the development in L2 phonological acquisition, as apparent in perception and production of L2 speech sounds and suprasegmentals. A preliminary qualitative study of L2 forms produced by speakers of River Plate Spanish studying at English Teacher Training Colleges was carried out to observe the type and frequency of pronunciation errors in the light of Flege’s Speech Learning Model (1995). Sources of error were considered in the light of the models discussed. The subjects were given ear training with explicit instructions as to the cues to pay attention to. The perceptual training was based on the use of synthesized sample stimuli with enhanced phonetic cues. Pre-test and a post- test were carried out to see if the training had any effect on the subject’s perceptual and production skills. Improvements were observed in these preliminary studies, but further research, quantitative in nature, is needed in order to explore the feasibility of perceptual training as a teaching tool.
The Journal of the Acoustical Society of America, 1978
In order to investigate the nature of some processes in speech acquisition, synthetic speechlike stimuli were played to groups of English and French children between two and fourteen years of age. The acoustic parameters varied were voice onset time and first-formant transition. Three stages were observed in the development of children's labeling behavior. These were called scattered labeling, progressive labeling, and categorical labeling, respectively. Individual response patterns were examined. The first stage (scattered labeling) includes mostly children of two to three years of age for the English and up to about four for the French. Children label most confidently those stimuli closest in physical terms to those of their natural speech environment. All stimuli with intermediate VOT values are labeled quasirandomly. Progressive labeling behavior is found mostly amongst children aged three and four for the English, up to about seven for the French. Children's response curves go progressively-almost linearly-from one type of label (voiced) to the other (voiceless): response follows stimulus continuum. Categorical labeling becomes the dominant pattern only at the age of five to six for the English, one or two years later for the French. This development was found to be highly significant (p smaller than 0.003 for both English and French, using Kendall's tau measure of correlation). English children learn to make use of the F1 transition feature around five years, whereas French children never use it as a voicing cue. French children will have fewer features than English children at their disposal: This may account for the later age at which French children, as a group, reach the various labeling behavior stages, and for labeling curves being less sharply categorical for French than for English children. These findings indicate that categorical labeling for speech sounds is not innate but learned through a relatively slow process which is to a certain extent language specific. The implications of the results are discussed in the light of previous work in the field.
2008
Timing patterns and the qualities of speech sounds are two important aspects of pronunciation. It is generally believed that imitation from adult models is the mechanism by which a child replicates them. However, this account is unsatisfactory, both for theoretical reasons and because it leaves the developmental data difficult to explain. I describe two alternative mechanisms. The first explains some timing patterns (vowel length changes, 'rhythm', etc.) as emerging because a child's production apparatus is small, immature and still being trained. As a result, both the aerodynamics of his speech and his style of speech breathing differ markedly from the adult model. Under their constraints the child modifies his segmental output in various ways which have effects on speech timing but these effects are epiphenomenal rather than the result of being modelled directly. The second mechanism accounts for how children learn to pronounce speech sounds. The common, but actually p...
Journal of Chemical Physics, 2006
We consider two theories of laryngeal representation, one using a single feature [voice] generalizing across prevoicing languages and aspiration languages, and the other using multiple features: [voice] for pre-voicing languages and [spread glottis] for aspiration languages. We derive predictions for children's early productions, and test these for three Germanic languages. Children acquiring Dutch, a prevoicing language, show de-voicing of stops, while available data from German, an aspiration language, show de-aspiration. Although the difference might simply reflect intrinsic properties of children's early production and perception systems, we argue that a representational account is in order, based on multiples features. The case is made for English, an aspiration language, based on the early productions of a single child. A laryngeal harmony pattern is found which spreads voicelessness from coda to onset, which is argued to involve activity of [spread glottis]. This is interpreted as evidence for a laryngeal representation involving multiple features.
Language, 1994
Adults and older children learning the sound system of a second language (L2) differ from young children acquiring the sound system of their native language (LI) in two important respects. The L2 learners are better able to control their speech apparatus than the young children ilcquiring an Ll, ilnd they alreildy possess a phonetic systcm for producing speech. As il result, far more errors in production ilre likely to arise from the inilppropriilte use of !Jrc(liollsly Ilcqllired structllres in L2 learning than in Ll acquisition.' It is well known that adult learners are mrely, if ever, completely successful almaslering the sound system of iln L2. A widely held view is Ihilt when ildults encounter iln L2 word, they attempt to "decompose" it into the phonemic units of the Ll, ilnd then produce the L2 word as if it consisted of phonic elements (illiophones, phonemes) from the LI (I'o/ivilnov 19:11). This chapter provides a brief overvicw of n.'seiHch Ihill hilS de-'We use the term L2tCll/llil/,'i r.lther than /1c'1l1isili"l/b,'C.HISI• 01 till' view th ••t I'honelic systel11s, e\'l'n those of adults, undergo constant change in till' laC(' 01 new phonetic input. Thus, speech is never fully /1cqllired. The terl11 sl't't'dllcnmi'J,'i as used here refers to all aspects 01 learning that affect the production and I,,'reeption of the sounds 111.1 king up words. II is used in preference to 1,11I11/(1/o,'ii((/III'/1/11ill,'i because lI1ueh 01 our res(',uch to date has focused on phonetic-level processes. The preparation of this chal'tl'r was supported by NIII grant IJC00257. TI1<' author thanks wlkagues (0.-5. Bohn, W. Eplting, M. Munro). graduate students (K. Fletcher, S. Jang, C. Wang), and research assistants (L. Cueva, C. 1\1l'na, L. Skelton) who h'l\'e p,utieipated in work reported here. Thanks arc also extended to O.-S. lIohn. C. Ferguson, and 1\. Walle)' lor editorial C0Il11111'nts.
De Gruyter eBooks, 2009
In this article, I discuss a linguistic model for explaining second-language (L2) sound perception, which is a phenomenon that has commonly been modelled within the disciplines of phonetics and psycholinguistics. This linguistic model will be applied to the learning of SIMILAR L2 sounds. This L2 learning scenario refers to the acquisition of the knowledge involved in the perception of L2 sounds that are phonologically equivalent but yet phonetically different from the acoustically closest sounds in the learner's first language (L1). In the introduction, I argue, based on phonetic and psycholinguistic grounds, that speech perception is a language-specific phenomenon that should be brought into the domain of phonological modelling. Additionally, I propose a number of characteristics which incorporate phonological, phonetic, and psycholinguistic modelling and which should be found in a comprehensive and explanatory adequate model for sound perception. In § 2, I demonstrate that the Linguistic Perception (LP) model complies with these criteria. Crucially, I show that the L1 acquisition component of the LP model is shown to constitute a successful proposal for the mechanisms involved in learning to perceive L1 sounds. In § 3, I show how the L2 version of the LP model successfully describes, explains, and predicts the learning of SIMILAR L2 sounds. Specifically, the model predicts that listeners are optimal perceivers of their native language and that beginning L2 learners start with a copy of their L1 optimal perception. These two predictions are confirmed by the perception of /ae/ and // by monolingual Canadian English (CE) and Canadian French (CF) listeners and by the L2 perception of beginning CE learners of CF. Further, the model predicts that learners will adjust their initial L2 perception by means of the same mechanism used by L1 learners. This developmental prediction is confirmed by the gradual shifting of the category boundary between /ae/ and // in CE learners of CF. Finally, the L2LP model hypothesizes that both L1 and L2 can be optimal because they are handled by two separate grammars. The data demonstrate that CE learners of CF have differential perception systems for their L1 and L2. In sum, it is shown that this model provides the currently most comprehensive description, explanation, and prediction of L2 sound perception. It successfully incorporates an L2 phenomenon which was commonly regarded as phonetic or psycholinguistic within the domain of phonology, a modelling proposal which follows the tradition started by .
Proceedings of the 26th Annual Boston …, 2002
2021
The acquisition of a sound system is an integral component of second language (L2) communication, yet it is one of the most difficult skills to teach and is therefore largely ignored in L2 classrooms (Derwing, 2010). In laboratory settings, phonetic training studies have typically examined syllables, rather than words, with no referential meaning. Support for this decontextualization of the stimuli and the subsequent distributional learning that takes place has come from findings that word learning impedes phonetic learning, especially when involving minimal pairs (Feldman, Griffiths, Goldwater, & Morgan, 2013; Hayes-Harb & Masuda, 2008). This advantage for distributional learning over minimal pair learning, however, has not been demonstrated with regard to generalizing to untrained, analogous contexts. To investigate the role of the lexicon in L2 phonetic learning, the experiment in this dissertation trained participants on an artificial language with a different VOT category boundary than that of their first language, English. The experiment featured a between-subjects design in which participants were exposed to one of two training conditions. In the explicit, minimal pairbased condition, participants learned fine-grained VOT differences through voiced-voiceless minimal pairs (bilsu/pilsu) that illustrated a VOT category boundary of around 0 ms, providing explicit evidence for the phones' contrasting phonological statuses. The implicit condition served to determine whether distributional learning could take place without explicit information from minimal pairs. The implicit condition's lexicon contained no minimal pairs to illustrate a direct meaningful relationship between voiced/voiceless pairs of phones (binsu/pilsu). Lexical learning
Frontiers in psychology, 2014
1970
This paper takes issue with the position that children's phoneme acquisition schedule is dictated primarily by auditory perceptual factors and suggests the alternative position that ease of production accounts for age of acquisition. It is felt that perceptual theory cannot adequately explain phonological development, e.g. three-year-olds produce certain sounds which they will not accurately perceive until much later and vice versa. Three psychological scaling tests are described in which adults were asked to designate certain phonemes as harder or easier to produce. These judgments were compared with the phonemes acquired by three-year-olds. A highly significant correlation between the adult ratings and children's phoneme acquisition was found. Featural analysis tended to support the hypothesis, and further confirmation was seen in the patterning of articulatory errors among children. The author stresses that these findings do not negate the importance of perceptual factors, emphasizing that he is trying to explain not the dynamics of phoneme acquisition but rather the schedule which it follows. It is, however, emphasized that there is no compelling evidence for perceptual theory, and certain observations which either militate against perceptual theory or encourage alternative speculation are discussed. (FWB)
Perspectives on Behavior Science, 2023
Language researchers have historically either dismissed or ignored completely behavioral accounts of language acquisition while at the same time acknowledging the important role of experience in language learning. Many language researchers have also moved away from theories based on an innate generative universal gram- mar and promoted experience-dependent and usage-based theories of language. These theories suggest that hearing and using language in its context is critical for learning language. However, rather than appealing to empirically derived princi- ples to explain the learning, these theories appeal to inferred cognitive mechanisms. In this article, I describe a usage-based theory of language acquisition as a recent example of a more general cognitive linguistic theory and note both logical and methodological problems. I then present a behavior-analytic theory of speech per- ception and production and contrast it with cognitive theories. Even though some researchers acknowledge the role of social feedback (they rarely call it reinforce- ment) in vocal learning, they omit the important role played by automatic reinforce- ment. I conclude by describing automatic reinforcement as the missing link in a par- simonious account of vocal development in human infants and making comparisons to vocal development in songbirds.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.