Imitation In Language Learning: Prerequisites and Myths

The widespread common-sense view considers that children learn their language from their parents as a direct product of their interaction with them. Yet, it remains to clarify how this happens and specifically which cognitive mechanisms become involved in this learning process. The common knowledge would suggest that children learn their language from parents by means of imitation alone. However, before imitation could even take place, children’s ability to scan the communicative context should be considered. Moreover, relatively little effort was made to investigate the issue of imitation in children language learning process. Literature about imitation in language is mentioned in some outdated works, such as Tiedemann (1897) who observed the attempts of his six-month child to follow his mother gaze and to pronounce the syllable “ma”. The more recent literature on the topic shows that infants who perform better in a gaze following task appear to acquire vocabulary at a surprisingly rapid pace. ‘Gaze following’ is defined as the ability to follow the attentional focus of another person, which emerges during the second half of the first year of life as confirmed by several contributions such as Butterworth and Jarrett’s work (1991) and Gredebäck, Fikke and Melinder’s one (2010).

Children gain language knowledge from actual usage in a social context.

During the 1970s, the child’s linguistic environment was a major focus in language acquisition research. However, this interest subsequently decreased and the focus shifted to the child himself. One main reason for this lack of interest is the use of Child Directed Speech (CDS). According to Dominey et al. (2004, p. 125): “In this CDS, ‘motherese’, or ‘parentese’, the segmental structure is deformed in favour of the exaggerated prosodic structure”. In other words, CDS is a distinct register in comparison to Adult Directed Speech (ADS) due to the exaggerate intonation which produces great swooping curves of sound over an extended pitch range(Saxton, 2017, p. 88). If on one hand the segmental structure pertains the articulatory characteristics of an utterance, on the other the prosodic structure serves the function of grouping and giving prominence to the elements which make up the speech signal (Harrington et al., 2014). For instance, mothers seem rather prone to raise their pitch when children are emotionally engaged. Therefore, CDS enables caregivers to emphasise the information conveyed by prosodic cues i.e. the information retrieved by the musicality of speech and adapt it to the child’s perceptual capabilities. Some syllables in a speech seem in fact produced with more energy usually they are referred to as stressed” — due to differences in prosodic cues such as amplitude, duration or pitch. Nonetheless, the validity of CDS — which is both a phonological and lexical register — was undermined by the assumption that it is a privilege of a minority of Western mothers. Indeed, the frequency and quality of CDS were proven to be strictly related to the family socio-economic status (Schwab et al., 2016), which is averagely higher in Western families. Nonetheless, the cross-cultural research presented in Saxton (2017) suggests otherwise, attributing to CDS the pivotal role of obtaining the child’s attention. Therefore, a child’s environment is still attention worthy as far as language learning mechanisms is concerned. One of the most recent and prominent non-nativist account of syntax acquisition, also known as ‘usage-based’ theory, considers critical factors in language development such as the use of pointing and the emergence of collaborative engagement in a shared goal. On the contrary, the nativist approach reduces the child’s environment to a matter of limited exposure to key linguistic forms — which then triggers language acquisition. The principle which underpins the usage-based theory is that our language knowledge is obtained from the use of language itself, as said in Langacker (1987). In his Cognitive Grammar (2008, p. 16), Langacker affirms that

Automatization is the process observed in learning to tie a shoe or recite the alphabet: through repetition or rehearsal, a complex structure is thoroughly mastered, to the point that using it is virtually automatic and requires little conscious monitoring.

The process of automatization is thus seen as one of many basic phenomena that are evident in many facets of cognition. Therefore, phenomena such as association, schematization, categorization and automatization are seen as independent cognitive processes recruited for language usage.

Children are naturally prone to recognise the interlocutor communicative intentions.

Hence, the usage-based theory of language development is firmly rooted on the prominence of the social act of communication. One main reason that accounts for his position is that abilities such as reading and sharing intentions of others are in fact uniquely human abilities. The communication skills of the infant go through a flourishing period of development over their first year of life. The interest of the child in attending to the mother’s face and voice is immediately followed by an increase in smiling and cooing. Moreover, the adult’s role in capturing and maintaining the child’s attention is crucial for the language development process. Between 11 and 14 months of age, developmentally adequate children usually reach a sophisticated ability to follow changes in the direction of an adult’s gaze as firstly confirmed by Scaife and Bruner (1975). This very same ability is just one feature of shared attention. In addition, the importance of social cognition in general is underlined by Tomasello et al. (2005, p. 683) where the authors state that

conversation is an inherently collaborative activity in which the joint goal is to reorient the listener’s intentions and attention so that they align with those of the speaker.
Pointing is one crucial predictor of a child’s language development.

Consequently, face scanning assumes a crucial role in providing important information for language learning. A study by Young et al. (2009) demonstrates how children who attend to their mother’s mouths during communication — observed by means of eye-tracking methodologies — tend to have a larger vocabulary size in toddlerhood together with a more successful final language attainment. Arguably, information around the speaker’s mouth can foster the child’s ability to associate mouth shapes with speech sounds — e.g. a round-shaped mouth with the [o] sound. Although abilities such gaze following and attention to the mouth during face scanning have both been found to be relevant to later language attainment, they could be considered as different developmental functions. According to Tenenbaum et al. (2015, p. 3)

face scanning could reflect an infant’s sensitivity to linguistically relevant information during speaking, whereas gaze following could reflect aspects of social cognition that also relate to language.

Therefore, face scanning and gaze following could either be considered as having a common underlying factor — thus being capable of predicting language outcome — or could reflect different processes — thus having unique relations to language outcomes.

Children acquire since early life how to reproduce the adult’s behaviour.

In conclusion, face scanning and gaze following are useful predictors of language, however they give information about the child’s attitude to scan the communicative scene as well. Therefore, even if the importance of imitation is always minimised in theories of language development, the abilities which underlie imitation itself have been found to have their relevance. Imitation alone does not provide a complete explanation for the acquisition of grammar. Nonetheless, imitation could be defined, as in Saxton (2017, p. 96), “the reproduction of another person’s behaviour” and infants have been described as “more prolific imitators than the young of every species” (Marshall and Meltzoff, 2014, p. 1). Moreover, since every time a child uses a new word which was uttered by an adult in the first place is a case of deferred imitation, an even partial role of this mechanism in lexical development can be held true. Lastly, it is rather frequent that a child partly incorporates the utterances of an adult in their responses, hence proving the ability of a selective imitation (Snow, 1981).

Bibliographical references

