Perceiving Reference through Dynamic Gesture during Early Lexical Developent
Patricia Zukow-Goldring, University of California, Los Angeles
ÊNancy de Villiers Rader and Theresa Cain, Ithaca College


How do infants unlock the puzzle of reference, the mundane fact that words refer to objects, actions, attributes, and more? That is, how do infants learn the relation between the stream of speech and the unceasing flow of events? While this sounds like a simple question, its answer remains elusive.

Overview of Other Views.

The preponderance of research investigating lexical development has tested hypotheses proposing that innate predispositions (Markman, 1992), cognitive prerequisites (Clark, 1993; Golinkoff, Mervis, & Hirsh-Pasek, 1994), and/or social abilities (Tomasello, 1988, 2001) underlie lexical development, usually from 15 months onward. Some six months earlier, however, infants begin to display comprehension and production of their first words. A large body of work documents which types of words and their tokens emerge when and in what order (Fenson et al., 1994). However, little experimental work examining underlying development has focused on the time frame from 9 through 14 months in which infants learn these first words. Our research seeks to add to knowledge of this developmental process by testing a hypothesis derived from a perceptually based theory of early lexical development.

Our Perceptually-Based, Social Interactive Approach

We propose that attention to movement and synchrony facilitates infants' detection of the correspondence between two fundamentally different kinds of things: words and aspects of ongoing events (Zukow-Goldring, 1997; Zukow-Goldring & Rader, 2001). This research focused on early language development, documenting that caregivers' gestures educate their children's attention to engender a shared perspective of the world.Ê Understanding the perceptual abilities and caregiver practices underlying the emergence of infants' first words is key, because language development without words cannot go forward.

Attention is a topic that permeates the early lexical development literature. However, the relationship found between attentional style and early lexical development brings us no closer to understanding how lexical development does, in fact, occur. We believe that J. Gibsonâs theory of direct perception (1966, 1979) provides a basis for explaining why perceptual processes hold the key to understanding early lexical development. Gibsonâs theory describes perceptual input in terms of invariant information in a dynamic array over time and argues that these invariants specify meaningful aspects of the environment.Ê Attention is embodied as an individual orients to and explores the environment. During cycles of perceiving and acting, a person detects the perceptual structure in light, sound, odor, touch, and taste; such discovery informs further action. According to E. Gibson's (1969, 1997) view of perceptual learning, what changes with learning is the detection of this information. With practice, perceiving becomes more differentiated and appropriate to the task at hand (Gibson & Rader, 1979).

Theory of Amodal Referring through Gesture and Speech

We briefly present four postulates that delineate developmentally important processes and caregivers' methods of educating attention. (For a more complete account, see Zukow-Goldring & Rader, 2001).

Postulate 1: A general perceptual process (namely, detecting amodal invariants) allows infants to perceive the correspondence between word and referent.Ê Amodal invariants perceivable in the synchronous and dynamic coupling of gesture and speech may clip for the infant the relevant segment of the speech stream and of the ongoing event, making detectable the relation between a word and what it represents.

Postulate 2: Infants can and do detect amodal regularities prior to the comprehension of speech.
S
pecifically, we propose that infants learn to comprehend words initially through the pick-up of the amodal invariance expressed in vision, sound, touch, and so on by the caregiver while gesturing and speaking.

Postulate 3: Caregivers present amodal regularities as they gesture and speak to infants from 6 months. "An equivalence in gestural and prosodic contour suggests a means by which infants might detect an invariant relation between object and word" (Zukow, 1990: 718). She argued that "caregivers make amodal invariant relations (timing, tempo, intensity, and rhythmicity) prominent as they gather and direct attention to animate beings, objects, and actions. Infants pick up these regularities in their speech and action" (Zukow-Goldring, 1997: 240).

Postulate 4: Temporal synchrony of action and word, not temporal contiguity, is of crucial importance to lexical development. Naturalistic evidence supports this position: asynchronous, but temporally contiguous, presentation of word and action did not lead to word learning, whereas temporal synchrony did (Zukow-Goldring, 1997, pp. 237-239).

Research Hypothesis

In this study we focus on a hypothesis derived from the first postulate by comparing the efficacy of dynamic show versus static held gestures for the learning of new lexical items. show gestures, in which objects are loomed and waggled, occur most frequently during the prelinguistic period when caregivers display objects. (Zukow-Goldring & Ferko, 1994; Zukow-Goldring, 1997; Gogate et al., 2000). In waggling, caregivers routinely display the entire object by turning it back and forth. held gestures display a much smaller proportion of the object's surface and do not dynamically set off the object from the background. We hypothesize that dynamic show gestures accompanied by attention-directing speech will facilitate early lexical development more than held gestures correlated with the same verbal script.

Method

Participants

In this within-in subject study, participants were ten infants, four males and six females, between the ages of 10 and 14 months with a mean age of 12 months. A criterion for participation was the ability to look towards mommy or daddy when asked to do so.Ê Each infant participated in both gesture conditions, but was randomly assigned to word-object pairings and orderings.

Materials

Nonce words. The monosyllabic stimuli (hork, smiff) adhered to the phonological rules of English (Ladefoged, 1975).

Objects. The two objects we constructed were distinct in color and form, mildly attractive to infants and novel for any infants tested.Ê (Object 1: blue and yellow jumbo Lego blocks arranged in a tower format with cantilevered segments, Object 2: green and organgeorange circular modules.)

Learning and Testing Trials

Apparatus. Infants were seated in an adjustable infant high chair positioned 21” away from a 20” television monitor. To avoid subtle influences from the caregiver, he/she was seated behind the infant, but was not in physical contact with her/him. A small video camera mounted on the wall recorded the infants' behavior. ÊThe VCRs for presenting the stimuli during training trials and for recording the infant throughout training and testing were located in an observation room to avoid distraction.

Video Learning Trials

 

 


 

 

 

 

To insure consistency, videotaped scenarios were used. Utilizing infant-directed speech, the presenter said, This is a hork/smiff. Look it's a hork/smiff.Ê See the hork/smiff.Ê Look at the hork/smiff.Ê See, hork/smiff. A complete repetition of the script followed for a total of ten utterances. For the dynamic show gesture, the objects were loomed and waggled. This gesturing occurred synchronously with word articulation. In the static held gesture scenario, the object was held in front of the speaker but was not moved as she talked.The particular combination of word and gesture was randomly determined for each infant.

Comprehension Trial.

 

 

 

 

 

 

 


Following the learning phase of the testing, word comprehension was assessed, using an adaptation of the preferential looking task (Hirsh-Pasek & Golinkoff, 1996). Two videotaped scenarios were created, one for each object, in which a different female speaker asked the infant to look at the hork or the smiff. The speakerâs face appeared in a central monitor while the referent choice objects sat on small tables to the right and left of the infant. The speaker said, Do you see the hork/smiff?Ê Look at the hork/smiff.Ê Are you looking at the hork/smiff?Ê I want you to look at the hork/smiff.Ê Look at the hork/smiff.Ê The order of the requests and the positioning of the choices were counterbalanced.

Results

Because we were interested in the degree of attention during the learning trials, we coded for looking duration for each condition during this phase of the study. Coding was done without knowledge of the gesture condition.Ê Inter-rater reliabilities were r=.99 for the static-gesture scenarios and r=.98 for the dynamic-gesture scenarios. Infants did not spend more time looking at the dynamic show gesture scenario than at the static held gesture scenario, F(1,7)=.09, p=.77.Ê

For the comprehension trials, we were interested in whether the infant had learned the relationship between the nonce word and the objects.Ê Therefore, for this phase of the study the dependent variable was the proportion of correct looks to the total number of looks. Inter-rater reliability was r=1.0 for the condition in which the object had been presented with a dynamic gesture and r=.96 for the condition in which the object had been presented with a static gesture. The effect of gesture was significant, F(1,9)=5.73, p=.04, with infants making more correct looks to the object presented using the dynamic SHOW gesture.

Discussion

This experimental study was designed to test the effects of dynamic show gestures caregivers have been observed to use during early word learning.Ê The results support the hypothesis that a dynamic show gesture leads to better comprehension than a static held gesture.Ê We suggest that amodal invariants make word and referent "stand in" relation to each other and "stand out" from other possibilities. Furthermore, these practices/methods may be more important in the visually crowded environments of daily life than in the very spare lab environment that has few distractions

Trochaic stress patterns in word final position (strong-weak) may assist the child to detect a word in the speech stream and enhance comprehension. However, trochaic stress alone cannot explain our results, because trochaic stress occurs in both our experimental conditions. In the condition in which movement was synchronous with this pattern, we found improved comprehension.

References

Clark, E. V. (1993). The lexicon in acquisition. Cambridge, UK: Cambridge.

Fenson, L., Dale, P. S., Reznick, J. S., Bates, E., Thal, D. J., & Pethick, S. J. (1994). Variability in early communicative development. Monographs of the Society for Research in Child Development, 59 (5, Serial No. 242).

Gibson, E. J. (1969). Principles of perceptual learning and development. New York: Appleton-Century-Crofts.

Gibson, E. J. (1997). An ecological psychologist's prolegomena for perceptual development: A functional approach. In C. Dent-Read & P. Zukow-Goldring (Eds.), Evolving explanations of development: Ecological approaches to organism-environment systems (pp. 23-45). Washington, D. C.: American Psychological Association.

Gibson, E. J., & Rader, N. (1979). Attention: The perceiver as performer. In G. Hale & M. Lewis (Eds.), Attention and cognitive development (pp. 1-21). New York: Plenum.

Gibson, J. J. (1966). The senses considered as perceptual systems. Boston: Houghton Mifflin.

Gibson, J. J. (1979). The ecological approach to visual perception. Boston: Houghton Mifflin.

Gogate, L. J., Bahrick, L. E., & Watson, J. S. (2000). A study of multimodal motherese: The role of temporal synchrony between verbal labels and gestures. Child Development, 71, 878-894.

Golinkoff, R., Mervis, C., & Hirsh-Pasek, K. (1994). Early object labels: The case for a developmental lexical principles framework. Journal of Child Language, 21, 125-155.

Hirsh-Pasek,, K., & Golinkoff, R. M. (1996). The origins of grammar: Evidence from early language comprehension. Cambridge, MA: The MIT Press.

Ladefoged, P. (1975). A course in phonetics. New York: Harcourt Brace Jovanovich.

Markman, E. M. (1992). Constraints on word learning: Speculations about their nature, origins, and word specificity. In M. Gunnar & M. Maratsos (Eds.). Modularity and constraints in language and cognition. Hillsdale, NJ: Lawremce Erlbaum Associates.

Tomasello, M. (1988). The role of joint attentional processes in early language development. Language Sciences, 10, 69-88.

Tomasello, M. (2001). Perceiving intentions and learning words in the second year. In M.Bowerman & S. Levinson (Eds.), Language acquisition and conceptual development. Cambridge University Press.

Zukow, P. G. (1990). Socio‑perceptual bases for the emergence of language: An alternative to innatist approaches. In C. Dent, & P. G. Zukow (Eds.), The idea of innateness: Effects on language and communication research. Developmental Psychobiology, 23, 705‑726.

Zukow-Goldring, P. (1997). A social ecological realist approach to the emergence of the lexicon: Educating attention to amodal invariants in gesture and speech. In C. Dent-Read & P. Zukow-Goldring (Eds.), Evolving explanations of development: Ecological approaches to organism-environment systems (199-250). Washington, D. C.: American Psychological Association.

Zukow-Goldring, P., &Ê Rader, N. (2001). Perceiving referring actions: A commentary on Gogate, Walker-Andrews, & Bahrick. Developmental Science, 4. 28-30.