Individual differences |
Methods | Statistics | Clinical | Educational | Industrial | Professional items | World psychology |
The cocktail party effectis an example of selective attention and is the phenomenon of being able to focus one's auditory attention on a particular stimulus while filtering out a range of other stimuli, much the same way that a partygoer can focus on a single conversation in a noisy room. This effect is what allows most people to "tune in" to a single voice and "tune out" all others. It may also describe a similar phenomenon that occurs when one may immediately detect words of importance originating from unattended stimuli, for instance hearing one's name in another conversation.
Binaural processing[edit | edit source]
The cocktail party effect works best as a binaural effect, which requires hearing with both ears. People with only one functional ear seems much more disturbed by interfering noise than people with two healthy ears.. However, even without binaural location information, people can, even if with greater difficulty, selectively attend to one particular speaker if the pitch of their voice or the topic of their speech is sufficiently distinctive.
The binaural aspect of the cocktail party effect is related to the localization of sound sources. Experiments have shown that the auditory system is able to localize at least two sound sources simultaneously and assign the correct sound source characteristics to these sound sources simultaneously too. In other words, as soon as the auditory system has localized a sound source, it can extract the signals of this sound source out of a mixture of interfering sound sources.
It is assumed[attribution needed] that the auditory system performs a kind of cross-correlation function between both ear signals. A cross correlation function projects signals onto an axis, which corresponds to the time difference between both ear signals. For example, sound with an inter-aural time difference of 0.3 ms is projected onto the 0.3 ms position of the correlation axis. If multiple sound sources are present, then complex correlation patterns appear. The statistical parameters of these patterns, like mean value and variance, depend on the directions and levels of the sound sources. The auditory system is obviously able to analyze these patterns and determine the signals of a dedicated sound source.
Attempts have been made to simulate the cocktail party effect by technical means. Cocktail party processors have been constructed which can extract the signal of a single sound source out of a mixture of sound sources. There are cocktail party processors, which are based on correlation functions, evaluating inter-aural time differences, but there are also cocktail party processors for inter-aural level differences. However, the principles of the human cocktail party effect are not yet fully investigated. Technical cocktail party processors do not yet reach the capabilities of the human auditory system. Nima Mesgarani and Edward Chang were able to follow which speaker each volunteer was listening to, just by monitoring their brain activity - the first time this has been done. After the key word was spoken, the spectrogram showed that the volunteer's auditory cortex was responding only to a single voice rather than a combination of the two, also the algorithm also enabled the team to tell when listeners mistakenly focused on the wrong speaker, as the translated brain activity in the spectrogram represented a sentence spoken by the other voice.
Monaural processing[edit | edit source]
The auditory system does not only use methods for a direction specific signal processing, it also uses monaural effects for noise reduction. If the characteristics of a desired signal are known (like the characteristics of speech) or can be estimated (like expected phonemes at observed mouth movements), then all signal components which do not match the expected characteristics can be suppressed and the disturbing effect of this noise can be reduced.
The human pinna (the external flap of skin and cartilage of the ear) is a directionally-dependent filter that selectively removes particular frequencies, based on the direction from which sound comes. This filter can distinguish sounds from above vs. below, and from front vs. back, even when only a single ear is used.
Control of the direction of attention[edit | edit source]
In the early 1950s much of the early work in this area can be traced to problems faced by air traffic controllers. At that time, controllers received messages from pilots over loudspeakers in the control tower. Hearing the intermixed voices of many pilots over a single loudspeaker made the controller's task very difficult. The effect was first defined and named "the cocktail party problem" by Colin Cherry in 1953. Cherry conducted attention experiments in which subjects were asked to listen to two different messages from a single loudspeaker at the same time and try to separate them. His work reveals that our ability to separate sounds from background noise is affected by many variables, such as the gender of the speaker, the direction from which the sound is coming, the pitch, and the rate of speech.
Models of attention[edit | edit source]
Some of the earliest work in exploring mechanisms of selective attention was performed by Donald Broadbent, who proposed a theory that came to be known as the filter model. This model was established using the dichotic listening task. In this type of experiment, a participant wears a pair of headphones and listens to two different auditory streams, one in each ear. The participant then pays attention to one stream while ignoring the other. After listening, the participant is asked to recall information from both the attended and unattended channels. Broadbent's research using the dichotic listening task showed that most participants were accurate in recalling information that they actively attended to, but were far less accurate in recalling information that they had not attended to. This led Broadbent to the conclusion that there must be a "filter" mechanism in the brain that could block out information that was not selectively attended to. The filter model was hypothesized to work in the following way: as information enters the brain through sensory organs (in this case, the ears) it is stored in sensory memory. Before information is processed further, the filter mechanism allows only attended information to pass through. The selected attention is then passed into working memory, where it can be operated on and eventually transferred into long-term memory. In this model, auditory information can be selectively attended to on the basis of its physical characteristics, such as location and volume. Others suggest that information can be attended to on the basis of Gestalt features, including continuity and closure. For Broadbent, this explained the mechanism by which we can choose to attend to only one source of information at a time while excluding others. However, Broadbent's model failed to account for the observation that words of semantic importance, for example one's own name, can be instantly attended to despite having been in an unattended channel. Shortly after Broadbent's experiments, Oxford undergraduates Gray and Wedderburn repeated his dichotic listening tasks, altered with monosyllabic words that could form meaningful phrases, except that the words were divided across ears. For example the words, "Dear, one, Jane," were sometimes presented in sequence to the right ear, while the words, "three, Aunt, six," were presented in a simultaneous, competing sequence to the left ear. Participants were more likely to remember, "Dear Aunt Jane," than to remember the numbers; they were also more likely to remember the words in the phrase order than to remember the numbers in the order they were presented.
In a later addition to this existing theory of selective attention, Anne Treisman developed the attenuation model. In this model, information, when processed through a filter mechanism, is not completely blocked out as Broadbent might suggest. Instead, the information is weakened (attenuated), allowing it to pass through all stages of processing at an unconscious level. Treisman also suggested a threshold mechanism whereby some words, on the basis of semantic importance, may grab one's attention from the unattended stream. One's own name, according to Treisman, has a low threshold value (i.e. it has a high level of meaning) and thus is recognized more easily. The same principle applies to words like fire, directing our attention to situations that may immediately require it. The only way this can happen, Treisman argued, is if information was being processed continuously in the unattended stream.
In order to explain in more detail how words can be attended to on the basis of semantic importance, Deutsch & Deutsch and Norman later proposed a model of attention which includes a second selection mechanism based on meaning. In what came to be known as the Deutsch-Norman model, information in the unattended stream is not processed all the way into working memory, as Treisman's model would imply. Instead, information on the unattended stream is passed through a secondary filter after pattern recognition. If the unattended information is recognized and deemed unimportant by the secondary filter, it is prevented from entering working memory. In this way, only immediately important information from the unattended channel can come to awareness.
Daniel Kahneman also proposed a model of attention, but it differs from previous models in that he describes attention not in terms of selection, but in terms of capacity. For Kahneman, attention is a resource to be distributed among various stimuli, a proposition which has received some support. This model describes not when attention is focused, but how it is focused. According to Kahneman, attention is generally determined by arousal; a general state of physiological activity. The Yerkes-Dodson law predicts that arousal will be optimal at moderate levels - performance will be poor when one is over- or under-arousedTemplate:Refn. Thus, arousal determines our available capacity for attention. Then, an allocation policy acts to distribute our available attention among a variety of possible activities. Those deemed most important by the allocation policy will have the most attention given to them. The allocation policy is affected by enduring dispositions (automatic influences on attention) and momentary intentions (a conscious decision to attend to something). Momentary intentions requiring a focused direction of attention rely on substantially more attention resources than enduring dispositions. Additionally, there is an ongoing evaluation of the particular demands of certain activities on attention capacity. That is to say, activities that are particularly taxing on attention resources will lower attention capacity and will influence the allocation policy - in this case, if an activity is too draining on capacity, the allocation policy will likely cease directing resources to it and instead focus on less taxing tasks. Kahneman's model explains the cocktail party phenomenon in that momentary intentions might allow one to expressly focus on a particular auditory stimulus, but that enduring dispositions (which can include new events, and perhaps words of particular semantic importance) can capture our attention. It is important to note that Kahneman's model doesn't necessarily contradict selection models, and thus can be used to supplement them.
Visual correlates[edit | edit source]
Some research has demonstrated that the cocktail party effect may not be simply an auditory phenomenon, and that relevant effects can be obtained when testing visual information as well. For example, Shapiro et al. were able to demonstrate an "own name effect" with visual tasks, where subjects were able to easily recognize their own names when presented as unattended stimuli. They adopted a position in line with late selection models of attention such as the Treisman or Deutsch-Normal models, suggesting that early selection would not account for such a phenomenon. The mechanisms by which this effect might occur were left unexplained. It has been suggested in brain imaging studies using PET that a variety of brain areas may be involved in selectively processing visual linguistic material (i.e. word form), including the inferior prefrontal and posterior insular cortices, the amygdala, caudate nucleus, and several areas of temporal cortex. It is currently unknown if these same brain areas are implicated in focusing attention for other visual or auditory stimuli.
This phenomenon is still very much a subject of research, in humans as well as in computer implementations (where it is typically referred to as source separation or blind source separation). The neural mechanism in human brains is not yet fully clear.
Notes[edit | edit source]
See also[edit | edit source]
- Attenuation theory
- Auditory processing disorder
- Auditory masking
- Auditory scene analysis
- Bottleneck theory
- Cognitive science
- Echoic memory
- Filter theory
- Language processing
- King–Kopetzky syndrome
- Dream speech
- Spatial hearing loss
- Franssen effect
References[edit | edit source]
- Bronkhorst, Adelbert W. (2000). The Cocktail Party Phenomenon: A Review on Speech Intelligibility in Multiple-Talker Conditions. Acta Acustica united with Acustica 86: 117–128.
- Wood, Noelle, Cowan, Nelson (1 June 1995). The cocktail party phenomenon revisited: How frequent are attention shifts to one's name in an irrelevant auditory channel?. Journal of Experimental Psychology: Learning, Memory, and Cognition 21 (1): 255-260.
- Conway, Andrew, R. A., Cowan, Nelson; Bunting, Michael F. (15 August 2001). The cocktail party phenomenon revisited: The importance of working memory capacity.. Psychonomic Bulletin & Review 8 (2): 331-335.
- Slatky, Harald (1992): Algorithms for direction specific Processing of Sound Signals - the Realization of a binaural Cocktail-Party-Processor-System, Dissertation, Ruhr-University Bochum, Germany
- Jeffress, L. A., 1948 A place theory of sound localization, J. Comp. Physiol Psychol, vol. 41, pp. 35-39.
- Hamzelou, Jessica [www.newscientist.com/article/mg21428613.800-cocktail-party-effect-identified-in-the-brain.html 'Cocktail party effect' identified in the brain]. URL accessed on September 16, 2012.
- Sorkin, Robert D.; Kantowitz, Barry H. (1983). Human factors: understanding people-system relationships, New York: Wiley.
- Cherry, E. Colin (1953-09). Some Experiments on the Recognition of Speech, with One and with Two Ears. Journal of Acoustic Society of America 25 (5): 975–979.
- Broadbent, D.E. (1954). The role of auditory localization in attention and memory span. Journal of Experimental Psychology 47 (3): 191–196.
- Scharf, Bertram (1990). On hearing what you listen for: The effects of attention and expectancy. Canadian Psychology 31 (4): 386-387.
- Brungart, Douglas S., Simpson, Brian D. (2007). Cocktail party listening in a dynamic multitalker environment. Attention, Perception and Psychophysics 69 (1): 79-91.
- Haykin, Simon, Chen, Zhe (17 Oct 2005). The Cocktail Party Problem.. Neural Computation 17 (9): 1875-1902.
- Gray J.A., Wedderburn A.A.I. (1960). Coping strategies with simultaneous stimuli. Quarterly Journal of Experimental Psychology 12 (3): 180–184.
- Treisman, Anne M. (1969). Strategies and models of selective attention.. Psychological Review 76 (3): 282-299.
- Deutsch, J.A., Deutsch, D. (1963). Attention: Some Theoretical Considerations.. Psychological Review 70 (I): 80-90.
- Norman, Donald A. (1968). Toward a theory of memory and attention.. Psychological Review 75 (6): 522-536.
- Kahneman, D. (1973). Attention and effort. Englewood Cliffs, NJ: Prentice-Hall.
- Narayan, Rajiv, Best, Virginia; Ozmeral, Erol; McClaine, Elizabeth; Dent, Micheal; Shinn-Cunningham, Barbara; Sen, Kamal (2007). Cortical interference effects in the cocktail party problem.. Nature Neuroscience 10 (12): 1601-1607.
- Dalton, Polly, Santangelo, Valerio; Spence, Charles (2009). The role of working memory in auditory selective attention.. The Quarterly Journal of Experimental Psychology 62 (11): 2126-2132.
- Koch, Iring, Lawo, Vera; Fels, Janina; Vorländer, Michael (9 May 2011). Switching in the cocktail party: Exploring intentional control of auditory selective attention.. Journal of Experimental Psychology: Human Perception and Performance 37 (4): 1140-1147.
- Shapiro, Kimron L., Caldwell, Judy; Sorensen, Robyn E. (1 January 1997). Personal names and the attentional blink: A visual "cocktail party" effect.. Journal of Experimental Psychology: Human Perception and Performance 23 (2): 504-514.
- Vorobyev, Victor A., Alho, Kimmo; Medvedev, Svyatoslav V.; Pakhomov, Sergey V.; Roudas, Marina S.; Rutkovskaya, Julia M.; Tervaniemi, Mari; van Zuijen, Titia L.; Näätänen, Risto (2004). Linguistic processing in visual and modality-nonspecific brain areas: PET recordings during selective attention.. Cognitive Brain Research 20 (2): 309-322.
Further reading[edit | edit source]
- Potts GF, Wood SM, Kothmann D, Martin LE (October 2008). Parallel perceptual enhancement and hierarchic relevance evaluation in an audiovisual conjunction task. Brain Res. 1236: 126–39.
- McLachlan N, Wilson S (January 2010). The central role of recognition in auditory perception: a neurobiological model. Psychol Rev 117 (1): 175–96.
[edit | edit source]
- Audio examples of a cocktail party processor
- Handbook for acoustic ecology
- Dichotic Listening Experiment
- Early and Late Selection Theories of Attention by Nick Milton (May 1994)
- Neurofunctional studies in birds and bats
- A Primer on 'The cocktail party problem' by Prof. Josh McDermott in Current Biology