Language Enlightenment

Saturday, December 10, 2016

August 12, 2015

Written by Maximus Peperkamp, M.S. Verbal Engineer

Dear Reader,

This writing is my twelfth response to “Talker-specific learning in speech perception” by Nygaard and Pisoni (1998). These researchers focus on something which has been apparent to me for a long time. It has often bothered me that what seems obvious to me is not accepted, let alone understood, by others. With certain people and under certain circumstances I am able to talk, think, feel and function coherently, but with other people and other circumstances I stumble over my words, I cannot think, I cannot remember and I only seem to make mistakes.

I recognize myself in what these authors write about. “In our use of language, we are often aware that through exposure to and learning of a novel talker’s voice, for example, we become increasingly able to recover the linguistic aspects of an utterance that seemed difficult to understand only moments earlier.” This positive occurrence is one which I would call Sound Verbal Behavior (SVB), in which the speaker’s voice has a positive effect on the listener. The influence of this “novel talker’s voice” is in stark contrast to the Noxious Verbal Behavior (NVB) speaker’s voice, which has a negative effect on the listener. The more “exposure to and learning of” a SVB voice can occur, the more capable and confident we seem to become.

“Perceptual learning involves an increase in the ability to extract information from the environment, as a result of experience and practice with stimulation coming from it. Gibson (1969) has identified two types of perceptual learning.” One type suggests “that perceptual sensitivity can be enhanced by pre-exposure to a set of stimuli.” In this type “Mere experience of the stimulus domain increases perceivers’ sensitivity.” If we consider the sound of the speaker’s voice as “stimulus domain”, we find that exposure to both SVB as well as NVB increases the listener’s “perceptual sensitivity.” However, if the listener is more exposed to NVB than to SVB, a different kind of sensitivity begins to occur in the listener, who will become biased to whatever he or she has been most often exposed to.

“In the second type, explicit experience in categorizing or identifying stimuli allows perceivers to become attuned to specific diagnostic physical features.” Thus, the listener’s “experience in categorizing or identifying stimuli” as belonging to the SVB or NVB category depends on the ways in which he or she was conditioned. Authors describe this learning process “which allows the perceivers to become attuned to specific diagnostic physical features.” However, such becoming “attuned” of course only applies to SVB, because NVB is only about coercion and obedience.

“For this type of learning, the organization of stimuli into categories has been shown to have an important influence on subsequent perceptual sensitivity.” The listener’s “perceptual sensitivity” is shaped by the extent to which he or she is more often exposed to SVB or NVB. I was never able to learn much from speakers who had a lot of NVB and little or no SVB. From an early age I favored SVB speakers, because with them was I able to learn and do something right. With NVB speakers, such as my father, I couldn’t do anything right.

“In the case of talker learning, categorizing or identifying talker’s voices may lead to increased distinctiveness of the perceptual dimensions of talker identity.” Although I was affected by the rejection of my father, luckily there were plenty of SVB speakers in my family, such as my mother, my grandmothers and my uncle, who supported and encouraged me. However, since they did not have any education, they couldn’t play a significant role in my academic development. Their primarily emotional support allowed me to listen to myself and figure out that I needed SVB to succeed in life.

“If a benefit of perceptual learning of voice can be demonstrated for linguistic processing as well, it would suggest that the same underlying dimensions subserve both perceptual abilities.” Such a “benefit of perceptual learning” can be demonstrated with SVB. Everything I have achieved is in my opinion due to SVB. In the studies reviewed by these authors “it has been found that a number of factors, such as the a priori distinctiveness of the set of voices to be learned, the number of talkers to be identified or discriminated, and the length or duration of the utterances used during training (i.e., syllables, words, phrases, passages), can mediate learning of voices.” Another way of summarizing these results is to state that in SVB we really listen to each other, because we acknowledge that it takes time to have a conversation. In NVB, on the other hand, we are always in a rush and stressed, as, supposedly, there is not enough time.

“Not surprisingly, listeners learn to recognize talkers’ voices most readily when utterances of long duration from a few highly distinct talkers are used.” In NVB, the speakers dominate and intimidate the listeners and talkers struggle to get the attention from other talkers, by forcing them to remain listeners. Moreover, in NVB communicators don’t give each other the time to speak and cut each other off whenever they can. “These results suggest that a period of perceptual learning is required for listeners to become sensitive to talker-specific information in the speech signal.” Only SVB has such “a period of perceptual learning” for the listener. In NVB no such learning period is needed as the speaker coerces the listener.

The author’s conclusion that “Listeners do not appear to acquire expertise in talker recognition effortlessly, but rather learn over time to attend explicitly to the unique, acoustically distinct properties of each talker’s voice” is clearly based on the ubiquity of NVB. Talking and listening is perceived as effortful only during NVB, but during SVB these two behaviors occur effortlessly. The fact that learning occurs “over time” does not have to mean that learning involves any effort. However, given the common lack of time which is experienced when we have NVB, the authors equate the lack of time with effort. In SVB we take more time to talk, but it takes no effort.

Tuesday, November 22, 2016

August 11, 2015

Written by Maximus Peperkamp, M.S. Verbal Engineer

Dear Reader,

This writing is my eleventh response to “Talker-specific learning in speech perception” by Nygaard and Pisoni (1998). Findings suggest “the effects of talker variability on perception and memory are a consequence of the additional processing time and resources that are devoted to encoding talker-specific information when the talker’s voice changes from item to item in these tasks.” However, these authors don’t mention that what they call “additional processing time” has to do with the sensitivity of the speaker for how the listener is affected by his or her voice.

The speaker's sensitivity to the listener occurs only there during Sound Verbal Behavior (SVB). However, the effects of the SVB speaker’s voice on the perception and memory of the listener have nothing to do with time, but whether it is perceived as an appetitive stimulus. The listener, who was conditioned by Noxious Verbal Behavior (NVB), may actually experience the SVB speaker’s voice as an aversive stimulus, as it doesn’t sound like anything he or she is used to. This is not to say that this cannot be changed, it can, but to build up more SVB repertoire requires a decrease and ideally the extinction of NVB responses.

These authors reify (make processes into things) when they write about “talker-specific information” which presumably is “retained in memory and can be used as a cue, in addition to linguistic content, to retrieve specific linguistic events from memory.” Certainly, the nervous system of speakers and listeners is altered, that is, conditioned, by spoken communication, due to which they are more likely to respond in a particular way, but “talker-specific information” is an inference which doesn’t explain anything.

No wonder that “the question still remains, however, as to the relationship between the processing of talker information and the processing of linguistic content.” That question can only be answered if we rephrase it in functional terms. I suggest: is what the speaker says affected by how he or she is saying it? And, could this perhaps be troubling the listener?

Instead of ‘mentalist’ inferences about “processing of talker-information” and “processing of linguistic information”, we should ask and answer why SVB produces better outcomes than NVB. If what we say is distracted from by how we say it, then we must prevent NVB and enhance SVB. If reducing NVB and increasing SVB leads to better results this is because how we say things determines whether what we say can or will be understood. How the speaker sounds and whether the speaker engages in SVB or NVB, either prevents or distracts the listener from paying attention to what the speaker is saying or it supports and stimulates the listener to pay attention to what the speaker is saying and to remember it.

The researcher’s question: “are the perceptual analyses that extract both types of information [talker-identity and linguistic content] integrally linked? (words between brackets added) is coming close to mine. During SVB what we say is congruent with how we say it, but during NVB the speaker produces contradicting messages with what he or she says and how he or she says it. The speaker's congruence also pertains to his or her verbal and nonverbal expression. Furthermore, the SVB speaker's speaking and listening behaviors are joined, that is, they occur at the same rate. Another way of describing this is that during SVB the speaker is conscious of his or her sound. The SVB speaker's voice is produced and listened to in the here and now. In NVB, on the other hand, the speaker is not listening to him or herself and is only busy trying to get others to listen to him or to her.

Thus, NVB is mechanical, unconscious and uncomfortable speech, which doesn't stimulate the speaker-as-own-listener. Consequently, the NVB speaker separates the speaker from the listener and in doing so separates public speech from private speech. During NVB what we really think and feel is kept out of public speech. We cannot express it as the sensitivity and awareness that is needed to do this is missing. Moreover, as the NVB speaker is not listening to his or her own sound, he or she gets carried away by what he or she is saying without ever realizing how he or she is saying it. In other words, the NVB speaker is heady. He or she is verbally fixated and he or she speaks in a disembodied, dissociated and dis-regulated fashion.

August 10, 2015

Written by Maximus Peperkamp, M.S. Verbal Engineer

Dear Reader,

This writing is my tenth response to “Talker-specific learning in speech perception” by Nygaard and Pisoni (1998). "Serial recall of spoken word lists produced by multiple talkers was poorer than recall of lists produced by a single talker; but the result was found only in the primacy portion of the serial recall curve.” These results need to be analyzed in terms of whether the speaker produced Sound Verbal Behavior (SVB) or Noxious Verbal Behavior (NVB) and positively influenced the listener with an appetitive sounding voice or an negatively influenced the listener with an aversive sounding voice.

“The primacy portion of the serial curve” is hypothesized to be absent in SVB and is believed to be mainly be a function of NVB. It was suggested that "variation in a talker’s voice from word to word in a list competes for processing resources in the recall task.” This interpretation doesn’t answer the question why this “competition for processing resources” occurs. The SVB/NVB distinction, however, makes us realize that this “competition” occurs only due to NVB.

In SVB the verbal and nonverbal expressions of the speaker are aligned, but in NVB they are two different messages as they are disjointed. Moreover, in SVB the speaker is his or her own listener. This always positively effects the feelings of the listener, but in NVB the speaker is not listening to him or herself, which always negatively effects the listener. What is recalled by the listener from an aversive-sounding NVB speaker is mainly that he or she sounds aversive.

Stated differently, because lexical information dominates and hinders the linguistic analysis of what the speaker says, the listener who listens to a NVB speaker is believed to remember less than the listener who listens to a SVB speaker. Whether it is possible or not, the listener who listens to a NVB speaker will try to move away from the aversive stimulation of the speaker and consequently remember less of the lexical information.

The “analysis of talker information during a memory task appears to be both time- and resource-demanding,” but only when the listener is dealing with a NVB speaker. Reasoned from the SVB/NVB distinction, we find that it is not the “talker variability” which “increases the capacity demands of the working memory system”, but it is SVB which increases this capacity and, by contrast, it is NVB, which decreases this working memory capacity.

The researchers noted that recall is also affected by presentation rates. They don’t mention that these, in turn, are determined by the kind of vocal verbal behavior of the speakers, that is, by their SVB or NVB. A SVB speaker's speech episode contains more instances of SVB than NVB, while a NVB speaker's presentation contains more NVB instances than SVB instances.

The SVB presentation occurs at more relaxed pace and slower rate than the anxiety and stress provoking NVB presentation. Leaving out the influence of the talker’s voice on the listener, the researchers overlook what may be the most important independent variable, which is unspecified in the catch-all-phrase “talker variability.”

Authors unaware of the SVB/NVB distinction will maintain ‘mentalistic’ definitions, which are useless in any behavioral account. 'Talker variability' is a useless term if it doesn't address SVB and NVB. “This interaction between presentation rate and serial recall for the multiple- and single-talker word lists suggests that at fast presentation rates, when processing is constrained by time, talker variability affects both the perceptual encoding and the rehearsal of items in the serial recall task” (words underline by me).

To consider the influence of the speaker’s sound, we should do away with constructs that represent verbal bias. Conclusions are drawn which prevent us from finding out what is happening. “At slower presentation rates, when listeners have more time and resources to encode and rehearse talker information, they are able to use that information to aid them in the encoding of item and order information.” With a SVB speaker the listener is at ease and better able to pay attention to what he or she is saying. The SVB/NVB distinction is a more parsimonious explanation than inferences about “encoding” and “rehearsing” of the “item and order information”.

Based on my knowledge about SVB and NVB, I object to the researcher’s conclusion. “These memory findings suggest that talker information may not be discarded in the process of spoken word recognition, but rather is retained in memory along with the more abstract, symbolic linguistic content of the utterance.” They seem to think that nebulous cognitive processes explain how “talker information” is “retained in memory.”

What is left out by these authors is the fact that the listener’s neural behavior is altered by the sound of the speaker's voice, leading one listener to supposedly have better memory than the other. What actually happens is that the body of the listener who ‘remembers’ what the speaker has said was positively affected by the tone of the speaker’s voice. The stress that is produced by the NVB speaker always has an adverse effect on memory.

If we don’t discard constructs as “information”, we continue to misrepresent classical and operant conditioning effects – in speakers and listeners – of how the speaker sounds. Presumably “Talker-specific information is retained in memory along with lexical information" and "this information can facilitate listeners’ recognition memory.” SVB and NVB can be heard, but “talker-specific information” and “lexical information”cannot.

Ironically, the researchers, who found that “Words repeated in the same voice were recognized better than words repeated in a different voice”, didn’t realize that fixation on words, a characteristic of NVB, distracts them from paying attention to how this “same voice” actually sounds.

Sunday, November 20, 2016

August 9, 2015

Written by Maximus Peperkamp, M.S. Verbal Engineer

Dear Reader,

This writing is my ninth response to “Talker-specific learning in speech perception” by Nygaard and Pisoni (1998). It amazes me that the relation between “the indexical properties of the speech signal” and “the more abstract linguistic content of an utterance” needs to be pointed out. My reasoning is based on Sound Verbal Behavior (SVB) in which what we say is as important as how we say it. Reasoning which is based on what I call Noxious Verbal Behavior (NVB) creates a split between what we say and how we say it. The former is more important than the latter in NVB.

We must realize that “the problem” created by this split only occurs in NVB and never in SVB. There is no problem that in SVB these two are conveyed simultaneously. Surely, most researchers are unaware of the great difference between SVB and NVB. That is why they write that “The essence of the problem is that both types of information are conveyed simultaneously along the same acoustic dimensions within the speech signal."

Actually, they are unknowingly saying that NVB is problem. Only in NVB “the information about the talker must be disentangled from information about the linguistic content of the utterance.” What they call “perceptual normalization” I call SVB, as SVB includes “an account of the processing and representation of both the linguistic and the indexical information that are carried in parallel in the speech signal.” This is a sophisticated way of describing SVB. Moreover, while SVB normalizes our perception, NVB can be said to distort our perception. It is only a small step from “talker variability” to a different way of talking, that is to SVB and NVB.

“Several studies have shown that talker variability has a significant impact both on the perceptual processing of spoken utterances and on the memory representations constructed during the perception of spoken language.” The interpretation of such studies begins to make much more sense when we identify such impact as the positive or negative emotions in the listener.

The two subclasses of vocal verbal behavior, SVB and NVB, refer to how the listener’s affective experiences interact with “perceptual processing” and “memory representations constructed during the perception of spoken language.” A reinterpretation of the research makes clear that because of a certain way of talking we perceive reality as it is, as we embody that reality during our spoken language.

“Talker variability has been shown to affect both vowel perception." Also, it was found that "perceptual identification of words presented in noise was significantly poorer when the words were produced by multiple talkers than when they were produced by a single talker." Once we are familiar with the SVB/NVB distinction it is quite clear that only SVB can improve vowel perception and spoken word recognition, while NVB will always impair it.

Perceptual identification of words presented in noise will only be better if this single talker has SVB, but not if he or she has NVB. If among multiple talkers there would be a couple of SVB talkers and if the single talker would be a NVB talker, then perceptual identification of words uttered by multiple talkers is hypothesized to be higher than for the single talker. Also, the “difficulty ignoring irrelevant variation in the talker’s voice when asked to classify syllables by initial phoneme” is hypothesized to only occur with a NVB speaker, but not a SVB speaker. To the contrary, with a SVB speaker’s variability is believed to enhance perception.

“Aspects of the speech signal related to classifying talker identity seem to be integrally linked to attributes related to the processing of the linguistic content of the signal.” They are, but we can only acknowledge this during SVB, whereas during NVB we deny this. Thus, we understand each other better during SVB in which the speaker talks with, not at the listener. In SVB there is no need to classify talker’s identity as the listener is safe, but in NVB talker’s identity is important as the talker threatens the listener.