Interplay of informational content and energetic masking in speech perception in noise

Vincent Aubanel, Chris Wayne Davis, Jeesun Kim

Research output: Contribution to conferencePaper

Abstract

It seems plausible that different regions of the speech signal convey different amounts of information. Understanding which aspects of the signal convey information is important for understanding speech perception, particularly when this occurs in noisy environments. The so called cochlea-scaled entropy (CSE) measure is an index of spoken information based on the distribution of spectral energy over consonant/vowel time scales that is defined independently of potential noise corruption. In speech in noise, however, energetic masking distorts information, because it suppresses certain spectro-temporal regions. This study explored the interplay of informational content (defined by CSE) and energetic masking in explaining the listeners ability to understand speech in noise. Using a priming paradigm, mixtures of speech and speech-shape noise were presented to listeners in an identification task. Sentences were preceded by previews consisting of either low or high informational content. Both types yielded a similar performance increase of around 19%. Although the low information preview transmitted less target information it had a greater overlap with the more energetic regions of the target sentence (i.e., those that were less masked). This could explain why both preview types were effective and calls for a consideration of both measures in understanding speech recognition in noise.

Original languageEnglish
Pages2046-2049
Number of pages4
Publication statusPublished - 1 Jan 2014
Event15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014 -
Duration: 14 Sep 2014 → …

Conference

Conference15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014
Period14/09/14 → …

Keywords

  • Speech in noise

Fingerprint Dive into the research topics of 'Interplay of informational content and energetic masking in speech perception in noise'. Together they form a unique fingerprint.

  • Cite this

    Aubanel, V., Davis, C. W., & Kim, J. (2014). Interplay of informational content and energetic masking in speech perception in noise. 2046-2049. Paper presented at 15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014, .