Recently it has been argued that speakers use conventionalized forms to express different prosodic attitudes . We examined this by looking at across speaker consistency in the expression of auditory and visual (head and face motion) prosodic attitudes produced on multiple different occasions. Specifically, we examined acoustic and motion profiles of a female and a male speaker expressing six different prosodic attitudes for four within-session repetitions across four different sessions. We used the same acoustic features as  and visual prosody was assessed by examining patterns of speaker's mouth, eyebrow and head movements. There was considerable variation in how prosody was realized across speakers, with the productions of one speaker more discriminable than the other. Within-session variation for both the acoustic and movement data was smaller than acrosssession variation, suggesting that short-term memory plays a role in consistency. The expression of some attitudes was less variable than others and better discrimination was found with the acoustic compared to the visual data, although certain visual features (e.g., eyebrow brow motion) provided better discrimination than others.
|Number of pages
|Published - 2016
|17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016 -
Duration: 8 Sept 2016 → …
|17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016
|8/09/16 → …