Building an audio-visual corpus of Australian English: large corpus collection with an economical portable and replicable Black Box

Denis Burnham, Dominique Estival, Steven Fazio, Jette Viethen, Felicity Cox, Robert Dale, Stephen Cassidy, Julien Epps, Roberto Togneri, Michael Wagner, Yuko Kinoshita, Roland Goecke, Joanne Arciuli, Mark Onslow, Trent Lewis, Andrew Butcher, John Hajek

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    28 Citations (Scopus)

    Abstract

    The Big Australian Speech Corpus project incorporates the strategic goals of 30 Chief Investigators from various speech science areas. Speech from 1000 geographically and socially diverse speakers is being recorded using a uniform and automated protocol plus standardized hardware and software to produce a widely applicable and extensible database - AusTalk. Here we describe the project's major components and organization; share the lessons learnt from difficulties and challenges; and present the results achieved so far.

    Original languageEnglish
    Title of host publicationINTERSPEECH 2011
    Pages841-844
    Number of pages4
    Publication statusPublished - 1 Dec 2011
    Event12th Annual Conference of the International Speech Communication Association -
    Duration: 28 Aug 2011 → …

    Conference

    Conference12th Annual Conference of the International Speech Communication Association
    Period28/08/11 → …

    Keywords

    • Australian English
    • AV data
    • Speech corpus

    Fingerprint Dive into the research topics of 'Building an audio-visual corpus of Australian English: large corpus collection with an economical portable and replicable Black Box'. Together they form a unique fingerprint.

    Cite this