Assessing the quality of TTS audio in the LARA learning-by-reading platform

Elham Akhlaghi, Anna Bączkowska, Harald Berthelsen, Branislav Bédi, Cathy Chua, Catia Cucchiarini, Hanieh Habibi, Ivana Horváthová, Pernille Hvalsøe, Roy Lotz, Christele Maizonniaux, Neasa Ní Chiaráin, Manny Rayner, Nikos Tsourakis, Chunlin Yao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Downloads (Pure)

Abstract

A popular idea in Computer Assisted Language Learning (CALL) is to use multimodal annotated texts, with annotations typically including embedded audio and translations, to support L2 learning through reading. An important question is how to create the audio, which can be done either through human recording or by a Text-To-Speech (TTS) synthesis engine. We may reasonably expect TTS to be quicker and easier, but humans to be of higher quality. Here, we report a study using the open-source LARA platform and ten languages. Samples of LARA audio totaling about three and a half minutes were provided for each language in both human and TTS form; subjects used a web form to compare different versions of the same item and rate the voices as a whole. Although human voice was more often preferred, TTS achieved higher ratings in some languages and was close in others.
Original languageEnglish
Title of host publicationCALL and professionalisation
Subtitle of host publicationshort papers from EUROCALL 2021
EditorsNaouel Zoghlami, Cédric Brudermann, Cedric Sarré, Muriel Grosbois, Linda Bradley, Sylvie Thouësny
Place of PublicationFrance
PublisherResearch-publishing.net
Pages1-5
Number of pages5
ISBN (Electronic)9782490057979
DOIs
Publication statusPublished - Dec 2021
EventEUROCALL 2021: CALL & Professionalisation - Online conference
Duration: 26 Aug 202127 Aug 2021
https://research-publishing.net/book?10.14705/rpnet.2021.54.9782490057979

Conference

ConferenceEUROCALL 2021
CityOnline conference
Period26/08/2127/08/21
Internet address

Keywords

  • Reading
  • Multimodality
  • Text-To-Speech (TTS)
  • Evaluation

Fingerprint

Dive into the research topics of 'Assessing the quality of TTS audio in the LARA learning-by-reading platform'. Together they form a unique fingerprint.

Cite this