Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers

Reitmaier, Thomas; Wallington, Electra; Klejch, Ondrej; Markl, Nina; Lam-Yee-Mui, Léa-Marie; Pearson, Jen; Jones, Matt; Bell, Peter; Robinson, Simon

doi:10.1145/3544548.3581385

Conference Paper/Proceeding/Abstract 1867 views 451 downloads

Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers

Thomas Reitmaier

, Electra Wallington, Ondrej Klejch, Nina Markl, Léa-Marie Lam-Yee-Mui, Jen Pearson

, Matt Jones

, Peter Bell, Simon Robinson

ACM CHI Conference on Human Factors in Computing Systems: CHI' 23, Pages: 1 - 17

Swansea University Authors: Thomas Reitmaier , Jen Pearson , Matt Jones , Simon Robinson

PDF | Version of Record
Download (2MB)

DOI (Published version): 10.1145/3544548.3581385

Abstract

In this paper we develop approaches to automatic speech recognition (ASR) development that suit the needs and functions of underheard language speakers. Our novel contribution to HCI is to show how community-engagement can surface key technical and social issues and opportunities for more efective s...

Full description

Published in:	ACM CHI Conference on Human Factors in Computing Systems: CHI' 23
ISBN:	978-1-4503-9421-5/23/04
Published:	ACM 2023
URI:	https://cronfa.swan.ac.uk/Record/cronfa62395

first_indexed	2023-01-23T10:21:39Z
last_indexed	2024-11-14T12:20:53Z
id	cronfa62395
recordtype	SURis
fullrecord	<?xml version="1.0"?><rfc1807><datestamp>2024-10-18T16:35:20.8337574</datestamp><bib-version>v2</bib-version><id>62395</id><entry>2023-01-23</entry><title>Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers</title><swanseaauthors><author><sid>ccd66b64d11d76b9cd8b28e9d42a0ff0</sid><ORCID>0000-0003-2078-6699</ORCID><firstname>Thomas</firstname><surname>Reitmaier</surname><name>Thomas Reitmaier</name><active>true</active><ethesisStudent>false</ethesisStudent></author><author><sid>6d662d9e2151b302ed384b243e2a802f</sid><ORCID>0000-0002-1960-1012</ORCID><firstname>Jen</firstname><surname>Pearson</surname><name>Jen Pearson</name><active>true</active><ethesisStudent>false</ethesisStudent></author><author><sid>10b46d7843c2ba53d116ca2ed9abb56e</sid><ORCID>0000-0001-7657-7373</ORCID><firstname>Matt</firstname><surname>Jones</surname><name>Matt Jones</name><active>true</active><ethesisStudent>false</ethesisStudent></author><author><sid>cb3b57a21fa4e48ec633d6ba46455e91</sid><ORCID>0000-0001-9228-006X</ORCID><firstname>Simon</firstname><surname>Robinson</surname><name>Simon Robinson</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2023-01-23</date><deptcode>MACS</deptcode><abstract>In this paper we develop approaches to automatic speech recognition (ASR) development that suit the needs and functions of underheard language speakers. Our novel contribution to HCI is to show how community-engagement can surface key technical and social issues and opportunities for more efective speech-based systems. We introduce a bespoke toolkit of technologies and showcase how we utilised the toolkit to engage communities of under-heard language speakers; and, through that engagement process, situate key aspects of ASR development in community contexts. The toolkit consists of (1) an information appliance to facilitate spoken-data collection on topics of community interest, (2) a mobile app to create crowdsourced transcripts of collected data, and (3) demonstrator systems to showcase ASR capabilities and to feed back research results to community members. Drawing on the sensibilities we cultivated through this research, we present a series of challenges to the orthodoxy of state-of-the-art approaches to ASR development.</abstract><type>Conference Paper/Proceeding/Abstract</type><journal>ACM CHI Conference on Human Factors in Computing Systems: CHI' 23</journal><volume/><journalNumber/><paginationStart>1</paginationStart><paginationEnd>17</paginationEnd><publisher>ACM</publisher><placeOfPublication/><isbnPrint/><isbnElectronic>978-1-4503-9421-5/23/04</isbnElectronic><issnPrint/><issnElectronic/><keywords>Text/speech/language, automatic speech recognition, mobile devices: phones/tablets</keywords><publishedDay>23</publishedDay><publishedMonth>4</publishedMonth><publishedYear>2023</publishedYear><publishedDate>2023-04-23</publishedDate><doi>10.1145/3544548.3581385</doi><url/><notes/><college>COLLEGE NANME</college><department>Mathematics and Computer Science School</department><CollegeCode>COLLEGE CODE</CollegeCode><DepartmentCode>MACS</DepartmentCode><institution>Swansea University</institution><apcterm>External research funder(s) paid the OA fee (includes OA grants disbursed by the Library)</apcterm><funders>UKRI (EP/T024976/1)</funders><projectreference/><lastEdited>2024-10-18T16:35:20.8337574</lastEdited><Created>2023-01-23T10:20:07.8635873</Created><path><level id="1">Faculty of Science and Engineering</level><level id="2">School of Mathematics and Computer Science - Computer Science</level></path><authors><author><firstname>Thomas</firstname><surname>Reitmaier</surname><orcid>0000-0003-2078-6699</orcid><order>1</order></author><author><firstname>Electra</firstname><surname>Wallington</surname><order>2</order></author><author><firstname>Ondrej</firstname><surname>Klejch</surname><order>3</order></author><author><firstname>Nina</firstname><surname>Markl</surname><order>4</order></author><author><firstname>Léa-Marie</firstname><surname>Lam-Yee-Mui</surname><order>5</order></author><author><firstname>Jen</firstname><surname>Pearson</surname><orcid>0000-0002-1960-1012</orcid><order>6</order></author><author><firstname>Matt</firstname><surname>Jones</surname><orcid>0000-0001-7657-7373</orcid><order>7</order></author><author><firstname>Peter</firstname><surname>Bell</surname><order>8</order></author><author><firstname>Simon</firstname><surname>Robinson</surname><orcid>0000-0001-9228-006X</orcid><order>9</order></author></authors><documents><document><filename>62395__26881__e485e0f343b448309b02477abfd5d603.pdf</filename><originalFilename>Situating-Automatic-Speech-Recognition.pdf</originalFilename><uploaded>2023-03-17T14:07:34.1310898</uploaded><type>Output</type><contentLength>2092884</contentLength><contentType>application/pdf</contentType><version>Version of Record</version><cronfaStatus>true</cronfaStatus><copyrightCorrect>true</copyrightCorrect><language>eng</language></document></documents><OutputDurs/></rfc1807>
spelling	2024-10-18T16:35:20.8337574 v2 62395 2023-01-23 Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers ccd66b64d11d76b9cd8b28e9d42a0ff0 0000-0003-2078-6699 Thomas Reitmaier Thomas Reitmaier true false 6d662d9e2151b302ed384b243e2a802f 0000-0002-1960-1012 Jen Pearson Jen Pearson true false 10b46d7843c2ba53d116ca2ed9abb56e 0000-0001-7657-7373 Matt Jones Matt Jones true false cb3b57a21fa4e48ec633d6ba46455e91 0000-0001-9228-006X Simon Robinson Simon Robinson true false 2023-01-23 MACS In this paper we develop approaches to automatic speech recognition (ASR) development that suit the needs and functions of underheard language speakers. Our novel contribution to HCI is to show how community-engagement can surface key technical and social issues and opportunities for more efective speech-based systems. We introduce a bespoke toolkit of technologies and showcase how we utilised the toolkit to engage communities of under-heard language speakers; and, through that engagement process, situate key aspects of ASR development in community contexts. The toolkit consists of (1) an information appliance to facilitate spoken-data collection on topics of community interest, (2) a mobile app to create crowdsourced transcripts of collected data, and (3) demonstrator systems to showcase ASR capabilities and to feed back research results to community members. Drawing on the sensibilities we cultivated through this research, we present a series of challenges to the orthodoxy of state-of-the-art approaches to ASR development. Conference Paper/Proceeding/Abstract ACM CHI Conference on Human Factors in Computing Systems: CHI' 23 1 17 ACM 978-1-4503-9421-5/23/04 Text/speech/language, automatic speech recognition, mobile devices: phones/tablets 23 4 2023 2023-04-23 10.1145/3544548.3581385 COLLEGE NANME Mathematics and Computer Science School COLLEGE CODE MACS Swansea University External research funder(s) paid the OA fee (includes OA grants disbursed by the Library) UKRI (EP/T024976/1) 2024-10-18T16:35:20.8337574 2023-01-23T10:20:07.8635873 Faculty of Science and Engineering School of Mathematics and Computer Science - Computer Science Thomas Reitmaier 0000-0003-2078-6699 1 Electra Wallington 2 Ondrej Klejch 3 Nina Markl 4 Léa-Marie Lam-Yee-Mui 5 Jen Pearson 0000-0002-1960-1012 6 Matt Jones 0000-0001-7657-7373 7 Peter Bell 8 Simon Robinson 0000-0001-9228-006X 9 62395__26881__e485e0f343b448309b02477abfd5d603.pdf Situating-Automatic-Speech-Recognition.pdf 2023-03-17T14:07:34.1310898 Output 2092884 application/pdf Version of Record true true eng
title	Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers
spellingShingle	Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers Thomas Reitmaier Jen Pearson Matt Jones Simon Robinson
title_short	Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers
title_full	Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers
title_fullStr	Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers
title_full_unstemmed	Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers
title_sort	Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers
author_id_str_mv	ccd66b64d11d76b9cd8b28e9d42a0ff0 6d662d9e2151b302ed384b243e2a802f 10b46d7843c2ba53d116ca2ed9abb56e cb3b57a21fa4e48ec633d6ba46455e91
author_id_fullname_str_mv	ccd66b64d11d76b9cd8b28e9d42a0ff0_*_Thomas Reitmaier 6d662d9e2151b302ed384b243e2a802f__Jen Pearson 10b46d7843c2ba53d116ca2ed9abb56e__Matt Jones cb3b57a21fa4e48ec633d6ba46455e91_*_Simon Robinson
author	Thomas Reitmaier Jen Pearson Matt Jones Simon Robinson
author2	Thomas Reitmaier Electra Wallington Ondrej Klejch Nina Markl Léa-Marie Lam-Yee-Mui Jen Pearson Matt Jones Peter Bell Simon Robinson
format	Conference Paper/Proceeding/Abstract
container_title	ACM CHI Conference on Human Factors in Computing Systems: CHI' 23
container_start_page	1
publishDate	2023
institution	Swansea University
isbn	978-1-4503-9421-5/23/04
doi_str_mv	10.1145/3544548.3581385
publisher	ACM
college_str	Faculty of Science and Engineering
hierarchytype
hierarchy_top_id	facultyofscienceandengineering
hierarchy_top_title	Faculty of Science and Engineering
hierarchy_parent_id	facultyofscienceandengineering
hierarchy_parent_title	Faculty of Science and Engineering
department_str	School of Mathematics and Computer Science - Computer Science{{{_:::_}}}Faculty of Science and Engineering{{{_:::_}}}School of Mathematics and Computer Science - Computer Science
document_store_str	1
active_str	0
description	In this paper we develop approaches to automatic speech recognition (ASR) development that suit the needs and functions of underheard language speakers. Our novel contribution to HCI is to show how community-engagement can surface key technical and social issues and opportunities for more efective speech-based systems. We introduce a bespoke toolkit of technologies and showcase how we utilised the toolkit to engage communities of under-heard language speakers; and, through that engagement process, situate key aspects of ASR development in community contexts. The toolkit consists of (1) an information appliance to facilitate spoken-data collection on topics of community interest, (2) a mobile app to create crowdsourced transcripts of collected data, and (3) demonstrator systems to showcase ASR capabilities and to feed back research results to community members. Drawing on the sensibilities we cultivated through this research, we present a series of challenges to the orthodoxy of state-of-the-art approaches to ASR development.
published_date	2023-04-23T13:53:51Z
_version_	1867795746814689280
score	11.108671

Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers

Similar Items