No Cover Image

Conference Paper/Proceeding/Abstract 1350 views

Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language

Briony Williams, Rhys Jones Orcid Logo

Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

Swansea University Author: Rhys Jones Orcid Logo

Abstract

We describe a procedure for generating pronunciations for a place-name dictionary for Welsh. For many such less-resourced languages, phoneticians with expertise in the language do not exist, so direct editing of pronunciations cannot occur. We describe a method where, if a native speaker and a text-...

Full description

Published in: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
Published: Marrakech, Morocco European Language Resources Association (ELRA) 2008
Online Access: http://www.lrec-conf.org/proceedings/lrec2008/pdf/55_paper.pdf
URI: https://cronfa.swan.ac.uk/Record/cronfa11776
Tags: Add Tag
No Tags, Be the first to tag this record!
first_indexed 2014-01-30T02:30:01Z
last_indexed 2018-02-09T04:41:36Z
id cronfa11776
recordtype SURis
fullrecord <?xml version="1.0"?><rfc1807><datestamp>2014-03-20T10:39:16.8888984</datestamp><bib-version>v2</bib-version><id>11776</id><entry>2012-06-25</entry><title>Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language</title><swanseaauthors><author><sid>896a6aacfd217fb099481697a43bfe80</sid><ORCID>0000-0003-3928-4701</ORCID><firstname>Rhys</firstname><surname>Jones</surname><name>Rhys Jones</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2012-06-25</date><deptcode>AMED</deptcode><abstract>We describe a procedure for generating pronunciations for a place-name dictionary for Welsh. For many such less-resourced languages, phoneticians with expertise in the language do not exist, so direct editing of pronunciations cannot occur. We describe a method where, if a native speaker and a text-to-speech synthesiser is available, pronunciations can be edited without detailed knowledge of the language's phonology. Our key development is the use of 're-spelling' to indicate, in a linguistically-na&#xEF;ve manner, the required pronunciation. These drive a set of specially-adapted letter-to-sound rules, thus generating the desired pronunciation. A speech synthesiser gives audio feedback to the native speaker and verifies that the pronunciation is correct, and a graphical user interface links the database backend, the speech synthesiser and the native speaker editor. This method has been used to generate a pronunciation database for Welsh placenames.</abstract><type>Conference Paper/Proceeding/Abstract</type><journal>Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)</journal><publisher>European Language Resources Association (ELRA)</publisher><placeOfPublication>Marrakech, Morocco</placeOfPublication><issnPrint/><issnElectronic/><keywords>Lexicon, lexical database, Speech synthesis, Text-to-speech systems, Tools, systems, applications</keywords><publishedDay>31</publishedDay><publishedMonth>5</publishedMonth><publishedYear>2008</publishedYear><publishedDate>2008-05-31</publishedDate><doi/><url>http://www.lrec-conf.org/proceedings/lrec2008/pdf/55_paper.pdf</url><notes/><college>COLLEGE NANME</college><department>Media</department><CollegeCode>COLLEGE CODE</CollegeCode><DepartmentCode>AMED</DepartmentCode><institution>Swansea University</institution><apcterm/><lastEdited>2014-03-20T10:39:16.8888984</lastEdited><Created>2012-06-25T13:12:27.0081945</Created><path><level id="1">Faculty of Humanities and Social Sciences</level><level id="2">School of Culture and Communication - Media, Communications, Journalism and PR</level></path><authors><author><firstname>Briony</firstname><surname>Williams</surname><order>1</order></author><author><firstname>Rhys</firstname><surname>Jones</surname><orcid>0000-0003-3928-4701</orcid><order>2</order></author></authors><documents/><OutputDurs/></rfc1807>
spelling 2014-03-20T10:39:16.8888984 v2 11776 2012-06-25 Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language 896a6aacfd217fb099481697a43bfe80 0000-0003-3928-4701 Rhys Jones Rhys Jones true false 2012-06-25 AMED We describe a procedure for generating pronunciations for a place-name dictionary for Welsh. For many such less-resourced languages, phoneticians with expertise in the language do not exist, so direct editing of pronunciations cannot occur. We describe a method where, if a native speaker and a text-to-speech synthesiser is available, pronunciations can be edited without detailed knowledge of the language's phonology. Our key development is the use of 're-spelling' to indicate, in a linguistically-naïve manner, the required pronunciation. These drive a set of specially-adapted letter-to-sound rules, thus generating the desired pronunciation. A speech synthesiser gives audio feedback to the native speaker and verifies that the pronunciation is correct, and a graphical user interface links the database backend, the speech synthesiser and the native speaker editor. This method has been used to generate a pronunciation database for Welsh placenames. Conference Paper/Proceeding/Abstract Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08) European Language Resources Association (ELRA) Marrakech, Morocco Lexicon, lexical database, Speech synthesis, Text-to-speech systems, Tools, systems, applications 31 5 2008 2008-05-31 http://www.lrec-conf.org/proceedings/lrec2008/pdf/55_paper.pdf COLLEGE NANME Media COLLEGE CODE AMED Swansea University 2014-03-20T10:39:16.8888984 2012-06-25T13:12:27.0081945 Faculty of Humanities and Social Sciences School of Culture and Communication - Media, Communications, Journalism and PR Briony Williams 1 Rhys Jones 0000-0003-3928-4701 2
title Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language
spellingShingle Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language
Rhys Jones
title_short Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language
title_full Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language
title_fullStr Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language
title_full_unstemmed Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language
title_sort Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language
author_id_str_mv 896a6aacfd217fb099481697a43bfe80
author_id_fullname_str_mv 896a6aacfd217fb099481697a43bfe80_***_Rhys Jones
author Rhys Jones
author2 Briony Williams
Rhys Jones
format Conference Paper/Proceeding/Abstract
container_title Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
publishDate 2008
institution Swansea University
publisher European Language Resources Association (ELRA)
college_str Faculty of Humanities and Social Sciences
hierarchytype
hierarchy_top_id facultyofhumanitiesandsocialsciences
hierarchy_top_title Faculty of Humanities and Social Sciences
hierarchy_parent_id facultyofhumanitiesandsocialsciences
hierarchy_parent_title Faculty of Humanities and Social Sciences
department_str School of Culture and Communication - Media, Communications, Journalism and PR{{{_:::_}}}Faculty of Humanities and Social Sciences{{{_:::_}}}School of Culture and Communication - Media, Communications, Journalism and PR
url http://www.lrec-conf.org/proceedings/lrec2008/pdf/55_paper.pdf
document_store_str 0
active_str 0
description We describe a procedure for generating pronunciations for a place-name dictionary for Welsh. For many such less-resourced languages, phoneticians with expertise in the language do not exist, so direct editing of pronunciations cannot occur. We describe a method where, if a native speaker and a text-to-speech synthesiser is available, pronunciations can be edited without detailed knowledge of the language's phonology. Our key development is the use of 're-spelling' to indicate, in a linguistically-naïve manner, the required pronunciation. These drive a set of specially-adapted letter-to-sound rules, thus generating the desired pronunciation. A speech synthesiser gives audio feedback to the native speaker and verifies that the pronunciation is correct, and a graphical user interface links the database backend, the speech synthesiser and the native speaker editor. This method has been used to generate a pronunciation database for Welsh placenames.
published_date 2008-05-31T03:13:38Z
_version_ 1763750150285557760
score 10.988012