No Cover Image

Book chapter 1433 views

A Computational Lexicography approach to phraseologisms

Cornelia Tschichold Orcid Logo

Phraseology: An interdisciplinary perspective, Pages: 361 - 376

Swansea University Author: Cornelia Tschichold Orcid Logo

Abstract

The cycle of lexicographic and linguistic work involved in compiling a computationalphraseological database is divided into three phases and described inrelation to the specific challenges multi-word expressions (MWEs) pose for alexical database. Data collection is a process that is far from complet...

Full description

Published in: Phraseology: An interdisciplinary perspective
Published: John Benjamins 2008
URI: https://cronfa.swan.ac.uk/Record/cronfa11545
Tags: Add Tag
No Tags, Be the first to tag this record!
first_indexed 2016-02-04T01:30:04Z
last_indexed 2018-02-09T04:41:15Z
id cronfa11545
recordtype SURis
fullrecord <?xml version="1.0"?><rfc1807><datestamp>2016-02-03T16:24:05.3654646</datestamp><bib-version>v2</bib-version><id>11545</id><entry>2012-06-14</entry><title>A Computational Lexicography approach to phraseologisms</title><swanseaauthors><author><sid>7ab58ba7c36c98911ed94a11fc7e5cb2</sid><ORCID>0000-0001-8487-2209</ORCID><firstname>Cornelia</firstname><surname>Tschichold</surname><name>Cornelia Tschichold</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2012-06-14</date><deptcode>APLI</deptcode><abstract>The cycle of lexicographic and linguistic work involved in compiling a computationalphraseological database is divided into three phases and described inrelation to the specific challenges multi-word expressions (MWEs) pose for alexical database. Data collection is a process that is far from complete for theMWEs found in English, with the variability of some phrases making identificationof all occurrences in large corpora a major challenge. Formalization of theform and variability ofMWEs is an interrelated process which can improve toolsfor data collection and other applications. Increased use of the phraseologicallexical database in NLP applications can ultimately lead to further insights intothe nature of MWEs and to improvements in the database. Due to the volumeof lexicographic data on MWEs that still needs to be collected, analysed and formalized,and the cyclical nature of the work, the resulting lexical database shouldbe reusable in as many applications as possible. WordManager-PhraseManager,the lexical resource described in the second part of the chapter, can capture thevariability ofMWEs in a way that allows for maximum reusability of lexical data.</abstract><type>Book chapter</type><journal>Phraseology: An interdisciplinary perspective</journal><paginationStart>361</paginationStart><paginationEnd>376</paginationEnd><publisher>John Benjamins</publisher><issnPrint/><issnElectronic/><keywords>computational lexicography, phraseology, reusable lexical database</keywords><publishedDay>31</publishedDay><publishedMonth>12</publishedMonth><publishedYear>2008</publishedYear><publishedDate>2008-12-31</publishedDate><doi/><url/><notes></notes><college>COLLEGE NANME</college><department>Applied Linguistics</department><CollegeCode>COLLEGE CODE</CollegeCode><DepartmentCode>APLI</DepartmentCode><institution>Swansea University</institution><apcterm/><lastEdited>2016-02-03T16:24:05.3654646</lastEdited><Created>2012-06-14T15:38:37.2046153</Created><path><level id="1">Faculty of Humanities and Social Sciences</level><level id="2">School of Culture and Communication - English Language, Tesol, Applied Linguistics</level></path><authors><author><firstname>Cornelia</firstname><surname>Tschichold</surname><orcid>0000-0001-8487-2209</orcid><order>1</order></author></authors><documents/><OutputDurs/></rfc1807>
spelling 2016-02-03T16:24:05.3654646 v2 11545 2012-06-14 A Computational Lexicography approach to phraseologisms 7ab58ba7c36c98911ed94a11fc7e5cb2 0000-0001-8487-2209 Cornelia Tschichold Cornelia Tschichold true false 2012-06-14 APLI The cycle of lexicographic and linguistic work involved in compiling a computationalphraseological database is divided into three phases and described inrelation to the specific challenges multi-word expressions (MWEs) pose for alexical database. Data collection is a process that is far from complete for theMWEs found in English, with the variability of some phrases making identificationof all occurrences in large corpora a major challenge. Formalization of theform and variability ofMWEs is an interrelated process which can improve toolsfor data collection and other applications. Increased use of the phraseologicallexical database in NLP applications can ultimately lead to further insights intothe nature of MWEs and to improvements in the database. Due to the volumeof lexicographic data on MWEs that still needs to be collected, analysed and formalized,and the cyclical nature of the work, the resulting lexical database shouldbe reusable in as many applications as possible. WordManager-PhraseManager,the lexical resource described in the second part of the chapter, can capture thevariability ofMWEs in a way that allows for maximum reusability of lexical data. Book chapter Phraseology: An interdisciplinary perspective 361 376 John Benjamins computational lexicography, phraseology, reusable lexical database 31 12 2008 2008-12-31 COLLEGE NANME Applied Linguistics COLLEGE CODE APLI Swansea University 2016-02-03T16:24:05.3654646 2012-06-14T15:38:37.2046153 Faculty of Humanities and Social Sciences School of Culture and Communication - English Language, Tesol, Applied Linguistics Cornelia Tschichold 0000-0001-8487-2209 1
title A Computational Lexicography approach to phraseologisms
spellingShingle A Computational Lexicography approach to phraseologisms
Cornelia Tschichold
title_short A Computational Lexicography approach to phraseologisms
title_full A Computational Lexicography approach to phraseologisms
title_fullStr A Computational Lexicography approach to phraseologisms
title_full_unstemmed A Computational Lexicography approach to phraseologisms
title_sort A Computational Lexicography approach to phraseologisms
author_id_str_mv 7ab58ba7c36c98911ed94a11fc7e5cb2
author_id_fullname_str_mv 7ab58ba7c36c98911ed94a11fc7e5cb2_***_Cornelia Tschichold
author Cornelia Tschichold
author2 Cornelia Tschichold
format Book chapter
container_title Phraseology: An interdisciplinary perspective
container_start_page 361
publishDate 2008
institution Swansea University
publisher John Benjamins
college_str Faculty of Humanities and Social Sciences
hierarchytype
hierarchy_top_id facultyofhumanitiesandsocialsciences
hierarchy_top_title Faculty of Humanities and Social Sciences
hierarchy_parent_id facultyofhumanitiesandsocialsciences
hierarchy_parent_title Faculty of Humanities and Social Sciences
department_str School of Culture and Communication - English Language, Tesol, Applied Linguistics{{{_:::_}}}Faculty of Humanities and Social Sciences{{{_:::_}}}School of Culture and Communication - English Language, Tesol, Applied Linguistics
document_store_str 0
active_str 0
description The cycle of lexicographic and linguistic work involved in compiling a computationalphraseological database is divided into three phases and described inrelation to the specific challenges multi-word expressions (MWEs) pose for alexical database. Data collection is a process that is far from complete for theMWEs found in English, with the variability of some phrases making identificationof all occurrences in large corpora a major challenge. Formalization of theform and variability ofMWEs is an interrelated process which can improve toolsfor data collection and other applications. Increased use of the phraseologicallexical database in NLP applications can ultimately lead to further insights intothe nature of MWEs and to improvements in the database. Due to the volumeof lexicographic data on MWEs that still needs to be collected, analysed and formalized,and the cyclical nature of the work, the resulting lexical database shouldbe reusable in as many applications as possible. WordManager-PhraseManager,the lexical resource described in the second part of the chapter, can capture thevariability ofMWEs in a way that allows for maximum reusability of lexical data.
published_date 2008-12-31T03:13:22Z
_version_ 1763750133485273088
score 11.035349