No Cover Image

E-Thesis 913 views 369 downloads

Design, compilation and applications of an English-Polish-Belarusian Parallel Literary Corpus / ANGELIKA PELJAK-LAPINSKA

Swansea University Author: ANGELIKA PELJAK-LAPINSKA

  • Peljak-Lapinska_Angelika_PhD_Thesis_Final_Redacted_Signature.pdf

    PDF | E-Thesis – open access

    Copyright: The author, Angelika Peljak-Łapińska, 2021.

    Download (7.06MB)

DOI (Published version): 10.23889/SUthesis.62708

Abstract

The main goal of the project is to create the English-Polish-Belarusian Literary Parallel Corpus (EPB corpus) and present its applications in several linguistic disciplines, including translation studies and discourse analysis. The thesis provides an outline of corpus linguistics research and corpus...

Full description

Published: Swansea 2021
Institution: Swansea University
Degree level: Doctoral
Degree name: Ph.D
Supervisor: Cheesman, Tom ; Rothwell, Andrew
URI: https://cronfa.swan.ac.uk/Record/cronfa62708
Abstract: The main goal of the project is to create the English-Polish-Belarusian Literary Parallel Corpus (EPB corpus) and present its applications in several linguistic disciplines, including translation studies and discourse analysis. The thesis provides an outline of corpus linguistics research and corpus linguistics as a methodology, then addresses the problem of the differences in the development of corpus linguistics in the three languages: English (as a lingua franca), Polish (a statutory national language) and Belarusian (a minority language). The analysis of available tools and resources for each of these languages proves the need for the EPB corpus in order to develop useful new resources for Belarusian in particular.A substantial part of the thesis presents the documentation of the process of creating the corpus. Various aspects of corpus design, text collection and text encoding are discussed in the context of the availability and usability of numerous tools. Special attention is paid to the tools specifically designed for each language and to the solutions that enable the data processed by these tools to be merged.Using corpus linguistics techniques (e.g. linguistic distribution, lexical density, vector-based semantic similarity measures) the thesis goes on to explore the application of the EPB corpus in investigating translation universals, in exploring the dependency between the author’s and the translator’s style, in supporting translation students and professionals, and in analysis of gender discourse. These case studies clearly show the practical value of the resource.Finally, the thesis provides a detailed overview of the plans and possibilities for further development of the project in the broader context of the evolution of Polish and Belarusian corpus linguistics.
Item Description: ORCiD identifier: https://orcid.org/0000-0001-6102-1815
Keywords: Polish language, Belarusian language, corpus linguistics, translation studies, English-Polish translation, English-Belarusian translation
College: Faculty of Humanities and Social Sciences