Applying Comparable Corpora to Machine Translation

Name: Applying Comparable Corpora to Machine Translation
Author: Krzysztof Wolk, Agnieszka Wolk
ISBN: 9783659762864

Author Krzysztof Wolk, Agnieszka Wolk

Publisher LAP LAMBERT Academic Publishing

📄 Viewing lite version Full site ›

🌎 Shop on Amazon — choose country

🇺🇸 USA 🇨🇦 Canada 🇬🇧 UK 🇩🇪 Germany 🇫🇷 France 🇮🇳 India

⌛ 🇫🇷 France pricing being fetched… Prices will appear once fetched — usually within a few minutes.

View in: 🇺🇸 USA

Book Details

Author(s)Krzysztof Wolk, Agnieszka Wolk

PublisherLAP LAMBERT Academic Publishing

ISBN / ASIN3659762865

ISBN-139783659762864

MarketplaceFrance 🇫🇷

Description ▲

The problem investigated here was how to improve statistical machine language translation between Polish and English speech. While excellent translation systems exist for many popular languages, it is fair to say that the development of such systems for Polish and English has been neglected. The most popular methodologies are not well suited for the Polish language and require adaptation. Polish language resources are lacking in parallel and monolingual data. Therefore, the main objective of the present study was to develop an automatic and robust Polish to English translation system to meet specific translation requirements and to develop bilingual textual resources by mining comparable corpora. Experiments were conducted mostly on casual human speech, consisting of lectures, movie subtitles, European Parliament proceedings, and European Medicines Agency. The aims were to rigorously analyze the various problems and to improve the quality of baseline systems, i.e., adaptation of techniques and training parameters to increase the Bilingual Evaluation Understudy (BLEU) score for maximum performance.