Recycling Translations: Extraction of Lexical Data from Parallel Corpora & Their Application in Natural Language Processing (Studia Linguistica Upsaliensia, 1)

Name: Recycling Translations: Extraction of Lexical Data from Parallel Corpora & Their Application in Natural Language Processing (Studia Linguistica Upsaliensia, 1)
Author: Jorg Tiedemann
ISBN: 9789155458157

Author Jorg Tiedemann

Publisher Uppsala Universitet

Shop on Amazon — pick your country

🇺🇸 USA 🇨🇦 Canada 🇬🇧 UK 🇩🇪 Germany 🇫🇷 France 🇮🇳 India

Buy New on Amazon 🇩🇪

Price not available for Germany

You can still browse on Amazon. Try another country above.

🇺🇸 USA

Book Details

Author(s) Jorg Tiedemann

Publisher Uppsala Universitet

ISBN / ASIN 9155458157

ISBN-13 9789155458157

Marketplace Germany 🇩🇪

Description

This Ph.D. dissertation focuses on re-using translations for applications in natural language processing. It presents five parallel corpora consisting of documents and their translations, containing over 35 million words, and including 60 languages. The thesis also proposes an innovative approach to word alignment using statistical and linguistic clues. This approach can be used for the automatic extraction of bilingual lexical data. These data consist of words and phrases linked to their translations and can be applied to computational lexicography and to machine translation. Four example applications are discussed in the thesis.