Improved Word Alignments Using the Web as a Corpus (RANLP'2007)

Book cover
Scientific article: We propose a novel method for improving word alignments in a parallel sentence-aligned bilingual corpus based on the idea that if two words are translations of each other then so should be many words in their local contexts. The idea is formalised using the Web as a corpus, a glossary of known word translations (dynamically augmented from the Web using bootstrapping), the vector space model, linguistically motivated weighted minimum edit distance, competitive linking, and the IBM models. Evaluation results on a Bulgarian-Russian corpus show a sizable improvement both in word alignment and in translation quality.
add to favoritesadd

Users who have this book

Users who want this book

What readers are saying

What do you think? Write your own comment on this book!

write a comment

What do you think? Write your own comment on this book

Info about the book

Series:

Unknown

ISBN:

1118179803

Rating:

3/5 (3)

Your rating:

0/5

Languge:

English

Do you want to exchange books? It’s EASY!

Get registered and find other users who want to give their favourite books to good hands!