Discovering Light Verb Constructions and their Translations from Parallel Corpora without Word Alignment
Résumé
We propose a method for joint unsu-pervised discovery of multiword expressions (MWEs) and their translations from parallel corpora. First, we apply independent monolingual MWE extraction in source and target languages simultaneously. Then, we calculate translation probability , association score and distributional similarity of co-occurring pairs. Finally, we rank all translations of a given MWE using a linear combination of these features. Preliminary experiments on light verb constructions show promising results.
Domaines
Informatique et langage [cs.CL]
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...