Reranked aligners for interactive transcript correction - Aix-Marseille Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Reranked aligners for interactive transcript correction

Résumé

Clarification dialogs can help address ASR errors in speech-to-speech translation systems and other interactive applications. We propose to use variants of Levenshtein alignment for merging an errorful utterance with a targeted rephrase of an error segment. ASR errors that might harm the alignment are addressed through phonetic matching, and a word embedding distance is used to account for the use of synonyms outside targeted segments. These features lead to a relative improvement of 30% of word error rate on ASR output compared to not performing the clarification. Twice as many utterance are completely corrected compared to using basic word alignment. Furthermore, we generate a set of potential merges and train a neural network on crowd-sourced rephrases in order to select the best merger, leading to 24% more instances completely corrected. The system is deployed in the framework of the BOLT project.
Fichier non déposé

Dates et versions

hal-01194237 , version 1 (05-09-2015)

Identifiants

  • HAL Id : hal-01194237 , version 1

Citer

Benoit Favre, Mickael Rouvier, Frédéric Béchet. Reranked aligners for interactive transcript correction. ICASSP2014 - Speech and Language Processing (ICASSP2014 - SLTC), 2014, Florence, Italy. ⟨hal-01194237⟩
133 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More