Retrieving the syntactic structure of erroneous ASR transcriptions for open-domain Spoken Language Understanding

Retrieving the syntactic structure of erroneous ASR transcriptions can be of great interest for open-domain Spoken Language Understanding tasks in order to correct or at least reduce the impact of ASR errors on final applications. Most of the previous works on ASR and syntactic parsing have addressed this problem by using syntactic features during ASR to help reducing Word Error Rate (WER). The improvement obtained is rather small however the structure and the relations between words obtained through parsing can be of great interest for the SLU processes, even without a significant decrease of WER. That is why we adopt another point of view in this paper: considering that ASR transcriptions contain inevitably some errors, we show in this study that it is possible to improve the syntactic analysis of these erroneous transcriptions by performing a joint error detection / syntactic parsing process. The applicative framework used in this study is a speech-to-speech system developed through the DARPA BOLT project.

Mots clés

Automatic Speech Recognition Spoken Language Understanding Dependency Parsing Confidence Measures

Domaines

Informatique et langage [cs.CL]

Benoit Favre : Connectez-vous pour contacter le contributeur

https://amu.hal.science/hal-01194236

Soumis le : samedi 5 septembre 2015-11:12:36

Dernière modification le : vendredi 22 mars 2024-18:24:04

Dates et versions

hal-01194236 , version 1 (05-09-2015)

Identifiants

HAL Id : hal-01194236 , version 1

Citer

Frédéric Béchet, Benoit Favre, Alexis Nasr, Mathieu Morey. Retrieving the syntactic structure of erroneous ASR transcriptions for open-domain Spoken Language Understanding. ICASSP2014 - Speech and Language Processing (ICASSP2014 - SLTC), 2014, Florence, Italy. ⟨hal-01194236⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLN LIF CNRS INRIA UNIV-AMU EC-MARSEILLE UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD LIS-LAB

198 Consultations

0 Téléchargements