Retrieving the syntactic structure of erroneous ASR transcriptions for open-domain Spoken Language Understanding

Abstract : Retrieving the syntactic structure of erroneous ASR transcriptions can be of great interest for open-domain Spoken Language Understanding tasks in order to correct or at least reduce the impact of ASR errors on final applications. Most of the previous works on ASR and syntactic parsing have addressed this problem by using syntactic features during ASR to help reducing Word Error Rate (WER). The improvement obtained is rather small however the structure and the relations between words obtained through parsing can be of great interest for the SLU processes, even without a significant decrease of WER. That is why we adopt another point of view in this paper: considering that ASR transcriptions contain inevitably some errors, we show in this study that it is possible to improve the syntactic analysis of these erroneous transcriptions by performing a joint error detection / syntactic parsing process. The applicative framework used in this study is a speech-to-speech system developed through the DARPA BOLT project.
Document type :
Conference papers
Complete list of metadatas

https://hal-amu.archives-ouvertes.fr/hal-01194236
Contributor : Benoit Favre <>
Submitted on : Saturday, September 5, 2015 - 11:12:36 AM
Last modification on : Wednesday, April 3, 2019 - 1:23:02 AM

Identifiers

  • HAL Id : hal-01194236, version 1

Citation

Frédéric Béchet, Benoit Favre, Alexis Nasr, Mathieu Morey. Retrieving the syntactic structure of erroneous ASR transcriptions for open-domain Spoken Language Understanding. ICASSP2014 - Speech and Language Processing (ICASSP2014 - SLTC), 2014, Florence, Italy. ⟨hal-01194236⟩

Share

Metrics

Record views

260