ASR Error Detection via Audio-Transcript entailment

Meripo, Nimshi Venkat; Konam, Sandeep

Computer Science > Computation and Language

arXiv:2207.10849 (cs)

[Submitted on 22 Jul 2022]

Title:ASR Error Detection via Audio-Transcript entailment

Authors:Nimshi Venkat Meripo, Sandeep Konam

View PDF

Abstract:Despite improved performances of the latest Automatic Speech Recognition (ASR) systems, transcription errors are still unavoidable. These errors can have a considerable impact in critical domains such as healthcare, when used to help with clinical documentation. Therefore, detecting ASR errors is a critical first step in preventing further error propagation to downstream applications. To this end, we propose a novel end-to-end approach for ASR error detection using audio-transcript entailment. To the best of our knowledge, we are the first to frame this problem as an end-to-end entailment task between the audio segment and its corresponding transcript segment. Our intuition is that there should be a bidirectional entailment between audio and transcript when there is no recognition error and vice versa. The proposed model utilizes an acoustic encoder and a linguistic encoder to model the speech and transcript respectively. The encoded representations of both modalities are fused to predict the entailment. Since doctor-patient conversations are used in our experiments, a particular emphasis is placed on medical terms. Our proposed model achieves classification error rates (CER) of 26.2% on all transcription errors and 23% on medical errors specifically, leading to improvements upon a strong baseline by 12% and 15.4%, respectively.

Comments:	Accepted to Interspeech 2022
Subjects:	Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
Cite as:	arXiv:2207.10849 [cs.CL]
	(or arXiv:2207.10849v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2207.10849

Submission history

From: Nimshi Venkat Meripo [view email]
[v1] Fri, 22 Jul 2022 02:47:15 UTC (438 KB)

Computer Science > Computation and Language

Title:ASR Error Detection via Audio-Transcript entailment

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ASR Error Detection via Audio-Transcript entailment

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators