A speech recognition method with enhanced transformer decoder

Abstract Addressing the issue that the Transformer decoder struggles to capture local features for monotonic alignment in speech recognition, and simultaneously incorporating language model cold fusion training into the decoder, an enhanced decoder-based speech recognition model is investigated. The...

Full description

Saved in:

Bibliographic Details
Main Authors:	Hengbo Hu, Tong Niu, Zhenhua He
Format:	Article
Language:	English
Published:	SpringerOpen 2025-02-01
Series:	EURASIP Journal on Audio, Speech, and Music Processing
Subjects:	Cross-attention Transformer decoder Language model cold fusion
Online Access:	https://doi.org/10.1186/s13636-025-00394-6
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://doi.org/10.1186/s13636-025-00394-6

A speech recognition method with enhanced transformer decoder

Internet

Similar Items