SANE2019 | Gabriel Synnaeve - wav2letter and the Many Meanings of End-to-End ASR

preview_player
Показать описание
Gabriel Synnaeve, research scientist on the Facebook AI Research (FAIR) team, presents his work on end-to-end automatic speech recognition at Columbia University, New York, NY, October 24, 2019.

Abstract: What does it mean for an automatic speech recognition (ASR)system to be end-to-end? Why do we care if it is end-to-end or not? We will present different facets of making a speech recognition system end-to-end, from starting from the waveform instead of speech features, to outputting words directly, differentiating through the decoder, and decoding with or without explicit language models.
Рекомендации по теме