SANE2019 | Gabriel Synnaeve - wav2letter and the Many Meanings of End-to-End ASR

preview_player

Показать описание

Gabriel Synnaeve, research scientist on the Facebook AI Research (FAIR) team, presents his work on end-to-end automatic speech recognition at Columbia University, New York, NY, October 24, 2019.

Abstract: What does it mean for an automatic speech recognition (ASR)system to be end-to-end? Why do we care if it is end-to-end or not? We will present different facets of making a speech recognition system end-to-end, from starting from the waveform instead of speech features, to outputting words directly, differentiating through the decoder, and decoding with or without explicit language models.

Speech and Audio in the Northeast (SANE)

Рекомендации по теме