Multitask Training with Text Data for End-to-End Speech Recognition - (3 minutes introduction)

Показать описание

Title: Multitask Training with Text Data for End-to-End Speech Recognition - (3 minutes introduction)

Authors: Peidong Wang (Google, USA), Tara N. Sainath (Google, USA), Ron J. Weiss (Google, USA)

Category: Neural network training methods for ASR

Abstract: We propose a multitask training method for attention-based end-to-end speech recognition models. We regularize the decoder in a listen, attend, and spell model by multitask training it on both audio-text and text-only data. Trained on the 100-hour subset of LibriSpeech, the proposed method, without requiring an additional language model, leads to an 11% relative performance improvement over the baseline and approaches the performance of language model shallow fusion on the test-clean evaluation set. We observe a similar trend on the whole 960-hour LibriSpeech training set. Analyses of different types of errors and sample output sentences demonstrate that the proposed method can incorporate language level information, suggesting its effectiveness in real-world applications.

d02s28t08trim

INTERSPEECH2021

Рекомендации по теме

Комментарии

Can you share the code with me if you have done it in matlab?

elonmuskfan

Multitask Training with Text Data for End-to-End Speech Recognition - (3 minutes introduction)

Multitask Training with Text Data for End-to-End Speech Recognition - (3 minutes introduction)

Multi-Task Learning | Explained in 5 Minutes

MultiTask Learning with NLP

T0: Multitask Prompted Training Enables Zero-Shot Task Generalization | Paper Explained

GPT-2: Language Models are Unsupervised Multitask Learners

Part 11: simple hierarchical multitask neural entity linking for biomedical text

Multitask Prompted Training Enables Zero-Shot Task Generalization

Richard Socher - The Natural Language Decathlon: Multitask Learning as Question Answering

Multiclass Classification vs Multilabel Classification vs Multitask Learning

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Andrej Karpathy: Tesla Autopilot and Multi-Task Learning for Perception and Prediction

CMU Neural Nets for NLP 2020 (20): Multitask and Multilingual Learning

Zero Shot Text Classification - AI Guild Series

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Personalized Multi-task Learning for Predicting Tomorrow's Mood, Stress, and Health

Crosslingual Generalization through Multitask Finetuning (BLOOMZ & mT0)

What is Prompt Tuning?

Humans and multitasking - How much can we do simultaneously? | DW Documentary

Multitask Prompted Training Enables Zero-shot Task Generalization (Explained)

Neural networks [10.10] : Natural language processing - multitask learning

MLT __init__ Session #13: Multitask Prompted Training Enables Zero-Shot Task Generalization

Gradient Surgery for Multi-Task Learning

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Nicole Carlson, Michael Sugimura: Building one multitask model to rule them all | PyData Global 2020

MLT init Session #13: Multitask Prompted Training Enables Zero-Shot Task Generalization