Whisper Paper Explained: Robust Speech Recognition via Large-Scale Weak Supervision

Показать описание

❤️ Support the channel ❤️

Paid Courses I recommend for learning (affiliate links, no extra cost for you):

✨ Free Resources that are great:

💻 My Deep Learning Setup and Recording Setup:

GitHub Repository:

✅ One-Time Donations:

▶️ You Can Connect with me on:

Timestamps:
0:00 - Introduction
1:27 - Abstract
2:30 - Introduction
8:02 - Dataset collection and processing
10:29 - Model approach
15:25 - Figure of model
20:25 - Experiments and Evaluation
26:17 - Long form transcription, messy :/
28:36 - Model and Dataset scaling
31:25 - Long form transcription (cont), messy :/
32:32 - Ending

Aladdin Persson

Рекомендации по теме

Комментарии

A fine-tuning example would be awesome! btw, I extracted the audio from this YouTube video and ran it through Whisper, it's not bad. Unfortunately I got back one giant blob of text. We need to use another LLM/transformer to rewrite the output into proper paragraphs, and also to summarize and remove extraneous content: "What is going on guys? Welcome back to another video. In this one, we're taking a " :)

vishalgoklani

I have few question and ways to use whisper is there any community which talk about it

aniketsingh

I can reach the point where Whisper transcribes a YouTube vid, but I'm having trouble with the speaker identification or diarisation part. Anyone got a link to a great tutorial?

champagnebulge

New to your channel. I like that you go into the detail of the papers. Really looking forward to the fine tuning video next. Subscribed. 😊

LoneRanger.

Can you do a code and paper walk thru for continuous or binary hopfield pattern storage and recall

gtgs

You are one of the OGs. Thanks Aladdin.

daniellewis

Watching your Unet video. This one will be next in line. So much to learn in AI. I keep getting confused. How do you make a mind map of everything and organize all the info? Should explain in a video.

normalhuman

Whisper Paper Explained: Robust Speech Recognition via Large-Scale Weak Supervision

Whisper Paper Explained: Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper: Robust Speech Recognition via Large-Scale Weak Supervision | Paper and Code

OpenAI Whisper: Robust Speech Recognition via Large Scale Weak Supervision

Whisper: Robust Speech Recognition [It-Jim Paper Review]

NLP Deep Dive, Paper Reading: Robust Speech Recognition via Large-Scale Weak Supervision (Whisper)

[Olewave's Review] OpenAI's Whisper ASR: Robust Speech Recognition via Large-Scale Weak Su...

OpenAI's Whisper Model Explained

OpenAI’s Whisper Learned 680,000 Hours Of Speech!

SeamlessM4T: Andrew Ng, OpenAI Multimodal Whisper - AI Paper Explained

OpenAI's Whisper model - Explanation and demo

Conformer-1: a new large scale/robust speech recognition model

Introduction to Robust Speech Challenge

OpenAI's Whisper Model Explained: What is it and what can it do?

OpenAI Releases 1.6 Billion Parameter Multilingual Speech Recognition AI Whisper

[10 mins] Explain Why OpenAI's Whisper API Isn't As Good As ChatGPT

Open AI’s Whisper is Amazing!

OpenAI Whisper: Convert Speech To Text | OpenAI Whisper Explained in 8 Minutes | Simplilearn

Understanding Speech Recognition using OpenAI's Whisper Model

OpenAI Whisper Demo: Convert Speech to Text in Python

[ML News] OpenAI's Whisper | Meta Reads Brain Waves | AI Wins Art Fair, Annoys Humans

Speech Recognition with OpenAI Whisper

#OpenAI Releases #Whisper - An Automatic Speech Recognition System (ASR)

WHISPER OPEN AI HACKATHON Summary

SDS 620: OpenAI Whisper: General-Purpose Speech Recognition — with @JonKrohnLearns