Understanding Audio Signals for Machine Learning

preview_player
Показать описание
Learn about audio digital signals. I explain the difference between analog and digital signals, and how to convert an analog sound into a digital format that can then be processed for machine learning. I also delve deeper into Audio to Digital Conversion concepts such as sampling, quantization, and aliasing.

Slides:

Join The Sound Of AI Slack community:

Interested in hiring me as a consultant/freelancer?

Follow Valerio on Facebook:

Connect with Valerio on Linkedin:

Follow Valerio on Twitter:
Рекомендации по теме
Комментарии
Автор

I honoustly never take the time to comment or compliment on youtube videos. You, my man, are simply amazing and I truly enjoy listening to you. Going all the way till your last video =)

didismit
Автор

I was just curious about sound processing and found your lecture series. After I started watching, I binge watched the whole series! Absolute piece of art! PS-I started watching with an absolute zero knowledge about the subject.

hydraulicgames
Автор

bro your content is so helpful. very concise and straight to the point.

theaihacker
Автор

i came here for a language classification research, but now im amazed with the music thing

jancooqhedon
Автор

Loves the understanding, clarity in content & excellent examples through applications! love it

adityajindal
Автор

Hey Valerio,
This is just amazing content, i like the depth, the way you explain in so simple terms, you satisfied my curiosity for this whole topic.

mudassirkhan
Автор

Great stuff Valerio, this is amazing content - very educational. When you cover the audio features, can you also cover in depth MFCC's, and how they are typically used? I have yet to seen a good treatment of MFCCs and get an intuitive feel for how they work.

hersheyscoco
Автор

you are awesome! the best ML tutorial for audio signals

joeljoseph
Автор

Thank you so much for creating these video, I am really enjoying them! Always worked with computers, music and sound when I was young, and still am. Have all the basic knowledge of prorgamming, ml and music. But this is so much more depth, didnt knew I like this stuff. Thank you for creating a new passion for me. Ai with music❤️❤️❤️

chriskingston
Автор

At 15:45, the picture that appears seems to have an error. The numbers of amplitude scale are out of order in binary notation.

avidreader
Автор

@9:13 of the video, is it above or below the nyquist frequency?

ramportland
Автор

Thanks a lot for this, It's helping through a project I'm working on. I'm really grateful

fredrickpwol
Автор

Thanks for your wonderful job, beautifully done!

rangiding
Автор

Thank you for this amazing walkthrough; this is going to help me SO much with ML. Also, question for this section 17:37, do you know why we have to divide the bit depth and resolution sampling rate by 1048 window? In other words, why do we divide by 1, 048, 576 and then again by 8 bytes? Is there some resource on why this is default? (I'm assuming this has to do with the way computers work.)

Drew_
Автор

When you resample in Audacity, you are not hearing aliasing. Audacity used a LP filter (as any good downsampler should) to avoid aliasing. What you're hearing in the high frequencies being filtered out

mattdistad
Автор

Thank you for good works. I hava a question. (16x44100x60) / (8x1024x1024) = 5.046844
Why 5.49MB?

heecheolcho
Автор

I’m not sure your aliasing demo actually has aliasing. Usually you would hear nasty artifacts when downsampling so much without an antialiasing filter. Audacity is likely applying an antialiasing filter to reduce the bandwidth of the signal before downsampling it.

phosphoricx
Автор

Thank you very much. It is a very educational video. ( But in the audio there are some short bass bursts. )

AhmetAksoy
Автор

Valerio, thank you for this amazing work. You are helping me a lot, I am studying audio and you are answering all my questions. Do you have any book recommendation for me?

JogosEtudoMais
Автор

I got a liitle confused, Does aliasing mean we can hear frequencys higher than our hearing rang after digitalization of signal?

mohamadqodosi