3 - Audio Feature Extraction using Python

preview_player
Показать описание
In this video, we focus on audio feature extraction in the frequency domain.
The code shown in the video can be found at my Github page:

Helpful Resources to get more technical depth for some of the terms mentioned in the video/code are referenced throughout the jupyter notebook at the link above. Some of those are also mentioned below:

What is a Spectrogram?

What is windowing?

What is Short Time Fourier Transform (STFT)?

What are MelSpectrograms?
Рекомендации по теме
Комментарии
Автор

this is a really helpful video for someone who just starts trying to do signal processing and classifying, thank you for your effort and it really helps me understand better on spectrogram and signal processing!

fuchunhsieh
Автор

Thank you for the video. Very excited for these video series. One of your videos, yolo to coco conversation was very helpful.

soumyadrip
Автор

this was really helpful! thank you very very much!!

lindaflow
Автор

I want to analyze a frequency signal with a fairly large bandwidth. Will this method suit my task?

demkut
Автор

Hello, we are working on an assignment related to gender recognition from voice. However, we want to extract values such as "mean frequency, standaty deviation, spectral flattnes" from a person's voice using the data you use. How can we achieve this?

helios.
Автор

Hello Prabhjot, this is indeed an amazing work. Thank you for taking your time to share knowledge to the world. Could you please guide me on how to save batches of spectrograms? I have created TensorFlow dataset of audio files and pass them through a data pipeline inline with the kind of decoding in accordance to my work. I want to plot and save each spectrogram from the dataset generated. Thank you in anticipation of your kind response. Cheers!

jamilamuhammad
Автор

if you want to increase the resolution on the x axis you can increase the sr. But how do you increase the resolution of the frequency on the y axis?

SvSzYT
Автор

hi. can I have the slide presentation?really nice presentation
in 5:47 you mention spectral leakage, what is exactly?

devisthesis
Автор

Hi Prabhjot Gosal, thank you for your hot video which turned out to be very interesting!
a practical case: if I have to change the bpm of a song to make them constant for its entire duration (avoid drifting tempo) How tight is my library?

drjfilix
Автор

hello, its really helpful but can you please tell me how should i run the code and where? (ik silly question but im new to it)

Bekeyurious
Автор

Hi Prabhjot, can you make a video about LPC algorithm in Feature Extraction please?

KhanhNguyen-dnbm
Автор

Hi,

Good Day. Can we use this audio feature extraction to compare two voice of a same speaker in terms of authentication? Will saving the log mel output as logMel.out and compare the same speaker voice in different time as logMel2.out and compare both these output to authenticate ? Is that possible and result in a good way for this use case?

Regards,
Simhan

simhan
Автор

is it possible to share a link to the 'h_1.wav' file used in your youtube demo please 🙂

JeffT-nu
Автор

Hi
Very interesting
I have a query
How can we find audio abnormalities like
Missing samples specific duration
And glitches in between audio file
Could help me
Thank you adavance

durgaganesh
Автор

Hi Prabhjot, I want to compare audio of person A with audio of person B and get a match percentage. Can you guide how to achieve this?
Just pointing me in the right direction will be great help.

juicetin
Автор

What a waste! You showed literally nothing! Well, you showed you have no clue what you’re doing. Ugh.

kenturkey