3 - Audio Feature Extraction using Python

Показать описание

In this video, we focus on audio feature extraction in the frequency domain.
The code shown in the video can be found at my Github page:

Helpful Resources to get more technical depth for some of the terms mentioned in the video/code are referenced throughout the jupyter notebook at the link above. Some of those are also mentioned below:

What is a Spectrogram?

What is windowing?

What is Short Time Fourier Transform (STFT)?

What are MelSpectrograms?

Рекомендации по теме

Комментарии

this is a really helpful video for someone who just starts trying to do signal processing and classifying, thank you for your effort and it really helps me understand better on spectrogram and signal processing!

fuchunhsieh

Thank you for the video. Very excited for these video series. One of your videos, yolo to coco conversation was very helpful.

soumyadrip

this was really helpful! thank you very very much!!

lindaflow

I want to analyze a frequency signal with a fairly large bandwidth. Will this method suit my task?

demkut

Hello, we are working on an assignment related to gender recognition from voice. However, we want to extract values such as "mean frequency, standaty deviation, spectral flattnes" from a person's voice using the data you use. How can we achieve this?

helios.

Hello Prabhjot, this is indeed an amazing work. Thank you for taking your time to share knowledge to the world. Could you please guide me on how to save batches of spectrograms? I have created TensorFlow dataset of audio files and pass them through a data pipeline inline with the kind of decoding in accordance to my work. I want to plot and save each spectrogram from the dataset generated. Thank you in anticipation of your kind response. Cheers!

jamilamuhammad

if you want to increase the resolution on the x axis you can increase the sr. But how do you increase the resolution of the frequency on the y axis?

SvSzYT

hi. can I have the slide presentation?really nice presentation
in 5:47 you mention spectral leakage, what is exactly?

devisthesis

Hi Prabhjot Gosal, thank you for your hot video which turned out to be very interesting!
a practical case: if I have to change the bpm of a song to make them constant for its entire duration (avoid drifting tempo) How tight is my library?

drjfilix

hello, its really helpful but can you please tell me how should i run the code and where? (ik silly question but im new to it)

Bekeyurious

Hi Prabhjot, can you make a video about LPC algorithm in Feature Extraction please?

KhanhNguyen-dnbm

Hi,

Good Day. Can we use this audio feature extraction to compare two voice of a same speaker in terms of authentication? Will saving the log mel output as logMel.out and compare the same speaker voice in different time as logMel2.out and compare both these output to authenticate ? Is that possible and result in a good way for this use case?

Regards,
Simhan

simhan

is it possible to share a link to the 'h_1.wav' file used in your youtube demo please 🙂

JeffT-nu

Hi
Very interesting
I have a query
How can we find audio abnormalities like
Missing samples specific duration
And glitches in between audio file
Could help me
Thank you adavance

durgaganesh

Hi Prabhjot, I want to compare audio of person A with audio of person B and get a match percentage. Can you guide how to achieve this?
Just pointing me in the right direction will be great help.

juicetin

What a waste! You showed literally nothing! Well, you showed you have no clue what you’re doing. Ugh.

kenturkey

3 - Audio Feature Extraction using Python

3 - Audio Feature Extraction using Python

Introduction to Embedded Machine Learning 3.2.1 - Audio Feature Extraction

Extract Features from Audio File | MFCC | Python

How to Extract Audio Features

Features Extraction in Images, Text, and Audio Data

Audio processing in Python with Feature Extraction for machine learning

Types of Audio Features for Machine Learning

Mel Frequency Cepstral Coefficients (MFCC) Explained

MFCC features to Audio. Will it work?

Mel Spectrograms with Python and Librosa | Audio Feature Extraction

55 - Feature Extraction Introduction

MFCC Feature Extraction using MATLAB

Audio Data Processing in Python

Build a Deep Audio Classifier with Python and Tensorflow

Extracting Mel-Frequency Cepstral Coefficients with Python

Mel-Spectrogram and MFCCs | Lecture 72 (Part 1) | Applied Deep Learning

Hidden Features of Audio Data| Audio Extraction using Python - P1| Coding Using Python| Data Science

UVIc MIR Course - Audio Feature Extraction

Feature Extraction of Audio Signal in Time Domain

Speech features intro 3: Mel-scale spectrogram

64 Hidden Features of Audio Data | Audio Data Extraction using Python | Data Science |

Exploring MFCC Feature Extraction: A Comprehensive Guide | MFCC Tutorials Part 1

Extract Musical Notes from Audio in Python with FFT

Hidden Features of Audio Data | Extraction using Python - Part 2 | Data Science Using Python |