Python Tutorial : Introduction to audio data in Python

Показать описание

---

Hello and welcome to the course! My name is Daniel Bourke and I'll be your instructor. To get started, we're first going to see how speech and audio processing are different from other kinds of data processing.

Much like other data types, audio files come in many different formats, such as, mp3, wav, m4a, and flac. But each of these formats has a standard measure of frequency.

Frequency is measured in kilohertz but is also referred to as kHz or sampling rate. Much like how a movie shows 30 pictures per second which our brains register as moving pictures, the sampling rate of an audio file is a measure of the number of data chunks per second used to represent a digital sound.

With one kilohertz equaling one thousand pieces of information per second.

For example, a song you stream will usually have a 32 kHz sampling rate. This means 32,000 pieces of information per second. Speech and audiobooks are usually between 8 and 16 kHz. We'll look at some of these later.

And as you might've guessed, audio files are different from tabular or text data because you can't immediately see the data you're working with.

To get spoken language audio files into something we can see and manipulate, we first have to open the audio file with Python's built-in wave module.

We can get started with the wave module by running the command import wave.

Now, we have an audio file, good morning dot wav ready to go. It contains a person saying the words good morning.

To import it, we'll use wave's open method.

Now we've saved the good morning dot wav audio file to the variable good_morning in the format of a wave_object. However, in this state it's not very useful to us.

To manipulate it further, we'll use the readframes method to convert the wave_object to bytes. The -1 means we want to read in all of the pieces of information within the wave_object.

Now we've converted the audio file to bytes, what do they look like?

Okay, we can see a snippet of the entire soundwave in byte form.

But remember how kilohertz means thousands of pieces of information per second? The good morning dot wav audio file is 48 kilohertz and 2-seconds long. 48,000 pieces of information per second and 2-seconds long equals 96,000 chunks of data all for only two words.

So if we printed out the entire soundwave in byte form we'd see 96,000 of these combinations of letters and numbers.

Don't worry, if the output looks confusing for now, we'll learn how to convert these bytes into something more useful shortly.

Now you can start to see how working with audio and spoken language files is different from other kinds of data.

First of all, unlike text or tabular data, you can't immediately see what you're working with. So many audio files often require a conversion step before you can begin working with them.

And because of the frequency measure, even a few seconds of audio can contain large amounts of data. Add in background noise, other sounds, more speakers and the number of pieces of information grows even more. We'll look into this later on.

Alright, it's time to get hands-on and practice importing your first audio file!

#DataCamp #PythonTutorial #SpokenLanguageProcessinginPython #SpokenLanguageProcessing #audiodatainPython

Рекомендации по теме

Комментарии

I'm looking to make a program where i can load music in it and it gives me the chord progressions with tabs for guitar. Also i want it to make suggestions for changes in those progressions, all by the laws of music theory.

Is this doable ?
Do you know any open source code already written?

Thanks

strontvliegable

sample rate and frequency are not the same thing. Sample rate is a constant, but frequency is usually changing.

natetolbert

How do u convert an mp3 file into a .wav one?

vincenzorussotto

why the course on the website doesn't appear?

mennatullahabdallah

Hello ! You're video is really helpful, thank you very much for it ! However I have a little question: When you say that you "have your file good-morning.wav ready", where is it? I can not seem to find how to open this file. Where must it be to be opened correctly?

Thank you for your response

thomassouce

Python Tutorial : Introduction to audio data in Python

Python for Beginners - Learn Python in 1 Hour

👩‍💻 Python for Beginners Tutorial

Python Tutorial - Python Full Course for Beginners

Learn Python - Full Course for Beginners [Tutorial]

Introduction to Python Programming | Python for Beginners #lec1

#1 Python Tutorial for Beginners | Introduction to Python

What is Python? Why Python is So Popular?

Python Tutorial for Beginners - Learn Python in 5 Hours [FULL COURSE]

SQL AND - Part 3 - Using AND & OR #sql #programming #w3schools

Python for Beginners – Full Course [Programming Tutorial]

Python Tutorial for Absolute Beginners #1 - What Are Variables?

Introduction To Python -1 | Python For Beginners | Python Tutorial | Python Basics | Simplilearn

PYTHON BASICS (What I Would Learn First)

What is Python? | Python Programming For Beginners | Python Tutorial | Edureka

Python Tutorial 1: Introduction to Python for Absolute Beginners

Learn Python in Less than 10 Minutes for Beginners (Fast & Easy)

Python Programming Tutorial | Introduction | GeeksforGeeks

The complete guide to Python

Python Basics | Python Tutorial For Beginners | Learn Python Programming from Scratch | Edureka

Introduction to For Loops in Python (Python Tutorial #5)

What is Python? (Python Tutorial for Beginners) #1

Introduction to Python 3 Programming Tutorial

Introduction to Programming & Python | Python Tutorial - Day #1

Expert Python Tutorial #1 - Overview of Python & How it Works