CUDA Programming on Python

Показать описание

In this tutorial, I’ll show you everything you need to know about CUDA programming so that you could make use of GPU parallelization, thru simple modifications of your already existing code, running on a boring CPU. The following tutorial was recorded on NVIDIA’s Jetson Orin supercomputer. CUDA stands for Compute Unified Device Architecture, and is a parallel computing platform and application programming interface that enables software to use certain types of graphics processing units for general purpose processing, an approach called general-purpose computing on GPUs.

First, I will start by writing a simple function that does a vector multiplication, which is going to run on a CPU. Then we get the same job done using CUDA parallelization on a GPU. Keep in mind that GPU’s have more cores than CPU and hence when it comes to parallel computing of data, GPUs perform exceptionally better than CPUs even though GPUs have lower clock speed and lack several core management features as compared to CPUs. An example reveals that running 64 million massive multiplications on a GPU takes about 0.64 seconds, as opposed to 31.4 seconds when running on a CPU. This translates to a x50 gain in terms of speed, thanks to the parallelization on such a huge number of cores. Amazing ! This means that running a complex program on CPU taking about a month, could be executed in 14 hrs. This could be also faster given more cores.

Then, I’ll show you the gains in filling arrays on python on a CPU vs on a GPU. Another example reveals that the amount of time it took to fill the array on a CPU is about 2.58 seconds, as opposed to 0.39 seconds on a GPU, which is a gain of about 6.6x. The last fundamental section of this video is to show the gains in rendering images (or videos) on python. We will demonstrate why you see some film producers or movie makers rendering and editing their content on a GPU. GPU rendering delivers with a graphics card rather of a CPU, which may substantially speed up the rendering process because GPUs are primarily built for fast picture rendering. GPUs were developed in response to graphically intensive applications that taxed CPUs and slowed processing speed. I will use the Mandelbrot set to perform a comparison between CPU and GPU power. This example reveals that only 1.4 seconds of execution is needed on a GPU as opposed to 110 seconds on a CPU, which is a 78x gain. This simply means that instead of rendering a 4K resolution video over a week on a CPU, you could get the same video in 8K resolution rendered in 2 hours on a GPU, if you are using 32 threads. So imagine if you doubled the threads and blocks involved in GPU optimization.

⏲Outline⏲

00:00 Introduction
00:33 Multiplication gains on GPUs vs CPUs
08:31 Filling an array on GPUs vs CPUs
11:55 Rendering gains on GPU vs CPU
12:35 What is a Mandelbrot set ?
13:39 Mandelbrot set rendering on CPU
17:01 Mandelbrot set rendering on GPU
20:54 Outro

📚Related Lectures

🔴 Subscribe for more videos on CUDA programming
👍 Smash that like button, in case you find this tutorial useful.
👁‍🗨 Speak up and comment, I am all ears.

💰 Donate to help the channel
BTC wallet - 3KnwXkMZB4v5iMWjhf1c9B9LMTKeUQ5viP
ETH wallet - 0x44F561fE3830321833dFC93FC1B29916005bC23f
DOGE wallet - DEvDM7Pgxg6PaStTtueuzNSfpw556vXSEW
API3 wallet - 0xe447602C3073b77550C65D2372386809ff19515b
DOT wallet - 15tz1fgucf8t1hAdKpUEVy8oSR8QorAkTkDhojhACD3A4ECr
ARPA wallet - 0xf54bEe325b3653Bd5931cEc13b23D58d1dee8Dfd
QNT wallet - 0xDbfe00E5cddb72158069DFaDE8Efe2A4d737BBAC
AAVE wallet - 0xD9Db74ac7feFA7c83479E585d999E356487667c1
AGLD wallet - 0xF203e39cB3EadDfaF3d11fba6dD8597B4B3972Be
AERGO wallet - 0xd847D9a2EE4a25Ff7836eDCd77E5005cc2E76060
AST wallet - 0x296321FB0FE1A4dE9F33c5e4734a13fe437E55Cd
DASH wallet - XtzYFYDPCNfGzJ1z3kG3eudCwdP9fj3fyE

#cuda #cudaprogramming #gpu

Рекомендации по теме

Комментарии

Thank you so much for responding to my request for making a CUDA programming. I have donated 0.1 BTC to your account as a way to thank you. My professor has done so many hours trying to explain CUDA and none of my classmates really understood. I just can not believe that you do all this for free and that is why me and my classmates have decided to collect some funds to donate to you.

Thanks for all that you do and please keep going.

aishahoura

Too hard to find high-quality content like this these days. Thank you so much

Drex.Yt

You just opened my eyes to parallel programming. Thanks for the quick overview.

denizart

Thank you so much. Probably the best introdution to CUDA with Python. The example you use, while very basic, touches on usage of blocks, which is usually omitted in other introduction-level tutorials. Great stuff! Hope you return with some more videos. I have subscribed!

aoungamingyt

That was very well explained. I have only have taken one course, and you made it clearer than my professor or fellow students ever did.

sksk-lokc

Just did my research and this guy is at one of the most prestigious universities in the world ! No wonder why his lectures come up neat !

tatldunyas

as a data scientist +2 years of experience, i ALWAYS learn something new with your content! please nich, never stop doing this things, and also, never cut your smile in your face, even if your are having bugs!!

apogeetheboss

I have been looking into gpu programming using numba and python for a while, this seems to be the best tutorial I was able to find so far.. . thank you

pantherofficial

Too hard to find high-quality content like this these days. ⚡

todotasks

Hey this is super useful! I elected High Performance Computing and Microprocessors and Embedded Systems modules for my degree, and this channel has become my go-to guide.

erenbasak

LOL. Loved the graphic at 6:23! Brought tears to my eyes.

excolabirbuyuyecek

Thank you so much for this series! It's so clear and easy to follow

RAVIShankar-bmou

wanted to comment that the information in this presentation is very well structured and the flow is excellent.

basslvers

Love the channel Nicholas, have recently graduated from an NLP Master's degree and seeing you explain stuff in a simpler way and your coding challenges is really helping me connect with the material I've learned! Keep it up and I'll keep watching!

adeeshaamabidu

I feel like Cuda has been demystified. Very glad I found your series.

youtubemullim

Ahmad, thanks for taking time to create these videos. It is unfortunate that people view your videos and then feel inspired to complain about a free gift. Folks could just keep it moving or add helpful insights.

-oof

the essence of Deep learning in a few lines of code... awesome

muradhesenov

Fantastic tutorials on CUDA. You deserve more followers.

rithusvlogtime

This is the best introduction to CUDA I've seen, thanks a lot !

LouieVon

what a passionate tutorial! I wish you were my professor for my parallel programming course. Well done!

tajaochrisciamae

CUDA Programming on Python

CUDA Programming on Python

Tutorial: CUDA programming in Python with numba and cupy

CUDA Programming in Python

Learn to Use a CUDA GPU to Dramatically Speed Up Code In Python

Nvidia CUDA in 100 Seconds

Getting Started With CUDA for Python Programmers

CUDA Simply Explained - GPU vs CPU Parallel Computing for Beginners

Working with CUDA, Device and GPU / CPU in PyTorch #shorts

Learn to use a CUDA GPU to dramatically speed up code in Python.

Mind-bending new programming language for GPUs just dropped...

CUDA Programming Course – High-Performance Computing with GPUs

CUDA in your Python: Effective Parallel Programming on the GPU

CUDA in your Python Parallel Programming on the GPU - William Horton

Lecture 3: Getting Started With CUDA for Python Programmers

GPU-accelerated Python with CuPy and Numba's CUDA

Writing CUDA kernels in Python with Numba

CUDACast #10 - Accelerate Python code on GPUs

CUDACast #10a - Your First CUDA Python Program

Installing CUDA Python

William Horton - CUDA in your Python: Effective Parallel Programming on the GPU - PyCon 2019

The Power of CUDA in AI Development

Pytorch Tutorial 6- How To Run Pytorch Code In GPU Using CUDA Library

HOW and SINCE WHEN my computer has these powers?? is this CUDA magic?

CUDA Programming with Python: Accelerating Data-Parallel Applications