NSM Introduction to GPU Programming: L2: CUDA Memory

preview_player
Показать описание
Рекомендации по теме
Комментарии
Автор

At 50:00 timestamp, for coalescing in strided indexing, are you telling to launch a kernel with dkernel<<chunksize, no:of threads>>

I couldn't understand that part.

ajeethkumarm
Автор

So texture and constant memory can only be used to store instructions to be applied on transfered data from CPU to GPU memory and to store metas of the data available suring the time of GPU execution of instructions?

pronodbharatiya
Автор

L2 is shared among SMs so it means that if 1 sm needs say a particular chunk of L2, programmers have the liberty to asign that or is it so that the chunks are pre-fixed for each SMs and programmers can't asign values beyound that amont of space ?

pronodbharatiya
Автор

So the atomic instructions for L2 is stored in constant and or texture memory?

pronodbharatiya