TensorRT Overview

preview_player
Показать описание
🔗 Useful Links
w/ love ❤️

📚 About

During inference, TensorRT-based apps are up to 40 times faster than CPU-only systems. You may use TensorRT to improve neural network models trained in all major frameworks, calibrate for reduced precision while maintaining high accuracy, and deploy to hyperscale data centers, embedded systems, or automotive product platforms.

TensorRT is based on CUDA®, NVIDIA's parallel programming model, and allows you to optimize inference using CUDA-XTM libraries, development tools, and technologies for AI, autonomous machines, high-performance computing, and graphics. TensorRT takes advantage of sparse tensor cores on upcoming NVIDIA Ampere Architecture GPUs, delivering an additional performance increase.

For production deployments of deep learning inference applications such as video streaming, speech recognition, recommendation, fraud detection, text generation, and natural language processing, TensorRT provides INT8 using Quantization Aware Training and Post Training Quantization, as well as FP16 optimizations. Reduced precision inference cuts application latency in half, which is essential for many real-time services, as well as autonomous and embedded applications.

🗒️ Chapters:
00:00 Intro to TensorRT
02:20 Prerequisites
03:20 TensorRT Docker Images
06:27 Jupyter Lab within Docker Containers
07:25 Compile TRT OSS
08:26 HuggingFace GPT-2
13:42 PyTorch on CPU/GPU vs TensorRT on GPU
16:42 Outro

🙏 Credits:

#nvidia #tensorRT #pytorch
Рекомендации по теме
Комментарии
Автор

I have a feeling this video will be awesome !

thelinuxmann
Автор

I just discovered this channel since I have keen interest in NVIDIA products. Keep up the noble work.

BestShorts._U
Автор

What an underrated channel, shame that this you got less than 1M subscribers :)

gjboys
Автор

Hey Ahmad, I saw your poll on hosting paid lectures, don't you dare ! Thanks.

official_village
Автор

Crystal clear transparent demonstrations :_D

turkeyserver
Автор

When is the next TensorRT release ? Does it support HuggingFace ?

pamholmes
Автор

For tensor rt version to install... we need cuda version. The one from nvidia-smi or from nvcc -V?

albertrg
Автор

But tensorrt only runs on gpu right can it run on cpu?

theunknown
Автор

Dostum tensorrt'yi opencv ile birlikte kullanma şansım var mı

emrekeles
Автор

no background music pls, now its annoying to speed up the video.

holthuizenoemoet