Scaling CNN Inference for Extreme Throughput | AI/ML IN 5G CHALLENGE

preview_player
Показать описание
IN THIS SESSION...
Performance scaling with traditional computing architectures becomes increasingly challenging as next generation technology nodes provide diminishing benefits. Semiconductor companies aim to unleash new levels of performance through further specialization of computer and memory subsystems for specific application domains. During this talk, we will discuss examples of extreme forms of specialization that help scaling CNN inference to 100s of millions of inputs/second to handle ML workloads in novel applications such as network intrusion detection.
Рекомендации по теме