Branchless and SIMD Programming in Assembly

preview_player
Показать описание
We will step through multiple versions of a UCase function including a SIMD version that will out perform all of the others.

0:00 Introduction
1:55 Function Least1 - Uses Jumps
5.09 Function Least3 - Branchless
11:33 Function Least4 - Branchless #2
16:19 Function UCase1 - Standard
31:24 Function UCase2 - Branchless
41:10 Function UCase3 - Data Lookup
44:09 Function UCase4 - Aligned Data Lookup
46:09 Function UCase5 - Sets the bit low that has a value of 32
53:16 Function UCase6 - SIMD
1:08:05 Timed Runs

Рекомендации по теме