AI Compute Stack
Understanding the AI software stack from inference frameworks to hardware ISA
- 1
AI Compute Stack Overview β From Inference Frameworks to Hardware ISA
Intermediate#gpu#compute#software-stack#runtime#inference - 2
GPU Architecture β From Transistors to Threads
Intermediate#gpu#architecture#hardware#nvidia - 3
Matrix Acceleration Units β Tensor Core and XMX
Intermediate#gpu#tensor-core#xmx#systolic-array#nvidia#intel - 4
CUDA Programming Model β From Code to Hardware
Intermediate#gpu#cuda#programming#simt#simd#intel#sycl - 5
GEMM Optimization β From Naive to Peak Performance
Advanced#gpu#gemm#cuda#optimization#tensor-core#xmx#intel