#npu
3 articles
Advanced
NPU Architecture and GPU+NPU Co-Inference
#intel
#npu
#openvino
#hetero
#multi-device
#co-inference
Advanced
NPU Execution Model and the Boundaries of Its Programming Model
#intel
#npu
#execution-model
#dma
#tiling
#attention
#programming-model
#cute
Advanced
LLM Inference on NPU: KV Cache and the Software Stack
#intel
#npu
#llm
#kv-cache
#openvino
#npuw
#static-shape