#llm
3 articles
Advanced
Model Routing Landscape: Why One Model Isn't Enough
#model-routing
#llm
#cost-optimization
#system-design
Advanced
LLM Inference on NPU: KV Cache and the Software Stack
#intel
#npu
#llm
#kv-cache
#openvino
#npuw
#static-shape
Intermediate
When RL Meets LLM: From Language Generation to Policy Optimization
#reinforcement-learning
#llm
#post-training
#rlhf
#policy-optimization
#alignment