Content on this site is AI-generated and may contain errors. If you find issues, please report at GitHub Issues .

#llm

3 articles

Model Routing Landscape: Why One Model Isn't Enough

#model-routing #llm #cost-optimization #system-design

LLM Inference on NPU: KV Cache and the Software Stack

#intel #npu #llm #kv-cache #openvino #npuw #static-shape

When RL Meets LLM: From Language Generation to Policy Optimization

#reinforcement-learning #llm #post-training #rlhf #policy-optimization #alignment