Featured
Compare webshops (2)
Pages: 95, Paperback, Independently published
Independently Published
LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's...
Local LLM Inference Optimization: A Comprehensive Guide to Quantization, Hardware Acceleration, and...
AI Performance Engineering: From GPU Kernels to LLM Inference
Back to top