Independently Published

DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Name: DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications
Brand: Independently Published
SKU: 9ff801375eac530933018ac6fd0f92c2

1/1

Image of DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Amazon

Prices from

30.05

Featured

	€ 30.05	To Shop
	€ 30.05	To Shop
COMPARE ALL WEBSHOPS (2)

Description

Amazon Pages: 288, Paperback, Independently published

Compare webshops (2)

Shop

Price

€ 30.05

To Shop

€ 30.05

To Shop

Description (1)

Pages: 288, Paperback, Independently published

Brand	Independently Published
EAN	9798274507356
MPN	RKC2013009186

Independently Published

LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's...

€ 8.61

Compare 2 stores 2 stores

Independently Published

VECTOR DATABASE & RAG ENGINEERING: DESIGNING SCALABLE, LOW LATENCY RETRIEVAL SYSTEMS FOR...

€ 14.29

Compare 2 stores 2 stores

Independently Published

VECTOR DATABASE & RAG ENGINEERING: DESIGNING SCALABLE, LOW LATENCY RETRIEVAL SYSTEMS FOR...

€ 24.69

Compare 2 stores 2 stores

Independently Published

LLM Engineer: Build, Fine-Tune, and Deploy Production-Grade AI Applications with Python Modern LLMs

€ 29.89

Compare 2 stores 2 stores

Popular now

Categories

Popular categories

Brands

Merchants

Popular categories

DEEPSPEED IN PRODUCTION: inference OPTIMIZATION and MODEL: Deploy LLMs efficiently with optimized serving, quantization, low latency for real time applications

Description

Product specifications