CVE-2026-34756

EUVD-2026-19351

06.04.2026, 16:16

vLLM is an inference and serving engine for large language models (LLMs). From 0.1.0 to before 0.19.0, a Denial of Service vulnerability exists in the vLLM OpenAI-compatible API server. Due to the lack of an upper bound validation on the n parameter in the ChatCompletionRequest and CompletionRequest Pydantic models, an unauthenticated attacker can send a single HTTP request with an astronomically large n value. This completely blocks the Python asyncio event loop and causes immediate Out-Of-Memory crashes by allocating millions of request object copies in the heap before the request even reaches the scheduling queue. This vulnerability is fixed in 0.19.0.

Provider	Type	Base Score	Atk. Vector	Atk. Complexity	Priv. Required	Vector
NIST	Primary	6.5 MEDIUM	NETWORK	LOW	LOW	CVSS:3.1/AV:N/AC:L/PR:L/UI:N/S:U/C:N/I:N/A:H

Base Score

CVSS 3.x

EPSS Score

Percentile: 24%

Affected Products (NVD)

Vendor	Product	Version
vllm	vllm	0.1.0 ≤ 𝑥 < 0.19.0

𝑥

= Vulnerable software versions

Common Weakness Enumeration

References

https://github.com/vllm-project/vllm/commit/b111f8a61f100fdca08706f41f29ef3548de7380

https://github.com/vllm-project/vllm/pull/37952

https://github.com/vllm-project/vllm/security/advisories/GHSA-3mwp-wvh9-7528

https://access.redhat.com/security/cve/CVE-2026-34756

https://bugzilla.redhat.com/show_bug.cgi?id=2455425

https://security.access.redhat.com/data/csaf/v2/vex/2026/cve-2026-34756.json