vLLM inference performance testing version history

vLLM inference performance testing version history#

2025-06-23

2 min read time

Applies to Linux and Windows

This table lists previous versions of the ROCm vLLM inference Docker image for inference performance testing. For detailed information about available models for benchmarking, see the version-specific documentation. You can find tagged previous releases of the ROCm/vllm Docker image on Docker Hub.

ROCm version

vLLM version

PyTorch version

Resources

6.4.0

0.9.0.1

2.7.0

6.3.1

0.8.5 (0.8.6.dev)

2.7.0

6.3.1

0.8.5

2.7.0

6.3.1

0.8.3

2.7.0

6.3.1

0.7.3

2.7.0

6.3.1

0.6.6

2.7.0

6.2.1

0.6.4

2.5.0

6.2.0

0.4.3

2.4.0