Public GPU and NPU uploads

NVIDIA RTX 5090

21760 CUDA cores

Type

GPU

VRAM

32 GB

Memory bandwidth

1,792 GB/s

TDP

575 W

Benchmark results

Llama 2 7B
Q4_0 512 10,381.64 49.00 ms 264 llama.cpp Vulkan uploaded 4 weeks ago
AI Hardware Research System
Standardized test

Llama-Bench

Used prompt

llama-bench -p 512 -n 128

Notes

llama-bench / Vulkan scoreboard; Flash Attention deaktiviert