Public GPU and NPU uploads

NVIDIA RTX 5060 Ti 16 GB

4608 CUDA cores

Type

GPU

VRAM

16 GB

Memory bandwidth

448 GB/s

TDP

180 W

Benchmark results

Llama 2 7B
Q4_0 512 3,460.92 148.00 ms 94 llama.cpp Vulkan uploaded 4 weeks ago
AI Hardware Research System
Standardized test

Llama-Bench

Used prompt

llama-bench -p 512 -n 128

Notes

llama-bench / Vulkan scoreboard; Flash Attention deaktiviert