Public GPU and NPU uploads

NVIDIA RTX 4070 Ti SUPER

8448 CUDA cores

Type

GPU

VRAM

16 GB

Memory bandwidth

672 GB/s

TDP

285 W

Benchmark results

Llama 2 7B
Q4_0 512 6,099.18 84.00 ms 129 llama.cpp Vulkan uploaded 4 weeks ago
AI Hardware Research System
Standardized test

Llama-Bench

Used prompt

llama-bench -p 512 -n 128

Notes

llama-bench / Vulkan scoreboard; Flash Attention deaktiviert