Public GPU and NPU uploads

Find efficient local LLM hardware per watt

Compare GPUs and NPUs by token generation per watt to find local LLM hardware that balances speed and lower power consumption.

Apple M4 10-core GPU 16 GB
3.01 tok/W
Apple M4 10-core GPU 24 GB
3.01 tok/W
Apple M4 10-core GPU 32 GB
3.01 tok/W
Apple M1 Pro 14-core GPU 16 GB
2.37 tok/W
Apple M1 Pro 14-core GPU 32 GB
2.37 tok/W
Apple M4 Pro 16-core GPU 24 GB
1.99 tok/W
Apple M4 Pro 16-core GPU 48 GB
1.99 tok/W
Apple M1 Pro 16-core GPU 16 GB
1.73 tok/W
Apple M1 Pro 16-core GPU 32 GB
1.73 tok/W
Apple M2 10-core GPU 8 GB
1.62 tok/W
Apple M2 10-core GPU 16 GB
1.62 tok/W
Apple M2 10-core GPU 24 GB
1.62 tok/W
Apple M1 Max 24-core GPU 32 GB
1.56 tok/W
Apple M1 Max 24-core GPU 64 GB
1.56 tok/W
Apple M4 Max 32-core GPU 36 GB
1.55 tok/W
Apple M2 Pro 16-core GPU 32 GB
1.52 tok/W
Apple M2 Pro 16-core GPU 16 GB
1.51 tok/W
Apple M2 Max 30-core GPU 32 GB
1.45 tok/W
Apple M2 Max 30-core GPU 64 GB
1.45 tok/W
Apple M3 10-core GPU 8 GB
1.42 tok/W
Apple M3 10-core GPU 16 GB
1.42 tok/W
Apple M3 10-core GPU 24 GB
1.42 tok/W
Apple M1 7-core GPU 8 GB
1.42 tok/W
Apple M1 7-core GPU 16 GB
1.42 tok/W
Apple M1 8-core GPU 8 GB
1.42 tok/W
Apple M1 8-core GPU 16 GB
1.42 tok/W
Apple M2 Pro 19-core GPU 16 GB
1.39 tok/W
Apple M1 Max 32-core GPU 32 GB
1.39 tok/W
Apple M1 Max 32-core GPU 64 GB
1.39 tok/W
Apple M2 Pro 19-core GPU 32 GB
1.39 tok/W
Apple M3 Pro 14-core GPU 18 GB
1.28 tok/W
Apple M3 Pro 14-core GPU 36 GB
1.28 tok/W
Apple M2 Max 38-core GPU 32 GB
1.23 tok/W
Apple M2 Max 38-core GPU 64 GB
1.23 tok/W
Apple M2 Max 38-core GPU 96 GB
1.23 tok/W
Apple M1 Ultra 48-core GPU 64 GB
1.14 tok/W
Apple M1 Ultra 48-core GPU 128 GB
1.14 tok/W
Apple M4 Pro 20-core GPU 24 GB
1.13 tok/W
Apple M4 Pro 20-core GPU 48 GB
1.13 tok/W
Apple M4 Pro 20-core GPU 64 GB
1.13 tok/W
Apple M4 Max 40-core GPU 48 GB
1.11 tok/W
Apple M4 Max 40-core GPU 64 GB
1.11 tok/W
Apple M4 Max 40-core GPU 128 GB
1.11 tok/W
Apple M3 Max 40-core GPU 48 GB
1.11 tok/W
Apple M3 Max 40-core GPU 64 GB
1.11 tok/W
Apple M3 Max 40-core GPU 128 GB
1.11 tok/W
Apple M3 Pro 18-core GPU 18 GB
1.10 tok/W
Apple M3 Pro 18-core GPU 36 GB
1.10 tok/W
Apple M3 Max 30-core GPU 36 GB
1.07 tok/W
Apple M3 Max 30-core GPU 96 GB
1.07 tok/W
Apple M2 Ultra 60-core GPU 64 GB
1.04 tok/W
Apple M2 Ultra 60-core GPU 128 GB
1.04 tok/W
Apple M2 Ultra 60-core GPU 192 GB
1.04 tok/W
Apple M3 Ultra 60-core GPU 96 GB
0.98 tok/W
Apple M1 Ultra 64-core GPU 64 GB
0.95 tok/W
Apple M1 Ultra 64-core GPU 128 GB
0.95 tok/W
Apple M2 Ultra 76-core GPU 64 GB
0.87 tok/W
Apple M2 Ultra 76-core GPU 128 GB
0.87 tok/W
Apple M2 Ultra 76-core GPU 192 GB
0.87 tok/W
Apple M3 Ultra 80-core GPU 256 GB
0.77 tok/W
Apple M3 Ultra 80-core GPU 512 GB
0.77 tok/W
AMD RX 9070 16 GB
0.54 tok/W
AMD RX 7900 XTX 24 GB
0.54 tok/W
NVIDIA RTX 5060 Ti 16 GB 16 GB
0.52 tok/W
NVIDIA RTX 5060 Ti 8 GB 8 GB
0.52 tok/W
NVIDIA RTX 5080 16 GB
0.51 tok/W
AMD RX 9060 XT 8 GB 8 GB
0.47 tok/W
NVIDIA RTX 4070 12 GB
0.46 tok/W
NVIDIA RTX 4080 SUPER 16 GB
0.46 tok/W
NVIDIA RTX 5090 32 GB
0.46 tok/W
NVIDIA RTX 3090 24 GB
0.46 tok/W
NVIDIA RTX 4070 Ti SUPER 16 GB
0.45 tok/W
NVIDIA RTX 5070 Ti 16 GB
0.45 tok/W
AMD RX 9070 XT 16 GB
0.45 tok/W
AMD RX 7800 XT 16 GB
0.45 tok/W
NVIDIA RTX 3060 12 GB 12 GB
0.45 tok/W
NVIDIA RTX 3060 8 GB 8 GB
0.45 tok/W
AMD RX 7900 GRE 16 GB
0.45 tok/W
AMD RX 9060 XT 16 GB 16 GB
0.44 tok/W
NVIDIA RTX 3080 10 GB 10 GB
0.43 tok/W
NVIDIA RTX 4090 24 GB
0.42 tok/W
NVIDIA RTX 3080 12 GB 12 GB
0.40 tok/W
AMD RX 7900 XT 20 GB
0.39 tok/W
NVIDIA RTX 4070 Ti 12 GB
0.39 tok/W
AMD RX 6600 8 GB
0.38 tok/W
AMD RX 6800 16 GB
0.38 tok/W
Intel Arc B580 12 GB
0.37 tok/W
AMD RX 6700 XT 12 GB
0.36 tok/W
AMD RX 6900 XT 16 GB
0.36 tok/W
NVIDIA RTX 3070 8 GB
0.36 tok/W
AMD RX 6650 XT 8 GB
0.35 tok/W
AMD RX 6600 XT 8 GB
0.34 tok/W
AMD RX 6800 XT 16 GB
0.33 tok/W
Intel Arc B570 10 GB
0.33 tok/W
AMD RX 6750 XT 12 GB
0.33 tok/W
AMD RX 7600 XT 16 GB
0.28 tok/W
AMD RX 6500 XT 4 GB
0.25 tok/W
Intel Arc A770 16 GB 16 GB
0.23 tok/W
Intel Arc A770 8 GB 8 GB
0.23 tok/W
Intel Arc A750 8 GB
0.20 tok/W
AMD RX 6400 4 GB
AMD RX 6700 10 GB
AMD RX 6750 GRE 12 GB 12 GB
AMD RX 6750 GRE 10 GB 10 GB
AMD RX 6950 XT 16 GB
AMD RX 7600 8 GB
AMD RX 7700 XT 12 GB
AMD RX 9060 8 GB
Apple M1 7-core GPU 8 GB
Apple M2 8-core GPU 8 GB
Apple M2 8-core GPU 16 GB
Apple M2 8-core GPU 24 GB
Apple M3 8-core GPU 8 GB
Apple M3 8-core GPU 16 GB
Apple M3 8-core GPU 24 GB
Apple M4 8-core GPU 16 GB
Apple M4 8-core GPU 24 GB
Apple M4 8-core GPU 32 GB
Apple M4 9-core GPU 12 GB
Apple M5 8-core GPU 16 GB
Apple M5 8-core GPU 24 GB
Apple M5 8-core GPU 32 GB
Apple M5 10-core GPU 16 GB
Apple M5 10-core GPU 24 GB
Apple M5 10-core GPU 32 GB
Apple M5 Max 32-core GPU 36 GB
Apple M5 Max 40-core GPU 48 GB
Apple M5 Max 40-core GPU 64 GB
Apple M5 Max 40-core GPU 128 GB
Apple M5 Pro 20-core GPU 24 GB
Apple M5 Pro 20-core GPU 48 GB
Apple M5 Pro 20-core GPU 64 GB
Intel Arc A310 4 GB
Intel Arc A380 6 GB
Intel Arc A580 8 GB
NVIDIA RTX 3050 6 GB 6 GB
NVIDIA RTX 3050 8 GB 8 GB
NVIDIA RTX 3060 Ti 448 GB/s 8 GB
NVIDIA RTX 3060 Ti 608 GB/s 8 GB
NVIDIA RTX 3070 Ti 8 GB
NVIDIA RTX 3080 Ti 12 GB
NVIDIA RTX 3090 Ti 24 GB
NVIDIA RTX 4060 8 GB
NVIDIA RTX 4060 Ti 16 GB 16 GB
NVIDIA RTX 4060 Ti 8 GB 8 GB
NVIDIA RTX 4070 SUPER 12 GB
NVIDIA RTX 4080 16 GB
NVIDIA RTX 5050 8 GB
NVIDIA RTX 5060 8 GB
NVIDIA RTX 5070 12 GB