Skip to main content
Partial GPU

Best variant: Q4_K_M

Partial GPU offload — 6 GB VRAM is above the 5.5 GB minimum but below the 8 GB recommendation. Some layers will spill to RAM.

GPU VRAM
6 GB
Min VRAM (best fit)
5.5 GB
Recommended VRAM
8 GB
Estimated tok/s
~38.4

Share this matchup

Send this page so a friend can see if NVIDIA GeForce GTX 1660 Super fits Hermes 3 Llama 3.1 8B.

Every Hermes 3 Llama 3.1 8B quantization on NVIDIA GeForce GTX 1660 Super

Each row runs the compatibility engine against your VRAM, RAM, and the model's requirements.

QuantizationFile SizeMin VRAMRec VRAMContextVerdictEstimated tok/s
Q4_K_MBest fit4.9 GB5.5 GB8 GB8K / 128KPartial GPU~38.4

NVIDIA GeForce GTX 1660 Super is solid pick for Hermes 3 Llama 3.1 8B

Need second card or fresh build? These links help support site at no extra cost.

All hardware for Hermes 3 Llama 3.1 8BBest GPU for Hermes 3 Llama 3.1 8BModels that fit NVIDIA GeForce GTX 1660 SuperFull model detailsBrowse all models