Q4_K_M
15 GBMin VRAM: 17.3 GB
Recommended VRAM: 19.5 GB
Min RAM: 23 GB
Context: 8K / 8K
Loading model details...
Fetching variants, compatibility details, and metadata.
Share Qwen3 30B A3B with someone who is deciding what to run locally.
Social proof
34% of 994 scanned PCs run Qwen3 30B A3B fully on GPU.
592 keep at least some work on GPU. Based on anonymous compatibility checks.
General-purpose local model brief
Best for
Consider alternatives if
Quantization tip: Benchmark at least two quantizations and validate with a task-specific eval set before production use.
New to local models? Smaller quantization variants are easier to run, while larger ones can improve quality at the cost of more memory.
Q4_K_M
15 GBMin VRAM: 17.3 GB
Recommended VRAM: 19.5 GB
Min RAM: 23 GB
Context: 8K / 8K
Q5_K_M
18.8 GBMin VRAM: 21.6 GB
Recommended VRAM: 24.4 GB
Min RAM: 29 GB
Context: 8K / 8K
Q8_0
30 GBMin VRAM: 34.5 GB
Recommended VRAM: 39 GB
Min RAM: 45 GB
Context: 8K / 8K
FP16
60 GBMin VRAM: 69 GB
Recommended VRAM: 78 GB
Min RAM: 90 GB
Context: 8K / 8K
| Quantization | File Size | Min VRAM | Recommended VRAM | Min RAM | Context |
|---|---|---|---|---|---|
| Q4_K_M | 15 GB | 17.3 GB | 19.5 GB | 23 GB | 8K / 8K |
| Q5_K_M | 18.8 GB | 21.6 GB | 24.4 GB | 29 GB | 8K / 8K |
| Q8_0 | 30 GB | 34.5 GB | 39 GB | 45 GB | 8K / 8K |
| FP16 | 60 GB | 69 GB | 78 GB | 90 GB | 8K / 8K |
These GPUs meet the recommended 19.5 GB VRAM for the Q4_K_M quantization. Estimated speeds are approximate and assume full GPU offloading.
Budget Pick
AMD Radeon RX 7900 XT20 GB VRAM · ~42.7 tok/s
Lowest cost that meets recommended VRAM
Check price on AmazonFastest Pick
NVIDIA GeForce RTX 509032 GB VRAM · ~95.6 tok/s
Highest estimated throughput
Check price on AmazonBest Value
NVIDIA GeForce RTX 409024 GB VRAM · ~53.8 tok/s
Best speed per dollar of VRAM
Rent on RunPodNeed a detailed comparison? See all GPU rankings for Qwen3 30B A3B.