Q4_K_M
4.9 GBMin VRAM: 5.5 GB
Recommended VRAM: 8 GB
Min RAM: 8 GB
Context: 8K / 128K
Loading model details...
Fetching variants, compatibility details, and metadata.
New to local models? Smaller quantization variants are easier to run, while larger ones can improve quality at the cost of more memory.
Q4_K_M
4.9 GBMin VRAM: 5.5 GB
Recommended VRAM: 8 GB
Min RAM: 8 GB
Context: 8K / 128K
Q8_0
8.5 GBMin VRAM: 9.5 GB
Recommended VRAM: 12 GB
Min RAM: 12 GB
Context: 8K / 128K
| Quantization | File Size | Min VRAM | Recommended VRAM | Min RAM | Context |
|---|---|---|---|---|---|
| Q4_K_M | 4.9 GB | 5.5 GB | 8 GB | 8 GB | 8K / 128K |
| Q8_0 | 8.5 GB | 9.5 GB | 12 GB | 12 GB | 8K / 128K |