Compatibility Check
Can I Run Devstral Small 2 24B on Apple M4 Ultra?
Yes — Apple M4 Ultra runs Devstral Small 2 24B fully on GPU at the FP16 quantization.
Estimated ~22.8 tokens/sec on the FP16 quantization.
Full GPU
Best variant: FP16
Full GPU inference — 256 GB VRAM meets the 62.4 GB recommendation.
- GPU VRAM
- 256 GB
- Min VRAM (best fit)
- 55.2 GB
- Recommended VRAM
- 62.4 GB
- Estimated tok/s
- ~22.8
Share this matchup
Send this page so a friend can see if Apple M4 Ultra fits Devstral Small 2 24B.
Every Devstral Small 2 24B quantization on Apple M4 Ultra
Each row runs the compatibility engine against your VRAM, RAM, and the model's requirements.
| Quantization | File Size | Min VRAM | Rec VRAM | Context | Verdict | Estimated tok/s |
|---|---|---|---|---|---|---|
| Q4_K_M | 12 GB | 13.8 GB | 15.6 GB | 8K / 8K | Full GPU | ~72.8 |
| Q5_K_M | 15 GB | 17.3 GB | 19.5 GB | 8K / 8K | Full GPU | ~63.3 |
| Q8_0 | 24 GB | 27.6 GB | 31.2 GB | 8K / 8K | Full GPU | ~43.3 |
| FP16Best fit | 48 GB | 55.2 GB | 62.4 GB | 8K / 8K | Full GPU | ~22.8 |
Apple M4 Ultra is solid pick for Devstral Small 2 24B
Need second card or fresh build? These links help support site at no extra cost.