Best Local LLMs for the MacBook Air M2 (24 GB)

The best local LLM for a MacBook Air M2 (24 GB) is Qwen 4.1 32B-A3B at 10 tok/s. With 24 GB of unified memory it runs 43 of the models we benchmark — from compact options up to 41B-class models. For everyday chat and coding, Qwen 4.1 32B-A3B is the sweet spot. Full ranking below.

Unified memory
24
GB
Mem. bandwidth
100
GB/s
Models that fit
43
of 79
Top speed
78
tok/s

Top 3 picks for the MacBook Air M2 (24 GB)

⭐ Best overall
32B · Apache 2.0 · cap 46/50
10 tok/s
⚡ Fastest
3B · Apache 2.0 · cap 10/50
78 tok/s
🧠 Runner-up
32B · Apache 2.0 · cap 45/50
10 tok/s

Every model ranked for a MacBook Air M2 (24 GB)

Ranked by LLMCheck suitability (capability balanced against real speed on the M2). Click a model for its full benchmark and setup. Speeds marked est. are scaled from measured runs by memory bandwidth.

#ModelSizeLicenseSpeedCapability
1Qwen 4.1 32B-A3B32BApache 2.010 tok/s est.46/50
2Qwen 432BApache 2.010 tok/s est.45/50
3Qwen 4 Coder32BApache 2.010 tok/s est.44/50
4Qwen 4 Preview 32B-A3B32BApache 2.010 tok/s est.42/50
5Gemma 4 31B31BApache 2.04 tok/s est.40/50
6Qwen 3.6-35B-A3B35BApache 2.09 tok/s est.38/50
7Phi-5 Large 28B28BMIT6 tok/s est.36/50
8Gemma 4 26B-A4B26BApache 2.08 tok/s est.35/50
9Mistral Medium 441BApache 2.08 tok/s est.34/50
10Gemma 4.5 27B27BGemma7 tok/s est.32/50
11Gemma 4.5 12B12BGemma35 tok/s28/50
12Phi-5 Medium 14B14BMIT24 tok/s28/50

Showing the top 12 of 43 models that fit in 24 GB. See the full leaderboard or all benchmarks.

Quick start: run Qwen 4.1 32B-A3B on your MacBook Air M2

The fastest way to get started is Ollama. Install it, then pull the top pick for your Mac:

brew install ollama
ollama run qwen-41-32b-a3b

Prefer a GUI? LM Studio gives you a one-click download and chat window. For step-by-step help see our Ollama install guide, or open the Qwen 4.1 32B-A3B on M2 benchmark page for exact settings.

🛒 Ready to run bigger models than the MacBook Air M2 can handle?

The MacBook Air M2 (24 GB) tops out at Mistral Medium 4. Newer Apple Silicon with more unified memory runs larger, smarter models much faster:

As an Amazon Associate, LLMCheck earns from qualifying purchases. Affiliate links cost you nothing extra and never influence our rankings.

FAQ: local LLMs on the MacBook Air M2

What is the best local LLM for a MacBook Air M2 (24 GB)?

Qwen 4.1 32B-A3B (32B, Apache 2.0) is the best all-round pick at 10 tok/s on the M2. If you want maximum speed, SmolLM3 3B hits 78 tok/s; for maximum capability, Qwen 4 still fits in 24 GB.

How many models can a MacBook Air M2 with 24 GB run?

About 43 of the 79 models in the LLMCheck leaderboard fit in 24 GB of unified memory, from compact models up to Mistral Medium 4 (41B).

Can a MacBook Air M2 run a 70B model?

Not comfortably. A 70B model in Q4 needs ~40–44 GB; with 24 GB you should stick to models up to ~18 GB, such as Qwen 4.1 32B-A3B. For 70B, look at a 48 GB+ Mac.

Is 24 GB of RAM enough to run LLMs locally?

24 GB is great for small-to-mid models (up to ~14B comfortably); for 30B+ you'll want 32 GB or more. Because Apple Silicon uses unified memory, that figure is both your system RAM and your VRAM.

Related