local · offline
Local & offline autocomplete for your Mac
MacAutocomplete runs a real language model directly on your Apple Silicon. No internet, no server, no monthly bill. On a plane, in a tunnel, or with Wi-Fi switched off, it keeps finishing your sentences exactly the same — because the model lives on your Mac, not in someone's datacenter.
Flight notes — wrap the deck before we land in Chicago.
How local inference actually works
On first launch, MacAutocomplete downloads a single local model — IBM Granite 4.1 3B, quantized to 4-bit — about 2GB, once. From then on it runs through Apple's MLXframework, the same machine-learning runtime built for Apple Silicon. When you pause mid-sentence, the text around your caret is read, handed to the model on your Mac's GPU and Neural Engine, and a completion comes back as greyed ghost text right where you're typing. Nothing is bundled into a cloud request, queued behind other users, or shipped off-device. The weights sit on your disk; inference happens inside the app's own process.
Works with zero internet — airplane mode and all
Because every suggestion is generated locally, there is no connection to lose. Put your Mac in Airplane Mode, board a flight, drop into a subway tunnel, or work from a cabin with no signal — the ghost completions keep appearing and ⇥ Tabstill accepts them. Try it: turn off Wi-Fi right now and start typing. Most "AI writing" tools go dark the moment you disconnect. This one doesn't notice. It's the same on-device privacy guarantee, viewed from the other side: if nothing ever leaves your Mac, nothing ever needs a network either.
No round trip means it feels instant
Cloud writing tools have to send your text to an API, wait in line behind everyone else's requests, generate, and stream the answer back over the network. Even on a fast connection that's tens to hundreds of milliseconds of latency you can feel as you type. Local inference skips all of it. There's no DNS lookup, no TLS handshake, no server queue — just the model on your own silicon. Completions surface at the caret fast enough to keep up with your typing, and they don't stutter when your connection does.
Costs nothing to run — no subscription, ever
Once the model is on your Mac, running it is free. Every completion is computed on hardware you already own, so there are no token charges, no API bills, and no usage meter ticking in the background. There's no monthly fee to keep it working and no rate limit to throttle a heavy writing day. You pay once and own it.
| Local (MacAutocomplete) | Cloud writing tools |
|---|---|
| One-time price, then free to run | Monthly subscription + API bills |
| Works fully offline | Dead without a connection |
| No rate limits or usage meters | Throttled past a quota |
| No network round-trip latency | Round trip on every keystroke pause |
Curious how that lines up against a specific competitor? See MacAutocomplete vs. Cotypist, or read the longer case for keeping it on-device in local vs. cloud AI writing.
Frequently asked questions
Does MacAutocomplete need an internet connection?
No. After the one-time model download on first run, completions are generated entirely on your Mac. You can switch off Wi-Fi, enable Airplane Mode, or work somewhere with no signal at all, and the ghost suggestions keep appearing exactly as before.
Are there any API fees or usage limits?
None. Because the model runs on hardware you already own, there are no per-token charges, no API bills, and no rate limits. It's a one-time purchase — you can write all day without a meter running.
How big is the model and where does it run?
It's a 4-bit quantized IBM Granite 4.1 3B model, around 2GB, downloaded once on first launch. It runs through Apple's MLX framework on your Apple Silicon GPU and Neural Engine — never on a remote server.
Is running locally fast enough?
Yes — usually faster than cloud tools. There's no network round-trip, server queue, or handshake, so suggestions surface at the caret quickly enough to keep pace with your typing and don't stutter when your connection does.
get it
One model, on your Mac. Lifetime license.
Lifetime · up to 3 Macs · free updates · 14-day money-back · runs 100% on your Mac.
Lifetime · 3 devices · 14-day refund (95%).