The internet will happily talk you into a dual-RTX-5090 monster. You almost certainly don't need it. A genuinely useful local model — one that drafts emails, summarizes documents, and writes code — runs on hardware that costs less than a mid-range phone if you buy smart.

The one rule: VRAM > everything

For local LLMs, the amount of VRAM on your GPU sets the ceiling on what you can run. Raw speed matters second. A cheap card with 8 GB beats an expensive card with 6 GB for this job. So the cheapest useful LLM PC is really "the cheapest path to 8–12 GB of VRAM."

The budget tiers

Budget tierGPU pickRuns
Rock-bottomUsed RTX 3060 12 GBUp to a 14B model at Q4 — genuinely capable.
ValueRTX 5060 / 4060 8 GB7B–8B models fast, with new-card warranty.
No GPUAny modern CPU + 16 GB RAM1B–4B models on CPU only. Slow but free.
Our pick A used RTX 3060 12 GB is the budget hero of 2026. Twelve gigs of VRAM for a fraction of a new card means you can run a 14B-class reasoning model — the level where local LLMs stop feeling like a toy.

The rest of the build

Around the GPU, keep it boring: any 6-core (or better) CPU from the last few years, 16 GB of RAM minimum (32 GB if you want to offload bigger models), and a small NVMe SSD so model files load quickly. You do not need a top-tier CPU — for inference it mostly sits and waits on the GPU.

Where "cheap" starts to hurt

Two places. First, 6 GB cards: they technically work but box you into 1B–4B models and short contexts. Second, skimping on RAM: with only 8 GB of system RAM you can't offload anything, so the GPU's VRAM becomes a hard wall. Spend the extra few dollars on RAM before anything else.

See what your budget actually runs
Tell the calculator your target model and it shows the minimum GPU VRAM and system RAM — so you buy exactly enough and not a dollar more.
Open the Local AI Calculator

FAQ

What's the absolute minimum to run a local LLM?
A modern CPU with 16 GB of RAM runs small 1B–4B models on CPU alone. Add any 8 GB GPU and you jump to capable 7B–8B models.
Is a used mining GPU a good idea?
It can be great value for VRAM, but check the cooling and fan health. For inference the card runs cooler than mining did, so a healthy used card is usually fine.
AMD or Nvidia on a budget?
Nvidia still has the smoothest software path (CUDA) for local AI. AMD works well with modern runtimes but expect a little more setup.
Related guides
Can I Run a Local LLM on a Laptop? How Much VRAM for DeepSeek-R1? Q4 vs Q8 Quantization Explained

We may partner with companies or groups to affiliate hardware products based on user needs, earning a commission from qualifying purchases. Recommendations are based on VRAM requirements (weights + KV cache + overhead) and may vary by runtime. Data current as of June 2026.