PhyseaWiki How AI actually works Papers physea.ai →

Subject 03 · Builds on Architecture

Models

Frontier and local models in plain language. Families, sizes, licensing, context, pricing, and how to choose one.

22 pages across 6 topics

Frontier vs local

Rented in the cloud versus run on your machine.

Frontier vs local A frontier model lives on a vendor's servers and you rent it per request through an API. A local model is one whose weights you download and run on your own machine. The split shapes every trade-off that follows: capability, cost, and privacy.
The capability gap For everyday tasks like coding, summarizing, and answering questions over your own documents, good local models are now close to frontier ones. Frontier models keep a clear lead on the hardest reasoning, on images and audio, and on staying reliable over very long inputs.
The cost question Frontier models cost nothing to start and then charge for every request, so the bill grows with how much you use them. Local models cost a lot upfront in hardware but almost nothing per request after that. Which is cheaper depends on your volume.
Privacy and control A frontier model sends every prompt to the vendor's servers, which you have to trust with your data. A local model keeps prompts and outputs on hardware you control, which removes a whole category of exposure but makes you responsible for securing that hardware yourself.

Model families

Claude, GPT, Gemini, Llama, Qwen, and the rest.

Sizes & parameters

What 7B, 70B, and MoE actually mean.

Licensing & open weights

Open weights, open source, and what you may do.

Context & pricing

What you pay for, and how to pay less.

Choosing a model

Matching the model to the job.