Open vs. Closed AI Models

Open-weights models let you download and run them yourself; closed models are only accessible through APIs controlled by their creators.

What does "open" mean for AI models?

The AI industry uses "open" differently than traditional software. A spectrum exists:

Closed/Proprietary: GPT-5.1, Claude 4.5. You can only access them through APIs. The weights, training data, and architecture details are secret.
Open weights: Llama 3.3, Mistral. You can download the trained model and run it yourself. But the training code and data are often not released.
Fully open: Some research models release weights, training code, and data. Rare at the frontier.

Most "open-source AI" is actually open-weights: you get the result of training, not the recipe to reproduce it.

Open vs. Closed: Approximate Capability Frontier (Late 2025)

GPT-5.1 (Closed)

Claude Opus 4.5 (Closed)

Gemini 2.0 (Closed)

Llama 3.3 405B (Open)

Mistral Large 2 (Open)

Qwen 2.5 72B (Open)

What can you do with open-weights models?

With Llama or Mistral, you can:

Run locally: No API calls, no usage fees, complete privacy
Fine-tune: Adapt the model for your specific domain or task
Modify: Remove safety filters, change behavior (with all the responsibility that implies)
Deploy anywhere: Your servers, your cloud, your rules
Inspect: Study how the model works, run interpretability research

What are the trade-offs?

Open models offer:

Control over your data (nothing sent to external servers)
Customization (fine-tune for your use case)
Cost predictability (hardware costs, not per-token fees)
Independence (no API changes, no service shutdowns)

Closed models offer:

State-of-the-art capability (GPT-5.1, Claude 4.5 still lead)
No hardware management
Continuous improvements (providers update models behind the API)
Safety infrastructure (moderation, filtering, monitoring)

The safety debate

Open models spark genuine disagreement about safety:

For open models: Transparency enables safety research. Closed models are black boxes we have to trust. Open development distributes power rather than concentrating it.

Against unrestricted release: Powerful models in anyone's hands means bad actors can't be excluded. Fine-tuning can remove safety measures. Capability advances faster than our ability to prevent misuse.

Meta's approach with Llama: release weights with an acceptable use policy, but no enforcement mechanism beyond legal terms. Anyone can download the model; Meta hopes most users are responsible.

This debate intensifies as models become more capable. The next generation of open models may face release restrictions or government intervention.

The major open model families

Llama (Meta): The flagship open model. Llama 3.3 405B approaches frontier closed model capability. Permissive license for most uses.
Mistral (Mistral AI): French company, strong models, competitive with larger Llama variants. Some models fully open, some commercial.
Qwen (Alibaba): Strong multilingual performance, especially Chinese. Various sizes and specializations.
Gemma (Google): Smaller models for research and development. More restricted license than Llama.
Phi (Microsoft): Small but capable, designed to prove that smaller models can perform well.

Running open models yourself

The ecosystem for running open models has matured:

Ollama: One-command setup for running models locally on Mac, Windows, Linux
llama.cpp: Efficient C++ implementation that runs on consumer hardware
vLLM: High-performance inference for server deployments
Text Generation WebUI: Browser interface for local models

With a decent GPU (16GB+ VRAM), you can run 7-13B parameter models comfortably. For larger models, you need multiple GPUs or cloud instances.

Sources & Further Reading

🔗 Article

Meta Llama 3.3

Meta AI · 2025

🔗 Article

Mistral AI Documentation

Mistral AI

🔗 Article

The Case for Open Foundation Models

Stanford HAI · 2024

🔗 Article

Ollama - Run LLMs Locally

Ollama