🏛️ AI Model Museum

A curated collection of landmark language models · Admission free

Welcome. This museum documents notable AI language models — what they introduced, who built them, and why they mattered. Below you'll find a Q1 2026 roster aligned with current IDE model pickers (Cursor-class tooling), a featured spotlight, historical galleries by year, and a searchable master catalog table (parameters, training data, license, notes). Wikipedia & reference links for flagship models live on the Informational Links page. No flash photography. Don't touch the weights.

Collection Museum registry v1.0 28 Mar 2026

Model makers checklist

Labs, companies, and developers that appear as model makers in this museum’s master catalog and on-site exhibits (alphabetical). Each tile shows v1.0 (registry checklist) and the earliest catalog row for that maker, or an exhibit date. Only the five premium provider catalog pages we ship (Google Gemini, OpenAI, Anthropic, xAI, Arcee AI) get a completed badge — click v1.0 on those tiles to open the full-line museum page; everyone else stays registry-only until a catalog exists. Maintainers: sync with MODEL_MAKERS_CHECKLIST.md when adding rows.

AI21 Labs v1.0 Mar 2023
Alibaba v1.0 Jun 2024
Allen AI v1.0 Feb 2024
Amazon v1.0 Nov 2022
Anthropic v1.0 Dec 2021
Arcee AI Exhibit v1.0 Mar 2026
Baidu v1.0 Dec 2021
BharatGen v1.0 Jul 2025
BigScience / HF v1.0 Jul 2022
Bloomberg v1.0 Mar 2023
Cerebras v1.0 Mar 2023
Cursor / Anysphere v1.0 2026
Databricks v1.0 Mar 2024
DeepSeek v1.0 Nov 2023
EleutherAI v1.0 Mar 2021
ETH / EPFL v1.0 Sep 2025
Fujitsu / Titech et al. v1.0 May 2024
Google v1.0 Oct 2018
Google DeepMind v1.0 Dec 2021
HKChat OmniServe Limited Exhibit v1.0 Mar 2026
Huawei v1.0 Mar 2023
IBM v1.0 Jul 2023
LAION v1.0 Mar 2023
Meta v1.0 May 2022
Meta AI v1.0 Feb 2023
Microsoft v1.0 Dec 2023
Microsoft / NVIDIA v1.0 Oct 2021
MiniMax v1.0 Jan 2025
Mistral AI v1.0 Sep 2023
Moonshot AI v1.0 Jan 2026
NVIDIA v1.0 Jun 2024
OpenAI v1.0 Jun 2018
Sarvam AI v1.0 May 2025
TII v1.0 Mar 2023
xAI v1.0 Nov 2023
Xiaomi Exhibit v1.0 Mar 2026
Yandex v1.0 Jun 2022
Zhipu AI v1.0 Jul 2025

Premium flagship · Q1 2026

Top-tier frontier models commonly offered as premium picks in early 2026 — including Composer 2 as Cursor's own flagship, plus maximum tiers from Anthropic, OpenAI, xAI, Moonshot AI, Google DeepMind, Zhipu GLM-5, DeepSeek-V3.2, Xiaomi MiMo-V2-Pro, and Alibaba Qwen3-Max class lines.

Exhibit FF-01 Flagship

Composer 2

Cursor · IDE flagship · Q1 2026

Cursor's Q1 2026 flagship — the default top-tier Composer for agentic multi-file work, deep codebase understanding, and long-running coding sessions inside the Cursor IDE.

Composer Agentic IDE-native

Exhibit FF-02 Flagship

Claude Opus 4.6 (Thinking)

Anthropic · Maximum depth

Anthropic's strongest Claude tier for the hardest reasoning, long-horizon coding, and agentic work — the flagship when latency and cost are secondary to quality. Listed as Thinking in Cursor-style menus.

Thinking Reasoning Coding Agents

Exhibit FF-03 Flagship

GPT-5.4

OpenAI · Frontier GPT

OpenAI's broad flagship for multimodal tasks, tool use, and general intelligence — the default "max quality" GPT line in first-quarter 2026 pickers.

Multimodal Tools Generalist

Exhibit FF-04 Flagship

Grok 4.20

xAI · Real-time edge

xAI's Grok flagship for this era — strong reasoning with a product focus on live data, speed, and personality-forward assistants.

Real-time Reasoning xAI stack

Exhibit FF-05 Flagship

Kimi K2.5

Moonshot AI · Long context

Moonshot's Kimi line at K2.5 — known for very long effective context and solid coding assistance; a first-class alternative in multi-model IDEs.

Long context Coding Chinese + EN

Exhibit FF-06 Flagship

Gemini 3.1 Pro

Google DeepMind · Pro · High / Low

Google's Gemini 3.1 Pro — the serious multimodal workhorse for complex prompts, documents, and vision-heavy workflows. Pickers often expose High and Low variants (quality vs. speed) and may badge them as new releases.

Multimodal High / Low Workspace

Exhibit FF-07 Flagship

GLM-5

Zhipu AI (Z.AI) · Premium · MoE

Zhipu’s flagship GLM-5 generation — a large MoE stack (≈745B-class public framing) with very long context and a product focus on coding, agents, and Chinese-infrastructure inference paths. Positioned as a top-tier alternative in global and CN IDE pickers (often listed as GLM-5 next to other premium add-ons).

MoE Long context Coding Z.AI

Exhibit FF-08 Flagship

DeepSeek-V3.2

DeepSeek · V3.x successor

The V3.2 line continues DeepSeek’s MoE frontier with a balanced daily driver and a high-compute Speciale variant for deep reasoning — plus open-weight releases and an emphasis on sparse attention and agentic tool-use training at scale.

MoE Open weights Reasoning API

Exhibit FF-09 Flagship

MiMo-V2-Pro

Xiaomi · Agent flagship

MiMo-V2-Pro is Xiaomi’s top MiMo model for agentic workloads — a trillion-scale MoE design with massive context (up to ~1M tokens in public materials), hybrid attention, and strong coding/agent benchmarks — aimed at orchestration, planning, and long-horizon automation.

Agents MoE Long context Coding

Exhibit FF-10 Flagship

Qwen3-Max-Thinking

Alibaba (Qwen) · Cloud flagship

Alibaba’s Qwen3-Max family apex — the Thinking variant adds test-time scaling and multi-pass refinement for hard math, code, and tool use. Served as a proprietary cloud flagship (e.g. DashScope) alongside open/smaller Qwen tiers in global stacks.

Thinking MoE Multilingual DashScope

Qwen Code, Cursor, Warp & frontier · Q1 2026

Companion and specialist models from the same generation — Composer 1.5, Codex, Sonnet/Haiku, GPT-5.x variants, Gemini Flash tiers, and more. Snapshots: Qwen Code add-on billing row (its own card), Cursor context/comparison tables, Warp long model selector in the terminal, and other IDEs where noted. (Composer 2 lives in the flagship row above.)

IDE model menu (example snapshot)

Labels as shown in a typical Q1 2026 picker in Cursor and Google Antigravity — New matches the in-app badge; indicates models that may show a caution in the UI.

Gemini 3.1 Pro (High) New
Gemini 3.1 Pro (Low) New
Gemini 3 Flash
Claude Sonnet 4.6 (Thinking)
Claude Opus 4.6 (Thinking)
GPT-OSS 120B (Medium)

Qwen Code — add-on relative cost (example snapshot)

The add-on row in the Qwen Code UI in Cursor (Q1 2026). Relative cost multipliers are billing weights for these models (not vendor-published specs). List A–Z.

GLM-5 0.5×
Kimi-K2.5 0.3×
MiniMax-M2.7 0.2×
Qwen-Coder-Qoder-1.0 0.2×
Qwen3.5-Plus 0.2×

Warp — long model selector (example snapshot)

Dropdown labels from a Q1 2026 Warp terminal agent session. US-hosted marks regional API endpoints. List A–Z.

Typical labels (A–Z)

auto
claude 4 sonnet
claude 4.1 opus
claude 4.5 haiku
claude 4.5 opus
claude 4.5 sonnet
claude 4.6 opus
claude 4.6 sonnet
gemini 2.5 pro
gemini 3 pro
gemini 3.1 pro
glm 4.7 (us-hosted)
glm 5 (us-hosted)
gpt-5
gpt-5.1
gpt-5.1 codex
gpt-5.1 codex max
gpt-5.2
gpt-5.2 codex
gpt-5.3 codex
gpt-5.4
kimi k2.5 (us-hosted)

Cursor — context & comparison tables (example)

Context and score figures from in-app comparison UIs; they change as products update.

Context windows (Z.AI · MiniMax · OpenAI) — example

Model	Provider	Context (approx.)
GLM 4.6	Z.AI	200.0K
MiniMax M2.5	MiniMax	204.8K
MiniMax M2.1	MiniMax	204.8K
MiniMax M2.1 Lightning	MiniMax	204.8K
GLM 4.7	Z.AI	200.0K
GLM 5	Z.AI	202.8K
GPT-5.3-Codex	OpenAI	400.0K
GPT-5.2-Codex	OpenAI	400.0K

In-app comparison scores (example)

Model	Score	Org	Context
Claude Opus 4.6	10	Anthropic	1.0M
Claude Opus 4.5	10	Anthropic	200.0K
Claude Sonnet 4.6	10	Anthropic	1.0M
Claude Sonnet 4.5	10	Anthropic	1.0M
GPT-5.4	10	OpenAI	1.0M
GPT-5.4 Pro	10	OpenAI	1.0M
GPT-5.2	10	OpenAI	400.0K
GPT-5.1	10	OpenAI	400.0K
GPT-5.1-Codex-Max	10	OpenAI	400.0K
Gemini 3 Pro Preview	10	Google	1.0M
Gemini 3.1 Pro Preview	10	Google	1.0M
GPT-5 mini	9.9	OpenAI	400.0K
Claude Opus 4.1	9.8	Anthropic	200.0K
GPT-5	9.8	OpenAI	400.0K
Gemini 2.5 Pro (partial)	9.6	Google	1.0M

OpenCode — Select model (example snapshot)

From the OpenCode desktop app’s model picker (Jan 2026). Big Pickle is OpenCode’s free Zen-tier coding model (often the default). Free = bundled no-cost rows; Recommended matches in-app hints for add-on providers. Official products: OpenCode Zen (curated agent models) and OpenCode Go (low-cost subscription).

Free models provided by OpenCode

Big Pickle Free
MiMo V2 Omni Free Free
GPT-5 Nano Free
Nemotron 3 Super Free Free
MiniMax M2.5 Free Free
MiMo V2 Pro Free Free

Add more models from popular providers

OpenCode Zen — reliable optimized models Recommended
OpenCode Go — low-cost subscription for everyone Recommended
Anthropic — Claude models, including Pro and Max
GitHub Copilot

Exhibit CR-01 Q1 2026

Composer 1.5

Cursor·Stable gen

Prior-generation Composer still common in menus — balanced speed and reliability.

ComposerProven

Exhibit CR-02 Q1 2026

Codex 5.3

OpenAI · Cursor·Coding

OpenAI Codex family tuned for IDE workflows — strong inline and task completion.

CodexCode

Exhibit CR-03 Q1 2026

Claude Sonnet 4.6 (Thinking)

Anthropic·Daily driver

Fast, capable Claude tier — the usual sweet spot for everyday coding and agents. Same line as Thinking in Cursor pickers (extended reasoning budget).

ThinkingSonnetVision

Exhibit CR-03b Q1 2026

GPT-OSS 120B (Medium)

OpenAI·Open weights

Open-weight GPT-OSS family at 120B parameters — Medium denotes the reasoning / compute tier in the menu, not the parameter count.

GPT-OSSApache-2.0

Exhibit CR-04 Q1 2026

Claude Opus 4.5

Anthropic·Prior Opus

Earlier Opus generation — still listed for compatibility and cost-sensitive depth work.

OpusLegacy tier

Exhibit CR-05 Q1 2026

GPT-5.2

OpenAI·GPT-5 line

GPT-5 series model — strong generalist when 5.4 is overkill or unavailable.

GPT-5Generalist

Exhibit CR-06 Q1 2026

GPT-5.4 Mini

OpenAI·Efficient

Smaller 5.4 variant — faster turns and lower cost with shared family behavior.

MiniFast

Exhibit CR-07 Q1 2026

GPT-5.4 Nano

OpenAI·Lightweight

Nano tier for lint-level fixes, tiny edits, and high-volume completions.

NanoEconomy

Exhibit CR-08 Q1 2026

Claude Haiku 4.5

Anthropic·Fast Claude

Latency-first Claude — great for quick refactors, summaries, and chat throughput.

HaikuSpeed

Exhibit CR-09 Q1 2026

Codex 5.3 Spark

OpenAI · Cursor·Spark

Spark variant — optimized for snappy suggestions and lighter workloads.

CodexSpark

Exhibit CR-10 Q1 2026

Claude Sonnet 4.5

Anthropic·Prior Sonnet

Previous Sonnet — familiar behavior for teams pinning older stacks.

SonnetStable

Exhibit CR-11 Q1 2026

Codex 5.2

OpenAI · Cursor·Codex

Earlier Codex generation — still appears in long-running project defaults.

Codex5.2

Exhibit CR-12 Q1 2026

Codex 5.1 Max

OpenAI · Cursor·Max

5.1 family at Max width — heavier coding sessions before 5.2/5.3.

CodexMax

Exhibit CR-13 Q1 2026

GPT-5.1

OpenAI·GPT-5 line

GPT-5.1 checkpoint — general assistant quality with broad tool compatibility.

GPT-5Generalist

Exhibit CR-14 Q1 2026

Gemini 3 Flash

Google DeepMind·Fast

Gemini 3 family speed tier — high throughput for chat and multimodal bursts.

FlashGemini 3

Exhibit CR-15 Q1 2026

Codex 5.1 Mini

OpenAI · Cursor·Mini

Compact Codex 5.1 — ideal for tab completion and small-scope edits.

CodexMini

Exhibit CR-16 Q1 2026

Claude Sonnet 4

Anthropic·Gen 4

Sonnet 4 base — predecessor to 4.5/4.6 feature tiers.

SonnetGen 4

Exhibit CR-17 Q1 2026

GPT-5 Mini

OpenAI·Efficient

Budget GPT-5 class model — everyday tasks without full 5.4 spend.

MiniGPT-5

Exhibit CR-18 Q1 2026

Gemini 2.5 Flash

Google DeepMind·2.5 era

Widely deployed fast Gemini — often the default "Flash" in multi-model lists.

Flash2.5

Regional assistant · Hong Kong

HKChat (港話通) — LLM-based assistant for Hong Kong citizens: local life Q&A, bilingual help, and preliminary legal orientation. Not a substitute for professional advice.

Exhibit HK-01 Nov 2025

HKChat

HKChat OmniServe Limited·Hong Kong

Public-facing chatbot positioned as a localized assistant: Cantonese, English, and Mandarin; Hong Kong–specific knowledge; Android, iOS, Windows, and HarmonyOS clients. Described in sources as built on HK-GNN V1 (Hong Kong Generative AI R&D Center) with Hong Kong data and values. Official entry points: hkchat.org, hkchat.org/ai-service, web app chat.hkchat.app.

RegionalBilingualMobile + web

Featured Exhibit

✦ Newest Exhibit · Feb 2026

Exhibit ∞ · Anthropic

Claude Sonnet 4.6 (Thinking)

Anthropic · February 2026 · 200K context · 2026

The latest model from Anthropic — and the one powering this very page right now. Claude Sonnet 4.6 (Thinking) is a fast, highly capable reasoning model with best-in-class coding, strong vision understanding, and agentic task execution. Built with a deep emphasis on helpfulness, harmlessness, and honesty. If you're reading this through a Cursor chat, hi.

Extended Reasoning Vision Code Generation Agentic Tasks Tool Use 200K Context Multilingual Cursor Native

Recent Acquisitions · 2025

Exhibit 2025/001 Feb 2025

Claude 3.7 Sonnet

Anthropic · 200K ctx

Introduced "extended thinking" — a visible chain-of-thought reasoning mode that lets Claude work through hard problems step by step before answering. Set new coding benchmarks at launch.

Extended Thinking Coding 200K Context

Exhibit 2025/002 Feb 2025

Grok 3

xAI · Real-time data

Trained on a cluster of 200,000 GPUs. Notable for "DeepSearch" — a deep research mode that browses the web iteratively. Integrated with X (Twitter) for real-time context.

DeepSearch Reasoning Real-time Data

Exhibit 2025/003 Jan 2025

o3

OpenAI · Reasoning model

Achieved near-human scores on ARC-AGI, a benchmark designed to resist AI brute-force. The strongest reasoning model OpenAI has released, with compute-scaled thinking at inference time.

ARC-AGI SOTA Math Science Chain-of-Thought

Exhibit 2025/004 Jan 2025

DeepSeek R1

DeepSeek · Open weights

Open-source reasoning model that matched o1 on benchmarks at a fraction of the training cost. Caused significant market disruption at launch. Freely downloadable and runnable locally.

Open Source Reasoning Runs Locally Cost Efficient

Exhibit 2025/005 Feb 2025

Gemini 2.0 Flash

Google DeepMind · Multimodal

Google's fast flagship for 2025. Native multimodal — handles text, images, audio, and video. Designed for agentic use cases and high-throughput applications with a generous free tier.

Multimodal Fast Agentic 1M Context

Established Collection · 2024

Exhibit 2024/001 May 2024

GPT-4o

OpenAI · Omni multimodal

"Omni" — the first model to handle text, image, and audio natively in one architecture. Introduced real-time voice conversation with human-like response speeds. Became ChatGPT's default model.

Voice Vision Fast 128K Context

Exhibit 2024/002 Sep 2024

o1

OpenAI · First public reasoning model

OpenAI's first publicly released reasoning model. Introduced the paradigm of spending more compute at inference time to "think" before responding — enabling PhD-level performance on math and science.

Reasoning Math Science Coding

Exhibit 2024/003 Oct 2024

Claude 3.5 Sonnet

Anthropic · 200K ctx

Topped coding benchmarks for months at release. Introduced "computer use" — the ability to control a desktop GUI like a human would. Became the go-to model for agentic coding workflows.

Computer Use Coding Agentic

Exhibit 2024/004 Dec 2024

DeepSeek V3

DeepSeek · MoE · Open weights

A mixture-of-experts model with 671B total parameters (37B active). Trained for $5.6M — shocking efficiency compared to Western frontier models. Matched GPT-4o on benchmarks, fully open source.

MoE Architecture Open Source Cost Efficient

Exhibit 2024/005 Dec 2024

Llama 3.3 70B

Meta AI · Open weights

Meta's openly released 70B model that outperformed the 405B Llama 3.1 on several benchmarks. A milestone for on-device and self-hosted AI — runs on consumer hardware with quantization.

Open Weights Runs Locally Instruction Tuned

Exhibit 2024/006 Feb 2024

Gemini 1.5 Pro

Google DeepMind · 1M context

Introduced a breakthrough 1M token context window — enough to hold entire codebases, books, or hours of video. Changed what "long context" meant for the industry.

1M Token Context Multimodal MoE

Master catalog

Dense index of publicly notable LLM releases and open-weight checkpoints. This is not every private internal checkpoint; it follows the same broad canon as Wikipedia’s List of large language models, with extra gap-fill rows. Figures are often reported ranges or estimates, not audited specs.

Model	Released	Developer	Parameters	Training data	Compute / cost	License	Notes
GPT-1	Jun 2018	OpenAI	0.117B	-	-	MIT	First GPT; decoder-only transformer.
BERT	Oct 2018	Google	0.34B	3.3B words	-	Apache-2.0	Encoder-only; highly influential.
GPT-2	Feb 2019	OpenAI	1.5B	~10B tokens	-	MIT	Scaled LM generation.
T5	Oct 2019	Google	11B	34B tokens	-	Apache-2.0	Text-to-text transfer transformer.
XLNet	Jun 2019	Google	0.34B	33B words	-	Apache-2.0	Permutation LM.
GPT-3	May 2020	OpenAI	175B	300B tokens	-	Proprietary	Few-shot learning at scale.
GPT-Neo	Mar 2021	EleutherAI	2.7B	825 GiB	-	MIT	Open GPT-3-class alternative.
GPT-J	Jun 2021	EleutherAI	6B	The Pile	-	Apache-2.0	Open autoregressive model.
Megatron-Turing NLG	Oct 2021	Microsoft / NVIDIA	530B	338.6B tokens	-	Unreleased	Large-scale training on Selene.
Ernie 3.0 Titan	Dec 2021	Baidu	260B	4 TB	-	Proprietary	Chinese LLM; Ernie Bot lineage.
Claude	Dec 2021	Anthropic	52B	400B tokens	-	Proprietary	RLHF-style alignment focus.
GLaM	Dec 2021	Google	1200B MoE	1.6T tokens	-	Proprietary	Sparse MoE generalist.
Gopher	Dec 2021	Google DeepMind	280B	300B tokens	-	Proprietary	Later led to Chinchilla scaling insights.
LaMDA	Jan 2022	Google	137B	1.56T words	-	Proprietary	Dialog-specialized.
GPT-NeoX	Feb 2022	EleutherAI	20B	825 GiB	-	Apache-2.0	Megatron-based.
Chinchilla	Mar 2022	Google DeepMind	70B	1.4T tokens	-	Proprietary	Compute-optimal scaling law.
PaLM	Apr 2022	Google	540B	768B tokens	-	Proprietary	Pathways large model.
OPT	May 2022	Meta	175B	180B tokens	-	Non-commercial	Open replication effort + logbook.
YaLM 100B	Jun 2022	Yandex	100B	1.7 TB	-	Apache-2.0	EN–RU bilingual.
Minerva	Jun 2022	Google	540B	38.5B (math)	-	Proprietary	STEM reasoning; from PaLM.
BLOOM	Jul 2022	BigScience / HF	175B	350B tokens	-	RAIL	Multilingual open collaboration.
Galactica	Nov 2022	Meta	120B	106B tokens	-	CC-BY-NC-4.0	Scientific corpora.
AlexaTM	Nov 2022	Amazon	20B	1.3T tokens	-	Proprietary	Seq2seq architecture.
Llama	Feb 2023	Meta AI	65B	1.4T tokens	-	Research-only	Open-weights wave.
GPT-4	Mar 2023	OpenAI	Unknown	Unknown	-	Proprietary	Multimodal flagship era.
Cerebras-GPT	Mar 2023	Cerebras	13B	-	-	Apache-2.0	Chinchilla-optimal training.
Falcon	Mar 2023	TII	40B	1T tokens	-	Apache-2.0	RefinedWeb + curated.
BloombergGPT	Mar 2023	Bloomberg	50B	708B mixed	-	Unreleased	Finance-tuned.
PanGu-Σ	Mar 2023	Huawei	1085B	329B tokens	-	Proprietary	Very large dense/MoE stack.
OpenAssistant	Mar 2023	LAION	17B	1.5T tokens	-	Apache-2.0	Crowdsourced RLHF data.
Jurassic-2	Mar 2023	AI21 Labs	Unknown	-	-	Proprietary	API-first.
PaLM 2	May 2023	Google	340B	3.6T tokens	-	Proprietary	Bard / workspace era.
YandexGPT	May 2023	Yandex	Unknown	-	-	Proprietary	Alice assistant.
Llama 2	Jul 2023	Meta AI	70B	2T tokens	-	Llama 2 license	Widespread finetunes.
Claude 2	Jul 2023	Anthropic	Unknown	-	-	Proprietary	Long-context Claude chat.
Granite 13B	Jul 2023	IBM	13B	-	-	Proprietary	watsonx.ai stack.
Mistral 7B	Sep 2023	Mistral AI	7.3B	-	-	Apache-2.0	Efficient open weights.
Claude 2.1	Nov 2023	Anthropic	Unknown	-	-	Proprietary	~200K token context.
Grok 1	Nov 2023	xAI	314B	-	-	Apache-2.0	Open-weight release; X integration.
Gemini 1.0	Dec 2023	Google DeepMind	Unknown	-	-	Proprietary	Multimodal family.
Mistral 8x7B	Dec 2023	Mistral AI	46.7B MoE	-	-	Apache-2.0	MoE; strong benchmarks.
DeepSeek-LLM	Nov 2023	DeepSeek	67B	2T tokens	-	DeepSeek License	EN + Chinese.
Phi-2	Dec 2023	Microsoft	2.7B	1.4T tokens	-	MIT	Textbook-quality data.
Gemini 1.5	Feb 2024	Google DeepMind	Unknown	-	-	Proprietary	1M+ token context.
Gemini Ultra	Feb 2024	Google DeepMind	Unknown	-	-	Proprietary	Benchmark-focused tier.
Gemma	Feb 2024	Google DeepMind	7B	6T tokens	-	Gemma terms	Open-ish small models.
OLMo	Feb 2024	Allen AI	7B	2T tokens	-	Apache-2.0	Fully open pipeline.
Claude 3	Mar 2024	Anthropic	Unknown	-	-	Proprietary	Haiku / Sonnet / Opus.
DBRX	Mar 2024	Databricks	136B	12T tokens	-	DBRX license	MoE; Mosaic training.
Mixtral 8x22B	Apr 2024	Mistral AI	141B MoE	-	-	Apache-2.0	Larger MoE.
Phi-3	Apr 2024	Microsoft	14B	4.8T tokens	-	MIT	SLM marketing wave.
Qwen2	Jun 2024	Alibaba	72B	3T tokens	-	Qwen license	Multilingual.
DeepSeek-V2	Jun 2024	DeepSeek	236B MoE	8.1T tokens	-	DeepSeek License	Economic training.
Nemotron-4	Jun 2024	NVIDIA	340B	9T tokens	-	NVIDIA license	H100 cluster training.
Claude 3.5	Jun 2024	Anthropic	Unknown	-	-	Proprietary	Sonnet-led coding surge.
Llama 3.1	Jul 2024	Meta AI	405B	15.6T tokens	-	Llama 3 license	405B flagship open-ish.
Grok-2	Aug 2024	xAI	Unknown	-	-	xAI license	Later Grok 2.5 source-available.
OpenAI o1	Sep 2024	OpenAI	Unknown	-	-	Proprietary	Explicit reasoning model.
Mistral Large	Nov 2024	Mistral AI	123B	-	-	Mistral Research	API flagship.
Pixtral	Nov 2024	Mistral AI	123B	-	-	Mistral Research	Multimodal.
OLMo 2	Nov 2024	Allen AI	32B	6.6T tokens	-	Apache-2.0	Open research LM.
Phi-4	Dec 2024	Microsoft	14B	9.8T tokens	-	MIT	SLM continued.
DeepSeek-V3	Dec 2024	DeepSeek	671B MoE	14.8T tokens	-	MIT	Cost-shock open weights.
Amazon Nova	Dec 2024	Amazon	Unknown	-	-	Proprietary	Micro / Lite / Pro.
DeepSeek-R1	Jan 2025	DeepSeek	671B	RL only	-	MIT	Reasoning from base.
Qwen2.5	Jan 2025	Alibaba	72B	18T tokens	-	Qwen license	Dense + MoE lineup.
MiniMax-Text-01	Jan 2025	MiniMax	456B	4.7T tokens	-	MiniMax license	Long-context focus.
Gemini 2.0	Feb 2025	Google DeepMind	Unknown	-	-	Proprietary	Flash / Flash-Lite / Pro.
Claude 3.7	Feb 2025	Anthropic	Unknown	-	-	Proprietary	Sonnet + extended thinking.
GPT-4.5	Feb 2025	OpenAI	Unknown	-	-	Proprietary	Largest non-reasoning GPT then.
Grok 3	Feb 2025	xAI	Unknown	-	-	Proprietary	Massive compute claims.
Gemini 2.5	Mar 2025	Google DeepMind	Unknown	-	-	Proprietary	Flash / Flash-Lite / Pro.
Llama 4	Apr 2025	Meta AI	400B	40T tokens	-	Llama 4 license	Multimodal natively.
OpenAI o3 / o4-mini	Apr 2025	OpenAI	Unknown	-	-	Proprietary	Reasoning stack.
Qwen3	Apr 2025	Alibaba	235B	36T tokens	-	Apache-2.0	Many sizes down to 0.6B.
Claude 4	May 2025	Anthropic	Unknown	-	-	Proprietary	Sonnet + Opus refresh.
Sarvam-M	May 2025	Sarvam AI	24B	-	-	Apache-2.0	India-focused reasoning.
Grok 4	Jul 2025	xAI	Unknown	-	-	Proprietary	Frontier Grok line.
Param-1	Jul 2025	BharatGen	2.9B	5T tokens	-	Unknown	Indic languages.
GLM-4.5	Jul 2025	Zhipu AI	355B MoE	22T tokens	-	MIT	335B / 106B sizes.
GPT-OSS	Aug 2025	OpenAI	117B	-	-	Apache-2.0	20B + 120B open weights.
Claude 4.1	Aug 2025	Anthropic	Unknown	-	-	Proprietary	Opus refresh.
GPT-5	Aug 2025	OpenAI	Unknown	-	-	Proprietary	Mini / nano / full family.
DeepSeek-V3.1	Aug 2025	DeepSeek	671B	15.6T+ tokens	-	MIT	Hybrid thinking modes.
Apertus	Sep 2025	ETH / EPFL	70B	15T tokens	-	Apache-2.0	EU AI Act positioning.
Claude Sonnet 4.5	Sep 2025	Anthropic	Unknown	-	-	Proprietary	Coding + agents.
DeepSeek-V3.2-Exp	Sep 2025	DeepSeek	685B	-	-	MIT	DSA sparse attention.
GLM-4.6	Sep 2025	Zhipu AI	357B	-	-	Apache-2.0	Open flagship coding.
Gemini 3	Nov 2025	Google DeepMind	Unknown	-	-	Proprietary	Deep Think / Pro tiers.
Olmo 3	Nov 2025	Allen AI	32B	5.9T tokens	-	Apache-2.0	7B + 32B reasoning.
Claude Opus 4.5	Nov 2025	Anthropic	Unknown	-	-	Proprietary	Largest Claude then.
GPT-5.2	Dec 2025	OpenAI	Unknown	-	-	Proprietary	Reasoning + pro workloads.
GLM-4.7	Dec 2025	Zhipu AI	355B MoE	-	-	Apache-2.0	MoE SOTA coding claims.
Qwen3-Max-Thinking	Jan 2026	Alibaba	Unknown	-	-	Proprietary	Adaptive tool use.
Kimi K2.5	Jan 2026	Moonshot AI	1040B MoE	15T tokens	-	Modified MIT	32B active; multimodal.
Claude Opus 4.6	Feb 2026	Anthropic	Unknown	-	-	Proprietary	Frontier Claude.
GPT-5.3-Codex	Feb 2026	OpenAI	Unknown	-	-	Proprietary	Agentic coding line.
GPT-5.2-Codex	Feb 2026	OpenAI	Unknown	-	-	Proprietary	Codex family; large context in tools.
GLM-5	Feb 2026	Zhipu AI	754B	-	-	MIT	DSA; 200K context.
Qwen-Coder-Qoder-1.0	2026	Alibaba	Unknown	-	-	Proprietary	IDE add-on label; Qwen coding line.
Qwen3.5-Plus	2026	Alibaba	Unknown	-	-	Proprietary	Plus tier in model pickers.
MiniMax M2.1	2025	MiniMax	Unknown	-	-	Proprietary	M2 API family; ~205K context.
MiniMax M2.1 Lightning	2025	MiniMax	Unknown	-	-	Proprietary	Speed-focused M2.1 variant.
MiniMax M2.5	2025	MiniMax	Unknown	-	-	Proprietary	Long-context M2 line; ~205K.
MiniMax-M2.7	2026	MiniMax	Unknown	-	-	Proprietary	Add-on tier in Cursor-style menus.
Param-2	Feb 2026	BharatGen	17B MoE	~22T tokens	-	Unknown	More Indic langs.
Sarvam-1	Feb 2026	Sarvam AI	105B MoE	~12T tokens	-	Apache-2.0	India foundation model.
GPT-5.4	Mar 2026	OpenAI	Unknown	-	-	Proprietary	Thinking + Pro variants.
GPT-5.4 Pro	Mar 2026	OpenAI	Unknown	-	-	Proprietary	Pro tier; ~1M context in pickers.
GPT-5.1 Codex	2026	OpenAI	Unknown	-	-	Proprietary	Codex line in IDE menus.
GPT-5.1 Codex Max	2026	OpenAI	Unknown	-	-	Proprietary	Top Codex tier in pickers.
Grok 4.20	2026	xAI	Unknown	-	-	Proprietary	Picker label; see Grok 4 article.
Gemini 3.1 Pro	2026	Google DeepMind	Unknown	-	-	Proprietary	Pro tier naming in tools.
Gemini 3 Pro Preview	2026	Google DeepMind	Unknown	-	-	Proprietary	Preview label in model lists.
Gemini 3.1 Pro Preview	2026	Google DeepMind	Unknown	-	-	Proprietary	Preview label in model lists.
Gemini 2.5 Pro (partial)	2025	Google DeepMind	Unknown	-	-	Proprietary	Pro tier; partial availability.
Claude Opus 4.1	Aug 2025	Anthropic	Unknown	-	-	Proprietary	Opus tier naming in pickers.
Composer 2	2026	Cursor / Anysphere	-	-	-	Proprietary	IDE agent flagship (product).
Codex	Aug 2021	OpenAI	12B	-	-	Proprietary	Code fine-tune of GPT-3 lineage.
InstructGPT	Jan 2022	OpenAI	175B	-	-	Proprietary	RLHF alignment showcase.
GPT-4 Turbo	Nov 2023	OpenAI	Unknown	-	-	Proprietary	128K context; cheaper GPT-4 class.
YandexGPT 2	Sep 2023	Yandex	Unknown	-	-	Proprietary	Alice assistant update.
Llama 3	Apr 2024	Meta AI	8-70B	15T tokens	-	Llama 3 license	Dense family before 3.1.
GPT-4o	May 2024	OpenAI	Unknown	-	-	Proprietary	Omni multimodal flagship.
YandexGPT 3 Pro	Mar 2024	Yandex	Unknown	-	-	Proprietary	Alice chatbot.
YandexGPT 3 Lite	May 2024	Yandex	Unknown	-	-	Proprietary	Alice chatbot.
Fugaku-LLM	May 2024	Fujitsu / Titech et al.	13B	380B tokens	-	Fugaku terms	CPU-trained on Fugaku.
Chameleon	May 2024	Meta AI	34B	4.4T tokens	-	Non-commercial	Early-token fusion multimodal.
o1-mini	Sep 2024	OpenAI	Unknown	-	-	Proprietary	Smaller reasoning model.
YandexGPT 4 Lite/Pro	Oct 2024	Yandex	Unknown	-	-	Proprietary	Alice chatbot.
Llama 3.2	Sep 2024	Meta AI	1-90B	-	-	Llama 3.2 license	Vision + text stack.
Llama 3.3 70B	Dec 2024	Meta AI	70B	-	-	Llama 3.3 license	Strong 70B after 405B 3.1.
DeepSeek-V3-0324	Mar 2025	DeepSeek	671B	14.8T+ ext.	-	MIT	V3 refresh checkpoint.
YandexGPT 5 Lite Pretrain/Pro	Feb 2025	Yandex	Unknown	-	-	Proprietary	Alice Neural Network.
YandexGPT 5 Lite Instruct	Mar 2025	Yandex	Unknown	-	-	Proprietary	Alice Neural Network.
Gemini 2.5 Flash	Mar 2025	Google DeepMind	Unknown	-	-	Proprietary	Fast tier in 2.5 family.
YandexGPT 5.1 Pro	Aug 2025	Yandex	Unknown	-	-	Proprietary	Alice Neural Network.
Alice AI LLM 1.0	Oct 2025	Yandex	Unknown	-	-	Proprietary	Alice AI chatbot.
Claude Haiku 4.5	Oct 2025	Anthropic	Unknown	-	-	Proprietary	Fast Claude tier (companion lineup).