๐ŸŽฎ v1.10.1

๐Ÿ›๏ธ AI Model Museum

A curated collection of landmark language models · Admission free

Welcome. This museum documents notable AI language models โ€” what they introduced, who built them, and why they mattered. Below you'll find a Q1 2026 roster aligned with current IDE model pickers (Cursor-class tooling), a featured spotlight, historical galleries by year, and a searchable master catalog table (parameters, training data, license, notes). Wikipedia & reference links for flagship models live on the Informational Links page. No flash photography. Don't touch the weights.

Collection Museum registry v1.0

Model makers checklist

Labs, companies, and developers that appear as model makers in this museum’s master catalog and on-site exhibits (alphabetical). Each tile shows v1.0 (registry checklist) and the earliest catalog row for that maker, or an exhibit date. Only the five premium provider catalog pages we ship (Google Gemini, OpenAI, Anthropic, xAI, Arcee AI) get a completed badge โ€” click v1.0 on those tiles to open the full-line museum page; everyone else stays registry-only until a catalog exists. Maintainers: sync with MODEL_MAKERS_CHECKLIST.md when adding rows.

Premium flagship  ยท  Q1 2026

Top-tier frontier models commonly offered as premium picks in early 2026 โ€” including Composer 2 as Cursor's own flagship, plus maximum tiers from Anthropic, OpenAI, xAI, Moonshot AI, Google DeepMind, Zhipu GLM-5, DeepSeek-V3.2, Xiaomi MiMo-V2-Pro, and Alibaba Qwen3-Max class lines.

Qwen Code, Cursor, Warp & frontier  ยท  Q1 2026

Companion and specialist models from the same generation โ€” Composer 1.5, Codex, Sonnet/Haiku, GPT-5.x variants, Gemini Flash tiers, and more. Snapshots: Qwen Code add-on billing row (its own card), Cursor context/comparison tables, Warp long model selector in the terminal, and other IDEs where noted. (Composer 2 lives in the flagship row above.)

IDE model menu (example snapshot)

Labels as shown in a typical Q1 2026 picker in Cursor and Google Antigravity โ€” New matches the in-app badge; indicates models that may show a caution in the UI.

  • Gemini 3.1 Pro (High) New
  • Gemini 3.1 Pro (Low) New
  • Gemini 3 Flash
  • Claude Sonnet 4.6 (Thinking)
  • Claude Opus 4.6 (Thinking)
  • GPT-OSS 120B (Medium)

Qwen Code โ€” add-on relative cost (example snapshot)

The add-on row in the Qwen Code UI in Cursor (Q1 2026). Relative cost multipliers are billing weights for these models (not vendor-published specs). List Aโ€“Z.

  • GLM-5 0.5ร—
  • Kimi-K2.5 0.3ร—
  • MiniMax-M2.7 0.2ร—
  • Qwen-Coder-Qoder-1.0 0.2ร—
  • Qwen3.5-Plus 0.2ร—

Warp โ€” long model selector (example snapshot)

Dropdown labels from a Q1 2026 Warp terminal agent session. US-hosted marks regional API endpoints. List Aโ€“Z.

Typical labels (Aโ€“Z)

Cursor โ€” context & comparison tables (example)

Context and score figures from in-app comparison UIs; they change as products update.

Context windows (Z.AI ยท MiniMax ยท OpenAI) โ€” example

Model Provider Context (approx.)
GLM 4.6 Z.AI 200.0K
MiniMax M2.5 MiniMax 204.8K
MiniMax M2.1 MiniMax 204.8K
MiniMax M2.1 Lightning MiniMax 204.8K
GLM 4.7 Z.AI 200.0K
GLM 5 Z.AI 202.8K
GPT-5.3-Codex OpenAI 400.0K
GPT-5.2-Codex OpenAI 400.0K

In-app comparison scores (example)

Model Score Org Context
Claude Opus 4.6 10 Anthropic 1.0M
Claude Opus 4.5 10 Anthropic 200.0K
Claude Sonnet 4.6 10 Anthropic 1.0M
Claude Sonnet 4.5 10 Anthropic 1.0M
GPT-5.4 10 OpenAI 1.0M
GPT-5.4 Pro 10 OpenAI 1.0M
GPT-5.2 10 OpenAI 400.0K
GPT-5.1 10 OpenAI 400.0K
GPT-5.1-Codex-Max 10 OpenAI 400.0K
Gemini 3 Pro Preview 10 Google 1.0M
Gemini 3.1 Pro Preview 10 Google 1.0M
GPT-5 mini 9.9 OpenAI 400.0K
Claude Opus 4.1 9.8 Anthropic 200.0K
GPT-5 9.8 OpenAI 400.0K
Gemini 2.5 Pro (partial) 9.6 Google 1.0M

OpenCode โ€” Select model (example snapshot)

From the OpenCode desktop appโ€™s model picker (Jan 2026). Big Pickle is OpenCodeโ€™s free Zen-tier coding model (often the default). Free = bundled no-cost rows; Recommended matches in-app hints for add-on providers. Official products: OpenCode Zen (curated agent models) and OpenCode Go (low-cost subscription).

Free models provided by OpenCode

Add more models from popular providers

Regional assistant  ยท  Hong Kong

HKChat (ๆธฏ่ฉฑ้€š) โ€” LLM-based assistant for Hong Kong citizens: local life Q&A, bilingual help, and preliminary legal orientation. Not a substitute for professional advice.

Featured Exhibit
Recent Acquisitions  ยท  2025
Established Collection  ยท  2024

Master catalog

Dense index of publicly notable LLM releases and open-weight checkpoints. This is not every private internal checkpoint; it follows the same broad canon as Wikipedia’s List of large language models, with extra gap-fill rows. Figures are often reported ranges or estimates, not audited specs.

Model Released Developer Parameters Training data Compute / cost License Notes
GPT-1 Jun 2018 OpenAI 0.117B - - MIT First GPT; decoder-only transformer.
BERT Oct 2018 Google 0.34B 3.3B words - Apache-2.0 Encoder-only; highly influential.
GPT-2 Feb 2019 OpenAI 1.5B ~10B tokens - MIT Scaled LM generation.
T5 Oct 2019 Google 11B 34B tokens - Apache-2.0 Text-to-text transfer transformer.
XLNet Jun 2019 Google 0.34B 33B words - Apache-2.0 Permutation LM.
GPT-3 May 2020 OpenAI 175B 300B tokens - Proprietary Few-shot learning at scale.
GPT-Neo Mar 2021 EleutherAI 2.7B 825 GiB - MIT Open GPT-3-class alternative.
GPT-J Jun 2021 EleutherAI 6B The Pile - Apache-2.0 Open autoregressive model.
Megatron-Turing NLG Oct 2021 Microsoft / NVIDIA 530B 338.6B tokens - Unreleased Large-scale training on Selene.
Ernie 3.0 Titan Dec 2021 Baidu 260B 4 TB - Proprietary Chinese LLM; Ernie Bot lineage.
Claude Dec 2021 Anthropic 52B 400B tokens - Proprietary RLHF-style alignment focus.
GLaM Dec 2021 Google 1200B MoE 1.6T tokens - Proprietary Sparse MoE generalist.
Gopher Dec 2021 Google DeepMind 280B 300B tokens - Proprietary Later led to Chinchilla scaling insights.
LaMDA Jan 2022 Google 137B 1.56T words - Proprietary Dialog-specialized.
GPT-NeoX Feb 2022 EleutherAI 20B 825 GiB - Apache-2.0 Megatron-based.
Chinchilla Mar 2022 Google DeepMind 70B 1.4T tokens - Proprietary Compute-optimal scaling law.
PaLM Apr 2022 Google 540B 768B tokens - Proprietary Pathways large model.
OPT May 2022 Meta 175B 180B tokens - Non-commercial Open replication effort + logbook.
YaLM 100B Jun 2022 Yandex 100B 1.7 TB - Apache-2.0 ENโ€“RU bilingual.
Minerva Jun 2022 Google 540B 38.5B (math) - Proprietary STEM reasoning; from PaLM.
BLOOM Jul 2022 BigScience / HF 175B 350B tokens - RAIL Multilingual open collaboration.
Galactica Nov 2022 Meta 120B 106B tokens - CC-BY-NC-4.0 Scientific corpora.
AlexaTM Nov 2022 Amazon 20B 1.3T tokens - Proprietary Seq2seq architecture.
Llama Feb 2023 Meta AI 65B 1.4T tokens - Research-only Open-weights wave.
GPT-4 Mar 2023 OpenAI Unknown Unknown - Proprietary Multimodal flagship era.
Cerebras-GPT Mar 2023 Cerebras 13B - - Apache-2.0 Chinchilla-optimal training.
Falcon Mar 2023 TII 40B 1T tokens - Apache-2.0 RefinedWeb + curated.
BloombergGPT Mar 2023 Bloomberg 50B 708B mixed - Unreleased Finance-tuned.
PanGu-ฮฃ Mar 2023 Huawei 1085B 329B tokens - Proprietary Very large dense/MoE stack.
OpenAssistant Mar 2023 LAION 17B 1.5T tokens - Apache-2.0 Crowdsourced RLHF data.
Jurassic-2 Mar 2023 AI21 Labs Unknown - - Proprietary API-first.
PaLM 2 May 2023 Google 340B 3.6T tokens - Proprietary Bard / workspace era.
YandexGPT May 2023 Yandex Unknown - - Proprietary Alice assistant.
Llama 2 Jul 2023 Meta AI 70B 2T tokens - Llama 2 license Widespread finetunes.
Claude 2 Jul 2023 Anthropic Unknown - - Proprietary Long-context Claude chat.
Granite 13B Jul 2023 IBM 13B - - Proprietary watsonx.ai stack.
Mistral 7B Sep 2023 Mistral AI 7.3B - - Apache-2.0 Efficient open weights.
Claude 2.1 Nov 2023 Anthropic Unknown - - Proprietary ~200K token context.
Grok 1 Nov 2023 xAI 314B - - Apache-2.0 Open-weight release; X integration.
Gemini 1.0 Dec 2023 Google DeepMind Unknown - - Proprietary Multimodal family.
Mistral 8x7B Dec 2023 Mistral AI 46.7B MoE - - Apache-2.0 MoE; strong benchmarks.
DeepSeek-LLM Nov 2023 DeepSeek 67B 2T tokens - DeepSeek License EN + Chinese.
Phi-2 Dec 2023 Microsoft 2.7B 1.4T tokens - MIT Textbook-quality data.
Gemini 1.5 Feb 2024 Google DeepMind Unknown - - Proprietary 1M+ token context.
Gemini Ultra Feb 2024 Google DeepMind Unknown - - Proprietary Benchmark-focused tier.
Gemma Feb 2024 Google DeepMind 7B 6T tokens - Gemma terms Open-ish small models.
OLMo Feb 2024 Allen AI 7B 2T tokens - Apache-2.0 Fully open pipeline.
Claude 3 Mar 2024 Anthropic Unknown - - Proprietary Haiku / Sonnet / Opus.
DBRX Mar 2024 Databricks 136B 12T tokens - DBRX license MoE; Mosaic training.
Mixtral 8x22B Apr 2024 Mistral AI 141B MoE - - Apache-2.0 Larger MoE.
Phi-3 Apr 2024 Microsoft 14B 4.8T tokens - MIT SLM marketing wave.
Qwen2 Jun 2024 Alibaba 72B 3T tokens - Qwen license Multilingual.
DeepSeek-V2 Jun 2024 DeepSeek 236B MoE 8.1T tokens - DeepSeek License Economic training.
Nemotron-4 Jun 2024 NVIDIA 340B 9T tokens - NVIDIA license H100 cluster training.
Claude 3.5 Jun 2024 Anthropic Unknown - - Proprietary Sonnet-led coding surge.
Llama 3.1 Jul 2024 Meta AI 405B 15.6T tokens - Llama 3 license 405B flagship open-ish.
Grok-2 Aug 2024 xAI Unknown - - xAI license Later Grok 2.5 source-available.
OpenAI o1 Sep 2024 OpenAI Unknown - - Proprietary Explicit reasoning model.
Mistral Large Nov 2024 Mistral AI 123B - - Mistral Research API flagship.
Pixtral Nov 2024 Mistral AI 123B - - Mistral Research Multimodal.
OLMo 2 Nov 2024 Allen AI 32B 6.6T tokens - Apache-2.0 Open research LM.
Phi-4 Dec 2024 Microsoft 14B 9.8T tokens - MIT SLM continued.
DeepSeek-V3 Dec 2024 DeepSeek 671B MoE 14.8T tokens - MIT Cost-shock open weights.
Amazon Nova Dec 2024 Amazon Unknown - - Proprietary Micro / Lite / Pro.
DeepSeek-R1 Jan 2025 DeepSeek 671B RL only - MIT Reasoning from base.
Qwen2.5 Jan 2025 Alibaba 72B 18T tokens - Qwen license Dense + MoE lineup.
MiniMax-Text-01 Jan 2025 MiniMax 456B 4.7T tokens - MiniMax license Long-context focus.
Gemini 2.0 Feb 2025 Google DeepMind Unknown - - Proprietary Flash / Flash-Lite / Pro.
Claude 3.7 Feb 2025 Anthropic Unknown - - Proprietary Sonnet + extended thinking.
GPT-4.5 Feb 2025 OpenAI Unknown - - Proprietary Largest non-reasoning GPT then.
Grok 3 Feb 2025 xAI Unknown - - Proprietary Massive compute claims.
Gemini 2.5 Mar 2025 Google DeepMind Unknown - - Proprietary Flash / Flash-Lite / Pro.
Llama 4 Apr 2025 Meta AI 400B 40T tokens - Llama 4 license Multimodal natively.
OpenAI o3 / o4-mini Apr 2025 OpenAI Unknown - - Proprietary Reasoning stack.
Qwen3 Apr 2025 Alibaba 235B 36T tokens - Apache-2.0 Many sizes down to 0.6B.
Claude 4 May 2025 Anthropic Unknown - - Proprietary Sonnet + Opus refresh.
Sarvam-M May 2025 Sarvam AI 24B - - Apache-2.0 India-focused reasoning.
Grok 4 Jul 2025 xAI Unknown - - Proprietary Frontier Grok line.
Param-1 Jul 2025 BharatGen 2.9B 5T tokens - Unknown Indic languages.
GLM-4.5 Jul 2025 Zhipu AI 355B MoE 22T tokens - MIT 335B / 106B sizes.
GPT-OSS Aug 2025 OpenAI 117B - - Apache-2.0 20B + 120B open weights.
Claude 4.1 Aug 2025 Anthropic Unknown - - Proprietary Opus refresh.
GPT-5 Aug 2025 OpenAI Unknown - - Proprietary Mini / nano / full family.
DeepSeek-V3.1 Aug 2025 DeepSeek 671B 15.6T+ tokens - MIT Hybrid thinking modes.
Apertus Sep 2025 ETH / EPFL 70B 15T tokens - Apache-2.0 EU AI Act positioning.
Claude Sonnet 4.5 Sep 2025 Anthropic Unknown - - Proprietary Coding + agents.
DeepSeek-V3.2-Exp Sep 2025 DeepSeek 685B - - MIT DSA sparse attention.
GLM-4.6 Sep 2025 Zhipu AI 357B - - Apache-2.0 Open flagship coding.
Gemini 3 Nov 2025 Google DeepMind Unknown - - Proprietary Deep Think / Pro tiers.
Olmo 3 Nov 2025 Allen AI 32B 5.9T tokens - Apache-2.0 7B + 32B reasoning.
Claude Opus 4.5 Nov 2025 Anthropic Unknown - - Proprietary Largest Claude then.
GPT-5.2 Dec 2025 OpenAI Unknown - - Proprietary Reasoning + pro workloads.
GLM-4.7 Dec 2025 Zhipu AI 355B MoE - - Apache-2.0 MoE SOTA coding claims.
Qwen3-Max-Thinking Jan 2026 Alibaba Unknown - - Proprietary Adaptive tool use.
Kimi K2.5 Jan 2026 Moonshot AI 1040B MoE 15T tokens - Modified MIT 32B active; multimodal.
Claude Opus 4.6 Feb 2026 Anthropic Unknown - - Proprietary Frontier Claude.
GPT-5.3-Codex Feb 2026 OpenAI Unknown - - Proprietary Agentic coding line.
GPT-5.2-Codex Feb 2026 OpenAI Unknown - - Proprietary Codex family; large context in tools.
GLM-5 Feb 2026 Zhipu AI 754B - - MIT DSA; 200K context.
Qwen-Coder-Qoder-1.0 2026 Alibaba Unknown - - Proprietary IDE add-on label; Qwen coding line.
Qwen3.5-Plus 2026 Alibaba Unknown - - Proprietary Plus tier in model pickers.
MiniMax M2.1 2025 MiniMax Unknown - - Proprietary M2 API family; ~205K context.
MiniMax M2.1 Lightning 2025 MiniMax Unknown - - Proprietary Speed-focused M2.1 variant.
MiniMax M2.5 2025 MiniMax Unknown - - Proprietary Long-context M2 line; ~205K.
MiniMax-M2.7 2026 MiniMax Unknown - - Proprietary Add-on tier in Cursor-style menus.
Param-2 Feb 2026 BharatGen 17B MoE ~22T tokens - Unknown More Indic langs.
Sarvam-1 Feb 2026 Sarvam AI 105B MoE ~12T tokens - Apache-2.0 India foundation model.
GPT-5.4 Mar 2026 OpenAI Unknown - - Proprietary Thinking + Pro variants.
GPT-5.4 Pro Mar 2026 OpenAI Unknown - - Proprietary Pro tier; ~1M context in pickers.
GPT-5.1 Codex 2026 OpenAI Unknown - - Proprietary Codex line in IDE menus.
GPT-5.1 Codex Max 2026 OpenAI Unknown - - Proprietary Top Codex tier in pickers.
Grok 4.20 2026 xAI Unknown - - Proprietary Picker label; see Grok 4 article.
Gemini 3.1 Pro 2026 Google DeepMind Unknown - - Proprietary Pro tier naming in tools.
Gemini 3 Pro Preview 2026 Google DeepMind Unknown - - Proprietary Preview label in model lists.
Gemini 3.1 Pro Preview 2026 Google DeepMind Unknown - - Proprietary Preview label in model lists.
Gemini 2.5 Pro (partial) 2025 Google DeepMind Unknown - - Proprietary Pro tier; partial availability.
Claude Opus 4.1 Aug 2025 Anthropic Unknown - - Proprietary Opus tier naming in pickers.
Composer 2 2026 Cursor / Anysphere - - - Proprietary IDE agent flagship (product).
Codex Aug 2021 OpenAI 12B - - Proprietary Code fine-tune of GPT-3 lineage.
InstructGPT Jan 2022 OpenAI 175B - - Proprietary RLHF alignment showcase.
GPT-4 Turbo Nov 2023 OpenAI Unknown - - Proprietary 128K context; cheaper GPT-4 class.
YandexGPT 2 Sep 2023 Yandex Unknown - - Proprietary Alice assistant update.
Llama 3 Apr 2024 Meta AI 8-70B 15T tokens - Llama 3 license Dense family before 3.1.
GPT-4o May 2024 OpenAI Unknown - - Proprietary Omni multimodal flagship.
YandexGPT 3 Pro Mar 2024 Yandex Unknown - - Proprietary Alice chatbot.
YandexGPT 3 Lite May 2024 Yandex Unknown - - Proprietary Alice chatbot.
Fugaku-LLM May 2024 Fujitsu / Titech et al. 13B 380B tokens - Fugaku terms CPU-trained on Fugaku.
Chameleon May 2024 Meta AI 34B 4.4T tokens - Non-commercial Early-token fusion multimodal.
o1-mini Sep 2024 OpenAI Unknown - - Proprietary Smaller reasoning model.
YandexGPT 4 Lite/Pro Oct 2024 Yandex Unknown - - Proprietary Alice chatbot.
Llama 3.2 Sep 2024 Meta AI 1-90B - - Llama 3.2 license Vision + text stack.
Llama 3.3 70B Dec 2024 Meta AI 70B - - Llama 3.3 license Strong 70B after 405B 3.1.
DeepSeek-V3-0324 Mar 2025 DeepSeek 671B 14.8T+ ext. - MIT V3 refresh checkpoint.
YandexGPT 5 Lite Pretrain/Pro Feb 2025 Yandex Unknown - - Proprietary Alice Neural Network.
YandexGPT 5 Lite Instruct Mar 2025 Yandex Unknown - - Proprietary Alice Neural Network.
Gemini 2.5 Flash Mar 2025 Google DeepMind Unknown - - Proprietary Fast tier in 2.5 family.
YandexGPT 5.1 Pro Aug 2025 Yandex Unknown - - Proprietary Alice Neural Network.
Alice AI LLM 1.0 Oct 2025 Yandex Unknown - - Proprietary Alice AI chatbot.
Claude Haiku 4.5 Oct 2025 Anthropic Unknown - - Proprietary Fast Claude tier (companion lineup).