Published: August 7, 2025 (Europe/Berlin)

TL;DR

OpenAI launched GPT-5 today and pushed it broadly across ChatGPT (including free users), Microsoft Copilot products, and the OpenAI API. It’s faster, more reliable at multi-step reasoning and coding, and comes in three API sizes—gpt-5gpt-5-mini, and gpt-5-nano—with aggressively lower prices than previous flagships. Microsoft flipped on a new Smart mode that routes tasks to GPT-5 where it helps most. Context is up to 400K tokens on the official docs (with some outlets reporting even higher in marketing copy), and early enterprise positioning frames GPT-5 as a “subject-matter expert” for real work. (OpenAIThe VergeSourceOpenAI Platform)


What actually launched

  • ChatGPT now uses GPT-5, with free users getting access (higher limits on paid tiers). (The VergeWIRED)
  • Microsoft integrated GPT-5 across Copilot, Microsoft 365 Copilot, GitHub Copilot, and Azure AI Foundry via Smart mode model routing. (The VergeSource)
  • Developers get three API variants: gpt-5gpt-5-minigpt-5-nano, plus a separate “GPT-5 in ChatGPT” router system under the hood. (OpenAI)

Headline capabilities

  • Stronger reasoning & coding: OpenAI says GPT-5 beats their previous top reasoning/coding models on internal and public evals (e.g., SWE-bench Verified), and partners like Cursor and GitHub are already lighting it up. (OpenAI)
  • Enterprise focus: Marketed as “our best model for work,” with improved long-context handling, conversation stability, and fewer tool calls to get the same job done. (OpenAI)
  • Ecosystem rollout: GitHub Copilot exposes GPT-5 in public preview; Microsoft’s Copilot Smart mode dynamically switches models for task fit. (The GitHub BlogThe Verge)

Context window and performance

  • OpenAI docs list 400,000 tokens context and 128,000 max output for GPT-5 and the mini/nano variants. That’s built for long specs, multi-file codebases, or large document packs. (Some media mention even larger windows; go by the docs when building production systems.) (OpenAI PlatformOpenAI)

Pricing (API)

OpenAI undercut prior flagships materially:

This is… aggressive. Expect teams to re-run cost/perf bake-offs against Claude, Gemini, and DeepSeek this week.


Availability matrix (quick view)

  • ChatGPT (web & apps): GPT-5 on by default, free users included (paid tiers get more usage). (The Verge)
  • Microsoft: Copilot (consumer & M365), GitHub Copilot, Azure AI Foundry support. (The VergeSource)
  • OpenAI API: gpt-5gpt-5-minigpt-5-nano live today. (OpenAI)

Strategic backdrop (why this matters)

  • First major leap since GPT-4 (2023): Media are treating GPT-5 as the barometer for whether the last two years of AI hype is paying off. (AP News)
  • Workflows over wow-demos: The pitch is “expert-level intelligence you can trust for work,” with Microsoft’s distribution giving it instant enterprise reach. (OpenAIThe Verge)
  • Scale + governance story: OpenAI highlights 700M weekly ChatGPT users and PBC-style moves to align incentives—positioning ahead of regulatory pressure and enterprise risk reviews. (OpenAIAP News)

Early guidance for teams

  1. Re-benchmark immediately. If you’re on GPT-4/4.5 or o-series today, test GPT-5 at your actual workloads—coding agents, RAG pipelines, planning/reasoning tasks. Track latencytool-call count, and token budgets; OpenAI claims fewer tool calls for the same outcomes. (OpenAI)
  2. Exploit the tiering. Use mini/nano for high-volume, low-complexity paths (classification, extraction, draft-summaries) and reserve full GPT-5 for gnarly reasoning or generation where it moves KPIs. (OpenAI)
  3. Long-context ≠ forget retrieval. 400K tokens is generous, but RAG with selective chunking still wins on speed and cost. Treat long context as a safety valve for edge cases, not your default. (OpenAI Platform)
  4. Governance now, not later. If you’re in regulated spaces (health/finance/biotech), refresh your model risk assessments and red-team plans against GPT-5’s new capabilities and failure modes. Major outlets are already flagging “high-risk” misuse areas. (Financial Times)

What to watch next

  • Eval transparency: Expect deeper third-party evals on reasoning and code agents over the next 2–3 weeks. (OpenAI)
  • Pricing pressure on rivals: GPT-5’s pricing may force immediate cuts across the board; look for announcements from Anthropic/Google. (TechCrunch)
  • Router behavior in the wild: Microsoft’s Smart mode and ChatGPT’s internal routing will shape user perception—watch for drift, consistency, and edge-case failures. (The Verge)

Screenshot
You May Also Like

AI Unveiled: 15 Data-Driven Snapshots in 15 Minutes

Fifteen charts. The complete picture of where artificial intelligence stands right now…

Inside the New AI Manhattan Project: 8 Dynamics Shaping the Future

What if I told you that the world’s superpowers are currently engaged…

AI Is Printing Billions of Dollars

The AI bubble everyone keeps talking about? What if it’s not a…

Turn‑key, cross‑platform deployment plan to ship Gemma 3 270M on Android, iOS, and the Web

0) What we’re optimizing for • Footprint & speed: 270M INT4 QAT…