Published: August 7, 2025 (Europe/Berlin)

TL;DR

OpenAI launched GPT-5 today and pushed it broadly across ChatGPT (including free users), Microsoft Copilot products, and the OpenAI API. It’s faster, more reliable at multi-step reasoning and coding, and comes in three API sizes—gpt-5gpt-5-mini, and gpt-5-nano—with aggressively lower prices than previous flagships. Microsoft flipped on a new Smart mode that routes tasks to GPT-5 where it helps most. Context is up to 400K tokens on the official docs (with some outlets reporting even higher in marketing copy), and early enterprise positioning frames GPT-5 as a “subject-matter expert” for real work. (OpenAIThe VergeSourceOpenAI Platform)


What actually launched

  • ChatGPT now uses GPT-5, with free users getting access (higher limits on paid tiers). (The VergeWIRED)
  • Microsoft integrated GPT-5 across Copilot, Microsoft 365 Copilot, GitHub Copilot, and Azure AI Foundry via Smart mode model routing. (The VergeSource)
  • Developers get three API variants: gpt-5gpt-5-minigpt-5-nano, plus a separate “GPT-5 in ChatGPT” router system under the hood. (OpenAI)

Headline capabilities

  • Stronger reasoning & coding: OpenAI says GPT-5 beats their previous top reasoning/coding models on internal and public evals (e.g., SWE-bench Verified), and partners like Cursor and GitHub are already lighting it up. (OpenAI)
  • Enterprise focus: Marketed as “our best model for work,” with improved long-context handling, conversation stability, and fewer tool calls to get the same job done. (OpenAI)
  • Ecosystem rollout: GitHub Copilot exposes GPT-5 in public preview; Microsoft’s Copilot Smart mode dynamically switches models for task fit. (The GitHub BlogThe Verge)

Context window and performance

  • OpenAI docs list 400,000 tokens context and 128,000 max output for GPT-5 and the mini/nano variants. That’s built for long specs, multi-file codebases, or large document packs. (Some media mention even larger windows; go by the docs when building production systems.) (OpenAI PlatformOpenAI)

Pricing (API)

OpenAI undercut prior flagships materially:

This is… aggressive. Expect teams to re-run cost/perf bake-offs against Claude, Gemini, and DeepSeek this week.


Availability matrix (quick view)

  • ChatGPT (web & apps): GPT-5 on by default, free users included (paid tiers get more usage). (The Verge)
  • Microsoft: Copilot (consumer & M365), GitHub Copilot, Azure AI Foundry support. (The VergeSource)
  • OpenAI API: gpt-5gpt-5-minigpt-5-nano live today. (OpenAI)

Strategic backdrop (why this matters)

  • First major leap since GPT-4 (2023): Media are treating GPT-5 as the barometer for whether the last two years of AI hype is paying off. (AP News)
  • Workflows over wow-demos: The pitch is “expert-level intelligence you can trust for work,” with Microsoft’s distribution giving it instant enterprise reach. (OpenAIThe Verge)
  • Scale + governance story: OpenAI highlights 700M weekly ChatGPT users and PBC-style moves to align incentives—positioning ahead of regulatory pressure and enterprise risk reviews. (OpenAIAP News)

Early guidance for teams

  1. Re-benchmark immediately. If you’re on GPT-4/4.5 or o-series today, test GPT-5 at your actual workloads—coding agents, RAG pipelines, planning/reasoning tasks. Track latencytool-call count, and token budgets; OpenAI claims fewer tool calls for the same outcomes. (OpenAI)
  2. Exploit the tiering. Use mini/nano for high-volume, low-complexity paths (classification, extraction, draft-summaries) and reserve full GPT-5 for gnarly reasoning or generation where it moves KPIs. (OpenAI)
  3. Long-context ≠ forget retrieval. 400K tokens is generous, but RAG with selective chunking still wins on speed and cost. Treat long context as a safety valve for edge cases, not your default. (OpenAI Platform)
  4. Governance now, not later. If you’re in regulated spaces (health/finance/biotech), refresh your model risk assessments and red-team plans against GPT-5’s new capabilities and failure modes. Major outlets are already flagging “high-risk” misuse areas. (Financial Times)

What to watch next

  • Eval transparency: Expect deeper third-party evals on reasoning and code agents over the next 2–3 weeks. (OpenAI)
  • Pricing pressure on rivals: GPT-5’s pricing may force immediate cuts across the board; look for announcements from Anthropic/Google. (TechCrunch)
  • Router behavior in the wild: Microsoft’s Smart mode and ChatGPT’s internal routing will shape user perception—watch for drift, consistency, and edge-case failures. (The Verge)

Screenshot
You May Also Like

Mondelez International Proves Generative AI ROI in Global Marketing

By Thorsten Meyer | ThorstenMeyerAI.com When a multinational like Mondelez International reports…

The Great Cloud Shake-Up: AWS and Microsoft Azure Outages Expose the Fragility of Our Digital Backbone

Two massive cloud disruptions within ten days have sent a shockwave through…

Ranking in ChatGPT: The New Playbook for AI Search Visibility (and How to Use ZimmWriter)

“Ranking in ChatGPT” is real—but it’s not the same thing as ranking…

Market Impact and Vertical-Specific Benefits of the October 2025 Google Search Console Reporting Delay and the 24‑Hour View

Introduction Google Search Console (GSC) is an indispensable tool for website owners and…