WayToClawEarn
高影响GitHub Blog / Ars Technica / TechCrunch

GitHub Copilot Goes Usage-Based: AI Credits Spark 10x–50x Cost Surge for Devs

On June 1, 2026, GitHub Copilot switched from flat-rate premium requests to token-based AI Credits. Developers are reporting cost increases of 10x to 50x, with some Pro+ users seeing bills jump from $39 to over $750 per month. We break down the new pricing, per-model token rates, the developer backlash, and what this means for the AI coding tools landscape.

2026年6月12日 · 阅读约 8 分钟

TL;DR

On June 1, 2026, GitHub Copilot completed its transition to usage-based billing, replacing flat-rate premium requests with a token-based system called GitHub AI Credits. The developer community's response has been swift and overwhelmingly negative — reports of costs jumping from $39/month to $750/month, and from $100/month to $3,000/month, are spreading across Reddit, X (Twitter), and GitHub's own discussion threads.

This isn't a minor pricing tweak. It's a fundamental restructuring of how the most widely used AI coding tool charges its users, and it signals a broader shift in the AI-assisted development market.


What Changed on June 1

Before June 1, GitHub Copilot sold flat monthly subscriptions. A Pro ($10/mo) user got 300 premium requests per month, a Pro+ ($39/mo) user got 1,500. If you stayed within that bucket, your bill was fixed regardless of how many tokens your conversations consumed.

Starting June 1, every Copilot plan — Free, Pro, Pro+, Business, and Enterprise — moved to a usage-based system called GitHub AI Credits.

PlanMonthly PriceBase CreditsFlex CreditsTotal CreditsDollar Value
Pro$101,0005001,500$15
Pro+$393,9003,1007,000$70
Max$10010,00010,00020,000$200
Business$19/seatPooled$19/seat

One AI Credit = $0.01 USD. Credits are consumed based on token usage (input tokens + output tokens + cached tokens), and each model has its own per-token rate.

The Per-Model Rate Card

Here's where the real sticker shock comes from. Different models cost dramatically different amounts, and agentic workflows — which GitHub itself has spent the past year aggressively promoting — consume far more tokens than simple completions.

ModelInput Cost (per 1M tokens)Output Cost (per 1M tokens)Credits per 1M output tokens
GPT-5 Mini$0.15$0.6060
Gemini 3 Flash$0.15$0.6060
Claude Sonnet 4.6$3.00$15.001,500
GPT-5.3 Codex$10.00$40.004,000
GPT-5.4$5.00$20.002,000
GPT-5.5$7.50$25.002,500
Claude Opus 4.6$15.00$60.006,000
Gemini 3.1 Pro$1.25$5.00500

Source: GitHub Docs — Models and pricing for GitHub Copilot. Cached tokens are billed at input rates.

Why Developers Are Seeing 10x–50x Increases

The backlash isn't about the pricing model being hard to understand — it's about how quickly agentic workflows burn through credits.

Real-world scenario: A day of agentic coding

A developer using Claude Sonnet 4.6 (the default model for most Copilot agent sessions) on a typical agentic workflow:

  • 1 agentic session: ~15 conversation turns, each averaging ~4,000 input tokens and ~500 output tokens
  • Per session tokens: 60,000 input + 7,500 output
  • Per session cost: (60K × $3/M) + (7.5K × $15/M) = $0.18 + $0.1125 = $0.29
  • 10 sessions per day: $2.90/day
  • 22 working days per month: $63.80

That's $63.80 on top of the $39 Pro+ subscription — and the Pro+ plan only includes $70 worth of credits. A moderately active developer using Claude Sonnet 4.6 could burn through their entire monthly credit pool in 10–12 working days.

For developers using GPT-5.3 Codex or Claude Opus 4.6, the numbers get even worse. A single agentic session with Opus 4.6 costs roughly $0.80-$1.00. Run 10 sessions a day, and your monthly bill hits $176–$220 — on top of your subscription.

Reddit reports include:

  • One developer claiming their Pro+ plan would cost ~$1,000/month under the new credits
  • Another showing their annual cost projection jumping from $348/year to over $6,000/year
  • A power user estimating their monthly bill for agentic Claude Opus usage at $2,700–$3,400

As one HN commenter put it: "GitHub spent the last 18 months selling us on agent mode. Now they're charging us by the token for the privilege."

Why This Matters for the AI Coding Tools Landscape

1. The flat-rate era is ending

GitHub Copilot's move is the clearest signal yet that flat-rate AI coding subscriptions are not sustainable for the platform providers. The compute cost of agentic workflows — especially with frontier models like Claude Opus and GPT-5 — is simply too high to bundle into a fixed-price plan.

Cursor has already introduced tiered pricing ($20/$60/$200 per month) with model-based credit multipliers. Claude Code is bundled into Anthropic's $20/$100/$200 subscription tiers. But Copilot's move is different — it's the first major platform to go purely usage-based, relying on token metering rather than request counting.

2. Power users will ration their usage — or leave

The developers who benefit most from agentic AI coding tools — the ones running complex refactoring, multi-file edits, and automated testing loops — are also the ones most affected by this change. These power users are now the most expensive customers.

Early reactions suggest that many will:

  • Downshift to cheaper models (GPT-5 Mini, Gemini 3 Flash) for routine tasks
  • Ration agent mode to critical sessions, using simpler autocomplete for everyday work
  • Switch to alternative tools like Cursor, Claude Code (with its own subscription), or open-source options like OpenCode and Aider
  • Route through direct API access using third-party wrappers that offer pay-as-you-go pricing at the same or lower rates

3. Cursor, Claude Code, and Codex stand to benefit

Cursor has been the primary beneficiary of the backlash so far. Its tiered pricing ($20 Pro with 500 premium requests, $60 Pro+ with 1,500, $200 Ultra with 6,000) still uses request-based counting rather than token metering. For heavy agent-mode users, this can be significantly cheaper.

Claude Code's pricing confusion from April 2026 is now fully resolved: it's available on Pro ($20/mo), Max 5x ($100/mo), and Max 20x ($200/mo) plans, with usage caps rather than token billing. The simple monthly cap model suddenly looks attractive.

OpenAI Codex has been aggressively adding features. The June 2026 update brought local conversation search, --profile as the primary CLI and sandbox selector, improved MCP setup with concurrent read-only tool execution, and web search for local tasks in the CLI and IDE extension. Codex is available on every ChatGPT plan including Free and Go, with meaningful daily usage starting at Plus ($20/mo). For developers already in the OpenAI ecosystem, it's a compelling fallback.

4. The open-source alternatives gain momentum

Tools like OpenCode (161K GitHub stars, MIT-licensed, GPT-5.5 Scout Agent) and Aider are seeing increased attention. OpenCode is fully open-source and supports local models — for developers willing to trade some capability for cost predictability, these are increasingly viable options.

OpenAI Codex June 2026: The Counterpoint

In the same week that GitHub's pricing change went live, OpenAI quietly shipped a significant update to Codex:

  • Local conversation search: Full-text, case-insensitive search across local session transcripts — previously a pain point for heavy Codex users
  • --profile as primary selector: The CLI now uses --profile as the primary sandbox and model selector, unifying the configuration experience
  • Concurrent MCP tools: Tools advertising readOnlyHint now run in parallel, dramatically speeding up tool-using workflows
  • Web search for local tasks: Codex can now call standalone web search directly from local CLI and IDE Extension tasks, returning plaintext search results
  • Rate-limit reset: Added rate-limit reset notifications in the desktop app, showing time until quota replenishes

The contrast is stark: while GitHub is raising costs for heavy users, OpenAI is adding productivity features to Codex at no additional charge (within existing plan limits). This is the classic platform play — invest in the ecosystem, don't extract from power users.

What to Do If You're Affected

If your Copilot bill has jumped or you're worried about the change, here are practical next steps:

  • Audit your usage first: Check your AI Credits dashboard in VS Code to see which models and workflows consume the most credits
  • Match model to task: Use GPT-5 Mini or Gemini 3 Flash for simple completions and documentation. Reserve Claude Sonnet 4.6 and GPT-5.3 Codex for complex tasks
  • Consider model auto-selection: Auto mode picks the cheapest capable model — a simple switch that can cut costs 40-60%
  • Try Cursor or Claude Code: Both have trial periods. Run your real workflow side-by-side for a week
  • Evaluate direct API access: For heavy agentic users, routing through direct API with a wrapper like Continue.dev can be cheaper than Copilot's bundled credits
  • Watch for OpenCode: If your workflow is CLI-based and you use models through OpenRouter or local inference, OpenCode is worth testing

The Bottom Line

GitHub Copilot's move to usage-based billing is the most significant pricing change in the AI coding tools market since Copilot launched in 2022. It's a rational move for Microsoft — agentic workflows are expensive to serve — but it fundamentally changes the value proposition for power users who were sold on the promise of unlimited AI-assisted coding.

For WayToClawEarn readers who use AI coding tools for content automation, affiliate workflows, and development: this is the moment to diversify your tool stack. Don't bet your entire workflow on a single platform's pricing model, because the flat-rate era is ending.


Published June 12, 2026. Pricing and model availability subject to change.

References

免责声明:本站案例均为知识分享内容,仅供灵感与参考,不构成收益承诺;由此进行的外部执行与结果请自行判断并承担相应责任。
GitHub Copilot Goes Usage-Based: AI Credits Spark 10x–50x Cost Surge for Devs · WayToClawEarn