Logo
FrontierNews.ai

Google's Hidden Pricing Shake-Up: Why Gemini Users Are About to See Usage Limits

Google is preparing a new subscription tier for Gemini that sits between its affordable Pro plan and expensive Ultra option, signaling the company expects a major surge in AI demand. The move comes as the entire AI industry grapples with token scarcity, where heavy users of coding tools and AI agents are burning through their monthly allowances faster than ever. Unlike competitors who have quietly reduced usage limits, Google is taking a different approach: creating more pricing options and giving users transparency into exactly how many tokens they have left.

What Is Google's New 'Ultra Lite' Plan?

Google's latest macOS app update reveals signs of an in-between subscription tier currently codenamed "Neon" and branded as "Google AI Ultra Lite." The plan is designed to bridge the gap between the $20 monthly Pro subscription and the $250 monthly Ultra plan, which is by far the most expensive AI subscription on the market. While Google hasn't announced a final price, industry observers expect it to land somewhere between $50 and $150 per month, potentially matching the $100 price point that both Anthropic and OpenAI offer for their mid-tier plans.

The new tier would primarily offer increased token usage limits compared to the Pro plan, though Google hasn't yet detailed what other exclusive features might come with it. The current Ultra plan includes premium access to features like Project Genie demos, expanded video and music generation through Flow, larger notebooks in NotebookLM, and premium features across other Google applications.

Why Is Google Adding Usage Limits and Transparency Now?

The timing of this announcement matters. The AI industry is experiencing what some call a "token crunch," where users of advanced AI tools are hitting their monthly limits much faster than expected. Claude users have publicly complained about noticeable decreases in per-session usage limits, while GitHub Copilot has dramatically adjusted its pricing structure for many models. Google appears to be getting ahead of this trend by preparing infrastructure to handle increased demand and giving users clear visibility into their consumption.

Alongside the new subscription tier, Google is building a dedicated usage dashboard at gemini.google.com/usage that will show subscribers exactly how many tokens they have remaining. This mirrors the approach Anthropic already offers for Claude, where users can see what percentage of their token budget is available for the current session, when that session resets, and how much is available in their weekly budget. Google's version will track five-hour and weekly usage limits separately, plus overage credits that allow users to go beyond their subscription's standard allotment.

How to Assess Your Current Gemini Usage and Plan Needs

  • Monitor Your Five-Hour Limit: Google's system tracks usage in five-hour windows, so you'll need to understand when your current window resets and how much capacity you have left before starting token-heavy tasks like coding sessions.
  • Check Your Weekly Budget: Beyond the five-hour limit, Google also enforces a separate weekly budget, so you'll want to track both timeframes to avoid unexpected overage charges.
  • Plan Coding Sessions Strategically: Agentic coding tools consume tokens rapidly, so having visibility into your remaining budget helps you decide whether to start a major coding task now or wait a few hours for your limit to reset.
  • Consider Overage Credits: If you occasionally exceed your plan's limits, overage credits let you continue working without immediately upgrading, though this adds cost on top of your subscription.

The introduction of usage transparency is particularly important for developers and power users. Coding agents tend to be extremely token-heavy, consuming far more of a user's monthly budget than typical conversational AI use. By showing users exactly how much capacity they have remaining, Google is helping them make informed decisions about whether their current plan is sufficient or whether they should upgrade.

What Does This Mean for Google's AI Strategy?

Google's move to introduce a mid-tier plan and usage transparency suggests the company is preparing for a significant increase in Gemini adoption. According to recent reporting, Google co-founder Sergey Brin is leading a "strike team" focused on radically improving Gemini's code creation capabilities in time for Google I/O 2026. If that effort succeeds, more developers will likely adopt Gemini for coding tasks, which would push many users beyond their current Pro plan limits and create demand for a higher-tier option without forcing them to jump to the $250 Ultra plan.

This strategy also positions Google more competitively against Anthropic and OpenAI. Both competitors offer $100 monthly plans that sit comfortably between their entry-level and premium tiers. Google's current pricing structure, which jumps from $20 to $250, has been a notable weakness. The new Ultra Lite plan fills that gap and gives cost-conscious power users a more reasonable upgrade path.

At Google I/O 2026, scheduled for May 19, the company is expected to announce major Gemini upgrades and demonstrate new agentic capabilities across Android devices. These announcements will likely drive increased interest in Gemini as a coding and productivity tool, making the timing of the new subscription tier and usage dashboard particularly strategic. Google is essentially preparing its infrastructure and pricing model to handle the wave of new users and use cases it expects to announce in the coming weeks.

For existing Gemini users, the key takeaway is simple: start paying attention to your token consumption now. The usage dashboard will soon give you clear visibility into your spending patterns, and the new Ultra Lite option will provide a middle ground between your current plan and the expensive Ultra tier. Whether you upgrade will depend entirely on how heavily you use Gemini for coding, content generation, and other token-intensive tasks.