Techy StatusTechy Status

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Samsung debuts world’s first 500Hz OLED gaming monitor for ultra-smooth gameplay.

    May 11, 2025

    National Technology Day: Origin, Importance, and Key Facts

    May 11, 2025

    DOOM: The Dark Ages Review – 2025’s pulse-pounding demon-slaying shooter

    May 11, 2025
    Facebook Twitter Instagram
    Facebook Twitter Instagram
    Techy Status Techy Status
    • Home
    • News & Updates
    • PC & Mobile
      • Android
      • IOS
      • Linux
      • Windows
    • Development
      • Laravel
      • Microservices
    • Productivity
    • AI
    Techy StatusTechy Status
    Home » All Articles » Google unveils “implicit caching” to lower the cost of using its newest AI models.
    AI

    Google unveils “implicit caching” to lower the cost of using its newest AI models.

    Allen Kurian ThomasBy Allen Kurian ThomasMay 9, 2025No Comments1 Min Read
    Share
    Facebook Twitter LinkedIn Pinterest Email Reddit WhatsApp

    Google is introducing “implicit caching” in its Gemini API, which it says can cut the cost of using its Gemini 2.5 Pro and 2.5 Flash models by up to 75% on repeated context. Now enabled by default, this feature automatically reuses any overlapping request prefixes you’ve previously sent, passing the savings directly back to you. To qualify for a cache hit, prompts must be at least 2,048 tokens for Pro and 1,024 for Flash—roughly 1,500 and 750 words, respectively—though any new or variable content should be tacked on at the end of your request to maximize cacheability.

    This move follows criticism of Google’s earlier explicit prompt‐caching system, which required developers to manually specify high-frequency prompts and sometimes resulted in unexpectedly large bills. In response to those complaints—and a public apology—Google has overhauled its approach to make caching seamless and automatic. However, it has not yet provided independent verification of the claimed savings, so developers will need to test it in their own workloads to confirm the benefits.

    AI Gemini google
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Reddit WhatsApp
    Previous ArticleChatGPT’s deep research feature adds a GitHub connector to address code-related queries.
    Next Article Windows 11 adds a new Start Menu, AI assistant, and other AI features.

    Related Posts

    Google I/O 2025: Anticipated Highlights, Featuring Gemini Enhancements and Android 16

    May 11, 2025

    ChatGPT’s deep research feature adds a GitHub connector to address code-related queries.

    May 9, 2025

    Two-way voice conversations now available to more users in Gemini

    September 5, 2024

    Apple’s Glowtime event: iOS 18.1 Beta 3 debuts with powerful new features – everything you need to know.

    August 30, 2024
    Add A Comment

    Leave A Reply Cancel Reply

    Editors Picks

    Hesitation Isn’t a Flaw — It’s a Business Model

    May 7, 2025

    You can now make ChatGPT your default assistant on Android.

    March 15, 2025

    Apple has discontinued the iPhone 15 Pro, iPhone 15 Pro Max, iPhone 13, and Watch Series 9 after unveiling the iPhone 16.

    September 12, 2024

    Laravel has secured $57 million in Series A funding from Accel.

    September 6, 2024
    Top Reviews
    Advertisement
    Techy Status
    Facebook Twitter Instagram YouTube
    © 2025 TechyStatus.com. Managed by Bi. Enterprises.

    Type above and press Enter to search. Press Esc to cancel.