Close Menu
CryptoAINews
  • Cryptocurrency
  • Blockchain
  • Bitcoin News
  • Altcoins
  • Crypto Market Trends
  • Crypto Mining
  • Ethereum
  • AI News
  • Sponsored
  • Advertise
Trending
  • Google Wallet and Pay are getting even better
  • OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks
  • Meet this year’s Doodle for Google winner
  • Trade Across Five Asset Classes with ICM24
  • Bitcoin Testing A Critical Support After Sharp Market-Wide Selloff
  • Will ETH Dump Toward $1K Next?
  • The Trump administration might take an equity stake in OpenAI
  • Build Kaggle Benchmarks Locally
  • AI News
  • Cryptocurrency
  • Blockchain
  • Bitcoin News
  • Altcoins
  • Crypto Market Trends
  • Crypto Mining
  • Ethereum
  • Sponsored
  • Advertise
CryptoAINews
  • Cryptocurrency
  • Blockchain
  • Bitcoin News
  • Altcoins
  • Crypto Market Trends
  • Crypto Mining
  • Ethereum
  • AI News
  • Sponsored
  • Advertise
CryptoAINews
Home » AI News » Flex and Priority tiers in the Gemini API
Gemini API Dials ss 1.width 1300
AI News

Flex and Priority tiers in the Gemini API

CryptoAINewsBy CryptoAINewsApril 3, 2026No Comments2 Mins Read
Share
Facebook Twitter LinkedIn Pinterest Email


Right now, we’re including two new service tiers to the Gemini API: Flex and Priority. These new choices provide you with granular management over value and reliability by way of a single, unified interface.

As AI evolves from easy chat into complicated, autonomous brokers, builders sometimes need to handle two distinct forms of logic:

  • Background duties: Excessive-volume workflows like information enrichment or “considering” processes that do not want immediate responses.
  • Interactive duties: Consumer-facing options like chatbots and copilots the place excessive reliability is required.

Till now, supporting each meant splitting your structure between commonplace synchronous serving and the asynchronous Batch API. Flex and Precedence assist to bridge this hole. Now you can route background jobs to Flex and interactive jobs to Precedence, each utilizing commonplace synchronous endpoints. This eliminates the complexity of async job administration whereas supplying you with the financial and efficiency advantages of specialised tiers.

Flex Inference: scale innovation for 50% much less

Flex Inference is our new cost-optimized tier, designed for latency-tolerant workloads with out the overhead of batch processing.

  • 50% value financial savings: Pay half the value of the Commonplace API by downgrading criticality of your request (making them much less dependable, and including latency).
  • Synchronous simplicity: Not like the Batch API, Flex is a synchronous interface. You employ the identical acquainted endpoints with out managing enter/output recordsdata or polling for job completion.
  • Ultimate use circumstances: Background CRM updates, large-scale analysis simulations, and agentic workflows the place the mannequin “browses” or “thinks” within the background.

Get began quick by merely configuring the service_tier parameter in your request:



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
CryptoAINews
  • Website

Related Posts

Google Wallet and Pay are getting even better

June 7, 2026

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

June 7, 2026

Meet this year’s Doodle for Google winner

June 6, 2026

The Trump administration might take an equity stake in OpenAI

June 6, 2026
Add A Comment
Leave A Reply Cancel Reply

About us

CryptoAINews is an independent digital publication focused on cryptocurrency, blockchain, and artificial intelligence news.

The platform is owned and operated by Robert Grabarevic, providing timely news coverage, market updates, and educational content for a global audience interested in emerging technologies and digital finance.

CryptoAINews is committed to transparent reporting, responsible publishing, and delivering informative content based on publicly available data, verified sources, and industry developments.

All content published on this website is for informational purposes only and does not constitute financial or investment advice.

Top Insights

Google Wallet and Pay are getting even better

June 7, 2026

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

June 7, 2026

Meet this year’s Doodle for Google winner

June 6, 2026
Categories
  • Advertise
  • AI News
  • Altcoins
  • Bitcoin News
  • Blockchain
  • Crypto Market Trends
  • Crypto Mining
  • Cryptocurrency
  • Ethereum
  • Sponsored
  • Imprint-Legal-Notice
  • Author / Publisher Bio
  • Privacy Policy
© 2025 CryptoAINews – Owned & Operated by Robert Grabarevic

Type above and press Enter to search. Press Esc to cancel.