Vitalik Buterin simply printed a analysis proposal that sidesteps the query everybody retains asking: can blockchains run AI fashions?
As a substitute, the analysis claims Ethereum because the privacy-preserving settlement layer for metered AI and API utilization. The publish, co-authored with Davide Crapis on Ethereum Analysis, argues that the true alternative is not placing LLMs on-chain.
The actual alternative lies in building the infrastructure that permits brokers and customers to pay for 1000’s of API calls with out compromising id or creating surveillance trails by means of billing knowledge.
The timing is important as a result of agentic AI is shifting from demonstrations to enterprise roadmaps. Gartner forecasts that 40% of enterprise applications will embody task-specific AI brokers by the tip of 2026, up from below 5% in 2025.
That shift implies a world through which software program autonomously generates large volumes of API calls, making billing rails strategic infrastructure quite than back-office plumbing.
Present metering methods power a selection between Web2 id billing, which depends on API keys and bank cards and leaks profiling knowledge, and on-chain pay-per-call fashions which might be too sluggish, too costly, and hyperlink exercise by means of clear transaction graphs.
The proposal introduces ZK API utilization credit, a fee and anti-abuse primitive constructed on Charge-Limiting Nullifiers.
RLN is a zero-knowledge gadget designed to forestall spam in nameless methods, and the analysis repurposes it for metered entry to companies.
The stream proceeds as follows: customers deposit funds as soon as into a sensible contract, and their dedication is added to an on-chain Merkle tree.
Every API request features a zero-knowledge proof demonstrating that the person is a legitimate depositor with ample credit score for the requested index.
If a person makes an attempt to reuse a ticket index, double-spending their allowance, RLN permits the system to recuperate their secret and slash their stake as an financial penalty.
The publish contains concrete examples. A person deposits 100 USDC and makes 500 hosted LLM queries. One other deposits 10 USDC for 10,000 Ethereum RPC calls.
The structure is explicitly designed for “many calls per deposit,” which means that on-chain exercise scales with the variety of accounts and settlement frequency quite than uncooked inference quantity.
Variable-cost help provides flexibility: customers prepay a most price per name, servers return signed refund tickets for unused quantities, and customers privately accumulate refunds to unlock extra calls with out extra deposits.
Infrastructure is already there
The proposal arrives when the fee substrate for utilization credit already exists at scale.
Stablecoins have a circulating market cap of approximately $307.6 billion, in keeping with DefiLlama, indicating that the on-chain greenback layer is sufficiently liquid to help deposit-based billing for high-frequency companies.
Ethereum’s scaling stack has matured to the purpose the place rollups course of way more exercise than layer-1, with L2Beat exhibiting a roughly 100x scaling issue, with rollups dealing with 1000’s of operations per second in comparison with tens on the Ethereum mainnet.
Common Ethereum transaction charges lately measured around $0.21 on Feb. 7, suggesting that occasional on-chain metering and settlement flows are possible with out prohibitive price.
The design explicitly avoids placing LLMs on-chain. Ethereum competes on impartial settlement, programmable escrow, and verifiable enforcement, not TPU cycles or inference velocity.
The structure treats inference as an off-chain service and the blockchain because the layer that makes fee, metering, and dispute decision credible, with out requiring customers to belief particular person suppliers or to disclose their identities.
If AI service suppliers settle for deposits and depend on Ethereum or layer 2 good contracts to adjudicate slashing, refunds, and disputes, Ethereum turns into the enforcement layer for AI commerce.
The mannequin parallels how Ethereum grew to become the settlement layer for stablecoins and DeFi, not by internet hosting the complete software stack on-chain, however by offering a impartial substrate the place financial agreements are enforced programmatically.
Eventualities with out hype
The on-chain footprint is bounded by settlement cadence, not uncooked name quantity.
In a crypto-native wedge situation focusing on RPC and infrastructure APIs, suppose 250,000 energy customers or brokers undertake utilization credit.
If every performs two on-chain actions monthly, a deposit or top-up plus a withdrawal, that generates roughly 500,000 transactions month-to-month attributable to the rail.
In an AI supplier adoption situation, think about a million customers make use of privacy-preserving credit throughout hosted LLM companies however nonetheless carry out just one to a few on-chain actions month-to-month.
That suggests a million to a few million transactions monthly tied to AI commerce rails, possible targeting layer 2s the place execution is cheaper.
Enterprise agent scenarios increase deposit sizes, elevating the stakes for credible enforcement and making slashing mechanisms extra consequential.
The metadata drawback
The proposal tries to make funds unlinkable, however the analysis thread itself highlights a possible weak point.
A commenter argues that even when nullifiers are cryptographically unlinkable, servers can correlate customers by means of inference-based metadata reminiscent of timing patterns, token counts, and cache hits.
The critique proposes bucketed pricing, with mounted enter and output courses, to scale back leakage. That stress between cryptographic privateness and behavioral metadata is central as to whether the design really delivers on its anonymity objectives.
Implementation actuality presents one other hurdle. The proposal makes use of RLN as a primitive, however the Privateness and Scaling Explorations venture web page notes that RLN is inactive or has been sundown.
Productionizing ZK API utilization credit possible requires sustaining forks or implementing new options quite than counting on current tooling.
RLNJS benchmarks report roughly 800 milliseconds for proof era and 130 milliseconds for verification on an M2 Mac, offering an early sanity examine on efficiency however leaving open questions on cell constraints and production-grade circuits at scale.
The proposal additionally assumes that suppliers will combine the deposit-and-proof stream, settle for stablecoin settlements, and undertake Ethereum or layer 2 contracts for dispute decision.
That is a coordination drawback, not only a technical one. Web2 API suppliers have current billing infrastructure and regulatory readability round identity-linked transactions.
Convincing them to undertake a ZK-based alternative requires demonstrating both a compelling price benefit or a differentiated market phase through which privacy-preserving billing unlocks income they may not in any other case seize.
| Mannequin | The way it payments | What it leaks/breaks | Who it fits |
|---|---|---|---|
| Web2 id billing (API keys + playing cards) | Account-based billing tied to id (API key + fee methodology); supplier meters requests and invoices centrally | Leaks: id linkage + profiling trails throughout requests. Breaks: pseudonymity/self-custody norms. Danger: centralized management (suspension/censorship, single-provider belief) | Mainstream SaaS/API suppliers; enterprises prioritizing compliance, simplicity, and current billing rails |
| Onchain pay-per-call | Every request (or batch) pays onchain per name through transactions/good contracts | Breaks: price/latency for high-frequency calls. Leaks: onchain linkability (transaction graph ties utilization collectively). Friction: UX overhead for repeated txs | Crypto-native companies with low name frequency; circumstances the place transparency/auditability is extra necessary than privateness/throughput |
| ZK API utilization credit (deposit as soon as, many calls) | Person deposits as soon as; every request carries a ZK proof of membership + remaining credit score; slashing for double-use; optionally available refund tickets for variable price | Danger: metadata correlation (timing/token patterns can re-link). Burden: supplier integration + coordination. Maturity: ZK tooling/ops complexity, circuit upkeep | Excessive-frequency APIs (LLMs, RPC, knowledge) the place privateness is a promoting level; agent toolchains; customers needing metering with out identity-based surveillance |
What this implies for Ethereum
If the design features traction, Ethereum’s worth proposition shifts additional towards serving as a impartial enforcement layer for digital commerce quite than a general-purpose computing platform.
The proposal treats blockchain because the settlement substrate the place financial guidelines get enforced credibly, not the place the place purposes run.
Stablecoin velocity may rise as deposits stream into utilization credit score contracts, creating a brand new class of on-chain financial exercise distinct from DeFi hypothesis or NFT buying and selling.
Layer 2 utilization may enhance as suppliers and customers resolve disputes, course of refunds, and deal with slashing occasions on throughput-optimized chains.


The query is whether or not a parallel ecosystem emerges through which privacy-preserving billing turns into a prerequisite for sure person segments.
Enterprises involved about knowledge leakage by means of billing logs, builders constructing agent toolchains that require auditable metering with out surveillance, and energy customers who worth pseudonymous entry to high-frequency companies are all potential early adopters.
Ethereum’s alternative is to function the layer on which AI service markets settle, with out requiring contributors to belief particular person platforms or to sacrifice privateness to billing infrastructure.
The proposal claims Ethereum can implement fee agreements, adjudicate disputes, and allow metered entry with out id linkage in ways in which conventional methods structurally can’t.
Whether or not that declare holds is determined by fixing the metadata correlation drawback, sustaining sturdy ZK implementations, and convincing suppliers that the market justifies the combination price it unlocks.



