TopicsAIUse the Pega AI Token Cost Calculator to find out how to...

Use the Pega AI Token Cost Calculator to find out how to Reduce AI Costs

  • Pega can reduce or eliminate ‘AI Token Tax’ with an efficient way to build and run Agentic Workflows
  • Pega Infinity 26 (available Q3) provides predictable agent outcomes with predictable cost and no metered token charges

 

Pegasystems Inc. (“Pega”), today at PegaWorld®, have announced that clients can now design, build, and run their agentic workflows across Pega InfinityTM 26 without paying for tokens.

The Pega Predictable AI™ architecture shifts the heavy AI reasoning to design time, so runtime agents are fast, reliable, and dramatically cheaper to run. This directly addresses two of the most pressing obstacles for enterprises trying to scale their use of AI agents: escalating token costs and unreliable outcomes.

Alan Trefler, founder and CEO of Pega commented: 

“Enterprises are quickly waking up to the fact that tokenmaxxing is ridiculous: it can only lead to unsustainable costs and unpredictable results. AI best creates value when it delivers reliable outcomes at scale. That’s why we don’t charge clients based on how many tokens they use, but by the meaningful work they accomplish. Combined with an architecture built for governed execution, Pega now gives organizations unrivaled freedom to use AI agents.”

Market Context: The bill comes due on AI experimentation

Token bills are arriving, and they are shocking enterprise leaders. As organizations look to scale agent experiments to production, LLM providers are converting flat rate subscriptions to more expensive token-metered pricing – while quietly running up expensive reasoning tokens behind the scenes. The more complex the request, the more reasoning steps are required – and the more likely it generates an
inadequate and inconsistent answer.

A Closer Look: The Pega AI architectural difference

Pega applies AI reasoning at design time, when its creative power delivers the most value for reimagining outdated processes and systems. With Pega Blueprint AI™ and the new Pega Infinity Studio™, Pega’s design agents help teams design and build the optimal agentic workflows for their mission-critical business processes. Common examples include servicing a customer request, approving a loan, underwriting a claim, or optimizing a patient experience.

Once the workflows are designed and deployed, Pega shifts to a lighter weight semantic mode of AI better suited for runtime, when agents are called on to process millions of user requests efficiently and consistently. Instead of re-reasoning each new workflow, agents use a lightweight AI query to understand the user intent, find the best Pega workflow for the job, and then follow it step-by-step to complete the work. If a specific step needs deeper LLM use (e.g., to parse a document or summarize previous interactions), the step provides specific and bounded instructions to ensure predictability.

This approach delivers two critical benefits:

  • Predictable outcomes: Re-reasoning each workflow leads to inconsistentand unpredictable outcomes. Instead, agents connected to Pega follow pre
    approved workflows consistently, which is critical for regulated industries –
    and smart for everybody.
  • Predictable costs: Pega’s approach uses AI reasoning once at design time,
    rather than inefficiently re-reasoning repeatedly at runtime, making it
    dramatically more efficient and affordable for agents to drive the processes
    that do the most work in a business.

 

Interactive Token Calculator: How much are you wasting on inefficient AI agents?

To help enterprises quantify the benefits of this approach, Pega introduced the AI Token Cost Calculator.

The interactive tool estimates possible savings by comparing Pega AI with token-metered alternatives based on users’ workflow volumes. Many clients can realize a savings of more than 20x depending on workflow complexity and scale. 

Pay for the work being done, not thinking about what to do

Pega’s outcomes-based approach charges per completed “case” – a task executed from start to finish – not per seat or per token. For example, when a customer uses an AI agent to change an existing order, that completed interaction is recorded as a single case.

Available in Q3 this year, Pega Infinity 26 clients pay a single, flat price per completed case, regardless of how much Pega AI is used behind the scenes. This aligns cost directly to business value.



RELATED ARTICLES

Read our latest magazine