Bounded AI pricing
Pricing
Fixing the cost problem no one wants to talk about
Governed, Pre-Execution Provisioning (G‑PEP)
Governed, pre-execution provisioning (G‑PEP) is an architectural control layer that authorizes, constrains, routes, defers, or denies AI execution before inference based on policy, entitlement, and cost constraints.
FLOP-Based Metering (FBM)
FLOP-Based Metering (FBM) measures AI usage based on compute work performed rather than tokens generated. It treats inference as a computational process whose cost is determined by underlying operations, not textual output.
Normalized Compute Unit (NCU)
A Normalized Compute Unit (NCU) is a standardized unit of measure that represents compute consumption in a consistent and comparable form. It translates underlying computational work into a normalized unit that can be used for estimation, comparison and control.