How to Automate Usage-Based Billing for Finance Tools

While each event log provides detailed information about every API request, you need to calculate aggregate usage to be useful for billing teams and customers. In a query, you can aggregate events by model type on a daily and monthly level. Snowflake’s documentation highlights how you can aggregate monthly with window functions.

select **customer_id**, 
date(**timestamp**) as **timestamp**,
**properties**:model_type,
sum(**properties**:token_count) as daily_records
sum(daily_records) OVER (PARTITION BY **customer_id**, date_trunc('month', **timestamp**) ORDER BY **timestamp**) AS cum_monthly_records
from RAW.AMPLITUDE.EVENT
where event_type = 'language_model' 
group by 1,2,3
order by 1,2,3 desc

aggregate_usage

Definitions:

daily tokens (number): the sum of how many tokens were used on a specific day
agg_monthly_tokens (number): the sum of all tokens used up to that day of the month (inclusive)

You get the following outcome.

customer_id: 1234
date: Jan 12, 2023
model_type: ada
daily_tokens: 100
agg_monthly_tokens: 1460

It may seem like a lot of computation to recalculate from the raw event log every time, but there are numerous benefits to having stateless calculations. Rather than incrementing or decrementing a counter for every event, you always maintain a single source of truth. This introduces:

Increased accuracy: Maintaining usage data in two separate systems can often lead to differences requiring reconciliation. Directly updating your billing metrics with the raw event logs helps you ensure your billable metric is accurate.
Increased auditability: Auditing is critical whenever billing is involved for both customer experience and compliance purposes. You can always analyze your invoices by going back to our event logs.
Calculation logic at any time: Suppose traffic exceeded what expected on ChatAI's launch causing processing to slow down, and they wanted to exclude all requests that took over 1,200 ms to respond; You can easily filter out those events in one line with an aggregate query.

Going from events to aggregate usage is one step closer to billing your customers, but you’ll need to translate usage to actual dollars. There are two options to do this.

The first option is sending all charges to an invoicing platform. To do this you'll need to create a model.

SELECT
  object_construct('internalId', customer_id) AS customer_id,
  CURRENT_DATE() AS invoice_date,
  'ChatAI Demo Invoice' AS memo,
  CONCAT(au.customer_id, invoice_date) AS _id,
  object_construct(
    'item',
    ARRAY_AGG(
      object_construct(
        'quanity',
        monthly_token_count,
        'rate',
        bi.rate,
        'item',
        bi.ns_id
      )
    )
  ) AS items
FROM
  AGGREGATE_USAGE au
  JOIN AI_BILLING_INFO bi ON au.model_type = bi.model_type
GROUP BY 1, 2, 3, 4

At the end of every billing cycle, you will create an invoice.

The second option is sending usage data to a subscription management platform.

When companies start to expand their product lines or have specialized pricing for enterprise customers, it makes sense to migrate to a subscription management platform. These platforms enable you to configure custom prices, plans, and products. The most common integrations are with Stripe and NetSuite, but many subscription management platforms like Metronome or Orb exist.

This example is based on Stripe, but most platforms have similar interfaces. Once you’ve configured your customers and subscriptions within Stripe, you can report usage via the Usage Records table within Hightouch.

Starting from your aggregate_usage record table above, you can join the customer_id and model_type to find the associated Stripe subscription_item_id.

From there you you'll need to create a usage record model.

SELECT ss.subscription_item_id, au.daily_tokens, au.date AS timestamp_
FROM aggegrate_usage au
JOIN stripe_subscriptions ss ON ss.customer_id = au.customer_id 
	AND ss.product = au.model_type

subscription_item_id: 1234
daily_tokens: 3682
timestamp_: 167391672

With a usage record model created, the last step is to map your model to the Stripe usage record table.

After selecting the Usage Record table, you only need four fields to update the usage record:

Subscription Item (string): Subscription Items map a customer to pricing plan
Quantity (integer): The usage quantity for the specified date
Timestamp (timestamp): The timestamp when this usage occurred
Action (enum): Increment or Set the usage record for the timestamp

Implementing Usage-Based Billing with Hightouch and the Cloud Data Warehouse

This playbook will show you how to build usage-based billing process for SaaS products.

Overview

Example Use Case

Getting Started

Step 1: Collecting Event Data

Step 2: Aggregating Usage

Step 3: Calculating Charges

Step 4: Powering a Usage Dashboard

Step 5: Configuring Usage Alerts

Wrapping Up