Skip to main content

window_sum

Definition

A window aggregation function that returns the sum of rows in a range relative to the current row.

Syntax

window_sum(agg_expr)
window_sum(agg_expr, order: order_expr, ...)
window_sum(agg_expr, range, order: order_expr, ...)
window_sum(agg_expr, range, order: order_expr, ..., reset: partition_expr, ...)
window_sum(agg_expr, range, order: order_expr, ..., partition: partition_expr, ...)
Examples
window_sum(count(users.id))
window_sum(count(users.id), order: count(users.id) | desc())
window_sum(count(users.id), -2..2, order: users.created_at | month())
window_sum(count(users.id), order: users.created_at | month(), reset: users.gender)

Input

  • agg_expr (required): An aggregation expression to be summed.
  • range (optional): A range of rows to include in the sum. Negative values indicate rows before the current row, and positive values indicate rows after the current row, while 0 indicates the current row. If the beginning or end of the range is not specified, the range will include all rows from the beginning or end of the table. By default, if the range is not specified:
    • If order is specified, the range is ..0 (from the first row to the current row).
    • If order is not specified, the range is .. (from the first row to the last row).
  • order (required, repeatable): A field that is used for ordering. The order is default to ascending. The order can be set explicitly with asc() or desc().
    danger

    If the specified order does not uniquely identify rows, the result of the function can be non-deterministic. For example, if you use order: users.age, and there are multiple users with the same age in the same partition, the result can be unexpected.

  • partition or reset (repeatable, optional): A field that is used for partitioning the table. If partitions are not specified:
    • If order is specified, the table will be partitioned by all other grouping columns.
    • If order is not specified, the table will be considered as a single partition.

Output

The sum of the current row and the rows within the specified range.

Sample Usages

Running Sum of Count over Order Status
explore {
dimensions {
users.gender,
orders.status
}
measures {
count(orders.id),
_running_sum: window_sum(count(orders.id), order: orders.status)
}
}

Notice that it automatically resets the sum when the gender changes. This is because we only specify order and thus reset defaults to gender. If you want it to not reset, you can order by both gender and status.

Running Sum of Count over Gender and Order Status
explore {
dimensions {
users.gender,
orders.status
}
measures {
count(orders.id),
_running_sum: window_sum(count(orders.id), order: users.gender, order: orders.status)
}
}

By default, the range is ..0 (from the first row to the current row). Thus if we want to sum all rows, we can use .. for the range. And since we are not using relative range, we can omit the order: parameter.

Total Count of Users with percentage
explore {
dimensions {
users.gender,
orders.status
}
measures {
count(orders.id),
pct: count(orders.id) / window_sum(count(orders.id), ..)
}
}

Let us know what you think about this document :)