Aggregators
You can use aggregations at query time to summarize result data.
You can use aggregations at query time to summarize result data. The following aggregators are available:
Analytics Aggregators
eventCount
eventCount computes the number of TelemetryDeck events in the query.
{ "type" : "eventCount", "name" : <output_name, defaults to "Events"> }userCount
userCount computes the number of unique users in the query.
{ "type" : "userCount", "name" : <output_name, defaults to "Users"> }histogram
histogram computes the histogram of a metric. By default, it computes the histogram of the floatValue metric, because that’s the only metric that is guaranteed to be a numerical value in TelemetryDeck events.
{
"type" : "histogram",
"name" : <output_name, defaults to "Histogram">,
"fieldName" : <metric_name, defaults to floatValue>,
"numBuckets" : <number_of_buckets, defaults to 10>
}Numerical Aggregators
Count
count computes the count of rows that match the filters.
{ "type" : "count", "name" : <output_name> }DoubleSum
doubleSum computes and stores the sum of values as a 64-bit floating point value.
The doubleSum aggregator takes the following properties:
name: Output name for the summed valuefieldName: Name of the metric column to sum over
{ "type" : "doubleSum", "name" : <output_name>, "fieldName" : <metric_name> }DoubleMin
doubleMin computes the minimum of all metric values and Double.POSITIVE_INFINITY.
{ "type" : "doubleMin", "name" : <output_name>, "fieldName" : <metric_name> }DoubleMax
doubleMax computes the maximum of all metric values and Double.NEGATIVE_INFINITY.
{ "type" : "doubleMax", "name" : <output_name>, "fieldName" : <metric_name> }Averaging Aggregators
DoubleMean
doubleMean computes and returns the arithmetic mean of a column’s values as a 64-bit floating point value.
Warning: this aggregator is mean 😡😡
{ "type" : "doubleMean", "name" : <output_name>, "fieldName" : <metric_name> }Unique Aggregators
ThetaSketch
A theta sketch gives you the number of unique values that a dimension has in your query.
Theta sketches are a probabilistic data structure used for the count-distinct problem. They allow us to quickly count elements in sets, such as the set of users in the aggregation buckets
{
"fieldName": "clientUser",
"name": "count",
"type": "thetaSketch"
}Cardinality
cardinality computes the cardinality of a dimension.
{
"byRow": false,
"fields": ["clientUser"],
"name": "a0",
"round": true,
"type": "cardinality"
}First and Last Aggregators
DoubleFirst
doubleFirst computes the first value of all metric values.
{ "type" : "doubleFirst", "name" : <output_name>, "fieldName" : <metric_name> }DoubleLast
doubleLast computes the last value of all metric values.
{ "type" : "doubleLast", "name" : <output_name>, "fieldName" : <metric_name> }StringFirst
stringFirst computes the first value of all metric values.
{ "type" : "stringFirst", "name" : <output_name>, "fieldName" : <metric_name> }StringLast
stringLast computes the last value of all metric values.
{ "type" : "stringLast", "name" : <output_name>, "fieldName" : <metric_name> }ANY Aggregators
DoubleAny
Returns any value including null. This aggregator can simplify and optimize the performance by returning the first encountered value (including null).
doubleAny returns any double metric value.
{ "type" : "doubleAny", "name" : <output_name>, "fieldName" : <metric_name> }StringAny
Returns any value including null. This aggregator can simplify and optimize the performance by returning the first encountered value (including null).
stringAny returns any string metric value.
{ "type" : "stringAny", "name" : <output_name>, "fieldName" : <metric_name> }Misc
Filtered
A filtered aggregator wraps any given aggregator, but only aggregates the values for which the given dimension filter matches.
This makes it possible to compute the results of a filtered and an unfiltered aggregation simultaneously, without having to issue multiple queries, and use both results as part of post-aggregations.
Note: If only the filtered results are required, consider putting the filter on the query itself, which will be much faster since it doesn’t require scanning all the data.
Filtered aggregators also support relative time intervals.
{
"type": "filtered",
"filter": {
"type": "and",
"fields": [
{
"type": "selector",
"dimension": "type",
"value": "InsightShown"
}
]
},
"aggregator": {
"type": "thetaSketch",
"name": "InsightShown",
"fieldName": "clientUser"
}
}