Skip to content

telemetry_metrics.distribution is not reporting sensible results #20

@alco

Description

@alco

I am not sure whether the error is in the metric collection or in the export but, given a metric definition like

distribution("vm.monitor.long_gc.timeout", tags: [:process_type], unit: :millisecond)

I'm seeing it reported to Honeycomb (via Otel export) as follows:

vm.monitor.long_gc.timeout.avg:    668
vm.monitor.long_gc.timeout.count:  1
vm.monitor.long_gc.timeout.max:    0
vm.monitor.long_gc.timeout.min:    0
vm.monitor.long_gc.timeout.p001:   500
vm.monitor.long_gc.timeout.p01:    500
vm.monitor.long_gc.timeout.p05:    500
vm.monitor.long_gc.timeout.p10:    500
vm.monitor.long_gc.timeout.p20:    500
vm.monitor.long_gc.timeout.p25:    500
vm.monitor.long_gc.timeout.p50:    500
vm.monitor.long_gc.timeout.p75:    500
vm.monitor.long_gc.timeout.p80:    500
vm.monitor.long_gc.timeout.p90:    500
vm.monitor.long_gc.timeout.p95:    500
vm.monitor.long_gc.timeout.p99:    500
vm.monitor.long_gc.timeout.p999:   500
vm.monitor.long_gc.timeout.sum:    668

So, even though the attributes *.min and *.max are present, it's no use plotting them on a chart. I don't think the percentiles make any sense either. Since *.count is 1, there must have been a single data point collected, so the avg value is the actual value collected, most likely?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions