Skip to main content

Spanpruning Processor

Status Available in: contrib Maintainers: @portertech Source: opentelemetry-collector-contrib

Supported Telemetry

Traces

Overview

Overview

The Span Pruning Processor identifies duplicate or similar leaf spans within a single trace, groups them, and replaces each group with a single aggregated summary span. When leaf spans are aggregated, the processor also recursively aggregates their parent spans if all children of those parents are being aggregated. Leaf spans are spans that are not referenced as a parent by any other span in the trace. They typically represent the last actions in an execution call stack (e.g., individual database queries, HTTP calls to external services). Spans are grouped by:
  1. Span name - spans must have the same name
  2. Span kind - spans must have the same kind (Internal, Server, Client, Producer, Consumer)
  3. Status code - spans must have the same status (OK, Error, or Unset)
  4. TraceState - spans must have identical TraceState values (for Consistent Probability Sampling compatibility)
  5. Configured attributes - spans must have matching values for attributes specified in group_by_attributes
  6. Parent span name - leaf spans must share the same parent span name to be grouped together
Parent spans are eligible for aggregation when all of their children are aggregated, they share the same name, kind, and status code, and they are not root spans. This processor is useful for reducing trace data volume while preserving meaningful information about repeated operations.

Use Cases

  • Database query optimization: When an application makes many similar database queries (e.g., N+1 queries), aggregate them into a single summary span
  • Batch operations: Consolidate many similar leaf operations into a single representative span
  • Cost reduction: Reduce trace storage costs by eliminating redundant span data

Configuration

processors:
  spanpruning:
    # Attributes to use for grouping similar leaf spans (supports glob patterns)
    # Spans with the same name AND same values for matching attributes will be grouped
    # Examples:
    #   - "db.*" matches db.operation, db.name, db.statement, etc.
    #   - "http.request.*" matches http.request.method, http.request.header, etc.
    #   - "db.operation" matches only the exact key "db.operation"
    group_by_attributes:
      - "db.*"
      - "http.method"

    # Minimum number of similar leaf spans required before aggregation
    # Default: 5
    min_spans_to_aggregate: 3

    # Maximum depth of parent span aggregation above leaf spans
    # 0 = only aggregate leaf spans (no parent aggregation)
    # -1 = unlimited depth
    # Default: 1
    max_parent_depth: 1

    # Prefix for aggregation statistics attributes
    # Default: "aggregation."
    aggregation_attribute_prefix: "batch."

Configuration Options

FieldTypeDefaultDescription
group_by_attributes[]string[]Attribute patterns for grouping (supports glob patterns like db.*)
min_spans_to_aggregateint5Minimum group size before aggregation occurs
max_parent_depthint1Max depth of parent aggregation (0=none, -1=unlimited)
aggregation_attribute_prefixstring”aggregation.”Prefix for aggregation statistics attributes

Glob Pattern Support

The group_by_attributes field supports glob patterns for matching attribute keys:
PatternMatches
db.*db.operation, db.name, db.statement, etc.
http.request.*http.request.method, http.request.header.content-type, etc.
rpc.*rpc.method, rpc.service, rpc.system, etc.
db.operationOnly the exact key db.operation
When multiple attributes match a pattern, they are all included in the grouping key (sorted alphabetically for consistency).

Summary Span

When spans are aggregated, the summary span includes:

Properties

  • Name: Original span name (e.g., SELECT)
  • TraceID: Same as original spans
  • SpanID: Newly generated unique ID
  • ParentSpanID: Same as original spans (common parent)
  • Kind: Same as template span (inherited from slowest span)
  • StartTimestamp: Earliest start time of all spans in the group
  • EndTimestamp: Latest end time of all spans in the group
  • Status: Same as original spans (spans are grouped by status code)
  • TraceState: Inherited from the template span (preserved for Consistent Probability Sampling compatibility)
  • Attributes: Inherited from the slowest span in the group
Note: The summary span’s duration (EndTimestamp - StartTimestamp) represents the total time window covered by all aggregated spans, which may exceed duration_max_ns. For example, if spans overlap or are staggered, the time range can be larger than any individual span’s duration. Use duration_max_ns to find the slowest individual operation.

What Gets Aggregated Away

When spans are aggregated into a summary span, the following data from non-template spans is lost:
DataBehavior
Span EventsEvents from the template (slowest) span are preserved
Span LinksLinks from the template span are preserved
AttributesNon-matching attribute values are lost
Individual TimestampsOriginal start/end times replaced by the group’s time range
SpanIDsOriginal SpanIDs are replaced by a single summary SpanID

Aggregation Attributes

The following attributes are added to the summary span (shown with default aggregation_attribute_prefix: "aggregation."):
AttributeTypeDescription
<prefix>is_summaryboolAlways true to identify summary spans
<prefix>span_countint64Number of spans that were aggregated
<prefix>duration_min_nsint64Minimum duration in nanoseconds
<prefix>duration_max_nsint64Maximum duration in nanoseconds
<prefix>duration_avg_nsint64Average duration in nanoseconds
<prefix>duration_total_nsint64Total duration in nanoseconds

Pipeline Placement

This processor is designed to work best when placed after processors that ensure complete traces are available:
service:
  pipelines:
    traces:
      receivers: [otlp]
      processors: [groupbytrace, spanpruning, batch]
      exporters: [otlp]
Or with tail sampling:
service:
  pipelines:
    traces:
      receivers: [otlp]
      processors: [tail_sampling, spanpruning, batch]
      exporters: [otlp]

Example

Basic Example

A trace with repeated database queries (some failing): Before Processing:
root-span (parent)
β”œβ”€β”€ SELECT (leaf) - duration: 10ms, db.operation: select, status: OK
β”œβ”€β”€ SELECT (leaf) - duration: 15ms, db.operation: select, status: OK
β”œβ”€β”€ SELECT (leaf) - duration: 12ms, db.operation: select, status: OK
β”œβ”€β”€ SELECT (leaf) - duration: 50ms, db.operation: select, status: Error
β”œβ”€β”€ SELECT (leaf) - duration: 45ms, db.operation: select, status: Error
└── INSERT (leaf) - duration: 20ms, db.operation: insert, status: OK
After Processing (with min_spans_to_aggregate: 2):
root-span (parent)
β”œβ”€β”€ SELECT (summary, status: OK)
β”‚   - aggregation.is_summary: true
β”‚   - aggregation.span_count: 3
β”‚   - aggregation.duration_min_ns: 10000000
β”‚   - aggregation.duration_max_ns: 15000000
β”‚   - aggregation.duration_avg_ns: 12333333
β”œβ”€β”€ SELECT (summary, status: Error)
β”‚   - aggregation.is_summary: true
β”‚   - aggregation.span_count: 2
β”‚   - aggregation.duration_min_ns: 45000000
β”‚   - aggregation.duration_max_ns: 50000000
β”‚   - aggregation.duration_avg_ns: 47500000
└── INSERT (unchanged - only 1 span, below threshold)
Note: Spans with different status codes are grouped separately, preserving error information.

Recursive Parent Aggregation Example

When spans are aggregated, the processor also checks if their parent spans can be aggregated. Parent spans are eligible for aggregation when:
  1. All of their children are being aggregated
  2. They share the same name, kind, and status code with other eligible parents
  3. They are not root spans (must have a parent)
  4. At least 2 parents meet the criteria
Before Processing (with min_spans_to_aggregate: 2, group_by_attributes: ["db.op"]):
root
β”œβ”€β”€ handler (status: OK)
β”‚   └── SELECT (db.op=select, status: OK) ───┐
β”œβ”€β”€ handler (status: OK)                      β”‚ leaf group A: 3 OK SELECTs
β”‚   └── SELECT (db.op=select, status: OK) ────
β”œβ”€β”€ handler (status: OK)                      β”‚
β”‚   └── SELECT (db.op=select, status: OK) β”€β”€β”€β”˜
β”œβ”€β”€ handler (status: Error)
β”‚   └── SELECT (db.op=select, status: Error) ┐ leaf group B: 2 Error SELECTs
β”œβ”€β”€ handler (status: Error)                   β”‚
β”‚   └── SELECT (db.op=select, status: Error) β”˜
β”œβ”€β”€ handler (status: OK)
β”‚   └── INSERT (db.op=insert, status: OK) ──── only 1, below threshold
└── worker (status: OK)
    └── SELECT (db.op=select, status: OK) ──── different parent name
After Processing:
root
β”œβ”€β”€ handler (summary, status: OK, span_count: 3)
β”‚   └── SELECT (summary, status: OK, span_count: 3)
β”œβ”€β”€ handler (summary, status: Error, span_count: 2)
β”‚   └── SELECT (summary, status: Error, span_count: 2)
β”œβ”€β”€ handler (status: OK)
β”‚   └── INSERT (status: OK) ─────────────────────────── unchanged
└── worker (status: OK)
    └── SELECT (status: OK) ─────────────────────────── unchanged
Why each span was handled this way:
SpanResultReason
3x handler (OK) with SELECT childrenAggregatedAll children aggregated, same name+kind+status
3x SELECT (OK) under handlerAggregatedSame name + kind + status + attributes + parent name
2x handler (Error) with SELECT childrenAggregatedAll children aggregated, same name+kind+status
2x SELECT (Error) under handlerAggregatedSame name + kind + status + attributes + parent name
handler (OK) with INSERT childUnchangedChild not aggregated (only 1 INSERT)
INSERT (OK)UnchangedBelow threshold (only 1 span)
worker (OK)UnchangedChild not aggregated
SELECT (OK) under workerUnchangedDifferent parent name than other SELECTs

Limitations

  • Requires complete traces for accurate leaf detection
  • Summary span inherits attributes from the slowest span in the group
  • Parent spans are only aggregated when ALL their children are aggregated

Consistent Probability Sampling (CPS) Compatibility

The processor is designed to be compatible with Consistent Probability Sampling (CPS). CPS uses TraceState to carry sampling metadata (ot=th:...;rv:...) where:
  • th (threshold) indicates the sampling probability threshold
  • rv (randomness value) provides consistent randomness for sampling decisions
Why TraceState matters for aggregation: Spans with different TraceState values represent different sampling populations with different β€œadjusted counts” (weights). Aggregating them together would produce statistically incorrect summaries and break downstream sampling decisions. The processor uses exact TraceState matching (not just the th value) because:
  • The rv value affects sampling decisions
  • Vendor-specific keys may have semantic meaning
  • Key ordering may be significant

Telemetry

The processor emits the following metrics to help monitor its operation:

Counters

MetricDescription
otelcol_processor_spanpruning_spans_receivedTotal number of spans received by the processor
otelcol_processor_spanpruning_spans_prunedTotal number of spans removed by aggregation
otelcol_processor_spanpruning_aggregations_createdTotal number of aggregation summary spans created
otelcol_processor_spanpruning_traces_processedTotal number of traces processed

Histograms

MetricDescription
otelcol_processor_spanpruning_aggregation_group_sizeDistribution of the number of spans per aggregation group
otelcol_processor_spanpruning_processing_durationTime taken to process each batch of traces (in seconds)
These metrics can be used to:
  • Monitor the effectiveness of span pruning (compare spans_received vs spans_pruned)
  • Track the compression ratio achieved by aggregation
  • Identify processing bottlenecks via processing_duration
  • Understand aggregation patterns via aggregation_group_size

Configuration

Example Configuration

spanpruning:
  group_by_attributes:
    - "db.operation"
  min_spans_to_aggregate: 5
  aggregation_attribute_prefix: "aggregation."

spanpruning/custom:
  group_by_attributes:
    - "db.operation"
    - "db.name"
  min_spans_to_aggregate: 3
  aggregation_attribute_prefix: "batch."

Last generated: 2026-04-13