- [FEATURE] Add parquet block format #1479 #1531 #1564 (@annanay25, @mdisibio)
- [FEATURE] Mark
log_received_traces
as deprecated. New flag islog_received_spans
. Extend distributor spans logger with optional features to include span attributes and a filter by error status. #1465 (@faustodavid) - [FEATURE] Add tags option for s3 backends. This allows new objects to be written with the configured tags. #1442 (@stevenbrookes)
- [CHANGE] metrics-generator: Changed added metric label
instance
to__metrics_gen_instance
to reduce collisions with custom dimensions. #1439 (@joe-elliott) - [CHANGE] Don't enforce
max_bytes_per_tag_values_query
when set to 0. #1447 (@joe-elliott) - [CHANGE] Add new querier service in deployment jsonnet to serve
/status
endpoint. #1474 (@annanay25) - [CHANGE] Swapped out Google Cloud Functions serverless docs and build for Google Cloud Run. #1483 (@joe-elliott)
- [CHANGE] BREAKING CHANGE Change spanmetrics metric names and labels to match OTel conventions. #1478 (@mapno) Old metric names:
traces_spanmetrics_duration_seconds_{sum,count,bucket}
New metric names:
traces_spanmetrics_latency_{sum,count,bucket}
Additionally, default label span_status
is renamed to status_code
.
- [CHANGE] Update to Go 1.18 #1504 (@annanay25)
- [CHANGE] Change tag/value lookups to return partial results when reaching response size limit instead of failing #1517 (@mdisibio)
- [CHANGE] Change search to be case-sensitive #1547 (@mdisibio)
- [CHANGE] Relax Hedged request defaults for external endpoints. #1566 (@joe-elliott)
querier: search: external_hedge_requests_at: 4s -> 8s external_hedge_requests_up_to: 3 -> 2
- [CHANGE] BREAKING CHANGE Include emptyDir for metrics generator wal storage in jsonnet #1556 (@zalegrala) Jsonnet users will now need to specify a storage request and limit for the generator wal. _config+:: { metrics_generator+: { ephemeral_storage_request_size: '10Gi', ephemeral_storage_limit_size: '11Gi', }, }
- [FEATURE] metrics-generator: support per-tenant processor configuration #1434 (@kvrhdn)
- [FEATURE] Include rollout dashboard #1456 (@zalegrala)
- [FEATURE] Add SentinelPassword configuration for Redis #1463 (@zalegrala)
- [ENHANCEMENT] Added the ability to have a per tenant max search duration. #1421 (@joe-elliott)
- [ENHANCEMENT] metrics-generator: expose max_active_series as a metric #1471 (@kvrhdn)
- [ENHANCEMENT] Azure Backend: Add support for authentication with Managed Identities. #1457 (@joe-elliott)
- [ENHANCEMENT] Add metric to track feature enablement #1459 (@zalegrala)
- [ENHANCEMENT] Added s3 config option
insecure_skip_verify
#1470 (@zalegrala) - [ENHANCEMENT] Added polling option to reduce issues in Azure
blocklist_poll_jitter_ms
#1518 (@joe-elliott) - [ENHANCEMENT] Add a config to query single ingester instance based on trace id hash for Trace By ID API. (1484)[grafana#1484] (@sagarwala, @bikashmishra100, @ashwinidulams)
- [ENHANCEMENT] Add blocklist metrics for total backend objects and total backend bytes #1519 (@ie-pham)
- [ENHANCEMENT] Adds
tempo_querier_external_endpoint_hedged_roundtrips_total
to count the total hedged requests #1558 (@joe-elliott) BREAKING CHANGE Removed deprecated metricstempodb_(gcs|s3|azure)_request_duration_seconds
in favor oftempodb_backend_request_duration_seconds
. These metrics have been deprecated since v1.1. - [BUGFIX] Fix nil pointer panic when the trace by id path errors. #1441 (@joe-elliott)
- [BUGFIX] Update tempo microservices Helm values example which missed the 'enabled' key for thriftHttp. #1472 (@hajowieland)
- [BUGFIX] Fix race condition in forwarder overrides loop. 1468 (@mapno)
- [BUGFIX] Fix v2 backend check on span name to be substring #1538 (@mdisibio)
- [BUGFIX] Fix wal check on span name to be substring #1548 (@mdisibio)
- [BUGFIX] Prevent ingester panic "cannot grow buffer" #1258 (@mdisibio)
- [BUGFIX] metrics-generator: do not remove x-scope-orgid header in single tenant modus #1554 (@kvrhdn)
- [BUGFIX] metrics-generator: don't inject X-Scope-OrgID header for single-tenant setups 1417 (@kvrhdn)
- [BUGFIX] compactor: populate
compaction_objects_combined_total
andtempo_discarded_spans_total{reason="trace_too_large_to_compact"}
metrics again 1420 (@mdisibio) - [BUGFIX] distributor: prevent panics when concurrently calling
shutdown
to forwarder's queueManager 1422 (@mapno)
- [CHANGE] Vulture now exercises search at any point during the block retention to test full backend search.
BREAKING CHANGE Dropped
tempo-search-retention-duration
parameter. #1297 (@joe-elliott) - [CHANGE] Updated storage.trace.pool.queue_depth default from 200->10000. #1345 (@joe-elliott)
- [CHANGE] Update alpine images to 3.15 #1330 (@zalegrala)
- [CHANGE] Updated flags
-storage.trace.azure.storage-account-name
and-storage.trace.s3.access_key
to no longer to be considered as secrets #1356 (@simonswine) - [CHANGE] Add warning threshold for TempoIngesterFlushes and adjust critical threshold #1354 (@zalegrala)
- [CHANGE] Include lambda in serverless e2e tests #1357 (@zalegrala)
- [CHANGE] Replace mixin TempoIngesterFlushes metric to only look at retries #1354 (@zalegrala)
- [CHANGE] Update the jsonnet for single-binary to include clustering #1391 (@zalegrala)
BREAKING CHANGE After this change, the port specification has moved under
$._config.tempo
to avoid global port spec. - [FEATURE]: v2 object encoding added. This encoding adds a start/end timestamp to every record to reduce proto marshalling and increase search speed. BREAKING CHANGE After this rollout the distributors will use a new API on the ingesters. As such you must rollout all ingesters before rolling the distributors. Also, during this period, the ingesters will use considerably more resources and as such should be scaled up (or incoming traffic should be heavily throttled). Once all distributors and ingesters have rolled performance will return to normal. Internally we have observed ~1.5x CPU load on the ingesters during the rollout. #1227 (@joe-elliott)
- [FEATURE] Added metrics-generator: an optional components to generate metrics from ingested traces #1282 (@mapno, @kvrhdn)
- [FEATURE] Allow the compaction cycle to be configurable with a default of 30 seconds #1335 (@willdot)
- [FEATURE] Add new config options for setting GCS metadata on new objects (@zalegrala)
- [ENHANCEMENT] Enterprise jsonnet: add config to create tokengen job explicitly #1256 (@kvrhdn)
- [ENHANCEMENT] Add new scaling alerts to the tempo-mixin #1292 (@mapno)
- [ENHANCEMENT] Improve serverless handler error messages #1305 (@joe-elliott)
- [ENHANCEMENT] Added a configuration option
search_prefer_self
to allow the queriers to do some work while also leveraging serverless in search. #1307 (@joe-elliott) - [ENHANCEMENT] Make trace combination/compaction more efficient #1291 (@mdisibio)
- [ENHANCEMENT] Add Content-Type headers to query-frontend paths #1306 (@wperron)
- [ENHANCEMENT] Partially persist traces that exceed
max_bytes_per_trace
during compaction #1317 (@joe-elliott) - [ENHANCEMENT] Make search respect per tenant
max_bytes_per_trace
and addedskippedTraces
to returned search metrics. #1318 (@joe-elliott) - [ENHANCEMENT] Improve serverless consistency by forcing a GC before returning. #1324 (@joe-elliott)
- [ENHANCEMENT] Add forwarding queue from distributor to metrics-generator. #1331 (@mapno)
- [ENHANCEMENT] Add hedging to queries to external endpoints. #1350 (@joe-elliott)
New config options and defaults:
** BREAKING CHANGE ** Querier options related to search have moved under a
querier: search: external_hedge_requests_at: 5s external_hedge_requests_up_to: 3
search
block:becomesquerier: search_query_timeout: 30s search_external_endpoints: [] search_prefer_self: 2
querier: search: query_timeout: 30s prefer_self: 2 external_endpoints: []
- [ENHANCEMENT] Added tenant ID (instance ID) to
trace too large message
. #1385 (@cristiangsp) - [ENHANCEMENT] Add a startTime and endTime parameter to the Trace by ID Tempo Query API to improve query performance #1388 (@sagarwala, @bikashmishra100, @ashwinidulams)
- [BUGFIX] Correct issue where Azure "Blob Not Found" errors were sometimes not handled correctly #1390 (@joe-elliott)
- [BUGFIX]: Enable compaction and retention in Tanka single-binary #1352
- [BUGFIX]: Remove unnecessary PersistentVolumeClaim #1245
- [BUGFIX] Fixed issue when query-frontend doesn't log request details when request is cancelled #1136 (@adityapwr)
- [BUGFIX] Update OTLP port in examples (docker-compose & kubernetes) from legacy ports (55680/55681) to new ports (4317/4318) #1294 (@mapno)
- [BUGFIX] Fixes min/max time on blocks to be based on span times instead of ingestion time. #1314 (@joe-elliott)
- Includes new configuration option to restrict the amount of slack around now to update the block start/end time. #1332 (@joe-elliott)
storage: trace: wal: ingestion_time_range_slack: 2m0s
- Includes a new metric to determine how often this range is exceeded:
tempo_warnings_total{reason="outside_ingestion_time_slack"}
- Includes new configuration option to restrict the amount of slack around now to update the block start/end time. #1332 (@joe-elliott)
- [BUGFIX] Prevent data race / ingester crash during searching by trace id by using xxhash instance as a local variable. #1387 (@bikashmishra100, @sagarwala, @ashwinidulams)
- [BUGFIX] Fix spurious "failed to mark block compacted during retention" errors #1372 (@mdisibio)
- [BUGFIX] Fix error message "Writer is closed" by resetting compression writer correctly on the error path. #1379 (@annanay25)
- [BUGFIX] Fixed an issue where the query-frontend would corrupt start/end time ranges on searches which included the ingesters [#1295] (@joe-elliott)
- [BUGFIX] Fixed panic when using etcd as ring's kvstore #1260 (@mapno)
- [FEATURE]: Add support for inline environments. #1184 @irizzant
- [CHANGE] Search: Add new per-tenant limit
max_bytes_per_tag_values_query
to limit the size of tag-values response. #1068 (@annanay25) - [CHANGE] Reduce MaxSearchBytesPerTrace
ingester.max-search-bytes-per-trace
default to 5KB #1129 @annanay25 - [CHANGE] BREAKING CHANGE The OTEL GRPC receiver's default port changed from 55680 to 4317. #1142 (@tete17)
- [CHANGE] Remove deprecated method
Push
fromtempopb.Pusher
#1173 (@kvrhdn) - [CHANGE] Upgrade cristalhq/hedgedhttp from v0.6.0 to v0.7.0 #1159 (@cristaloleg)
- [CHANGE] Export trace id constant in api package #1176
- [CHANGE] GRPC
1.33.3
=>1.38.0
broke compatibility withgogoproto.customtype
. Enforce the use of gogoproto marshalling/unmarshalling for Tempo, Cortex & Jaeger structs. #1186 (@annanay25) - [CHANGE] BREAKING CHANGE Remove deprecated ingester gRPC endpoint and data encoding. The current data encoding was introduced in v1.0. If running earlier versions, first upgrade to v1.0 through v1.2 and allow time for all blocks to be switched to the "v1" data encoding. #1215 (@mdisibio)
- [FEATURE] Added support for full backend search. #1174 (@joe-elliott)
BREAKING CHANGE Moved
querier.search_max_result_limit
andquerier.search_default_result_limit
toquery_frontend.search.max_result_limit
andquery_frontend.search.default_result_limit
- [ENHANCEMENT]: Improve variables expansion support #1212 @irizzant
- [ENHANCEMENT] Expose
upto
parameter on hedged requests for each backend withhedge_requests_up_to
. #1085 (@joe-elliott) - [ENHANCEMENT] Search: drop use of TagCache, extract tags and tag values on-demand #1068 (@kvrhdn)
- [ENHANCEMENT] Jsonnet: add
$._config.namespace
to filter by namespace in cortex metrics #1098 (@mapno) - [ENHANCEMENT] Add middleware to compress frontend HTTP responses with gzip if requested #1080 (@kvrhdn, @zalegrala)
- [ENHANCEMENT] Allow query disablement in vulture #1117 (@zalegrala)
- [ENHANCEMENT] Improve memory efficiency of compaction and block cutting. #1121 #1130 (@joe-elliott)
- [ENHANCEMENT] Include metrics for configured limit overrides and defaults: tempo_limits_overrides, tempo_limits_defaults #1089 (@zalegrala)
- [ENHANCEMENT] Add Envoy Proxy panel to
Tempo / Writes
dashboard #1137 (@kvrhdn) - [ENHANCEMENT] Reduce compactionCycle to improve performance in large multitenant environments #1145 (@joe-elliott)
- [ENHANCEMENT] Added max_time_per_tenant to allow for independently configuring polling and compaction cycle. #1145 (@joe-elliott)
- [ENHANCEMENT] Add
tempodb_compaction_outstanding_blocks
metric to measure compaction load #1144 (@mapno) - [ENHANCEMENT] Update mixin to use new backend metric #1151 (@zalegrala)
- [ENHANCEMENT] Make
TempoIngesterFlushesFailing
alert more actionable #1157 (@dannykopping) - [ENHANCEMENT] Switch open-telemetry/opentelemetry-collector to grafana/opentelemetry-collectorl fork, update it to 0.40.0 and add missing dependencies due to the change #1142 (@tete17)
- [ENHANCEMENT] Allow environment variables for Azure storage credentials #1147 (@zalegrala)
- [ENHANCEMENT] jsonnet: set rollingUpdate.maxSurge to 3 for distributor, frontend and queriers #1164 (@kvrhdn)
- [ENHANCEMENT] Reduce search data file sizes by optimizing contents #1165 (@mdisibio)
- [ENHANCEMENT] Add
tempo_ingester_live_traces
metric #1170 (@mdisibio) - [ENHANCEMENT] Update compactor ring to automatically forget unhealthy entries #1178 (@mdisibio)
- [ENHANCEMENT] Added the ability to pass ISO8601 date/times for start/end date to tempo-cli query api search #1208 (@joe-elliott)
- [ENHANCEMENT] Prevent writes to large traces even after flushing to disk #1199 (@mdisibio)
- [BUGFIX] Add process name to vulture traces to work around display issues #1127 (@mdisibio)
- [BUGFIX] Fixed issue where compaction sometimes dropped spans. #1130 (@joe-elliott)
- [BUGFIX] Ensure that the admin client jsonnet has correct S3 bucket property. (@hedss)
- [BUGFIX] Publish tenant index age correctly for tenant index writers. #1146 (@joe-elliott)
- [BUGFIX] Ingester startup panic
slice bounds out of range
#1195 (@mdisibio) - [BUGFIX] Update goreleaser install method to
go install
. # (@mapno) - [BUGFIX] tempo-mixin: remove TempoDB Access panel from
Tempo / Reads
, metrics don't exist anymore #1218 (@kvrhdn)
- [BUGFIX] Fix defaults for MaxBytesPerTrace (ingester.max-bytes-per-trace) and MaxSearchBytesPerTrace (ingester.max-search-bytes-per-trace) #1109 (@bitprocessor)
- [BUGFIX] Ignore empty objects during compaction #1113 (@mdisibio)
- [CHANGE] BREAKING CHANGE Drop support for v0 and v1 blocks. See 1.1 changelog for details #919 (@joe-elliott)
- [CHANGE] Renamed CLI flag from
--storage.trace.maintenance-cycle
to--storage.trace.blocklist_poll
. This is a breaking change #897 (@mritunjaysharma394) - [CHANGE] update jsonnet alerts and recording rules to use
job_selectors
andcluster_selectors
for configurable unique identifier labels #935 (@kevinschoonover) - [CHANGE] Modify generated tag keys in Vulture for easier filtering #934 (@zalegrala)
- [CHANGE] BREAKING CHANGE Consolidate status information onto /status endpoint #952 @zalegrala)
The following endpoints moved.
/runtime_config
moved to/status/runtime_config
/config
moved to/status/config
/services
moved to/status/services
- [CHANGE] BREAKING CHANGE Change ingester metric
ingester_bytes_metric_total
in favor ofingester_bytes_received_total
#979 (@mapno) - [CHANGE] Add troubleshooting language to config for
server.grpc_server_max_recv_msg_size
andserver.grpc_server_max_send_msg_size
when handling large traces #1023 (@thejosephstevens) - [CHANGE] Parse search query tags from
tags
query parameter #1055 (@kvrhdn) - [FEATURE] Add ability to search ingesters for traces #806 (@mdisibio)
- [FEATURE] Add runtime config handler #936 (@mapno)
- [FEATURE] Search WAL reload and compression(versioned encoding) support #1000 (@annanay25, @mdisibio)
- [FEATURE] Added ability to add a middleware to the OTel receivers' consume function #1015 (@chaudum)
- [FEATURE] Add ScalableSingleBinary operational run mode #1004 (@zalegrala)
- [FEATURE] Added a jsonnet library for Grafana Enterprise Traces (GET) deployments #1096
- [ENHANCEMENT] Added "query blocks" cli option. #876 (@joe-elliott)
- [ENHANCEMENT] Added "search blocks" cli option. #972 (@joe-elliott)
- [ENHANCEMENT] Added traceid to
trace too large message
. #888 (@mritunjaysharma394) - [ENHANCEMENT] Add support to tempo workloads to
overrides
from single configmap in microservice mode. #896 (@kavirajk) - [ENHANCEMENT] Make
overrides_config
block name consistent with Loki and Cortex in microservice mode. #906 (@kavirajk) - [ENHANCEMENT] Changes the metrics name from
cortex_runtime_config_last_reload_successful
totempo_runtime_config_last_reload_successful
#945 (@kavirajk) - [ENHANCEMENT] Updated config defaults to reflect better capture operational knowledge. #913 (@joe-elliott)
ingester: trace_idle_period: 30s => 10s # reduce ingester memory requirements with little impact on querying flush_check_period: 30s => 10s query_frontend: query_shards: 2 => 20 # will massively improve performance on large installs storage: trace: wal: encoding: none => snappy # snappy has been tested thoroughly and ready for production use block: bloom_filter_false_positive: .05 => .01 # will increase total bloom filter size but improve query performance bloom_filter_shard_size_bytes: 256KiB => 100 KiB # will improve query performance compactor: compaction: chunk_size_bytes: 10 MiB => 5 MiB # will reduce compactor memory needs compaction_window: 4h => 1h # will allow more compactors to participate in compaction without substantially increasing blocks
- [ENHANCEMENT] Make s3 backend readError logic more robust #905 (@wei840222)
- [ENHANCEMENT] Include additional detail when searching for traces #916 (@zalegrala)
- [ENHANCEMENT] Add
gen index
andgen bloom
commands to tempo-cli. #903 (@annanay25) - [ENHANCEMENT] Implement trace comparison in Vulture #904 (@zalegrala)
- [ENHANCEMENT] Improve zstd read throughput using zstd.Decoder #948 (@joe-elliott)
- [ENHANCEMENT] Dedupe search records while replaying WAL #940 (@annanay25)
- [ENHANCEMENT] Add status endpoint to list the available endpoints #938 (@zalegrala)
- [ENHANCEMENT] Compression updates: Added s2, improved snappy performance #961 (@joe-elliott)
- [ENHANCEMENT] Add search block headers #943 (@mdisibio)
- [ENHANCEMENT] Add search block headers for wal blocks #963 (@mdisibio)
- [ENHANCEMENT] Add support for vulture sending long running traces #951 (@zalegrala)
- [ENHANCEMENT] Support global denylist and per-tenant allowlist of tags for search data. #960 (@annanay25)
- [ENHANCEMENT] Add
search_query_timeout
to querier config. #984 (@kvrhdn) - [ENHANCEMENT] Include simple e2e test to test searching #978 (@zalegrala)
- [ENHANCEMENT] Jsonnet: add
$._config.memcached.memory_limit_mb
#987 (@kvrhdn) - [ENHANCEMENT] Upgrade jsonnet-libs to 1.19 and update tk examples #1001 (@mapno)
- [ENHANCEMENT] Shard tenant index creation by tenant and add functionality to handle stale indexes. #1005 (@joe-elliott)
- [ENHANCEMENT] BREAKING CHANGE Support partial results from failed block queries #1007 (@mapno)
Querier
GET /querier/api/traces/<traceid>
response's body has been modified to returntempopb.TraceByIDResponse
instead of simplytempopb.Trace
. This will cause a disruption of the read path during rollout of the change. - [ENHANCEMENT] Add
search_default_limit
andsearch_max_result_limit
to querier config. #1022 #1044 (@kvrhdn) - [ENHANCEMENT] Add new metric
tempo_distributor_push_duration_seconds
#1027 (@zalegrala) - [ENHANCEMENT] Add query parameter to show the default config values and the difference between the current values and the defaults. #1045 (@MichelHollands)
- [ENHANCEMENT] Adding metrics around ingester flush retries #1049 (@dannykopping)
- [ENHANCEMENT] Performance: More efficient distributor batching #1075 (@joe-elliott)
- [ENHANCEMENT] Allow search disablement in vulture #1069 (@zalegrala)
- [ENHANCEMENT] Jsonnet: add
$._config.search_enabled
, correctly sethttp_api_prefix
in config #1072 (@kvrhdn) - [ENHANCEMENT] Performance: Remove WAL contention between ingest and searches #1076 (@mdisibio)
- [ENHANCEMENT] Include tempo-cli in the release #1086 (@zalegrala)
- [ENHANCEMENT] Add search on span status #1093 (@mdisibio)
- [ENHANCEMENT] Slightly improved compression performance #1094 (@bboreham)
- [BUGFIX] Update port spec for GCS docker-compose example #869 (@zalegrala)
- [BUGFIX] Fix "magic number" errors and other block mishandling when an ingester forcefully shuts down #937 (@mdisibio)
- [BUGFIX] Fix compactor memory leak #806 (@mdisibio)
- [BUGFIX] Fix an issue with WAL replay of zero-length search data when search is disabled. #968 (@annanay25)
- [BUGFIX] Set span's tag
span.kind
toclient
in query-frontend #975 (@mapno) - [BUGFIX] Nil check overrides module in the
/status
handler #994 (@mapno) - [BUGFIX] Several bug fixes for search contention and panics #1033 (@mdisibio)
- [BUGFIX] Fixes
tempodb_backend_hedged_roundtrips_total
to correctly count hedged roundtrips. #1079 (@joe-elliott) - [BUGFIX] Update go-kit logger package to remove spurious debug logs #1094 (@bboreham)
- [CHANGE] Upgrade Cortex from v1.9.0 to v1.9.0-131-ga4bf10354 #841 (@aknuds1)
- [CHANGE] Change default tempo port from 3100 to 3200 #770 (@MurzNN)
- [CHANGE] Jsonnet: use dedicated configmaps for distributors and ingesters #775 (@kvrhdn)
- [CHANGE] Docker images are now prefixed by their branch name #828 (@jvrplmlmn)
- [CHANGE] Update to Go 1.17 #953
- [FEATURE] Added the ability to hedge requests with all backends #750 (@joe-elliott)
- [FEATURE] Added a tenant index to reduce bucket polling. #834 (@joe-elliott)
- [ENHANCEMENT] Added hedged request metric
tempodb_backend_hedged_roundtrips_total
and a new storage agnostictempodb_backend_request_duration_seconds
metric that supersedes the soon-to-be deprecated storage specific metrics (tempodb_azure_request_duration_seconds
,tempodb_s3_request_duration_seconds
andtempodb_gcs_request_duration_seconds
). #790 (@JosephWoodward) - [ENHANCEMENT] Performance: improve compaction speed with concurrent reads and writes #754 (@mdisibio)
- [ENHANCEMENT] Improve readability of cpu and memory metrics on operational dashboard #764 (@bboreham)
- [ENHANCEMENT] Add
azure_request_duration_seconds
metric. #767 (@JosephWoodward) - [ENHANCEMENT] Add
s3_request_duration_seconds
metric. #776 (@JosephWoodward) - [ENHANCEMENT] Add
tempo_ingester_flush_size_bytes
metric. #777 (@bboreham) - [ENHANCEMENT] Microservices jsonnet: resource requests and limits can be set in
$._config
. #793 (@kvrhdn) - [ENHANCEMENT] Add
-config.expand-env
cli flag to support environment variables expansion in config file. #796 (@Ashmita152) - [ENHANCEMENT] Add ability to control bloom filter caching based on age and/or compaction level. Add new cli command
list cache-summary
. #805 (@annanay25) - [ENHANCEMENT] Emit traces for ingester flush operations. #812 (@bboreham)
- [ENHANCEMENT] Add retry middleware in query-frontend. #814 (@kvrhdn)
- [ENHANCEMENT] Add
-use-otel-tracer
to use the OpenTelemetry tracer, this will also capture traces emitted by the gcs sdk. Experimental: not all features are supported (i.e. remote sampling). #842 (@kvrhdn) - [ENHANCEMENT] Add
/services
endpoint. #863 (@kvrhdn) - [ENHANCEMENT] Include distributed docker-compose example #859 (@zalegrala)
- [ENHANCEMENT] Added "query blocks" cli option. #876 (@joe-elliott)
- [ENHANCEMENT] Add e2e integration test for GCS. #883 (@annanay25)
- [ENHANCEMENT] Added traceid to
trace too large message
. #888 (@mritunjaysharma394) - [ENHANCEMENT] Add support to tempo workloads to
overrides
from single configmap in microservice mode. #896 (@kavirajk) - [ENHANCEMENT] Make
overrides_config
block name consistent with Loki and Cortex in microservice mode. #906 (@kavirajk) - [ENHANCEMENT] Make
overrides_config
mount name statictempo-overrides
in the tempo workloads in microservice mode. #906 (@kavirajk) - [ENHANCEMENT] Reduce compactor memory usage by forcing garbage collection. #915 (@joe-elliott)
- [ENHANCEMENT] Implement search in vulture. #944 (@zalegrala)
- [BUGFIX] Allow only valid trace ID characters when decoding #854 (@zalegrala)
- [BUGFIX] Queriers complete one polling cycle before finishing startup. #834 (@joe-elliott)
- [BUGFIX] Update port spec for GCS docker-compose example #869 (@zalegrala)
- [BUGFIX] Cortex upgrade to fix an issue where unhealthy compactors can't be forgotten #878 (@joe-elliott)
- [BUGFIX] Guard against negative dataLength #763 (@joe-elliott)
- [CHANGE] Mark
-auth.enabled
as deprecated. New flag is-multitenancy.enabled
and is set to false by default. This is a breaking change if you were relying on auth/multitenancy being enabled by default. #646 - [ENHANCEMENT] Performance: Improve Ingester Record Insertion. #681
- [ENHANCEMENT] Improve WAL Replay by not rebuilding the WAL. #668
- [ENHANCEMENT] Add config option to disable write extension to the ingesters. #677
- [ENHANCEMENT] Preallocate byte slices on ingester request unmarshal. #679
- [ENHANCEMENT] Reduce marshalling in the ingesters to improve performance. #694 This change requires a specific rollout process to prevent dropped spans. First, rollout everything except distributors. After all ingesters have updated you can then rollout distributors to the latest version. This is due to changes in the communication between ingesters <-> distributors.
- [ENHANCEMENT] Allow setting the bloom filter shard size with support dynamic shard count.#644
- [ENHANCEMENT] GCS SDK update v1.12.0 => v.15.0, ReadAllWithEstimate used in GCS/S3 backends. #693
- [ENHANCEMENT] Add a new endpoint
/api/echo
to test the query frontend is reachable. #714 - [BUGFIX] Fix Query Frontend grpc settings to avoid noisy error log. #690
- [BUGFIX] Zipkin Support - CombineTraces. #688
- [BUGFIX] Zipkin support - Dedupe span IDs based on span.Kind (client/server) in Query Frontend. #687
- [BUGFIX] Azure Backend - Fix an issue with the append method on the Azure backend. #736
License Change v0.7.0 and future versions are licensed under AGPLv3 #660
- [CHANGE] Add
json
struct tags to overrides'Limits
struct in addition toyaml
tags. #656 - [CHANGE] Update to Go 1.16, latest OpenTelemetry proto definition and collector #546
- [CHANGE]
max_spans_per_trace
limit override has been removed in favour ofmax_bytes_per_trace
. This is a breaking change to the overrides config section. #612 - [CHANGE] Add new flag
-ingester.lifecycler.ID
to manually override the ingester ID with which to register in the ring. #625 - [CHANGE]
ingestion_rate_limit
limit override has been removed in favour ofingestion_rate_limit_bytes
.ingestion_burst_size
limit override has been removed in favour ofingestion_burst_size_bytes
. This is a breaking change to the overrides config section. #630 - [FEATURE] Add page based access to the index file. #557
- [FEATURE] (Experimental) WAL Compression/checksums. #638
- [ENHANCEMENT] Add a Shutdown handler to flush data to backend, at "/shutdown". #526
- [ENHANCEMENT] Queriers now query all (healthy) ingesters for a trace to mitigate 404s on ingester rollouts/scaleups. This is a breaking change and will likely result in query errors on rollout as the query signature b/n QueryFrontend & Querier has changed. #557
- [ENHANCEMENT] Add list compaction-summary command to tempo-cli #588
- [ENHANCEMENT] Add list and view index commands to tempo-cli #611
- [ENHANCEMENT] Add a configurable prefix for HTTP endpoints. #631
- [ENHANCEMENT] Add kafka receiver. #613
- [ENHANCEMENT] Upgrade OTel collector to
v0.21.0
. #613 - [ENHANCEMENT] Add support for Cortex Background Cache. #640
- [BUGFIX] Fixes permissions errors on startup in GCS. #554
- [BUGFIX] Fixes error where Dell ECS cannot list objects. #561
- [BUGFIX] Fixes listing blocks in S3 when the list is truncated. #567
- [BUGFIX] Fixes where ingester may leave file open #570
- [BUGFIX] Fixes a bug where some blocks were not searched due to query sharding and randomness in blocklist poll. #583
- [BUGFIX] Fixes issue where wal was deleted before successful flush and adds exponential backoff for flush errors #593
- [BUGFIX] Fixes issue where Tempo would not parse odd length trace ids #605
- [BUGFIX] Sort traces on flush to reduce unexpected recombination work by compactors #606
- [BUGFIX] Ingester fully persists blocks locally to reduce amount of work done after restart #628
- [CHANGE] Fixed ingester latency spikes on read #461
- [CHANGE] Ingester cut blocks based on size instead of trace count. Replace ingester
traces_per_block
setting withmax_block_bytes
. This is a breaking change. #474 - [CHANGE] Refactor cache section in tempodb. This is a breaking change b/c the cache config section has changed. #485
- [CHANGE] New compactor setting for max block size data instead of traces. #520
- [CHANGE] Change default ingester_client compression from gzip to snappy. #522
- [CHANGE/BUGFIX] Rename
tempodb_compaction_objects_written
andtempodb_compaction_bytes_written
metrics totempodb_compaction_objects_written_total
andtempodb_compaction_bytes_written_total
. #524 - [CHANGE] Replace tempo-cli
list block
--check-dupes
option with--scan
and collect additional stats #534 - [FEATURE] Added block compression. This is a breaking change b/c some configuration fields moved. #504
- [CHANGE] Drop Vulture Loki dependency. This is a breaking change. #509
- [ENHANCEMENT] Serve config at the "/config" endpoint. #446
- [ENHANCEMENT] Switch blocklist polling and retention to different concurrency mechanism, add configuration options. #475
- [ENHANCEMENT] Add S3 options region and forcepathstyle #431
- [ENHANCEMENT] Add exhaustive search to combine traces from all blocks in the backend. #489
- [ENHANCEMENT] Add per-tenant block retention #77
- [ENHANCEMENT] Change index-downsample to index-downsample-bytes. This is a breaking change #519
- [BUGFIX] Upgrade cortex dependency to v1.7.0-rc.0+ to address issue with forgetting ring membership #442 #512
- [BUGFIX] No longer raise the
tempodb_blocklist_poll_errors_total
metric if a block doesn't have meta or compacted meta. #481] - [BUGFIX] Replay wal completely before ingesting new spans. #525
- [CHANGE] Redo tempo-cli with basic command structure and improvements #385
- [CHANGE] Add content negotiation support and sharding parameters to Querier #375
- [CHANGE] Remove S3 automatic bucket creation #404
- [CHANGE] Compactors should round robin tenants instead of choosing randomly #420
- [CHANGE] Switch distributor->ingester communication to more efficient PushBytes method. This is a breaking change when running in microservices mode with separate distributors and ingesters. To prevent errors ingesters must be fully upgraded first, then distributors.
- [CHANGE] Removed disk_cache. This is a breaking change b/c there is no disk cache. Please use redis or memcached. #441
- [CHANGE] Rename IngestionMaxBatchSize to IngestionBurstSize. This is a breaking change. #445
- [ENHANCEMENT] Add docker-compose example for GCS along with new backend options #397
- [ENHANCEMENT] tempo-cli list blocks usability improvements #403
- [ENHANCEMENT] Reduce active traces locking time. #449
- [ENHANCEMENT] Added
tempo_distributor_bytes_received_total
as a per tenant counter of uncompressed bytes received. #453 - [BUGFIX] Compactor without GCS permissions fail silently #379
- [BUGFIX] Prevent race conditions between querier polling and ingesters clearing complete blocks #421
- [BUGFIX] Exclude blocks in last active window from compaction #411
- [BUGFIX] Mixin: Ignore metrics and query-frontend route when checking for TempoRequestLatency alert. #440
- [FEATURE] Add support for Azure Blob Storage backend #340
- [FEATURE] Add Query Frontend module to allow scaling the query path #400
- [CHANGE] From path.Join to filepath.Join #338
- [CHANGE] Upgrade Cortex from v1.3.0 to v.1.4.0 #341
- [CHANGE] Compact more than 2 blocks at a time #348
- [CHANGE] Remove tempodb_compaction_duration_seconds metric. #360
- [ENHANCEMENT] Add tempodb_compaction_objects_combined metric. #339
- [ENHANCEMENT] Added OpenMetrics exemplar support. #359
- [ENHANCEMENT] Add tempodb_compaction_objects_written metric. #360
- [ENHANCEMENT] Add tempodb_compaction_bytes_written metric. #360
- [ENHANCEMENT] Add tempodb_compaction_blocks_total metric. #360
- [ENHANCEMENT] Add support for S3 V2 signatures. #352
- [ENHANCEMENT] Add support for Redis caching. #354
- [BUGFIX] Frequent errors logged by compactor regarding meta not found #327
- [BUGFIX] Fix distributors panicking on rollout #343
- [BUGFIX] Fix ingesters occassionally double flushing #364
- [BUGFIX] Fix S3 backend logs "unsupported value type" #381
- [CHANGE] Bloom filters are now sharded to reduce size and improve caching, as blocks grow. This is a breaking change and all data stored before this change will not be queryable. #192
- [CHANGE] Rename maintenance cycle to blocklist poll. #315
- [ENHANCEMENT] CI checks for vendored dependencies using
make vendor-check
. Update CONTRIBUTING.md to reflect the same before checking in files in a PR. #274 - [ENHANCEMENT] Add warnings for suspect configs. #294
- [ENHANCEMENT] Add command line flags for s3 credentials. #308
- [ENHANCEMENT] Support multiple authentication methods for S3 (IRSA, IAM role, static). #320
- [ENHANCEMENT] Add per tenant bytes counter. #331
- [BUGFIX] S3 multi-part upload errors #306
- [BUGFIX] Increase Prometheus
notfound
metric on tempo-vulture. #301 - [BUGFIX] Return 404 if searching for a tenant id that does not exist in the backend. #321
- [BUGFIX] Prune in-memory blocks from missing tenants. #314