Skip to content

Downsampling performance analysis and improvement #90226

Closed
@salvatore-campagna

Description

@salvatore-campagna

Description

We would like to measure performances of downsampling operations using Rally. For this purpose we need to include a new challenge to the existing tsdb Rally track. The new challenge will measure latency for a limited set of downsampling operations using different values for the fixed_interval parameter. As part of the analysis we need to collect JFR recordings and flame graphs so that we can spot areas of the code we can improve.

Right now the tsdb track uses a dataset including more than 116M documents for a total JSON file size of more than 120 GB, which results in a 32.5 GB index. The plan is to measure downsampling latency with a single thread implementation, a single node Elasticsearch cluster and a single shard.

Metadata

Metadata

Labels

:StorageEngine/RollupTurn fine-grained time-based data into coarser-grained data>enhancementTeam:AnalyticsMeta label for analytical engine team (ESQL/Aggs/Geo)

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions