Skip to content

Integrate rollup search with _search endpoint #48005

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
csoulios opened this issue Oct 14, 2019 · 6 comments
Closed

Integrate rollup search with _search endpoint #48005

csoulios opened this issue Oct 14, 2019 · 6 comments
Labels
>feature :StorageEngine/Rollup Turn fine-grained time-based data into coarser-grained data Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo)

Comments

@csoulios
Copy link
Contributor

csoulios commented Oct 14, 2019

Currently searching in rollups is facilitated by a specific endpoint named _rollup_search. For seamless integration of rollups we should remove this endpoint and implement its functionality in the _search endpoint.

When running aggregations using the existing _search endpoint we must implement the following:

  • Merge rollup buckets with regular buckets.
  • Ensure that partial buckets can merge across shards (e.g. when rollover is not on a clean interval).
  • Modify existing aggregators to pull doc_count from rollup indices, rather than just incrementing. This is needed so a "rollup metric" can provide doc counts for avg and bucketing aggregations, by allowing the aggregator framework to treat a single document as if it were multiple docs.

Producing rollups is discussed in #48003

@csoulios csoulios added the :StorageEngine/Rollup Turn fine-grained time-based data into coarser-grained data label Oct 14, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-analytics-geo (:Analytics/Rollup)

@timroes
Copy link
Contributor

timroes commented Nov 18, 2019

💯 This would be very awesome and ease a lot of code in Kibana, if we would have the integration in the regular _search endpoint.

@lizozom
Copy link

lizozom commented Apr 30, 2020

@csoulios is there a target version for this change?

Anyway, please ping @elastic/kibana-app-arch when this is being implemented, so we can go ahead an implement elastic/kibana#64873. 🙏

@rjernst rjernst added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label May 4, 2020
@tangpf877
Copy link

tangpf877 commented Jul 2, 2020

Is there any news about rollup? I am wandering when _rollup_search endpoint will be removed, and what the ILM side will be like in the future? Does 'Create an ILM rollup action' means rollup will proceed automatically and we needn't start an rollup job? Thanks for reply

@csoulios
Copy link
Contributor Author

csoulios commented Jul 3, 2020

Hi @benjam1n77, rollups refactoring is a task we are actively working on. You can track progress and related discussions in this ticket: #42720

When it comes to ILM and rollups, the goal is to stop having rollups as continues jobs. Instead rollup will be an action in the ILM configuration. See the relevant ticket for more: #48003

@wchaparro
Copy link
Member

With the 8.7 release of Elasticsearch, we have made a new downsampling capability associated with the new time series datastreams functionality generally available (GA). This capability was in tech preview in ILM since 8.5. Downsampling provides a method to reduce the footprint of your time series data by storing it at reduced granularity. The downsampling process rolls up documents within a fixed time interval into a single summary document. Each summary document includes statistical representations of the original data: the min, max, sum, value_count, and average for each metric. Data stream time series dimensions are stored unchanged.

Downsampling is superior to rollup because:

  • Downsampled indices are searched through the _search API
  • It is possible to query multiple downsampled indices together with raw data indices
  • The pre-aggregation is based on the metrics and time series definitions in the index mapping so very little configuration is required (i.e. much easier to add new time serieses)
  • Downsampling is managed as an action in ILM
  • It is possible to downsample a downsampled index, and reduce granularity as the index ages
  • The performance of the pre-aggregation process is superior in downsampling, as it builds on the time_series index mode infrastructure

Because of the introduction of this new capability, we are deprecating the rollups functionality, which never left the Tech Preview/Experimental status, in favor of downsampling and thus we are closing this issue. We encourage you to migrate your solution to downsampling and take advantage of the new TSDB functionality.

@wchaparro wchaparro closed this as not planned Won't fix, can't repro, duplicate, stale Jun 23, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
>feature :StorageEngine/Rollup Turn fine-grained time-based data into coarser-grained data Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo)
Projects
None yet
Development

Successfully merging a pull request may close this issue.

7 participants