Fix IndexOutOfBoundsException in histograms for NaN doubles (#26787) #26856

thomas11 · 2017-10-02T19:15:52Z

The linked issue should contain all information necessary to review.

There might be better ways to test this change, my experience with the Elasticsearch test design is limited.

I also have a working patch against the 5.6 branch, which I care about more at this point (having the fix in a future 5.x release). Let me know if you'd like me to submit that one separately.

Thanks!

elasticmachine · 2017-10-02T19:15:54Z

Since this is a community submitted pull request, a Jenkins build has not been kicked off automatically. Can an Elastic organization member please verify the contents of this patch and then kick off a build manually?

jpountz

Changes look good. Let me make it run through CI.

jpountz · 2017-10-06T09:37:26Z

@elasticmachine please test it

…26856)

* ccr: (42 commits) [DOCS] Added info about snapshotting your data before an upgrade. Add documentation about disabling `_field_names`. (elastic#26813) Remove UnsortedNumericDoubleValues (elastic#26817) Fix IndexOutOfBoundsException in histograms for NaN doubles (elastic#26787) (elastic#26856) [TEST] Added skipping the `headers` feature to the Bulk REST YAML test Update type-field.asciidoc Fix search_after with geo distance sorting (elastic#26891) Use proper logging placeholder for Netty logging Add Netty channel information on write and flush failure Remove deploying in JBoss documentation Document JVM option MaxFDLimit for macOS () Add additional low-level logging handler () Unwrap causes when maybe dying Change log level on write and flush failure to warn [TEST] add test to ensure legacy list syntax in yml works fine Bump BWC version for settings serialization to 6.1.0 Removed void token filter entries and added two tests Added Bengali Analyzer to Elasticsearch with respect to the lucene update(PR#238) Fix toString() in SnapshotStatus (elastic#26852) elastic#26870 change bwc version for fuzzy_transpositions to 6.1 after backport ...

* ccr: (110 commits) Use LF line endings in Painless generated files (elastic#26822) [DOCS] Added info about snapshotting your data before an upgrade. Add documentation about disabling `_field_names`. (elastic#26813) Remove UnsortedNumericDoubleValues (elastic#26817) Fix IndexOutOfBoundsException in histograms for NaN doubles (elastic#26787) (elastic#26856) [TEST] Added skipping the `headers` feature to the Bulk REST YAML test Update type-field.asciidoc Fix search_after with geo distance sorting (elastic#26891) Use proper logging placeholder for Netty logging Add Netty channel information on write and flush failure Remove deploying in JBoss documentation Document JVM option MaxFDLimit for macOS () Add additional low-level logging handler () Unwrap causes when maybe dying Change log level on write and flush failure to warn [TEST] add test to ensure legacy list syntax in yml works fine Bump BWC version for settings serialization to 6.1.0 Removed void token filter entries and added two tests Added Bengali Analyzer to Elasticsearch with respect to the lucene update(PR#238) Fix toString() in SnapshotStatus (elastic#26852) ...

In this test we were randomizing different values but minDocCount was hardcoded to 1. It's important to test other values, especially `0` as it's the default. The test needed some adapting in the way buckets are randomly generated: all aggs need to share the same interval, minDocCount and emptyBucketInfo. Also assertions need to take into account that more (or less) buckets are expected depending on minDocCount. This was originated by elastic#35921 and its need to test adding empty buckets as part of the reduce phase. Also relates to elastic#26856 as one more key comparison needed to use `Double.compare` to properly handle `NaN` values, this was triggered by the increased test coverage.

In `InternalHistogramTests` we were randomizing different values but `minDocCount` was hardcoded to `1`. It's important to test other values, especially `0` as it's the default. To make this possible, the test needed some adapting in the way buckets are randomly generated: all aggs need to share the same `interval`, `minDocCount` and `emptyBucketInfo`. Also assertions need to take into account that more (or less) buckets are expected depending on `minDocCount`. This was originated by #35921 and its need to test adding empty buckets as part of the reduce phase. Also relates to #26856 as one more key comparison needed to use `Double.compare` to properly handle `NaN` values, which was triggered by the increased test coverage.

Fix IndexOutOfBoundsException in histograms for NaN doubles (#26787)

37e30a8

jpountz self-requested a review October 5, 2017 09:09

jpountz approved these changes Oct 6, 2017

View reviewed changes

jpountz merged commit 16431a6 into elastic:master Oct 6, 2017

jpountz pushed a commit that referenced this pull request Oct 6, 2017

Fix IndexOutOfBoundsException in histograms for NaN doubles (#26787) (#…

1a2f265

…26856)

thomas11 deleted the thomas11/histogram-nan-26787-master branch October 6, 2017 15:40

jpountz pushed a commit that referenced this pull request Oct 6, 2017

Fix IndexOutOfBoundsException in histograms for NaN doubles (#26787) (#…

a02a6a1

…26856)

jpountz pushed a commit that referenced this pull request Oct 6, 2017

Fix IndexOutOfBoundsException in histograms for NaN doubles (#26787) (#…

a9e0f06

…26856)

jpountz added :Analytics/Aggregations Aggregations >bug v5.6.4 v6.0.0 labels Oct 6, 2017

jpountz mentioned this pull request Oct 6, 2017

Histogram aggregation fails on NaN #26787

Closed

jpountz added v6.1.0 v7.0.0 labels Oct 9, 2017

javanna added v5.6.3 and removed v5.6.4 labels Oct 9, 2017

lcawl added v6.0.0-rc2 and removed v6.0.0 labels Oct 30, 2017

lcawl removed the v6.1.0 label Dec 12, 2017

javanna mentioned this pull request Nov 28, 2018

Increase InternalHistogramTests coverage #36004

Merged

colings86 added v7.0.0-beta1 and removed v7.0.0 labels Feb 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix IndexOutOfBoundsException in histograms for NaN doubles (#26787) #26856

Fix IndexOutOfBoundsException in histograms for NaN doubles (#26787) #26856

thomas11 commented Oct 2, 2017

elasticmachine commented Oct 2, 2017

jpountz left a comment

jpountz commented Oct 6, 2017

Fix IndexOutOfBoundsException in histograms for NaN doubles (#26787) #26856

Fix IndexOutOfBoundsException in histograms for NaN doubles (#26787) #26856

Conversation

thomas11 commented Oct 2, 2017

elasticmachine commented Oct 2, 2017

jpountz left a comment

Choose a reason for hiding this comment

jpountz commented Oct 6, 2017