Make tests baseline comparison more lenient #619

sfilipi · 2018-07-31T18:51:07Z

There are a few tests that are disabled because the baseline comparisons fail on the 5th decimal for some of the numbers generated, on some OS.

As an example, the RegressorOlsTest() in PredictorTests fails just on the Mac debug version, because only one out of the generated 4896 predictions doesn't match:

baseline:
2625 5 5.09176636 0.091766357421875 0.0084210643544793129

Mac debug run predictions:
2625 5 5.091751 0.0917510986328125 0.0084182641003280878

I think we should make the tests more lenient to failures like this, modifying the comparison with the baseline to:
1- Have a sensitivity threeshold. Compare up to the 4th, or 6th decimal digit.
2- Count the failures, and declare the test as failed if 3% o the predictions/lines differ?

@justinormont @TomFinley @Zruty0 @zeahmed are those acceptable ranges?

The text was updated successfully, but these errors were encountered:

zeahmed · 2018-07-31T19:45:33Z

Can we afford to have platform specific thresholds?

sfilipi · 2018-07-31T20:23:17Z

We can, but the downside is that sometimes, like the example above, you have an entire file in, because one line is different.
We can have a system of baselines and overrides for that baseline: if no override default to the baseline.
@Ivanidzo4ka what do you think?

Ivanidzo4ka · 2018-07-31T20:53:03Z

I think what we have too many issues, and nobody reads them :)
#410

sfilipi · 2018-07-31T21:10:17Z

Touche :)
Duplicate of #410

Let's continue the conversation on the original issue.

sfilipi self-assigned this Jul 31, 2018

sfilipi mentioned this issue Jul 31, 2018

Splitting OLS to a separate package called AdditionalLearners #611

Merged

sfilipi added the test related to tests label Jul 31, 2018

sfilipi closed this as completed Aug 1, 2018

ghost locked as resolved and limited conversation to collaborators Mar 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make tests baseline comparison more lenient #619

Make tests baseline comparison more lenient #619

sfilipi commented Jul 31, 2018

zeahmed commented Jul 31, 2018

sfilipi commented Jul 31, 2018 •

edited

Loading

Ivanidzo4ka commented Jul 31, 2018

sfilipi commented Jul 31, 2018

Make tests baseline comparison more lenient #619

Make tests baseline comparison more lenient #619

Comments

sfilipi commented Jul 31, 2018

zeahmed commented Jul 31, 2018

sfilipi commented Jul 31, 2018 • edited Loading

Ivanidzo4ka commented Jul 31, 2018

sfilipi commented Jul 31, 2018

sfilipi commented Jul 31, 2018 •

edited

Loading