sql/stats: bring back guard against non-zero NumRange in forecasts #144037

michae2 · 2025-04-08T05:27:54Z

It occurred to me tonight that the code we removed in #143955 not only generated a sentry report, but also returned an error instead of producing a faulty histogram. I too hope that #93892 is now fixed, but it seems wise to at least keep some code that guards against faulty histograms, even if we don't think the sentry report is necessary any more.

Informs: #93892

Epic: None

Release note: None

It occurred to me tonight that the code we removed in cockroachdb#143955 not only generated a sentry report, but also returned an error instead of producing a faulty histogram. I too hope that cockroachdb#93892 is now fixed, but it seems wise to at least keep some code that guards against faulty histograms, even if we don't think the sentry report is necessary any more. Informs: cockroachdb#93892 Epic: None Release note: None

blathers-crl · 2025-04-08T05:27:58Z

It looks like your PR touches production code but doesn't add or edit any test code. Did you consider adding tests to your PR?

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.}

cockroach-teamcity · 2025-04-08T05:28:04Z

This change is

yuzefovich

Thanks for catching this! I agree that more guardrails never hurt - I didn't realize that the check was useful as a guardrail.

Reviewed 1 of 1 files at r1, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @michae2)

pkg/sql/stats/forecast.go line 335 at r1 (raw file):

		forecast.setHistogramBuckets(hist)

		// Verify that the first two buckets (the initial NULL bucket and the first

nit: this verification is stricter than the one we do in props/histogram.go - there we return "the first bucket should have NumRange=0" assertion error in two spots, but in both we're only looking at 0th bucket and only NumRange value (multiplied by selectivity). Why do we deviate here and verify more?

michae2

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @yuzefovich)

pkg/sql/stats/forecast.go line 335 at r1 (raw file):

Previously, yuzefovich (Yahor Yuzefovich) wrote…

nit: this verification is stricter than the one we do in props/histogram.go - there we return "the first bucket should have NumRange=0" assertion error in two spots, but in both we're only looking at 0th bucket and only NumRange value (multiplied by selectivity). Why do we deviate here and verify more?

In props/histogram.go in both filter and maxDistinctValuesCount we don't know whether we're working with a portion of a histogram (one that has already been filtered) or an entire histogram. So we only check the first bucket.

Here in forecasting we know we're working with the entire histogram, including the synthesized NULL bucket if it exists, so we might as well check that there's nothing between the synthesized NULL bucket and the first non-NULL value.

michae2 · 2025-04-08T17:13:57Z

TFTR!

bors r=yuzefovich

craig · 2025-04-08T18:12:55Z

Build failed (retrying...):

examples_orms

craig · 2025-04-08T20:09:58Z

This PR was included in a batch that successfully built, but then failed to merge into master (it was a non-fast-forward update). It will be automatically retried.

yuzefovich

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @michae2)

pkg/sql/stats/forecast.go line 335 at r1 (raw file):

Previously, michae2 (Michael Erickson) wrote…

In props/histogram.go in both filter and maxDistinctValuesCount we don't know whether we're working with a portion of a histogram (one that has already been filtered) or an entire histogram. So we only check the first bucket.

Here in forecasting we know we're working with the entire histogram, including the synthesized NULL bucket if it exists, so we might as well check that there's nothing between the synthesized NULL bucket and the first non-NULL value.

I see, makes sense about the bucket, thanks.

Other part of my comment was about also verifying DistinctRange - it doesn't look like we assert anything about that in props/histogram. Do we add that check here since NumRange = 0 and DistinctRange > 0 doesn't make sense, in general, even if we don't assert that later?

craig · 2025-04-08T21:24:52Z

Build succeeded:

michae2

Reviewable status: complete! 0 of 0 LGTMs obtained

pkg/sql/stats/forecast.go line 335 at r1 (raw file):

Previously, yuzefovich (Yahor Yuzefovich) wrote…

I see, makes sense about the bucket, thanks.

Other part of my comment was about also verifying DistinctRange - it doesn't look like we assert anything about that in props/histogram. Do we add that check here since NumRange = 0 and DistinctRange > 0 doesn't make sense, in general, even if we don't assert that later?

yes, that's right

michae2 requested a review from yuzefovich April 8, 2025 05:27

michae2 requested a review from a team as a code owner April 8, 2025 05:27

michae2 requested a review from a team April 8, 2025 05:28

yuzefovich approved these changes Apr 8, 2025

View reviewed changes

michae2 commented Apr 8, 2025

View reviewed changes

yuzefovich reviewed Apr 8, 2025

View reviewed changes

craig bot merged commit c285f81 into cockroachdb:master Apr 8, 2025
24 checks passed

celeste-cockroachdb bot added the target-release-25.2.0 label Apr 8, 2025

michae2 deleted the unrevert-guard branch April 8, 2025 21:34

michae2 commented Apr 8, 2025

View reviewed changes

celeste-cockroachdb bot added v25.2.0-prerelease and removed target-release-25.2.0 labels Apr 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sql/stats: bring back guard against non-zero NumRange in forecasts #144037

sql/stats: bring back guard against non-zero NumRange in forecasts #144037

Uh oh!

michae2 commented Apr 8, 2025

Uh oh!

blathers-crl bot commented Apr 8, 2025

Uh oh!

cockroach-teamcity commented Apr 8, 2025

Uh oh!

yuzefovich left a comment

Uh oh!

michae2 left a comment

Uh oh!

michae2 commented Apr 8, 2025

Uh oh!

craig bot commented Apr 8, 2025

Uh oh!

craig bot commented Apr 8, 2025

Uh oh!

yuzefovich left a comment

Uh oh!

craig bot commented Apr 8, 2025

Uh oh!

Uh oh!

michae2 left a comment

Uh oh!

Uh oh!

sql/stats: bring back guard against non-zero NumRange in forecasts #144037

sql/stats: bring back guard against non-zero NumRange in forecasts #144037

Uh oh!

Conversation

michae2 commented Apr 8, 2025

Uh oh!

blathers-crl bot commented Apr 8, 2025

Uh oh!

cockroach-teamcity commented Apr 8, 2025

Uh oh!

yuzefovich left a comment

Choose a reason for hiding this comment

Uh oh!

michae2 left a comment

Choose a reason for hiding this comment

Uh oh!

michae2 commented Apr 8, 2025

Uh oh!

craig bot commented Apr 8, 2025

Uh oh!

craig bot commented Apr 8, 2025

Uh oh!

yuzefovich left a comment

Choose a reason for hiding this comment

Uh oh!

craig bot commented Apr 8, 2025

Uh oh!

Uh oh!

michae2 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!