stats,opt: performance improvements for histograms #39178

rytaft · 2019-07-30T19:21:22Z

stats,opt: move histogram bucket distinct count estimation code

This commit moves the code to estimate the number of distinct values
per histogram bucket out of the optimizer and into the stats package.
The purpose of doing this is to remove the overhead of calculating the
number of distinct values per bucket from the critical path of query
planning, and instead perform the calculation offline when initially
generating the histogram.

opt: performance improvements for histograms

This commit improves the performance of histograms in the optimizer
by avoiding allocating and copying the full histogram unless strictly
necessary. Additionally, it changes the code for filtering histograms
to use binary search instead of performing a linear scan.

As a result of both of these commits, the overhead of histograms is reduced. Prior to these changes, the overhead of histograms caused 14% lower throughput and 29% higher latency when running kv with 95% reads. After, it caused only 4.6% lower throughput and 11% higher latency. (Clearly there is still work to do, but this is some progress...)

cockroach-teamcity · 2019-07-30T19:21:32Z

This change is

RaduBerinde

Nice changes!

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @rytaft)

pkg/sql/opt/memo/statistics_builder.go, line 558 at r2 (raw file):

	colStat := sb.copyColStat(colSet, s, inputColStat)
	if inputColStat.Histogram != nil {
		if s.Selectivity != 1 || scan.HardLimit.IsSet() {

In the HardLimit case, we're not modifying the histogram AFAICT

pkg/sql/opt/memo/statistics_builder.go, line 567 at r2 (raw file):

	}

	if s.Selectivity != 1 {

[nit] would be a bit more clear if we moved these two inside the block above

justinj

Do you have any benchmarks showing the improvement this PR gives?

Reviewed 19 of 19 files at r1, 5 of 6 files at r2.
Reviewable status: complete! 2 of 0 LGTMs obtained (waiting on @justinj and @rytaft)

pkg/sql/opt/props/histogram.go, line 159 at r2 (raw file):

		return 0, false

	default:

would it ever be relevant to return 1 if lowerBound=upperBound or is this function never called like that?

pkg/sql/stats/histogram.go, line 90 at r1 (raw file):

				break
			} else if c > 0 {
				panic("samples not sorted")

nit/not actually a part of this change, but should this be an AssertionFailedf?

pkg/sql/stats/histogram.proto, line 42 at r1 (raw file):

    // value because it is estimated by distributing the known distinct
    // count for the column among the buckets, in proportion to the number
    // of rows in each bucket.

nit: consider mentioning that this value is in fact derived from the rest of the data, but is included to avoid re-computing it later?

rytaft

TFTRs!

Do you have any benchmarks showing the improvement this PR gives?

The PR message has some perf numbers - is that not what you had in mind?

Reviewable status: complete! 0 of 0 LGTMs obtained (and 2 stale) (waiting on @justinj and @RaduBerinde)

pkg/sql/opt/memo/statistics_builder.go, line 558 at r2 (raw file):

Previously, RaduBerinde wrote…

In the HardLimit case, we're not modifying the histogram AFAICT

We actually might be, in finalizeFromRowCount. But I realized that we already account for the limit with cardinality, so I removed the check for the hard limit below -- all will be updated if needed in finalizeFromRowCount.

pkg/sql/opt/memo/statistics_builder.go, line 567 at r2 (raw file):

Previously, RaduBerinde wrote…

[nit] would be a bit more clear if we moved these two inside the block above

How? We still want to execute this block even if the histogram is nil.

pkg/sql/opt/props/histogram.go, line 159 at r2 (raw file):

Previously, justinj (Justin Jaffray) wrote…

would it ever be relevant to return 1 if lowerBound=upperBound or is this function never called like that?

No, because the upperBound is exclusive. So in that case we still want to return 0.

pkg/sql/stats/histogram.go, line 90 at r1 (raw file):

Previously, justinj (Justin Jaffray) wrote…

nit/not actually a part of this change, but should this be an AssertionFailedf?

Done.

pkg/sql/stats/histogram.proto, line 42 at r1 (raw file):

Previously, justinj (Justin Jaffray) wrote…

nit: consider mentioning that this value is in fact derived from the rest of the data, but is included to avoid re-computing it later?

Done.

justinj · 2019-07-31T19:22:06Z

Ah nice! I missed that—I only looked at the commit messages in reviewable

RaduBerinde

Reviewable status: complete! 0 of 0 LGTMs obtained (and 2 stale) (waiting on @justinj and @rytaft)

pkg/sql/opt/memo/statistics_builder.go, line 558 at r2 (raw file):

Previously, rytaft (Rebecca Taft) wrote…

We actually might be, in finalizeFromRowCount. But I realized that we already account for the limit with cardinality, so I removed the check for the hard limit below -- all will be updated if needed in finalizeFromRowCount.

I see. This seems fragile; the condition here has to match what finalizeFromRowCount is doing. I don't have a good solution though.. perhaps move the condition inside a canFinalizeFromRowCountModifyHistogram helper (and put that next to finalizeFromRowCount), and assert that it returns true when we modify the histogram (inside finalizeFromRowCount).

pkg/sql/opt/memo/statistics_builder.go, line 567 at r2 (raw file):

Previously, rytaft (Rebecca Taft) wrote…

How? We still want to execute this block even if the histogram is nil.

The block could be

colStat.Histogram = inputColStat.Histogram
if <need copy> {
  colStat.Histogram = colStat.Histogram.Copy()
  ...
}

But I didn't realize finalizeFromRowCount plays a role, so this doesn't help much.

This commit moves the code to estimate the number of distinct values per histogram bucket out of the optimizer and into the stats package. The purpose of doing this is to remove the overhead of calculating the number of distinct values per bucket from the critical path of query planning, and instead perform the calculation offline when initially generating the histogram. Release note: None

rytaft

Reviewable status: complete! 0 of 0 LGTMs obtained (and 2 stale) (waiting on @justinj and @RaduBerinde)

pkg/sql/opt/memo/statistics_builder.go, line 558 at r2 (raw file):

Previously, RaduBerinde wrote…

I see. This seems fragile; the condition here has to match what finalizeFromRowCount is doing. I don't have a good solution though.. perhaps move the condition inside a canFinalizeFromRowCountModifyHistogram helper (and put that next to finalizeFromRowCount), and assert that it returns true when we modify the histogram (inside finalizeFromRowCount).

Thinking about this more, I think it would be much cleaner if histograms are immutable. Histogram.Filter already returns a new histogram without changing the original, so I changed Histogram.ApplySelectivity to do the same thing. There are cases where this may cause a histogram to get copied twice (e.g., a scan that is both constrained and limited), but in the general case I think this makes everything much cleaner and more efficient, and will save us a lot of headaches down the road. Let me know what you think.

pkg/sql/opt/memo/statistics_builder.go, line 567 at r2 (raw file):

Previously, RaduBerinde wrote…

The block could be
colStat.Histogram = inputColStat.Histogram
if <need copy> {
  colStat.Histogram = colStat.Histogram.Copy()
  ...
}
But I didn't realize finalizeFromRowCount plays a role, so this doesn't help much.

Done.

RaduBerinde

Reviewable status: complete! 1 of 0 LGTMs obtained (and 1 stale) (waiting on @justinj and @rytaft)

pkg/sql/opt/memo/statistics_builder.go, line 558 at r2 (raw file):

Previously, rytaft (Rebecca Taft) wrote…

Thinking about this more, I think it would be much cleaner if histograms are immutable. Histogram.Filter already returns a new histogram without changing the original, so I changed Histogram.ApplySelectivity to do the same thing. There are cases where this may cause a histogram to get copied twice (e.g., a scan that is both constrained and limited), but in the general case I think this makes everything much cleaner and more efficient, and will save us a lot of headaches down the road. Let me know what you think.

Awesome, sounds great to me.

This commit improves the performance of histograms in the optimizer by avoiding allocating and copying the full histogram unless strictly necessary. Additionally, it changes the code for filtering histograms to use binary search instead of performing a linear scan. Release note: None

rytaft

Thanks!

bors r+

Reviewable status: complete! 0 of 0 LGTMs obtained (and 2 stale) (waiting on @justinj)

39178: stats,opt: performance improvements for histograms r=rytaft a=rytaft **stats,opt: move histogram bucket distinct count estimation code** This commit moves the code to estimate the number of distinct values per histogram bucket out of the optimizer and into the stats package. The purpose of doing this is to remove the overhead of calculating the number of distinct values per bucket from the critical path of query planning, and instead perform the calculation offline when initially generating the histogram. **opt: performance improvements for histograms** This commit improves the performance of histograms in the optimizer by avoiding allocating and copying the full histogram unless strictly necessary. Additionally, it changes the code for filtering histograms to use binary search instead of performing a linear scan. ---- As a result of both of these commits, the overhead of histograms is reduced. Prior to these changes, the overhead of histograms caused 14% lower throughput and 29% higher latency when running kv with 95% reads. After, it caused only 4.6% lower throughput and 11% higher latency. (Clearly there is still work to do, but this is some progress...) Co-authored-by: Rebecca Taft <becca@cockroachlabs.com>

craig · 2019-08-01T15:23:51Z

Build succeeded

GitHub CI (Cockroach)

rytaft requested review from a team July 30, 2019 19:21

rytaft requested a review from a team as a code owner July 30, 2019 19:21

rytaft requested review from a team July 30, 2019 19:21

RaduBerinde approved these changes Jul 31, 2019

View reviewed changes

justinj reviewed Jul 31, 2019

View reviewed changes

rytaft force-pushed the perf-fixes branch from 80b804d to 4131602 Compare July 31, 2019 19:13

rytaft commented Jul 31, 2019

View reviewed changes

RaduBerinde reviewed Jul 31, 2019

View reviewed changes

rytaft force-pushed the perf-fixes branch from 4131602 to ab69367 Compare August 1, 2019 13:27

rytaft commented Aug 1, 2019

View reviewed changes

RaduBerinde reviewed Aug 1, 2019

View reviewed changes

rytaft force-pushed the perf-fixes branch from ab69367 to 3243046 Compare August 1, 2019 14:31

rytaft commented Aug 1, 2019

View reviewed changes

craig bot merged commit 3243046 into cockroachdb:master Aug 1, 2019

knz mentioned this pull request Nov 10, 2019

User-facing changes in 19.2 that were not picked up in release notes cockroachdb/docs#5819

Closed

rytaft deleted the perf-fixes branch April 2, 2020 22:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stats,opt: performance improvements for histograms #39178

stats,opt: performance improvements for histograms #39178

rytaft commented Jul 30, 2019

cockroach-teamcity commented Jul 30, 2019

RaduBerinde left a comment

justinj left a comment

rytaft left a comment

justinj commented Jul 31, 2019

RaduBerinde left a comment

rytaft left a comment

RaduBerinde left a comment

rytaft left a comment

craig bot commented Aug 1, 2019

stats,opt: performance improvements for histograms #39178

stats,opt: performance improvements for histograms #39178

Conversation

rytaft commented Jul 30, 2019

cockroach-teamcity commented Jul 30, 2019

RaduBerinde left a comment

Choose a reason for hiding this comment

justinj left a comment

Choose a reason for hiding this comment

rytaft left a comment

Choose a reason for hiding this comment

justinj commented Jul 31, 2019

RaduBerinde left a comment

Choose a reason for hiding this comment

rytaft left a comment

Choose a reason for hiding this comment

RaduBerinde left a comment

Choose a reason for hiding this comment

rytaft left a comment

Choose a reason for hiding this comment

craig bot commented Aug 1, 2019

Build succeeded