Name	Name	Last commit message	Last commit date
parent directory ..
instructor	instructor
media	media
README.md	README.md
instructor-notes.md	instructor-notes.md

Query Optimization - Student Labs

The following labs will take you through various query tuning scenarios and allow you to discover various ways to observe, diagnose, and optimize query performance with CockroachDB.

Labs Prerequisites

Local Deployment

Build the dev cluster following these instructions.
You also need:
- a modern web browser,
- a SQL client:
  - Cockroach SQL client
  - psql
  - DBeaver Community edition (SQL tool with built-in CockroachDB plugin)

Shared Cluster Deployment

SSH into the Jumpbox using the IP address provided by the Instructor.

Lab 0 - Create database and load data

Local Deployment

Connect to any node and use the workload simulator to load the TPC-C data with 50 warehouses - it will take about 25 minutes

docker exec -it roach-newyork-1 bash -c "./cockroach workload init tpcc --drop --db tpcc --warehouses 50 postgres://root@127.0.0.1:26257?sslmode=disable"

Connect to the database

# use cockroach sql, defaults to localhost:26257
cockroach sql --insecure -d tpcc

# or use the --url param for any another host:
cockroach sql --url "postgresql://localhost:26258/tpcc?sslmode=disable"

# or use psql
psql -h localhost -p 26257 -U root tpcc

Shared Cluster Deployment

Create TPCC database

cockroach workload init tpcc --drop --db tpcc --warehouses 50

This should create the TPCC database like so:

$ cockroach workload init tpcc --drop --db tpcc --warehouses 50

I221201 04:04:21.209451 1 ccl/workloadccl/fixture.go:318  [-] 1  starting import of 9 tables
I221201 04:04:21.512376 22 ccl/workloadccl/fixture.go:481  [-] 2  imported 2.6 KiB in warehouse table (50 rows, 0 index entries, took 238.121653ms, 0.01 MiB/s)
I221201 04:04:22.449613 23 ccl/workloadccl/fixture.go:481  [-] 3  imported 50 KiB in district table (500 rows, 0 index entries, took 1.17532982s, 0.04 MiB/s)
I221201 04:04:24.287214 27 ccl/workloadccl/fixture.go:481  [-] 4  imported 6.0 MiB in new_order table (450000 rows, 0 index entries, took 3.012691714s, 1.99 MiB/s)
I221201 04:04:24.335870 28 ccl/workloadccl/fixture.go:481  [-] 5  imported 7.9 MiB in item table (100000 rows, 0 index entries, took 3.061350464s, 2.57 MiB/s)
I221201 04:04:24.994671 26 ccl/workloadccl/fixture.go:481  [-] 6  imported 78 MiB in order table (1500000 rows, 1500000 index entries, took 3.720090812s, 20.95 MiB/s)
I221201 04:04:25.057877 25 ccl/workloadccl/fixture.go:481  [-] 7  imported 109 MiB in history table (1500000 rows, 0 index entries, took 3.783504121s, 28.74 MiB/s)
I221201 04:04:29.978697 24 ccl/workloadccl/fixture.go:481  [-] 8  imported 881 MiB in customer table (1500000 rows, 1500000 index entries, took 8.704391776s, 101.21 MiB/s)
I221201 04:04:30.860104 29 ccl/workloadccl/fixture.go:481  [-] 9  imported 1.5 GiB in stock table (5000000 rows, 0 index entries, took 9.585533076s, 160.05 MiB/s)
I221201 04:04:31.452226 30 ccl/workloadccl/fixture.go:481  [-] 10  imported 839 MiB in order_line table (15004305 rows, 0 index entries, took 10.177731905s, 82.41 MiB/s)
I221201 04:04:31.452575 1 ccl/workloadccl/fixture.go:326  [-] 11  imported 3.4 GiB bytes in 9 tables (took 10.242946335s, 337.26 MiB/s)

cockroach sql --database tpcc --insecure

SHOW TABLES;

  schema_name | table_name | type  | owner | estimated_row_count
--------------+------------+-------+-------+----------------------
  public      | customer   | table | root  |             1500000
  public      | district   | table | root  |                 500
  public      | history    | table | root  |             1500000
  public      | item       | table | root  |              100000
  public      | new_order  | table | root  |              450000
  public      | order      | table | root  |             1500000
  public      | order_line | table | root  |            15004305
  public      | stock      | table | root  |             5000000
  public      | warehouse  | table | root  |                  50
(9 rows)

Time: 38ms total (execution 37ms / network 1ms)

Lab 1 - Optimization

Run the following query and observe the performance.

SELECT ol_number, SUM(ol_quantity)
FROM order_line
WHERE ol_w_id > 30
   AND ol_amount > 9990
GROUP BY ol_number
ORDER BY ol_number;

  ol_number | sum
------------+-------
          1 |  905
          2 |  805
          3 |  735
          4 | 1000
          5 |  750
          6 |  755
          7 |  665
          8 |  720
          9 |  465
         10 |  345
         11 |  415
         12 |  305
         13 |  230
         14 |  120
         15 |  100
(15 rows)

Time: 3.361s total (execution 3.362s / network -0.001s)

This is taking too long! We want the Response Time to be much faster. To view the query plan, we can use the EXPLAIN command.

Let's check the verbose query plan using EXPLAIN (VERBOSE) to see how we can optimize it

EXPLAIN (VERBOSE) SELECT ol_number, SUM(ol_quantity)
FROM order_line
WHERE ol_w_id > 30
   AND ol_amount > 9990
GROUP BY ol_number
ORDER BY ol_number;

            tree           |        field        |    description     |                   columns                    |  ordering
---------------------------+---------------------+--------------------+----------------------------------------------+-------------
                           | distribution        | full               |                                              |
                           | vectorized          | true               |                                              |
  sort                     |                     |                    | (ol_number, sum)                             | +ol_number
   │                       | estimated row count | 15                 |                                              |
   │                       | order               | +ol_number         |                                              |
   └── group               |                     |                    | (ol_number, sum)                             |
        │                  | estimated row count | 15                 |                                              |
        │                  | aggregate 0         | sum(ol_quantity)   |                                              |
        │                  | group by            | ol_number          |                                              |
        └── project        |                     |                    | (ol_number, ol_quantity)                     |
             └── filter    |                     |                    | (ol_w_id, ol_number, ol_quantity, ol_amount) |
                  │        | estimated row count | 6157               |                                              |
                  │        | filter              | ol_amount > 9990   |                                              |
                  └── scan |                     |                    | (ol_w_id, ol_number, ol_quantity, ol_amount) |
                           | estimated row count | 5608610            |                                              |
                           | table               | order_line@primary |                                              |
                           | spans               | /31-               |                                              |
(17 rows)

Time: 2ms total (execution 1ms / network 1ms)

It is estimating the need to scan 5,608,610 rows, a lot! Let's confirm what fields make up the primary index of table order_line.

SHOW CREATE TABLE order_line;

  table_name |                                                             create_statement
-------------+--------------------------------------------------------------------------------------------------------------------------------------------
  order_line | CREATE TABLE public.order_line (
             |     ol_o_id INT8 NOT NULL,
             |     ol_d_id INT8 NOT NULL,
             |     ol_w_id INT8 NOT NULL,
             |     ol_number INT8 NOT NULL,
             |     ol_i_id INT8 NOT NULL,
             |     ol_supply_w_id INT8 NULL,
             |     ol_delivery_d TIMESTAMP NULL,
             |     ol_quantity INT8 NULL,
             |     ol_amount DECIMAL(6,2) NULL,
             |     ol_dist_info CHAR(24) NULL,
             |     CONSTRAINT "primary" PRIMARY KEY (ol_w_id ASC, ol_d_id ASC, ol_o_id DESC, ol_number ASC),
             |     CONSTRAINT fk_ol_w_id_ref_order FOREIGN KEY (ol_w_id, ol_d_id, ol_o_id) REFERENCES public."order"(o_w_id, o_d_id, o_id),
             |     CONSTRAINT fk_ol_supply_w_id_ref_stock FOREIGN KEY (ol_supply_w_id, ol_i_id) REFERENCES public.stock(s_w_id, s_i_id),
             |     FAMILY "primary" (ol_o_id, ol_d_id, ol_w_id, ol_number, ol_i_id, ol_supply_w_id, ol_delivery_d, ol_quantity, ol_amount, ol_dist_info)
             | )

We can see that primary includes ol_w_id so it can leverage the index to quickly jump to the right key, but it doesn't include ol_amount which is a field used to filter.

We can create an index on ol_amount to reduce the rows to scan and improve the performance. For testing, we will create 2 indexes to show the value of STORING.

Before, however, let's gather an explain plan with statistics used by the optimizer. We use EXPLAIN with OPT,VERBOSE as parameters.

EXPLAIN (OPT,VERBOSE) SELECT ol_number, SUM(ol_quantity)
FROM order_line
WHERE ol_w_id > 30
   AND ol_amount > 9990
GROUP BY ol_number
ORDER BY ol_number;

                                                                                                                                            text
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
  sort
   ├── columns: ol_number:4 sum:13
   ├── immutable
   ├── stats: [rows=15, distinct(4)=15, null(4)=0]
   ├── cost: 7223657.24
   ├── key: (4)
   ├── fd: (4)-->(13)
   ├── ordering: +4
   ├── prune: (13)
   └── group-by
        ├── columns: ol_number:4 sum:13
        ├── grouping columns: ol_number:4
        ├── immutable
        ├── stats: [rows=15, distinct(4)=15, null(4
        )=0]
        ├── cost: 7223655.76
        ├── key: (4)
        ├── fd: (4)-->(13)
        ├── prune: (13)
        ├── select
        │    ├── columns: ol_w_id:3 ol_number:4 ol_quantity:8 ol_amount:9
        │    ├── immutable
        │    ├── stats: [rows=5335.03271, distinct(3)=19, null(3)=0, distinct(4)=15, null(4)=0, distinct(9)=939.439645, null(9)=0, distinct(3,9)=5335.03271, null(3,9)=0]
        │    │   histogram(3)=  0 258.75 0 243.02 0 294.48 0 271.61 0 281.62 0 248.74 0 270.18 0 281.62 0 301.63 0 290.2 0 301.63 0 268.75 0 278.76 0 295.91 0 295.91 0 285.91 0 264.46 0 294.48 0 307.35
        │    │                <---- 31 ----- 32 ----- 33 ----- 34 ----- 35 ----- 36 ----- 37 ----- 38 ----- 39 ---- 40 ----- 41 ----- 42 ----- 43 ----- 44 ----- 45 ----- 46 ----- 47 ----- 48 ----- 49 -
        │    │   histogram(9)=  0   0    3953.2  1381.8
        │    │                <--- 9990 -------- 9999.61
        │    ├── cost: 7223495.54
        │    ├── prune: (4,8)
        │    ├── interesting orderings: (+3)
        │    ├── scan order_line
        │    │    ├── columns: ol_w_id:3 ol_number:4 ol_quantity:8 ol_amount:9
        │    │    ├── constraint: /3/2/-1/4: [/31 - ]
        │    │    ├── stats: [rows=5599605.82, distinct(3)=19, null(3)=0]
        │    │    │   histogram(3)=  0 2.7158e+05 0 2.5507e+05 0 3.0909e+05 0 2.8508e+05 0 2.9558e+05 0 2.6107e+05 0 2.8358e+05 0 2.9558e+05 0 3.1659e+05 0 3.0459e+05 0 3.1659e+05 0 2.8208e+05 0 2.9258e+05 0 3.1059e+05 0 3.1059e+05 0 3.0009e+05 0 2.7758e+05 0 3.0909e+05 0 3.2259e+05
        │    │    │                <------ 31 --------- 32 --------- 33 --------- 34 --------- 35 --------- 36 --------- 37 --------- 38 --------- 39 --------- 40 --------- 41 --------- 42 --------- 43 --------- 44 --------- 45 --------- 46 --------- 47 --------- 48 --------- 49 ---
        │    │    ├── cost: 7167499.46
        │    │    └── interesting orderings: (+3)
        │    └── filters
        │         └── ol_amount:9 > 9990 [outer=(9), immutable, constraints=(/9: (/9990 - ]; tight)]
        └── aggregations
             └── sum [as=sum:13, outer=(8)]
                  └── ol_quantity:8
(42 rows)

Time: 3ms total (execution 1ms / network 2ms)

Cost is roughly calculated by:

Estimating how much time each node in the query plan will use to process all results
Modeling how data flows through the query plan

Adding keyword ANALYZE will both show the plan, in a graphical format, and execute it, too. This will show the query runtime performance

EXPLAIN ANALYZE SELECT ol_number, SUM(ol_quantity)
FROM order_line
WHERE ol_w_id > 30
   AND ol_amount > 9990
GROUP BY ol_number
ORDER BY ol_number;

  automatic |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          url
------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
    true    | https://cockroachdb.github.io/distsqlplan/decode.html#eJy8Vttu4zYQfe9XEPMUo0ositT1yc7G2xp17NRO0KatYSjWwBEgiVqK6sYN8ln9gX5ZITm7thytVs46-6ghZ3jOmeGhHiH7EIEHg9-vRv3hmPTH_dHtHwNycjGcXc9-HXXIbDAavLsmIlokeXyHUiNZHp-IaPEh9xMVqnWHvJ9OLomQAcpFFCZIfvt5MB2QYs_HRRiQv3JdZ0iY3iH98UUZ92ORJ-rTiuu6eof8NJ3cXJHz2-1RZDK9GEwrIdAgEQGO_Rgz8P4EChowmGuQSrHELBOyCD-Wm4bBA3i6BmGS5qoIzzVYCongPYIKVYTgwbV_F-EU_QBlt6gVoPLDqCy9JdRLZRj7cg0azFI_yTzSZXbX7ToO7dJT0OB9GCmUHum5u5Q8z7sYvBte9kegwSRXHulxreeABne-Wt5jRkSu0iJuOcXZKk-jnSDVOQMNhhOiwhg9ws44ZZZlZ0WBtcKMSPQDjxgWI5fhOWggxcdPQWZxalId5k8abAo-C5Apf4Xg0SetvUj91UriyldCdllVo16Buz--XYwn14vxzWh00qOdQqaby5Oe0amhWkfUBA3wAZe5CkXyzJaecW7HBdfYfyB_41IJGf6DAYkxFnJN_CgSS19h4BFq6OSX8HyP6xb-3Zrc-9n9C-Tzp60exhf12NbJk3IkMKhUKqs0KGa-gWKsRjDD_O_fdnLROrm2UrBDRmMmpELZtfdJ_vh6Ik5bHs00eIuO1vWzBF_b1LE4FWnXqe7eJ8nbznfB8kvYzQp22t7C9EMsjJ7u2NgRXIw57CV5y9B3TYyeuSa3LHPfxCh3X5qYoVu6aRotTewrMu1cSeN7mZhrWW2H-egeRo_oYfwtBDPqboXF38TFvjIbzy5mvdLFaqmw1kxe62M1RKaYpSLJcK-79ZX1ousYrHAzIpnI5RKvpFiWx2w-J2VeGQgwU5tVuvkYJpulAuBuMt1PprvJdiWZlmhUrZW2VvqMU5vatPTTQ6AY3wKl7umiLVtuOJ97XsVrNOrOmnVnjcm8SnY_mTcmu80nm43JVnOy9Z3HpXiCbIO739apZtTHnyzuPv8wVGHYjco7zco7hwzMkS6qZRsFjfnTD_8HAAD__xGesv4=
(1 row)

Time: 3.432s total (execution 3.433s / network -0.001s)

In above plan, for example, we see that 2 nodes are required to execute the query: as ol_w_id is part of the primary key of index/table order_line@primary, the optimizer can easely apply the filter condition WHERE ol_w_id > 30 by spanning only over the ranges where this condition is true. The ranges are spread over 2 nodes:

Node3 spans ranges 31-37;
Node1 spans ranges 37 till the end.

This is done by the TableReader process, and you can see that this process also applies the filter ol_amount > 9990 as it reads through the rows. It outputs column @4 and @8, that is, ol_number and ol_quantity in batches of tuples as the execution is vectorized.

The Aggregator process receives such tuples and:

performs the grouping by hash @1, where @1 is the first element in the tuple, ol_number;
computes the sum SUM(@2 where @2 is the second element in the tuple, ol_quantity.

The output is sent to the Sorter process that performs the ORDER BY ol_number part of the query (ordered @1+), before the gateway nodes sends back the result set to the client.

Refer to our docs for the complete instructions on how to read every section of the plan.

With this information at hand, let's proceed with creating the indexes

-- these will take few minutes to create
CREATE INDEX idx_ol_amount ON order_line(ol_amount);
-- same as above but storing ol_quantity as a value - useful for our sum()
CREATE INDEX idx_ol_amount_storing_ol_quantity ON order_line(ol_amount) STORING (ol_quantity);

Let's review again the Response Time by specifically selecting what Index to use. Remember, primary is the main index of the table.

-- using primary
SELECT ol_number, SUM(ol_quantity)
FROM order_line@primary
WHERE ol_w_id > 30
   AND ol_amount > 9990
GROUP BY ol_number
ORDER BY ol_number;

-- using the first index
SELECT ol_number, SUM(ol_quantity)
FROM order_line@idx_ol_amount
WHERE ol_w_id > 30
   AND ol_amount > 9990
GROUP BY ol_number
ORDER BY ol_number;

-- using the index that stores also the ol_quantity
SELECT ol_number, SUM(ol_quantity)
FROM order_line@idx_ol_amount_storing_ol_quantity
WHERE ol_w_id > 30
   AND ol_amount > 9990
GROUP BY ol_number
ORDER BY ol_number;

  ol_number | sum
------------+-------
          1 |  905
          2 |  805
          3 |  735
          4 | 1000
          5 |  750
          6 |  755
          7 |  665
          8 |  720
          9 |  465
         10 |  345
         11 |  415
         12 |  305
         13 |  230
         14 |  120
         15 |  100
(15 rows)

Time: 4.267s total (execution 4.268s / network -0.001s)

  ol_number | sum
------------+-------
          1 |  905
          2 |  805
          3 |  735
          4 | 1000
          5 |  750
          6 |  755
          7 |  665
          8 |  720
          9 |  465
         10 |  345
         11 |  415
         12 |  305
         13 |  230
         14 |  120
         15 |  100
(15 rows)

Time: 98ms total (execution 96ms / network 2ms)

  ol_number | sum
------------+-------
          1 |  905
          2 |  805
          3 |  735
          4 | 1000
          5 |  750
          6 |  755
          7 |  665
          8 |  720
          9 |  465
         10 |  345
         11 |  415
         12 |  305
         13 |  230
         14 |  120
         15 |  100
(15 rows)

Time: 15ms total (execution 14ms / network 2ms)

Review the query plan of the original query to see what the optimizer will choose now that we have added the indexes.

EXPLAIN (VERBOSE) SELECT ol_number, SUM(ol_quantity)
FROM order_line
WHERE ol_w_id > 30
   AND ol_amount > 9990
GROUP BY ol_number
ORDER BY ol_number;

            tree           |        field        |           description            |                   columns                    |  ordering
---------------------------+---------------------+----------------------------------+----------------------------------------------+-------------
                           | distribution        | full                             |                                              |
                           | vectorized          | true                             |                                              |
  sort                     |                     |                                  | (ol_number, sum)                             | +ol_number
   │                       | estimated row count | 15                               |                                              |
   │                       | order               | +ol_number                       |                                              |
   └── group               |                     |                                  | (ol_number, sum)                             |
        │                  | estimated row count | 15                               |                                              |
        │                  | aggregate 0         | sum(ol_quantity)                 |                                              |
        │                  | group by            | ol_number                        |                                              |
        └── project        |                     |                                  | (ol_number, ol_quantity)                     |
             └── filter    |                     |                                  | (ol_w_id, ol_number, ol_quantity, ol_amount) |
                  │        | estimated row count | 6157                             |                                              |
                  │        | filter              | ol_w_id > 30                     |                                              |
                  └── scan |                     |                                  | (ol_w_id, ol_number, ol_quantity, ol_amount) |
                           | estimated row count | 6673                             |                                              |
                           | table               | order_line@idx_ol_amount_storing_ol_quantity |                                              |
                           | spans               | /9.99E+3/PrefixEnd-              |                                              |
(17 rows)

Time: 2ms total (execution 1ms / network 1ms)

Very good! As expected, the Optimizer chose idx_ol_amount_storing_ol_quantity over primary for the lower estimated rows required to be scanned (6,673 vs 5,599,606).

You might however wonder how storing ol_quantity in the 2nd index affected the speed of execution so drammatically. Pull the verbose plan for the query using the index idx_ol_amount

EXPLAIN (VERBOSE) SELECT ol_number, SUM(ol_quantity)
FROM order_line@idx_ol_amount
WHERE ol_w_id > 30
   AND ol_amount > 9990
GROUP BY ol_number
ORDER BY ol_number;

              tree              |        field        |             description              |                      columns                      |  ordering
--------------------------------+---------------------+--------------------------------------+---------------------------------------------------+-------------
                                | distribution        | full                                 |                                                   |
                                | vectorized          | true                                 |                                                   |
  sort                          |                     |                                      | (ol_number, sum)                                  | +ol_number
   │                            | estimated row count | 15                                   |                                                   |
   │                            | order               | +ol_number                           |                                                   |
   └── group                    |                     |                                      | (ol_number, sum)                                  |
        │                       | estimated row count | 15                                   |                                                   |
        │                       | aggregate 0         | sum(ol_quantity)                     |                                                   |
        │                       | group by            | ol_number                            |                                                   |
        └── project             |                     |                                      | (ol_number, ol_quantity)                          |
             └── index join     |                     |                                      | (ol_w_id, ol_number, ol_quantity, ol_amount)      |
                  │             | estimated row count | 6157                                 |                                                   |
                  │             | table               | order_line@primary                   |                                                   |
                  │             | key columns         | ol_w_id, ol_d_id, ol_o_id, ol_number |                                                   |
                  └── filter    |                     |                                      | (ol_o_id, ol_d_id, ol_w_id, ol_number, ol_amount) |
                       │        | estimated row count | 2494                                 |                                                   |
                       │        | filter              | ol_w_id > 30                         |                                                   |
                       └── scan |                     |                                      | (ol_o_id, ol_d_id, ol_w_id, ol_number, ol_amount) |
                                | estimated row count | 6673                                 |                                                   |
                                | table               | order_line@idx_ol_amount             |                                                   |
                                | spans               | /9.99E+3/PrefixEnd-                  |                                                   |
(21 rows)

Time: 3ms total (execution 2ms / network 1ms)

As index idx_ol_amount doesn't store field ol_quantity, a join operation with primary is required to get this field to compute the sum operation. This extra step is expensive and thus takes longer to execute: compare it with the plan selected by the optimizer, above.

When you are content with this lab, feel free to drop idx_ol_amount

DROP INDEX idx_ol_amount;

You can learn more about the the Cost-Based Optimizer in our docs.

Learn more about the Vectorized Query Execution.

Lab 2 - Joining mechanisms

Run the following query:

SELECT w_name, w_city, sum(ol_amount)
FROM order_line
INNER JOIN warehouse ON (w_id = ol_supply_w_id)
WHERE ol_supply_w_id > 40
GROUP BY 1,2;

  w_name | w_city |     sum
---------+--------+---------------
  10     | 19     | 449060385.59
  9      | 17     | 451801641.36
  9      | 18     | 452314890.95
  9      | 11     | 447365923.46
  7      | 14     | 896184081.57
  6      | 12     | 449354824.30
  9      | 20     | 452794529.04
  7      | 19     | 449416001.35
(8 rows)

Time: 7.249s total (execution 7.252s / network -0.004s)

It's a bit slow... Check the query plan

EXPLAIN (VERBOSE) SELECT w_name, w_city, sum(ol_amount)
FROM order_line
INNER JOIN warehouse ON (w_id = ol_supply_w_id)
WHERE ol_supply_w_id > 40
GROUP BY 1,2;

             tree             |        field        |        description        |                      columns                      | ordering
------------------------------+---------------------+---------------------------+---------------------------------------------------+-----------
                              | distribution        | full                      |                                                   |
                              | vectorized          | true                      |                                                   |
  group                       |                     |                           | (w_name, w_city, sum)                             |
   │                          | estimated row count | 9                         |                                                   |
   │                          | aggregate 0         | sum(ol_amount)            |                                                   |
   │                          | group by            | w_name, w_city            |                                                   |
   └── project                |                     |                           | (ol_amount, w_name, w_city)                       |
        └── hash join (inner) |                     |                           | (ol_supply_w_id, ol_amount, w_id, w_name, w_city) |
             │                | estimated row count | 2701275                   |                                                   |
             │                | equality            | (ol_supply_w_id) = (w_id) |                                                   |
             │                | right cols are key  |                           |                                                   |
             ├── filter       |                     |                           | (ol_supply_w_id, ol_amount)                       |
             │    │           | estimated row count | 2701275                   |                                                   |
             │    │           | filter              | ol_supply_w_id > 40       |                                                   |
             │    └── scan    |                     |                           | (ol_supply_w_id, ol_amount)                       |
             │                | estimated row count | 15004305                  |                                                   |
             │                | table               | order_line@primary        |                                                   |
             │                | spans               | FULL SCAN                 |                                                   |
             └── scan         |                     |                           | (w_id, w_name, w_city)                            |
                              | estimated row count | 9                         |                                                   |
                              | table               | warehouse@primary         |                                                   |
                              | spans               | /41-                      |                                                   |
(22 rows)

Time: 4ms total (execution 3ms / network 1ms)

It's using a hash join. For the sake of testing, let's put into question the choice taken by the Optimizer. Let's force this query to use all supported CockroachDB join methods: LOOKUP, HASH, MERGE and compare the Response Time.

-- INNER LOOKUP JOIN
SELECT w_name, w_city, sum(ol_amount)
FROM order_line
INNER LOOKUP JOIN warehouse ON (w_id = ol_supply_w_id)
WHERE ol_supply_w_id > 40
GROUP BY 1,2;

  w_name | w_city |     sum
---------+--------+---------------
  9      | 20     | 452794529.04
  7      | 14     | 896184081.57
  7      | 19     | 449416001.35
  10     | 19     | 449060385.59
  9      | 11     | 447365923.46
  6      | 12     | 449354824.30
  9      | 18     | 452314890.95
  9      | 17     | 451801641.36
(8 rows)

Time: 7.989s total (execution 7.994s / network -0.005s)

-- INNER HASH JOIN
SELECT w_name, w_city, sum(ol_amount)
FROM order_line@primary
INNER HASH JOIN warehouse ON (w_id = ol_supply_w_id)
WHERE ol_supply_w_id > 40
GROUP BY 1,2;

[...]
Time: 6.704s total (execution 6.707s / network -0.003s)

-- INNER MERGE JOIN
SELECT w_name, w_city, sum(ol_amount)
FROM order_line@primary
INNER MERGE JOIN warehouse ON (w_id = ol_supply_w_id)
WHERE ol_supply_w_id > 40
GROUP BY 1,2;

[...]
Time: 8.320s total (execution 8.304s / network 0.016s)

The hash join was indeed the fastest, so the Optimizer chose correctly. However, a MERGE join is usually the preferred join mechanism. Let's create an index on ol_supply_w_id storing also ol_amount so we avoid a full scan and a hash join.

CREATE INDEX idx_ol_supp_w_id ON order_line(ol_supply_w_id) STORING (ol_amount);

EXPLAIN (VERBOSE) SELECT w_name, w_city, sum(ol_amount)
FROM order_line
INNER JOIN warehouse ON (w_id = ol_supply_w_id)
WHERE ol_supply_w_id > 40
GROUP BY 1,2;

              tree             |        field        |         description         |                      columns                      |    ordering
-------------------------------+---------------------+-----------------------------+---------------------------------------------------+------------------
                               | distribution        | full                        |                                                   |
                               | vectorized          | true                        |                                                   |
  group                        |                     |                             | (w_name, w_city, sum)                             |
   │                           | estimated row count | 9                           |                                                   |
   │                           | aggregate 0         | sum(ol_amount)              |                                                   |
   │                           | group by            | w_name, w_city              |                                                   |
   └── project                 |                     |                             | (ol_amount, w_name, w_city)                       |
        └── merge join (inner) |                     |                             | (ol_supply_w_id, ol_amount, w_id, w_name, w_city) |
             │                 | estimated row count | 2683769                     |                                                   |
             │                 | equality            | (ol_supply_w_id) = (w_id)   |                                                   |
             │                 | right cols are key  |                             |                                                   |
             │                 | merge ordering      | +"(ol_supply_w_id=w_id)"    |                                                   |
             ├── scan          |                     |                             | (ol_supply_w_id, ol_amount)                       | +ol_supply_w_id
             │                 | estimated row count | 2683769                     |                                                   |
             │                 | table               | order_line@idx_ol_supp_w_id |                                                   |
             │                 | spans               | /41-                        |                                                   |
             └── scan          |                     |                             | (w_id, w_name, w_city)                            | +w_id
                               | estimated row count | 9                           |                                                   |
                               | table               | warehouse@primary           |                                                   |
                               | spans               | /41-                        |                                                   |
(20 rows)

Time: 3ms total (execution 2ms / network 1ms)

Perfect, we have a merge join using our new index. Let's see how it performs

SELECT w_name, w_city, sum(ol_amount)
FROM order_line
INNER JOIN warehouse ON (w_id = ol_supply_w_id)
WHERE ol_supply_w_id > 40
GROUP BY 1,2;

  w_name | w_city |     sum
---------+--------+---------------
  9      | 11     | 447365923.46
  6      | 12     | 449354824.30
  9      | 20     | 452794529.04
  7      | 14     | 896184081.57
  9      | 18     | 452314890.95
  7      | 19     | 449416001.35
  10     | 19     | 449060385.59
  9      | 17     | 451801641.36
(8 rows)

Time: 2.905s total (execution 2.905s / network -0.001s)

Good job, we were able to cut Response Time in half! Mind, we still have to scan over 2,683,769 rows.

Lab 3 - Time Travel Queries: reading a previous snapshot

The following report query is run to populate a dashboard within your application.

SELECT h_w_id, count(*)
FROM history
WHERE h_w_id < 10
GROUP BY 1
ORDER BY 1;

  h_w_id | count
---------+--------
       0 | 30000
       1 | 30000
       2 | 30000
       3 | 30000
       4 | 30000
       5 | 30000
       6 | 30000
       7 | 30000
       8 | 30000
       9 | 30000
(10 rows)

Time: 158ms total (execution 156ms / network 1ms)

When experimenting, someone accidently uploaded some old data with today's date

INSERT INTO history (h_c_id, h_c_d_id, h_c_w_id, h_d_id, h_w_id, h_date, h_amount, h_data)
SELECT h_c_id, h_c_d_id, h_c_w_id, h_d_id, h_w_id, now(), h_amount, h_data
FROM history
WHERE h_w_id = 0;

Run the report query again to show the additional data.

SELECT h_w_id, count(*)
FROM history
WHERE h_w_id < 10
GROUP BY 1
ORDER BY 1;

  h_w_id | count
---------+--------
       0 | 60000
       1 | 30000
       2 | 30000
       3 | 30000
       4 | 30000
       5 | 30000
       6 | 30000
       7 | 30000
       8 | 30000
       9 | 30000

You can use the AS OF SYSTEM TIME clause to query the table at a previous state. You can leverage this feature to exclude the recently added data.

SELECT h_w_id, count(*)
FROM history AS OF SYSTEM TIME '-1m'
WHERE h_w_id < 10
GROUP BY 1
ORDER BY 1;

  h_w_id | count
---------+--------
       0 | 30000
       1 | 30000
       2 | 30000
       3 | 30000
       4 | 30000
       5 | 30000
       6 | 30000
       7 | 30000
       8 | 30000
       9 | 30000
(10 rows)

Time: 270ms total (execution 269ms / network 1ms)

Bingo! You can find more info about reading historical snapshot data using AS OF SYSTEM TIME in here.

Time Travel Queries are also useful to resolve a hot spot condition by relaxing the requirement to read solely from the leaseholder and instead use all replicas for reading requests. Read more in the Follower Reads topology.

Lab 4 - Query Tracing and Troubleshooting

Connect to the Admin UI at http://localhost:8080. We are going to cover how to Monitor and Analyse our CockroachDB cluster. You can find the Overview page a great place to start to learn more about it.

From the Admin UI you can gain a lot of information related to your database: size, number of tables, vie the CREATE TABLE statements, table ranges and columns, etc.

From the above screenshot, we see that the database size is 3.8GiB spread over 9 tables and 45 ranges. The biggest table is order_line with 1.9 GiB in size.

Over to the Statements page, we can see the most recent statements being executed along with some statistics

Hover over a query and the full skeleton will appear. Click on the query to see more details, including the Logical Plan and Execution Stats

You can enable Diagnostics on any statement. Enable tracing on any query by clicking on the Activate link under the Diagnostics column.

Run the query again, then collect the bundle zip file from the AdminUI.

Alternatively, you can create the Statement Bundle from the SQL prompt:

EXPLAIN ANALYZE (DEBUG) SELECT w_name, w_city, sum(ol_amount)
FROM order_line
INNER JOIN warehouse ON (w_id = ol_supply_w_id)
WHERE ol_supply_w_id > 40
GROUP BY 1,2;

                                        text
------------------------------------------------------------------------------------
  Statement diagnostics bundle generated. Download from the Admin UI (Advanced
  Debug -> Statement Diagnostics History), via the direct link below, or using
  the command line.
  Admin UI: http://roach-newyork-1:8080
  Direct link: http://roach-newyork-1:8080/_admin/v1/stmtbundle/612293770916331521
  Command line: cockroach statement-diag list / download

Explore the data gathered for query execution. This data will be helpful if you are experiencing a performance issues and need advise from Cockroach Labs.

Let's start Jaeger to view the trace file

docker run --rm -d --name jaeger \
  -e COLLECTOR_ZIPKIN_HTTP_PORT=9411 \
  -p 5775:5775/udp \
  -p 6831:6831/udp \
  -p 6832:6832/udp \
  -p 5778:5778 \
  -p 16686:16686 \
  -p 14268:14268 \
  -p 14250:14250 \
  -p 9411:9411 \
  jaegertracing/all-in-one:1.18

Open the Jaeger UI at http://localhost:16686 and import the trace-jaeger.json file collected in the statement bundle.

From here you can analyse every step of its execution and discover possible bottlenecks.

Final thoughts

Congratulations! You are now familiar with many of the techniques used to optimize the performance of your cluster, you have a deeper understanding of the architecture and the role that secondary indexes can play. You also practiced with troubleshooting and diagnosing a slow running query.

Some suggested material to further expand on this topic are found in our docs:

Blog:

Cost-Based Optimizer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

query-optimization

query-optimization

README.md

Query Optimization - Student Labs

Labs Prerequisites

Local Deployment

Shared Cluster Deployment

Lab 0 - Create database and load data

Local Deployment

Shared Cluster Deployment

Lab 1 - Optimization

Lab 2 - Joining mechanisms

Lab 3 - Time Travel Queries: reading a previous snapshot

Lab 4 - Query Tracing and Troubleshooting

Final thoughts

Files

query-optimization

Directory actions

More options

Directory actions

More options

Latest commit

History

query-optimization

Folders and files

parent directory

README.md

Query Optimization - Student Labs

Labs Prerequisites

Local Deployment

Shared Cluster Deployment

Lab 0 - Create database and load data

Local Deployment

Shared Cluster Deployment

Lab 1 - Optimization

Lab 2 - Joining mechanisms

Lab 3 - Time Travel Queries: reading a previous snapshot

Lab 4 - Query Tracing and Troubleshooting

Final thoughts