Much slower query on meta data compared with CockroachDB and PostgresSQL #15224

bnuzhouwei · 2022-12-06T07:31:53Z

Jira Link: DB-4403

Description

The following sql:

 select count(0) FROM pg_attribute t1 JOIN pg_class t2 ON t1.attrelid = t2.oid

Took 0.9s on YBDB, but only 0.014s on CRDB and PostgreSQL.

ddorian · 2022-12-06T10:02:12Z

@bnuzhouwei can you do explain analyze of the query on all 3 cases?

ddorian · 2022-12-06T10:12:48Z

Also, what's the hardware and how many nodes are you using?

bnuzhouwei · 2022-12-06T10:17:15Z

Only one single node, and my computer is:

cpu: 12700K
ram: 4*32G DDR4 3200
2T SSD

the compose file for docker to create a test enviroment:

version: '2'

volumes:
  yb-master-data-1:
  yb-tserver-data-1:

services:
  yb-master:
      image: yugabytedb/yugabyte:latest
      container_name: yb-master-n1
      volumes:
      - yb-master-data-1:/mnt/master
      command: [ "/home/yugabyte/bin/yb-master",
                "--fs_data_dirs=/mnt/master",
                "--master_addresses=yb-master-n1:7100",
                "--rpc_bind_addresses=yb-master-n1:7100",
                "--replication_factor=1"]
      ports:
      - "7000:7000"
      environment:
        SERVICE_7000_NAME: yb-master

  yb-tserver:
      image: yugabytedb/yugabyte:latest
      container_name: yb-tserver-n1
      volumes:
      - yb-tserver-data-1:/mnt/tserver
      command: [ "/home/yugabyte/bin/yb-tserver",
                "--fs_data_dirs=/mnt/tserver",
                "--enable_ysql",
                "--rpc_bind_addresses=yb-tserver-n1:9100",
                "--tserver_master_addrs=yb-master-n1:7100"]
      ports:
      - "9042:9042"
      - "5433:5433"
      - "9000:9000"
      environment:
        SERVICE_5433_NAME: ysql
        SERVICE_9042_NAME: ycql
        SERVICE_6379_NAME: yedis
        SERVICE_9000_NAME: yb-tserver
      depends_on:
      - yb-master

I am testing yugabytes, and i want to suit my app to YGDB, my app need to read the meta data of of all table and columns to dynamically create SQL.

So i find the poor performance of YGDB to query join meta tables, while CRDB and PostgreSQL are faster.

I think may be because of lacking indexes on meta tables of YGDB?

ddorian · 2022-12-06T10:35:40Z

I think may be because of lacking indexes on meta tables of YGDB?

You have to paste the explain analyze of the queries.

The reason is probably because the metadata is located in yb-master. yb-tserver has to do RPCs to it and it's not very efficient.
We're working on making this more efficient.

How often are you running metadata queries?

bnuzhouwei · 2022-12-06T10:39:00Z

I don't know how to paste the explain analyze of the queries.

The query is fast when query each table, but slow in join tables.

-- fast:
select * from pg_attribute
select * from pg_class 
-- very slow:
select count(0) FROM pg_attribute t1 JOIN pg_class t2 ON t1.attrelid = t2.oid

ddorian · 2022-12-06T11:04:00Z

Run the query below and paste the output:

explain analyze select count(0) FROM pg_attribute t1 JOIN pg_class t2 ON t1.attrelid = t2.oidl

How often are you running metadata queries?

Please answer

FranckPachot · 2022-12-06T11:35:37Z

Hi,
I see that you are on :latest which is the preview version.
You set yb_bnl_batch_size to 100 which will make a huge difference:

yugabyte=#  select count(0) FROM pg_attribute t1 JOIN pg_class t2 ON t1.attrelid = t2.oid;
 count
-------
  2547
(1 row)

Time: 24032.212 ms (00:24.032)

yugabyte=# set yb_bnl_batch_size=100;
SET
Time: 14.998 ms

yugabyte=#  select count(0) FROM pg_attribute t1 JOIN pg_class t2 ON t1.attrelid = t2.oid;
 count
-------
  2547
(1 row)

Time: 52.533 ms
yugabyte=#

bnuzhouwei · 2022-12-07T01:45:36Z

@FranckPachot Yes, much fater after set yb_bnl_batch_size, but it only for a query.
How to set this variable as global setting?

FranckPachot · 2022-12-07T02:51:27Z

You can add --ysql_pg_conf_csv=yb_bnl_batch_size=1000 when starting yb-tserver

bnuzhouwei · 2022-12-07T03:01:40Z

I use PostgreSQL 11.2-YB-2.14.5.0-b0 on x86_64-pc-linux-gnu, compiled by clang version 12.0.1 (https://github.com/yugabyte/llvm-project.git bdb147e675d8c87cee72cc1f87c4b82855977d94), 64-bit, the problem still have.

And no yb_bnl_batch_size variable can be set.

How often are you running metadata queries?

Almost every query i need the schema tables, because my engine use FillSchema to got a datatable of schema, and then a AdminUI is auto created from the schema table.

bnuzhouwei · 2022-12-07T03:09:24Z

explain analyze select count(0) FROM pg_attribute t1 JOIN pg_class t2 ON t1.attrelid = t2.oid

Aggregate  (cost=216.39..216.40 rows=1 width=8) (actual time=1753.865..1753.865 rows=1 loops=1)
  ->  Nested Loop  (cost=0.00..213.89 rows=1000 width=0) (actual time=3.446..1752.203 rows=4423 loops=1)
        ->  Seq Scan on pg_attribute t1  (cost=0.00..100.00 rows=1000 width=4) (actual time=2.863..6.896 rows=4423 loops=1)
        ->  Index Scan using pg_class_oid_index on pg_class t2  (cost=0.00..0.11 rows=1 width=4) (actual time=0.387..0.387 rows=1 loops=4423)
              Index Cond: (oid = t1.attrelid)
Planning Time: 0.115 ms
Execution Time: 1753.921 ms
Peak Memory Usage: 31 kB

FranckPachot · 2022-12-07T16:30:11Z

Ok, for versions that do not have Batched Nested Loop yes, I have an ugly one:

set random_page_cost=1e42

but better do that only for queries on the dictionary. Ideally as a hint /*+ Set(random_page_cost 1e42) */

bnuzhouwei · 2022-12-13T01:31:15Z

Not a good user experience, how to set the default configs.

Use Navicat double click to open a table, also very slow...

The metadata do cause many performance issues..

FranckPachot · 2022-12-13T05:34:40Z

The defaults can be set at cluster, database, user, connection, session, transaction, que level.

I don't know FillSchema but reading the schema tables for each query is not what an application is supposed to do. This will never be scalable

m-iancu · 2023-01-17T00:51:23Z

Closing this as the issue is identified -- the fundamental issue (changing the defaults) should be tracked in: #14070.

bnuzhouwei added area/ysql Yugabyte SQL (YSQL) status/awaiting-triage Issue awaiting triage labels Dec 6, 2022

yugabyte-ci added kind/bug This issue is a bug priority/medium Medium priority issue labels Dec 6, 2022

m-iancu closed this as completed Jan 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Much slower query on meta data compared with CockroachDB and PostgresSQL #15224

Much slower query on meta data compared with CockroachDB and PostgresSQL #15224

bnuzhouwei commented Dec 6, 2022 •

edited by yugabyte-ci

Loading

ddorian commented Dec 6, 2022

ddorian commented Dec 6, 2022

bnuzhouwei commented Dec 6, 2022 •

edited

Loading

ddorian commented Dec 6, 2022

bnuzhouwei commented Dec 6, 2022

ddorian commented Dec 6, 2022

FranckPachot commented Dec 6, 2022 •

edited

Loading

bnuzhouwei commented Dec 7, 2022

FranckPachot commented Dec 7, 2022

bnuzhouwei commented Dec 7, 2022 •

edited

Loading

bnuzhouwei commented Dec 7, 2022

FranckPachot commented Dec 7, 2022

bnuzhouwei commented Dec 13, 2022 •

edited

Loading

FranckPachot commented Dec 13, 2022

m-iancu commented Jan 17, 2023

Much slower query on meta data compared with CockroachDB and PostgresSQL #15224

Much slower query on meta data compared with CockroachDB and PostgresSQL #15224

Comments

bnuzhouwei commented Dec 6, 2022 • edited by yugabyte-ci Loading

Description

ddorian commented Dec 6, 2022

ddorian commented Dec 6, 2022

bnuzhouwei commented Dec 6, 2022 • edited Loading

ddorian commented Dec 6, 2022

bnuzhouwei commented Dec 6, 2022

ddorian commented Dec 6, 2022

FranckPachot commented Dec 6, 2022 • edited Loading

bnuzhouwei commented Dec 7, 2022

FranckPachot commented Dec 7, 2022

bnuzhouwei commented Dec 7, 2022 • edited Loading

bnuzhouwei commented Dec 7, 2022

FranckPachot commented Dec 7, 2022

bnuzhouwei commented Dec 13, 2022 • edited Loading

FranckPachot commented Dec 13, 2022

m-iancu commented Jan 17, 2023

bnuzhouwei commented Dec 6, 2022 •

edited by yugabyte-ci

Loading

bnuzhouwei commented Dec 6, 2022 •

edited

Loading

FranckPachot commented Dec 6, 2022 •

edited

Loading

bnuzhouwei commented Dec 7, 2022 •

edited

Loading

bnuzhouwei commented Dec 13, 2022 •

edited

Loading