DPLT-1049 Provision separate DB per user #132

morgsmccauley · 2023-07-17T04:08:21Z

This PR updates the Lambda Runner provisioning step to create a separate Postgres database per user. This provides greater security, preventing users from manipulating other users data within the provided SQL.

I've added the high level steps, as well as some notes on the changes to them, below:

1. Create database/user in Postgres

It's possible to run SQL against Postgres via Hasura, but the provided statements are run within a transaction block. As CREATE DATABASE can not be run within a transaction, we need a direct connection to Postgres to create new databases.

Crypto random passwords are generated for each PG user, these will be stored by Hasura in plain text within its metadata table. Exposing the password/connection string via an environment variable isn't really feasible as this would require: creating a secret, updating the Cloud Run revision, and restarting the instance. We can look at more secure options for this in future.

2. Add database to Hasura

Database connections are added as 'data sources' to Hasura, this makes up the majority of changes to Hasura Client - making source configurable. As mentioned above, these connections are stored in the Hasura metadata tables and are therefore persisted across restarts.

A database will be created per account, we must therefore check if the datasource exists before provisioning to avoid attempting to recreate it when the account creates their second indexer.

3. Run user provided SQL

There are two potential conflicts when provisioning an endpoint:

Postgres - a user could theoretically create two different functions with the same tables, to avoid conflicting table names a new schema is created per function. Since this is scoped within the accounts DB, we can name this after just the function, rather than a concatenation of both account/function like we have currently.
GraphQL - Hasura exposes all tables from all databases, with the above, we could end up with conflicts when two different accounts create the same named function and table(s). To avoid this conflict each database has it's own namespace, which scopes all top-level fields to that namespace.

The end result is a query like to the following:

query {
  account_name {
    function_name_table_name {
      column
    }
  }
}

Instead of namespaces, we could create a DB schema named after both account ID/function, which would result in the top-level fields we have currently, i.e. account_name_function_name_table_name. But I believe the namespace reduces the noise and makes GraphQL operations more readable.

4. Track tables, foreign key relationships, and add permissions

These steps are mostly unchanged, but have been slightly refactored to take in to account execution against different databases/sources.

Other Considerations

By default, all users are able to connect to the template databases, which are used as 'templates' for new databases. To prevent users from modifying these templates we should remove PUBLIC access from them:

REVOKE CONNECT ON DATABASE template0 FROM PUBLIC
REVOKE CONNECT ON DATABASE template1 FROM PUBLIC

This reverts commit 8e18566.

morgsmccauley added 7 commits July 18, 2023 08:02

feat: Add datasource to hasura

5f36a18

chore: Add pg

d1f6166

feat: Create PG DB restricted to user

f2e3e3f

feat: Provision user DB and add to Hasura

001425e

refactor: Use postgres pool

868ba2c

feat: Run user sql and track in hasura

46b3896

test: Test separate db per user provisioning

0c2a6e4

morgsmccauley force-pushed the DPLT-1049-db-per-user branch 3 times, most recently from d5403f0 to 62ff02b Compare July 17, 2023 23:19

feat: Generate random passwords for user DBs

2116c94

morgsmccauley force-pushed the DPLT-1049-db-per-user branch from 62ff02b to 2116c94 Compare July 17, 2023 23:20

morgsmccauley added 4 commits July 18, 2023 13:32

feat: Escape user input before executing sql

f2c9269

fix: Correctly espace pg user password

d68f2fd

refactor: Get table names via hasura provided method

539a559

refactor: Extract default schema to constant

07e61b5

morgsmccauley force-pushed the DPLT-1049-db-per-user branch from b7d5e1a to 07e61b5 Compare July 18, 2023 02:55

morgsmccauley added 2 commits July 18, 2023 15:15

feat: Add method to check if user api is provisioned

992cc20

feat: Provision separate DB per user in runner

ab8856c

morgsmccauley force-pushed the DPLT-1049-db-per-user branch from 146fa74 to ab8856c Compare July 18, 2023 03:24

morgsmccauley marked this pull request as ready for review July 18, 2023 09:08

morgsmccauley requested a review from a team as a code owner July 18, 2023 09:08

This comment was marked as resolved.

Sign in to view

morgsmccauley added 6 commits July 19, 2023 10:00

refactor: Move default password length to fn signature

12cb3f8

refactor: Adjust provisioner function args

1a35f43

feat: Provision schema per function

fe998e5

fix: Check both source/schema when verifying provisioning status

aec4195

fix: Skip provisioning datasource if it exists

8e82c4b

fix: Add trailing _ to graphql typename prefix

58397e9

morgsmccauley removed the request for review from a team July 19, 2023 00:11

morgsmccauley requested a review from a team July 19, 2023 00:11

gabehamilton approved these changes Jul 20, 2023

View reviewed changes

morgsmccauley merged commit 8e18566 into main Jul 21, 2023
1 check passed

morgsmccauley deleted the DPLT-1049-db-per-user branch July 21, 2023 01:19

morgsmccauley added a commit that referenced this pull request Jul 21, 2023

Revert "DPLT-1049 Provision separate DB per user (#132)"

61ec730

This reverts commit 8e18566.

This was referenced Jul 21, 2023

DPLT-1049 Revert separate user DB provisioning #143

Merged

DPLT-1049 Provision separate DB per user #144

Merged

DPLT-1072 Prod Release #145

Merged

morgsmccauley mentioned this pull request Apr 22, 2024

test stable branch git fix up #687

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DPLT-1049 Provision separate DB per user #132

DPLT-1049 Provision separate DB per user #132

morgsmccauley commented Jul 17, 2023 •

edited

Loading

This comment was marked as resolved.

DPLT-1049 Provision separate DB per user #132

DPLT-1049 Provision separate DB per user #132

Conversation

morgsmccauley commented Jul 17, 2023 • edited Loading

1. Create database/user in Postgres

2. Add database to Hasura

3. Run user provided SQL

4. Track tables, foreign key relationships, and add permissions

Other Considerations

This comment was marked as resolved.

morgsmccauley commented Jul 17, 2023 •

edited

Loading