-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Detect conflicts between workspaces in the account for groups and tables #299
Conversation
Codecov ReportAttention:
Additional details and impacted files@@ Coverage Diff @@
## main #299 +/- ##
==========================================
- Coverage 83.27% 79.24% -4.03%
==========================================
Files 30 31 +1
Lines 2146 2265 +119
Branches 366 394 +28
==========================================
+ Hits 1787 1795 +8
- Misses 279 390 +111
Partials 80 80 ☔ View full report in Codecov by Sentry. |
Should we consider a "warehouse" for assessment/migration? Pulling from all workspaces? |
@FastLee so how warehouse should work across 1000 workspaces? Do we want to select a warehouse from each of those?.. what if warehouses have different names or not existing? We also may need to create a "main workspace" for migration, but i am not sure if it's simpler than storing configuration on a laptop |
Customers will not love the idea of creating a workspace to run a tool, and no customer thus far has EVER upgraded all their workspaces at once. This will probably never happen. Therefore, we should aim to make this chunkable using the workspace as a chunking point and we should follow simple and predictable logic when trying to reconcile workspace local groups with account groups, and document this so people know how it works. Here are the scenarios that come to mind. Scenario 1 Scenario 2 Scenario 3 Scenario 4 Scenario 5 |
We have no way of knowing if SCIM sync is configured, because it's the external tool calling our APIs. Scim sync is something done by identity admins |
We need an issue for this scenario, please create.
Migration probably would happen in workspace-by-workspace job-in-UI triggering. We can automate that also, if customers asks/votes. That's for groups. We didn't cover service principals at all 😉 on aws, account-level and service level spns may have the same/purpose, but different application_id, that is recorded in both grants and generic permissions. Now for tables, there also needs to be a report on table/db inconsistency - like And the team(s) that are driving UC Migration within account would make a decision after some time in review (of excel spreadsheet). By the way, we can split UCX installation across different Azure Subscriptions. And every installation would just focus on defining target catalog mapping per database. But here are unanswered questions:
We can technically support both db_to_catalog and workspace_to_catalog, and even at the same time, but db_to_catalog will override workspace_to_catalog. We also need default_catalog_for_workspace, if workspace_to_catalog is set (default catalog for all workspaces is set per metastore).. We can also do another override for tables, but we have unanswered questions:
Speaking of metastores, in the beginning, there needs to be workspace_to_metastore mapping with default_metastore_for_workspace. Can we come up with a good default mapping here? Coarse or fine grained? Select between the two? Ask for inline input? How many conflicts we expect to justify the need to create/support custom mapping? the last very important question is what future-proof configuration format might we need for this mapping. That's why I don't want any configuration after the assessment step. |
Workspace to metastore mapping can be solved with region:
|
stale PR, will need to start from scratch |
Fixes #83
Fixes #22