-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Namespace destination output #1921
Comments
Questions on how to handle this change to the protocol here: |
Open-questions to discuss and act on this week:
See the Open-question section here: |
After discussions, we will be splitting the work on this topic as follow:
I will create issues or PR and assign them to the next milestone instead of carrying this issue (which is big and might go through multiple iterations/milestones) |
Just adding another user request for this issue. They want to sync postgres to postgres. They don't like that public_ is prepended at the front. They just want the destination db to have exactly the same table names as the source one. @ChristopheDuong I know this is a use case you've mentioned before too. Will this be something that is possible when this project is complete? |
Yes we can handle it at the end of the project as I am also referring to it in this comment: |
Just a quick +1 with some additional info on the use-case I have. The primary use of Airbyte for me is to take data from various places and put it into a DB intended for OLAP use-cases. When these sources were made, we did not consider this possibility so multiple DBs contain a notion of 'user' with the name 'users'. Without namespacing on a connection (or on the source) I'm stuck with overwriting What I'm currently doingRoughly, |
In the latest Airbyte versions, there is now a prefix namespace on the connection page so you can solve this kind of conflict! |
Thank you! |
@roshan with 0.21.0-alpha, supported connectors will automatically duplicated the source schema into the destination. See this documentation - should make it even easier for you! |
Just adding another user request for this issue. We want to sync Postgres to BigQuery. We don't like the final dataset to be public (default namespace not overwritten by the BigQuery destination setting). Is there a way to workaround this? |
I'm going to close this issue now as it's too big in scope to be useful and somewhat out-of-date. Our next step for this will be #3481. I will go through the rest of the linked issues and deal with them at a later date. |
Tell us about the problem you're trying to solve
Problems:
1. Multiple sources can sync streams to the same destination, resulting in conflicts
2. A single source wants to sync to multiple destinations:
3. Advance configuration to make complex sync pipelines
4. Constraints on naming:
5. Cleaning, customizing things up:
Current State:
Schema name is defined in the destination connector (should be normalized)
The source’s stream name is:
_
(to mimic the “clean_name” displayed in UI)source_schema_name
.table_name
table_name
Final naming is therefore:
destination_schema
.stream_namedestination_schema
(.source_schema
).table_name
_airbyte_raw_
prefix to table namesDescribe the solution you’d like
Source_schema
.Table name
is the source’s stream nameschema_override
(namespace) andtable_override
src_conn_name
+ _ +Source_schema
.table_name
schema_override
.table_override
_airbyte_raw_
prefix to schema instead? (see Problem 4 and 5)The text was updated successfully, but these errors were encountered: