Add high level overview to normalization doc. #6445

avaidyanatha · 2021-09-24T21:22:11Z

Main Changes

Makes the Basic Normalization doc a little more readable to first-time deployers.

cgardens · 2021-09-27T15:30:41Z

docs/understanding-airbyte/basic-normalization.md

@@ -50,6 +44,24 @@ The [normalization rules](basic-normalization.md#Rules) are _not_ configurable.

 Airbyte places the json blob version of your data in a table called `_airbyte_raw_<stream name>`. If basic normalization is turned on, it will place a separate copy of the data in a table called `<stream name>`. Under the hood, Airbyte is using dbt, which means that the data only ingresses into the data store one time. The normalization happens as a query within the datastore. This implementation avoids extra network time and costs.

+## Why does Airbyte have Basic Normalization?
+
+At its core, Airbyte is geared to handle the EL \(Extract Load\) steps of an ELT process. These steps can also be referred in Airbyte's dialect as "Source" and "Destination".


I think it would be helpful to explain why the raw table exists since that is something we get questions about a lot.

e.g. (you can word it better) A core tenant of the ELT approach is that the E and L steps mutate the data as little as possible. By getting a copy of the unmodified data into the destination, we reduce the need for resending data in the future, because the "original" data is already in the destination. If you change your mind on how you want to materialize the data, Airbyte can use the untouched raw version that is already in the destination to do it and doesn't need to resend anything.

(of course we do actually resend data in a lot of cases right now, but aspirationally this is what we are going for and why we adhere to this philosophy.

This absolutely makes sense and I think it's good to explain why it exists. I've included a short explanation on the philosophy.

CLAassistant · 2021-09-27T17:23:39Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

Abhi Vaidyanatha seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Add high level overview to normalization

f970108

avaidyanatha added the area/documentation Improvements or additions to documentation label Sep 24, 2021

avaidyanatha requested review from marcosmarxm, sherifnada, ChristopheDuong and cgardens September 24, 2021 21:22

avaidyanatha changed the title ~~Add high level overview to normalization~~ Add high level overview to normalization doc. Sep 24, 2021

avaidyanatha temporarily deployed to more-secrets September 24, 2021 21:23 Inactive

cgardens approved these changes Sep 27, 2021

View reviewed changes

ChristopheDuong approved these changes Sep 27, 2021

View reviewed changes

marcosmarxm approved these changes Sep 27, 2021

View reviewed changes

Address review comments

15108aa

avaidyanatha temporarily deployed to more-secrets September 28, 2021 21:05 Inactive

avaidyanatha merged commit 6b19bf4 into master Sep 28, 2021

avaidyanatha deleted the abhi/normalize-normalization branch September 28, 2021 21:23

This was referenced Sep 29, 2021

Bump Airbyte version from 0.29.22-alpha to 0.30.0-alpha #6532

Closed

Bump Airbyte version from 0.29.22-alpha to 0.30.0-alpha #6539

Closed

Bump Airbyte version from 0.29.22-alpha to 0.30.0-alpha #6548

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add high level overview to normalization doc. #6445

Add high level overview to normalization doc. #6445

avaidyanatha commented Sep 24, 2021 •

edited

Loading

cgardens Sep 27, 2021

avaidyanatha Sep 28, 2021

CLAassistant commented Sep 27, 2021

Add high level overview to normalization doc. #6445

Add high level overview to normalization doc. #6445

Conversation

avaidyanatha commented Sep 24, 2021 • edited Loading

Main Changes

cgardens Sep 27, 2021

Choose a reason for hiding this comment

avaidyanatha Sep 28, 2021

Choose a reason for hiding this comment

CLAassistant commented Sep 27, 2021

avaidyanatha commented Sep 24, 2021 •

edited

Loading