Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

openlineage, snowflake: do not run external queries for Snowflake when #39113

Merged
merged 1 commit into from
Apr 22, 2024

Conversation

mobuchowski
Copy link
Contributor

Currently, we call Snowflake (and other DBs) to get schemas of tables used in particular queries.

However, Snowflake connector management is buggy and leaves us in a hanging, deadlocked state sometimes, which causes OpenLineage collection to fail and .

This PR prevents that by never calling Snowflake during OpenLineage method execution, and relying purely on information received from SQL parsing and Airflow Connection.

The negative consequence of that PR is lack of schema information in OpenLineage events, but this is a good tradeoff - most important feature of OpenLineage should be not affecting running tasks.

@RNHTTR
Copy link
Contributor

RNHTTR commented Apr 18, 2024

tests are failing ☹️

Signed-off-by: Maciej Obuchowski <obuchowski.maciej@gmail.com>
@mobuchowski mobuchowski force-pushed the snowflake-openlineage-dontuseexternalconnection branch from e94fb08 to 3b26f58 Compare April 18, 2024 15:53
@mobuchowski mobuchowski requested a review from o-nikolas as a code owner April 18, 2024 15:53
@mobuchowski
Copy link
Contributor Author

@RNHTTR not anymore :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants