Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-43920][SQL][CONNECT] Create sql/api module #41426

Closed
wants to merge 5 commits into from

Conversation

amaliujia
Copy link
Contributor

@amaliujia amaliujia commented Jun 1, 2023

What changes were proposed in this pull request?

We need a sql/api module to host public API like DataType, Row, etc. This module can be shared between Catalyst and Spark Connect client so that client do not need to depend on Catalyst anymore.

Why are the changes needed?

Towards Spark Connect client do not need to depend on Catalyst anymore.

Does this PR introduce any user-facing change?

No

How was this patch tested?

N/A

@amaliujia
Copy link
Contributor Author

@hvanhovell @cloud-fan

<sbt.project.name>sql-api</sbt.project.name>
</properties>

<dependencies>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we add common/util as a dependency?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I will add it once we move code into this module.

Copy link
Contributor

@hvanhovell hvanhovell left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor

@LuciferYang LuciferYang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sqlApi module shouldn't participate in mima check in the current version

@amaliujia
Copy link
Contributor Author

@LuciferYang right. I updated the SparkBuild to exclude it.

@LuciferYang
Copy link
Contributor

Waiting CI

@hvanhovell
Copy link
Contributor

Merging.

@hvanhovell hvanhovell closed this in 15202e5 Jun 8, 2023
Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1, LGTM.

Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

BTW, sorry for asking this question after merging.

  1. What is the minimal target for migration in order to help connect module? According to the PR description, Row is also mentioned. So,sql/catalyst/src/main/scala/org/apache/spark/sql/types is not enough?
  2. Does it cause many MIMA exceptions against old catalyst module?

@amaliujia
Copy link
Contributor Author

BTW, sorry for asking this question after merging.

  1. What is the minimal target for migration in order to help connect module? According to the PR description, Row is also mentioned. So,sql/catalyst/src/main/scala/org/apache/spark/sql/types is not enough?
  2. Does it cause many MIMA exceptions against old catalyst module?

In current scope, Row, coders, data types are required. However we may discover more during the process.

@dongjoon-hyun
Copy link
Member

Got it. Let's see it in the actual PR~

czxm pushed a commit to czxm/spark that referenced this pull request Jun 12, 2023
### What changes were proposed in this pull request?

We need a sql/api module to host public API like DataType, Row, etc. This module can be shared between Catalyst and Spark Connect client so that client do not need to depend on Catalyst anymore.

### Why are the changes needed?

Towards Spark Connect client do not need to depend on Catalyst anymore.

### Does this PR introduce _any_ user-facing change?

No

### How was this patch tested?

N/A

Closes apache#41426 from amaliujia/add_sql_api.

Authored-by: Rui Wang <rui.wang@databricks.com>
Signed-off-by: Herman van Hovell <herman@databricks.com>
HyukjinKwon pushed a commit that referenced this pull request Jul 25, 2024
### What changes were proposed in this pull request?
The pr aims to update doc `sql/README.md`.

### Why are the changes needed?
After #41426, We have added a subproject `API` to our `SQL moudle`, so we need to update the doc `sql/README.md` synchronously.

### Does this PR introduce _any_ user-facing change?
Yes, make the doc clearer and more accurate.

### How was this patch tested?
Manually test.

### Was this patch authored or co-authored using generative AI tooling?
No.

Closes #47476 from panbingkun/minor_docs.

Authored-by: panbingkun <panbingkun@baidu.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
ilicmarkodb pushed a commit to ilicmarkodb/spark that referenced this pull request Jul 29, 2024
### What changes were proposed in this pull request?
The pr aims to update doc `sql/README.md`.

### Why are the changes needed?
After apache#41426, We have added a subproject `API` to our `SQL moudle`, so we need to update the doc `sql/README.md` synchronously.

### Does this PR introduce _any_ user-facing change?
Yes, make the doc clearer and more accurate.

### How was this patch tested?
Manually test.

### Was this patch authored or co-authored using generative AI tooling?
No.

Closes apache#47476 from panbingkun/minor_docs.

Authored-by: panbingkun <panbingkun@baidu.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
fusheng-rd pushed a commit to fusheng-rd/spark that referenced this pull request Aug 6, 2024
### What changes were proposed in this pull request?
The pr aims to update doc `sql/README.md`.

### Why are the changes needed?
After apache#41426, We have added a subproject `API` to our `SQL moudle`, so we need to update the doc `sql/README.md` synchronously.

### Does this PR introduce _any_ user-facing change?
Yes, make the doc clearer and more accurate.

### How was this patch tested?
Manually test.

### Was this patch authored or co-authored using generative AI tooling?
No.

Closes apache#47476 from panbingkun/minor_docs.

Authored-by: panbingkun <panbingkun@baidu.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
attilapiros pushed a commit to attilapiros/spark that referenced this pull request Oct 4, 2024
### What changes were proposed in this pull request?
The pr aims to update doc `sql/README.md`.

### Why are the changes needed?
After apache#41426, We have added a subproject `API` to our `SQL moudle`, so we need to update the doc `sql/README.md` synchronously.

### Does this PR introduce _any_ user-facing change?
Yes, make the doc clearer and more accurate.

### How was this patch tested?
Manually test.

### Was this patch authored or co-authored using generative AI tooling?
No.

Closes apache#47476 from panbingkun/minor_docs.

Authored-by: panbingkun <panbingkun@baidu.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants