Skip to content
This repository has been archived by the owner on Sep 18, 2023. It is now read-only.

Encountered Hadoop version (3.2.1) conflict issue on AWS EMR-6.3.0 #368

Closed
haojinIntel opened this issue Jun 16, 2021 · 8 comments · Fixed by #402
Closed

Encountered Hadoop version (3.2.1) conflict issue on AWS EMR-6.3.0 #368

haojinIntel opened this issue Jun 16, 2021 · 8 comments · Fixed by #402
Assignees
Labels

Comments

@haojinIntel
Copy link
Collaborator

For OAP-1.2, we target to support all features on cloud. We choose emr-6.3.0 and the versions of software are showed below:
image
We try to start spark-shell after adding extra configurations for native-sql-engine and we encounter the issue like:
image
Is the issued caused by incompatible hadoop version?

@haojinIntel
Copy link
Collaborator Author

@zhouyuan @weiting-chen @zhixingheyi-tian Please help to track the issue. Thanks!

@zhouyuan
Copy link
Collaborator

please try to re-build your package by -Phadoop-3.2

@weiting-chen
Copy link
Collaborator

Yes, I can confirm this issue is from hadoop version support.
I can pass this issue in AWS EMR6.3.0 by using hadoop.version 3.2.1 in pom.xml
Please notice "-Phadoop-3.2" is using hadoop 3.2.0, I am not sure it can pass in AWS EMR.

@weiting-chen
Copy link
Collaborator

After the compile has been passed, another issue is NoSuchMethodError when creating external tables.
Add screenshot in below:
screenshot

@zhouyuan
Copy link
Collaborator

the CaseInsensitiveMap is a util class under spark catalyst
can you try to add spark-catalyst_2.12-3.1.1.jar to your classpath?
https://mvnrepository.com/artifact/org.apache.spark/spark-catalyst_2.12/3.1.1

@haojinIntel
Copy link
Collaborator Author

@zhouyuan @weiting-chen After I rebuild native-sql-engine by "mvn clean package -am -DskipTests -Dcpp_tests=OFF -Dbuild_arrow=OFF -Dstatic_arrow=OFF -Dbuild_protobuf=ON -Phadoop-3.2", we meet new issue like:
image

@weiting-chen weiting-chen self-assigned this Jun 17, 2021
@weiting-chen
Copy link
Collaborator

Let's use Hadoop 3.2.1 and ignore the error in 3.2.0 since AWS EMR 6.3.0 supports 3.2.1.
I will try to build one version that can run on AWS EMR 6.3.0 first.
We can use this issue to track the status.

@zhixingheyi-tian zhixingheyi-tian changed the title Support to use native-sql-engine on AWS EMR-6.3.0. Encountered Hadoop version (3.2.1) conflict issue on AWS EMR-6.3.0 Jun 30, 2021
@weiting-chen
Copy link
Collaborator

Done to test TPC-DS on AWS EMR 6.3.0.

  1. Pass all the 99 TPC-DS queries in both Datasource v1 and v2 API(turn of adaptive). There are some configurations that must be set for AWS EMR.
  2. Fail to run TPC-DS when adaptive is enabled, the error message "NoSuchMethodError" shows in below screenshot:
    adaptive_error

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants