-
Notifications
You must be signed in to change notification settings - Fork 28.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-3771][SQL] AppendingParquetOutputFormat should use reflection to prevent from breaking binary-compatibility. #2638
Conversation
Test FAILed. |
QA tests have started for PR 2638 at commit
|
QA tests have finished for PR 2638 at commit
|
A particular instance of Spark will be built for a particular version of Hadoop and/or YARN. It is not at this point a universal binary anyway, and so, I do not think it is necessary to add this indirection via reflection. That is, if you are deploying on Hadoop 1, you need to build Spark for Hadoop 1, and similarly for Hadoop 2. |
@srowen, Thank you for your comment. |
Can one of the admins verify this patch? |
@marmbrus, Thank you for your comment. |
QA tests have started for PR 2638 at commit
|
QA tests have finished for PR 2638 at commit
|
Test PASSed. |
Thanks! Merged. |
Original problem is SPARK-3764.
AppendingParquetOutputFormat
uses a binary-incompatible methodcontext.getTaskAttemptID
.This causes binary-incompatible of Spark itself, i.e. if Spark itself is built against hadoop-1, the artifact is for only hadoop-1, and vice versa.