make dist land in maven style directory pattern #57

robert3005 · 2016-11-16T14:06:07Z

Changes the place where we put dist so it conforms to maven layout

org/apache/spark/spark-dist/$version/spark-dist-$version.tgz

@pwoody @ash211 @mccheah . I will have to merge it to test.

…conf ## What changes were proposed in this pull request? This is an effort to reduce the difference between Hive and Spark. Spark supports case-sensitivity in columns. Especially, for Struct types, with `spark.sql.caseSensitive=true`, the following is supported. ```scala scala> sql("select named_struct('a', 1, 'A', 2).a").show +--------------------------+ |named_struct(a, 1, A, 2).a| +--------------------------+ | 1| +--------------------------+ scala> sql("select named_struct('a', 1, 'A', 2).A").show +--------------------------+ |named_struct(a, 1, A, 2).A| +--------------------------+ | 2| +--------------------------+ ``` And vice versa, with `spark.sql.caseSensitive=false`, the following is supported. ```scala scala> sql("select named_struct('a', 1).A, named_struct('A', 1).a").show +--------------------+--------------------+ |named_struct(a, 1).A|named_struct(A, 1).a| +--------------------+--------------------+ | 1| 1| +--------------------+--------------------+ ``` However, types are considered different. For example, SET operations fail. ```scala scala> sql("SELECT named_struct('a',1) union all (select named_struct('A',2))").show org.apache.spark.sql.AnalysisException: Union can only be performed on tables with the compatible column types. struct<A:int> <> struct<a:int> at the first column of the second table;; 'Union :- Project [named_struct(a, 1) AS named_struct(a, 1)#57] : +- OneRowRelation$ +- Project [named_struct(A, 2) AS named_struct(A, 2)#58] +- OneRowRelation$ ``` This PR aims to support case-insensitive type equality. For example, in Set operation, the above operation succeed when `spark.sql.caseSensitive=false`. ```scala scala> sql("SELECT named_struct('a',1) union all (select named_struct('A',2))").show +------------------+ |named_struct(a, 1)| +------------------+ | [1]| | [2]| +------------------+ ``` ## How was this patch tested? Pass the Jenkins with a newly add test case. Author: Dongjoon Hyun <dongjoon@apache.org> Closes apache#18460 from dongjoon-hyun/SPARK-21247.

…ntir#57)

Robert Kruszewski added 2 commits November 16, 2016 13:57

make dist land in maven style directory pattern

11b8c46

tar the same thing

2c39532

robert3005 merged commit 6928f1a into master Nov 17, 2016

robert3005 deleted the robertk/publish-dist-maven-style branch November 17, 2016 18:18

robert3005 mentioned this pull request Nov 18, 2016

Publish dist maven style #47

Closed

16pierre pushed a commit to 16pierre/spark that referenced this pull request May 24, 2021

Improvement: Conda environments symlink into the Python tempdir (pala…

9a3a628

…ntir#57)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make dist land in maven style directory pattern #57

make dist land in maven style directory pattern #57

robert3005 commented Nov 16, 2016

make dist land in maven style directory pattern #57

make dist land in maven style directory pattern #57

Conversation

robert3005 commented Nov 16, 2016