Skip to content

Latest commit

 

History

History
199 lines (171 loc) · 6.01 KB

release-2.0.0.md

File metadata and controls

199 lines (171 loc) · 6.01 KB

.NET for Apache Spark 2.0.0 Release Notes

Deprecation for Spark 2.3

We have officially dropped support for Spark 2.3 in the 2.0.0 release. The last Spark 2.3 release (2.3.4) was back in September 2019, and no new release is planned for Spark 2.3. Since there have been no new features introduced for Spark 2.3 in the last few releases of .NET for Apache Spark, if you are relying on Spark 2.3, you should be able to continue using .NET for Apache Spark 1.x.

New Features/Improvements

  • Support for Spark 3.1.0 APIs (#886, #887, #888, #889, #890, #893)
  • Support for Spark 2.4.8, 3.1.2 (#953)
  • Add Exists(string path) to Microsoft.Spark.Hadoop.Fs.FileSystem (#909)
  • Add Version() to Microsoft.Spark.SparkContext and Microsoft.Spark.Sql.SparkSession (#919)
  • Add Avro ToAvro, FromAvro APIs (#805)
  • Extensions.DotNet.Interactive add ENV var to control disposal of tmp dir (#952)
  • Add public APIs to access JvmBridge, JvmObjectReference, IJvmObjectReferenceProvider (#951)

    This is exposed to help users interact with the JVM. It is provided with limited support and should be used with caution.

Bug Fixes

  • None

Infrastructure / Documentation / Etc.

  • None

Breaking Changes

  • Add 'Z' to the string format in Timestamp.ToString() to indicate UTC time (#897)

Known Issues

  • Broadcast variables do not work with dotnet-interactive (#561)
  • UDFs defined using class objects with closures does not work with dotnet-interactive (#619)
  • In dotnet-interactive blocking Spark methods that require external threads to unblock them does not work. ie StreamingQuery.AwaitTermination requires StreamingQuery.Stop to unblock (#736)

Compatibility

Backward compatibility

The following table describes the oldest version of the worker that the current version is compatible with, along with new features that are incompatible with the worker.

Oldest compatible Microsoft.Spark.Worker version
v2.0.0

Forward compatibility

The following table describes the oldest version of .NET for Apache Spark release that the current worker is compatible with.

Oldest compatible .NET for Apache Spark release version
v2.0.0

Supported Spark Versions

The following table outlines the supported Spark versions along with the microsoft-spark JAR to use with:

Spark Version microsoft-spark JAR
2.4.0 microsoft-spark-2-4_2.11-2.0.0.jar
2.4.1
2.4.3
2.4.4
2.4.5
2.4.6
2.4.7
2.4.2 Not supported
3.0.0 microsoft-spark-3-0_2.12-2.0.0.jar
3.0.1
3.0.2
3.1.1 microsoft-spark-3-1_2.12-2.0.0.jar
3.1.2

Supported Delta Versions

The following table outlines the supported Delta versions along with the Microsoft.Spark.Extensions version to use with:

Delta Version Microsoft.Spark.Extensions.Delta
0.1.0 2.0.0
0.2.0
0.3.0
0.4.0
0.5.0
0.6.0
0.6.1
0.7.0
0.8.0

Supported Hyperspace Versions

The following table outlines the supported Hyperspace versions along with the Microsoft.Spark.Extensions version to use with:

Hyperspace Version Microsoft.Spark.Extensions.Hyperspace
0.1.0 2.0.0
0.2.0
0.3.0
0.4.0