All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog.
- Added metadata caching using the
fs.oci.caching.object.metadata.enabled
andfs.oci.caching.object.metadata.spec
configuration keys. Note that there is no check for consistency, and if your data in Object Storage changes, the cache may return outdated data. Therefore, it is most appropriate when your data is read-only and does not change. Use caution when applying these settings. - Added read-ahead and parquet caching. The read-ahead feature is configured using
fs.oci.io.read.ahead
andfs.oci.io.read.ahead.blocksize
. Parquet caching, which requiresfs.oci.io.read.ahead=true
, is controlled usingfs.oci.caching.object.parquet.enabled
andfs.oci.caching.object.parquet.spec
. Note that there is no check for consistency, and if your data in Object Storage changes, the cache may return outdated data. Therefore, it is most appropriate when your data is read-only and does not change. Use caution when applying these settings. - Added Jersey client logging, configured using
fs.oci.client.jersey.logging.enabled
,fs.oci.client.jersey.logging.level
, andfs.oci.client.jersey.logging.verbosity
.
- Updated to Hadoop version 3.3.0
- Updated to OCI Java SDK version 1.33.1
- Updated to OCI Java SDK version 1.25.2
- Fixed a potential data curruption problem with
RefreshableOnNotAuthenticatedProvider
. We recommend that you update to this version 3.2.1.3 or later. For details, see oracle#35
- Updated to OCI Java SDK version 1.23.1
- Updated to Hadoop version 3.2.1
- Updated to OCI Java SDK version 1.22.1
- Release incorporates
hdfs-full
module.
- Updated to OCI Java SDK version 1.22.0
- Updated to OCI Java SDK version 1.17.5
- Updated to OCI Java SDK version 1.17.0
- Added DelayStrategy that resets the exponential backoff between retries after reaching a maximum time, configuratble using
fs.oraclebmc.client.retry.reset.threshold.seconds
- Updated to Java SDK version 1.14.0
- Updated to Java SDK version 1.6.2
- Updated version number to stem from Hadoop version 2.9.2
- Fix race condition in
BmcFileBackedOutputStream#createBufferFile
- Support for retries upon failures. Retry timeout is configurable via
fs.oci.client.retry.timeout.seconds
- Updated to Java SDK version 1.5.12
- BmcDirectFSInputStream#read now attempts to retry the read from the service when an IOException is thrown
- Updated to Java SDK version 1.4.2
- Added relocation for shaded packages
javax.annotation
,javax.validation
andjavax.inject
- Updated version number to stem from Hadoop version 2.7.7
- Updated to latest Java SDK (1.2.49) to leverage the updated Object Storage UploadManager with HTTP proxy support
- The configuration option of
MULTIPART_MIN_PART_SIZE_IN_MB
is now deprecated in favor ofMULTIPART_PART_SIZE_IN_MB
to correspond with the configuration changes for the UploadManager in the java SDK - Bouncy castle and JSR-305 jars are no longer bundled within the distribution jar and now must be included in the Hadoop CLASSPATH. Required third party jars are bundled under the
third-party/lib
folder of the distribution zip archive
- Support for configuring an HTTP proxy. More information can be found here
- Disabled caching of stale key id and private key in the
InstancePrincipalsCustomAuthenticator
class
- Updated to latest Java SDK (1.2.42) to pick up bug fixes
- Enabled progress reporting to Application Master during upload operation
- Enabled usage in a Hadoop deployment with kerberos
- Updated to latest Java SDK (1.2.41) to pick up bug fixes
- Added build instruction and fixed broken GitHub links in README
- Updated version number to stem from Hadoop version 2.7.2
- Release to GitHub
- Support instance principals authentication
- Replaced copy+delete rename operation with renameObject to improve performance
- Fetching the private key password now uses 'getPassword' from the Configuration instead of getting the string in plaintext
- Added ability to override configuration based on bucket and namespace being accessed
- Maven packages renamed from "oracle-bmc-" to "oci-"" (group id renamed from "com.oracle.bmc.sdk" to "com.oracle.oci.sdk")
- Renamed configuration properties (from "oraclebmc" to "oci"); old properties are deprecated (see "Deprecated" below).
- Renamed HDFS scheme (from "oraclebmc" to "oci"); old scheme is deprecated (see "Deprecated" below).
- HTTP user agent changed from "Oracle-BMC_HDFS_Connector/" to "Oracle-HDFS_Connector/"
- The old configuration properties ("oraclebmc") are deprecated; please use ("oci") instead. The old properties still work for backward compatibility, as long as the corresponding new property isn't set at the same time.
- The old HDFS scheme ("oraclebmc") is deprecated; please use "oci" instead. The old scheme still works for backward compatibility.
- Updated to latest Java SDK (1.2.5) to pick up change for request id truncation (to fix multipart uploads)
- Changed properties and constants to allow for more useful documentation
- Updated maven shade plugin to non-snapshot version
- Internal changes for how properties are loaded
- Support to use multi-part uploads when saving files
- Configuration options to tune multi-part upload behavior (or disable it)
- Bug in directory listing resulting in duplicate directories
- Concurrency issue when creating directory placeholders
- Improved "list directory" performance for large directories
- Using correct Date header for object creation time
- Bug with seek operation
- Updated to Oracle Cloud Infrastructure Java SDK 1.2.0
- Shading a few more dependencies (h2k)
- Doc updates
- Abstract Filesystem to support usagage within Yarn and Spark
- Updated to Oracle BMCS Java SDK 1.1.0 to pick up bug fixes
- License/copyright headers added to all source files as part of the build
- Now relocating shaded packages for Bouncycastle, Apache Commons, Glassfish
- Updated to Oracle Cloud Infrastructure Java SDK 1.0.1 to pick up bug fixes
- Including MD5 validation during copy operations
- Initial Release
- Support added for Hadoop 2.7.2 using Oracle Cloud Infrastructure Services Java SDK 1.0.0