Modifying interleaved fastq format to be hadoop version independent. #289
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Final modifications. Removed the isSplittable function, as the method signature depends on the Hadoop version (specifically, the InputFileFormat API changes from Hadoop 2.2->2.3). Now, we inherit directly from the base InputFileFormat implementation, which always returns true (i.e., the file can be split). As a condition of this, we must disallow compressed interleaved FASTQ files. This condition is OK because compressed files cannot be split, and the interleaved FASTQ format (which is our own ad hoc definition of a file format) is only used to make splitting simpler.
Also, I noticed that the adam-core module still incorrectly depended on the old adam-format submodule. This was not causing tests to fail because Sonatype still contains a snapshot of adam-format. We should probably delete that snapshot...