Ozone is a part of the Apache Hadoop project. The bug tracking system for Ozone is under the Apache Jira project named HDDS.
If you are familiar with contributing to Apache Hadoop, then you already know everything you need to know to start filing Ozone bugs and submitting patches.
If you have never contributed to Apache Hadoop before, then you may find it useful to read How To Contribute.
This document summarize the contribution process and defines the differences.
We welcome contributions of:
- Code. File a bug and submit a patch, or pick up any one of the unassigned Jiras.
- Documentation Improvements: You can submit improvements to either:
- Ozone website. Instructions are here: Modifying the Ozone Website
- Developer docs. These are markdown files checked into the Apache Hadoop Source tree.
- The wiki pages: Please contact us at ozone-dev@hadoop.apache.org and we can provide you write access to the wiki.
- Testing: We always need help to improve our testing
- Unit Tests (JUnit / Java)
- Acceptance Tests (docker + robot framework)
- Blockade tests (python + blockade)
- Performance: We have multiple type of load generator / benchmark tools (
ozone freon
,ozone genesis
), which can be used to test cluster and report problems.
- Bug reports pointing out broken functionality, docs, or suggestions for improvements are always welcome!
If you have any questions, please don't hesitate to contact
- email: use ozone-dev@hadoop.apache.org.
- chat: You can find the #ozone channel at the ASF slack. Invite link is here
- meeting: We have weekly meetings which is open to anybody. Feel free to join and ask any questions
Requirements to compile the code:
- Unix System
- JDK 1.8 or higher
- Maven 3.5 or later
- Internet connection for first build (to fetch all Maven and Hadoop dependencies)
Additional requirements to run your first pseudo cluster:
- docker
- docker-compose
Additional requirements to execute different type of tests:
- Robot framework (for executing acceptance tests)
- docker-compose (to start pseudo cluster, also used for blockade and acceptance tests)
- blockade To execute network fault-injection testing.
- jq (for executing acceptance tests)
Optional dependencies:
- hugo to include the documentation in the web ui.
(Standard development tools such as make, gcc, etc. are required.)
After installing the requirements (especially maven) the build is as simple as:
mvn clean install -DskipTests
And you can start your first cluster:
cd hadoop-ozone/dist/target/ozone-*-SNAPSHOT/compose/ozone
docker-compose up -d --scale datanode=3
hadoop-ozone/dev-support/checks
directory contains helper scripts to build and check your code. (Including findbugs and checkstyle). Use them if you don't know the exact maven goals / parameters.
These scripts are executed by the CI servers, so it's always good to run them locally before creating a PR.
- Use
-DskipShade
to exclude ozonefs jar file creation from the release. It's way more faster, but you can't test Hadoop Compatible file system. - Use
-DskipRecon
to exclude the Recon build (Web UI and monitoring) from the build. It saves about 2 additional minutes. - Use
-Pdist
to build a distribution (Without this profile you won't have the final tar file) - Use
-Pdocker-build
to build a docker image which includes Ozone - Use
-Ddocker.image=repo/name
to define the name of your docker image - USe
-Pdocker-push
to push the created docker image to the docker registry
We use github pull requests instead of uploading patches to JIRA. The main contribution workflow is as follows:
- Fork
apache/hadoop-ozone
github repository (first time) - Create a new Jira in HDDS project (eg. HDDS-1234)
- Create a local branch for your contribution (eg.
git checkout -b HDDS-1234
) - Create your commits and push your branches to your personal fork.
- Create a pull request on github UI
- Please include the Jira link, problem description and testing instruction
- Set the Jira to "Patch Available" state
- Address any review comments if applicable by pushing new commits to the PR.
- When addressing review comments, there is no need to squash your commits. This makes it easy for reviewers to only review the incremental changes. The committer will take care to squash all your commits before merging to master.
We follow the code convention of Hadoop project (2 spaces instead of tabs, 80 char line width, ASF licence headers). The code checked with checkstyle, findbugs and various test frameworks.
Please don't post / commit any code with any code violations (all checks are not checking the introduced violations as checks in Hadoop but all the available violations).
The easiest way to check your contribution is using the simplified shell scripts under hadoop-ozone/dev-support/checks
. The problems will be printed out on the standard output.
For example:
hadoop-ozone/dev-support/checks/rat.sh
hadoop-ozone/dev-support/checks/checkstyle.sh
hadoop-ozone/dev-support/checks/findbugs.sh
Execution of rat and checkstyle are very fast. Findbugs is slightly slower. Executing unit.sh takes about 30 minutes.
The same scripts are executed by the github PR checker.
It's always good practice (and fast) to test with the related docker-compose based pseudo clusters:
cd hadoop-ozone/dist/target/ozone-*-SNAPSHOT/compose/ozone
./test.sh
(To test S3 use compose/ozones3
, to test security use compose/ozonsecure
, etc.
If you have very good reasons, you can ignore any Fingbugs warning. Your good reason can be persisted with the @SuppressFBWarnings
annotation.
@SuppressFBWarnings(value="AT_OPERATION_SEQUENCE_ON_CONCURRENT_ABSTRACTION",
justification="The method is synchronized and this is the only place "+
"dnsToUuidMap is modified")
private synchronized void addEntryTodnsToUuidMap(
...
As Ozone uses Apache Maven it can be developed from any IDE. As IntelliJ is a common choice, here are some suggestions to use it for Ozone development:
Ozone components depends on maven classpath. We generate classpath descriptor from the maven pom.xml files to use exactly the same classpath at runtime.
As a result, it's easy to start all the components from IDE as the right classpath (without provided scope) has already been set.
To start Ozone from IntelliJ:
- Stop your IDE
- Execute the
./hadoop-ozone/dev-support/intellij/install-runconfigs.sh
helper script. - Start the IDE
- New runner definitions are available from the Run menu.
Checkstyle plugin may help to detect violations directly from the IDE.
- Install
Checkstyle+IDEA
plugin fromFile
->Settings
->Plugins
- Open
File
->Settings
->Other settings
->Checkstyle
and Add (+
) a newConfiguration File
- Description:
Ozone
- Use a local checkstyle
./hadoop-hdds/dev-support/checkstyle/checkstyle.xml
- Check the
pom.xml
for the current version of the used checkstyle and use the same version with the plugin (File
->Settings
->Other settings
->Checkstyle
) - Open the Checkstyle Tool Window, select the
Ozone
rule and execute the check
IntelliJ may not pick up protoc generated classes as they can be very huge. If the protoc files can't be compiled try the following:
- Open Help -> Edit custom properties menu.
- Add
idea.max.intellisense.filesize=5000
entry - Restart your IDE