Skip to content

Latest commit

 

History

History
532 lines (329 loc) · 24.7 KB

CONTRIBUTING.rst

File metadata and controls

532 lines (329 loc) · 24.7 KB

Contributing

Contributions are welcome, and they are greatly appreciated! Every little bit helps, and credit will always be given.

You can contribute in many ways:

Types of Contributions

Implement Features, Indices or Indicators

xclim's structure makes it easy to create and register new user-defined indices and indicators. For the general implementation of indices and their wrapping into indicators, refer to :ref:`notebooks/extendxclim:Extending xclim` and :ref:`notebooks/customize:Customizing and controlling xclim`.

Look through the GitHub issues for features; Anything tagged with "enhancement" and "help wanted" is open to whoever wants to implement it.

General to-do list for implementing a new Indicator:

  1. Implement the indice

  2. Add unit tests

    • Indices are best tested with made up, idealized data to explicitly test the edge cases. Many pytest fixtures are available to help this data generation.
    • Tests should be added as one or more functions in tests/test_indices.py, see other tests for inspiration.
  3. Add the indicator

  4. Add unit tests

    • Indicators are best tested with real data, also looking at missing value propagation and metadata formatting. In addition to the atmosds fixture, only datasets that can be accessed with :py:func:`xclim.testing.open_dataset` should be used. For convenience, this special function is accessible as the open_dataset pytest fixture.
    • Tests are added in the most relevant tests/test_{variable}.py file.
  5. Add French translations

    xclim comes with an internationalization module and all "official" indicators (those in xclim.atmos.indicators) must have a french translation added to src/xclim/data/fr.json. This part can be done by the core team after you open a Pull Request.

Note

If you are adding new translations to the library (for languages other than French), please begin by opening a discussion on the xclim Discussions page to coordinate the scope and implementation of these translations.

General notes for implementing new bias-adjustment methods:

  • Method are implemented as classes in src/xclim/sdba/adjustment.py.
  • If the algorithm gets complicated and would generate many dask tasks, it should be implemented as functions wrapped by :py:func:`~xclim.sdba.map_blocks` or :py:func:`~xclim.sdba.map_groups` in src/xclim/sdba/_adjustment.py.
  • xclim doesn't implement monolithic multi-parameter methods, but rather smaller modular functions to construct post-processing workflows.
  • If you are working on numba-accelerated function that uses @guvectorize, consider disabling caching during the development phase and reactivating it once all changes are ready for review. This is done by commenting cache=True in the decorator.

Report Bugs

Report bugs at https://github.com/Ouranosinc/xclim/issues.

If you are reporting a bug, please include:

  • Your operating system name and version.
  • Any details about your local setup that might be helpful in troubleshooting.
  • Detailed steps to reproduce the bug.

Fix Bugs

Look through the GitHub issues for bugs. Anything tagged with "bug" and "help wanted" is open to whoever wants to implement it.

Write Documentation

xclim could always use more documentation, whether as part of the official xclim docs, in docstrings, or even on the web in blog posts, articles, and such.

To reference documents (article, presentation, thesis, etc) in the documentation or in a docstring, xclim uses sphinxcontrib-bibtex. Metadata of the documents is stored as BibTeX entries in the docs/references.bib file. To properly generate internal reference links, we suggest using the following roles:

  • For references cited in the References section of function docstrings, use :cite:cts:`label`.
  • For in-text references with first author and year, use :cite:t:`label`.
  • For reference citations in parentheses, use :cite:p:`label`.

Multiple references can be added to a single role using commas (e.g. :cite:cts:`label1,label2,label3`). For more information see: sphinxcontrib-bibtex.

Submit Feedback

The best way to send feedback is to file an issue at: https://github.com/Ouranosinc/xclim/issues.

If you are proposing a feature:

  • Explain in detail how it would work.
  • Keep the scope as narrow as possible, to make it easier to implement.
  • The xclim development team welcomes you and is always on hand to help. :)

Get Started!

Ready to contribute? Here's how to set up xclim for local development.

  1. Fork the xclim repo on GitHub.

  2. Clone your fork locally:

    git clone git@github.com:{my_github_username}/xclim.git
    cd xclim/
  3. Create a development environment. We recommend using conda:

    conda create -n xclim python=3.10 --file=environment.yml
    python -m pip install -e --no-deps .
  4. Create a branch for local development:

    git checkout -b name-of-your-bugfix-or-feature

    Now you can make your changes locally!

  5. Before committing your changes, we ask that you install pre-commit in your development environment. Pre-commit runs git hooks that ensure that your code resembles that of the project and catches and corrects any small errors or inconsistencies when you git commit:

    # To install the necessary pre-commit hooks:
    pre-commit install
    # To run pre-commit hooks manually:
    pre-commit run --all-files

    Instead of pre-commit, you can also verify your changes using the Make recipe for code linting checks:

    make lint

    Or, alternatively, you can check individual hooks manually with black, isort, ruff, flake8, flake8-rst-docstrings, nbqa, blackdoc, and yamllint:

    black --check xclim tests
    isort --check xclim tests
    ruff xclim tests
    flake8 --config=.flake8 xclim tests
    nbqa black --check docs
    nbqa isort --check docs
    blackdoc --check --exclude=src/xclim/indices/__init__.py xclim
    blackdoc --check docs
    yamllint --config-file=.yamllint.yaml xclim
  6. When features or bug fixes have been contributed, unit tests and doctests have been added, or notebooks have been updated, use $ pytest to test them:

    pytest --no-cov --nbval --dist=loadscope --rootdir=tests/ docs/notebooks --ignore=docs/notebooks/example.ipynb  # for notebooks, exclusively.
    pytest --no-cov --rootdir=tests/ --xdoctest src/xclim  # for doctests, exclusively.
    pytest  # for all unit tests, excluding doctests and notebooks.
    pytest -m "not slow"  # for all unit tests, excluding doctests, notebooks, and "slow" marked tests.

    Alternatively, one can use $ tox to run very specific testing configurations, as GitHub Workflows would do when a Pull Request is submitted and new commits are pushed:

    tox -e py310-coverage  # run tests on Python 3.10, reporting code coverage
    tox -e py313-upstream  # run tests on Python 3.13, with upstream dependencies
    tox -e py311-prefetch-offline -- -m "not slow"  # run tests on Python 3.11, force download of testing, ensure tests are all offline, exclude "slow" marked tests
    tox -e py312-lmoments -- -m "not slow"  # run tests on Python 3.12, installing lmoments3, excluding "slow" marked tests
    tox -e notebooks,doctests  # run the notebook-based tests, then run the doctests
    
    tox -m test  # run the standard tests used in GitHub Workflows

    Warning

    Starting from xclim v0.46.0, when running tests with tox, any pytest markers passed to pyXX builds (e.g. -m "not slow") must be passed to tox directly. This can be done as follows:

    $ tox -e py310 -- -m "not slow"
    The exceptions to this rule are:

    notebooks and doctests: these configurations do not pass test markers to its pytest call. offline: this configuration runs by default with the -m "not requires_internet" test marker. Be aware that running tox and manually setting a pytest marker will override this default.

    Note

    xclim tests are organized to support the pytest-xdist plugin for distributed testing across workers or CPUs. In order to benefit from multiple processes, add the flag --numprocesses=auto or -n auto to your pytest calls.

    When running tests via tox, numprocesses is set to the number of logical cores available (numprocesses=logical), with a maximum amount of 8.

  7. Docs should also be tested to ensure that the documentation will build correctly on ReadTheDocs. This can be performed in a number of ways:

    # To run in a contained virtualenv environment
    $ tox -e docs
    # or, alternatively, to build the docs directly
    $ make docs

    Note

    When building the documentation, the default behaviour is to evaluate notebooks ('nbsphinx_execute = "auto"'), rather than simply parse the content ('nbsphinx_execute = "never"'). Due to their complexity, this is a very computationally demanding task and should only be performed when necessary (i.e.: when the notebooks have been modified).

    In order to speed up documentation builds, setting a value for the environment variable "SKIP_NOTEBOOKS" (e.g. "$ export SKIP_NOTEBOOKS=1") will prevent the notebooks from being evaluated on all subsequent "$ tox -e docs" or "$ make docs" invocations.

  8. After clearing the previous checks, commit your changes and push your branch to GitHub:

    git add *
    git commit -m "Your detailed description of your changes."

    If installed, pre-commit will run checks at this point:

    • If no errors are found, changes will be committed.
    • If errors are found, modifications will be made and warnings will be raised if intervention is needed.
    • After addressing errors and effecting changes, simply git commit again:
    git push origin name-of-your-bugfix-or-feature
  9. Submit a pull request through the GitHub website.

Pull Request Guidelines

Before you submit a pull request, please follow these guidelines:

  1. Open an issue on our GitHub repository with your issue that you'd like to fix or feature that you'd like to implement.

  2. Perform the changes, commit and push them either to new a branch within Ouranosinc/xclim or to your personal fork of xclim.

    Warning

    Try to keep your contributions within the scope of the issue that you are addressing. While it might be tempting to fix other aspects of the library as it comes up, it's better to simply to flag the problems in case others are already working on it.

    Consider adding a "# TODO:" or "# FIXME:" comment if the need arises.

  3. Pull requests should raise test coverage for the xclim library. Code coverage is an indicator of how extensively tested the library is.

    Note

    If you are adding a new set of functions, they must be tested and coverage percentage should not significantly decrease.

  4. If the pull request adds functionality, your functions should include docstring explanations. So long as the docstrings are syntactically correct, sphinx-autodoc will be able to automatically parse the information. Please ensure that the docstrings and documentation adhere to the following standards (badly formed docstrings will fail build tests):

    Note

    If you aren't accustomed to writing documentation in reStructuredText (.rst), we encourage you to spend a few minutes going over the incredibly well-summarized reStructuredText Primer from the sphinx-doc maintainer community.

  5. The pull request should work for all currently-supported Python versions as well as raise test coverage. Pull requests are also checked for documentation build status and for PEP8 compliance. The build statuses and build errors for pull requests can be found at: https://github.com/Ouranosinc/xclim/actions

    Note

    The currently-supported Python versions are loosely based on the Scientific Python Ecosystem's SPEC 0 schedule. Generally, when numpy and xarray drop support for a dependency, xclim will follow suit in a subsequent release. For more information, see the SPEC 0 Schedule

    Warning

    PEP8, black, pytest (with xdoctest) and pydocstyle (for numpy docstrings) conventions are strongly enforced. Ensure that your changes pass all tests prior to pushing your final commits to your branch. Code formatting errors are treated as build errors and will block your pull request from being accepted.

  6. The version changes (CHANGELOG.rst) should briefly describe changes introduced in the Pull request. Changes should be organized by type (ie: New indicators, New features and enhancements, Breaking changes, Bug fixes, Internal changes) and the GitHub Pull Request, GitHub Issue. Your name and/or GitHub handle should also be listed among the contributors to this version. This can be done as follows:

    Contributors to this version: John Jacob Jingleheimer Schmidt (:user:`username`).
    
    Internal changes
    ^^^^^^^^^^^^^^^^
    * Updated the contribution guidelines. (:issue:`868`, :pull:`869`).

If this is your first contribution to Ouranosinc/xclim, we ask that you also add your name to the AUTHORS.rst, under Contributors as well as to the .zenodo.json, at the end of the creators block.

Updating Testing Data

If your code changes require changes to the testing data of xclim (i.e.: modifications to existing datasets or new datasets), these changes must be made via a Pull Request at the xclim-testdata repository.

xclim allows for developers to test specific branches/versions or forks of the xclim-testdata repository via the XCLIM_TESTDATA_BRANCH and XCLIM_TESTDATA_REPO environment variables, respectively, either through export, e.g.:

export XCLIM_TESTDATA_BRANCH="my_new_branch_of_testing_data"
export XCLIM_TESTDATA_REPO="https://github.com/my_username/xclim-testdata"

pytest
# or, alternatively:
tox

or by setting the variable at runtime:

env XCLIM_TESTDATA_BRANCH="my_new_branch_of_testing_data" XCLIM_TESTDATA_REPO="https://github.com/my_username/xclim-testdata" pytest
# or, alternatively:
env XCLIM_TESTDATA_BRANCH="my_new_branch_of_testing_data" XCLIM_TESTDATA_REPO="https://github.com/my_username/xclim-testdata" tox

This will ensure that tests load the appropriate testing data from this branch or repository before running.

If you anticipate not having internet access, we suggest prefetching the testing data from xclim-testdata repository and storing it in your local cache. This can be done by running the following console command:

xclim prefetch_testing_data

If your development branch relies on a specific branch of Ouranosinc/xclim-testdata, you can specify this using environment variables:

export XCLIM_TESTDATA_BRANCH="my_new_branch_of_testing_data"
xclim prefetch_testing_data

or, alternatively, with the --branch option:

xclim prefetch_testing_data --branch my_new_branch_of_testing_data --repo "https://github.com/my_username/xclim-testdata"

If you wish to test a specific branch using GitHub CI, this can be set in .github/workflows/main.yml:

env:
  XCLIM_TESTDATA_BRANCH: my_new_branch_of_testing_data

Warning

In order for a Pull Request to be allowed to merge to the main development branch, this variable must match the latest tagged commit name on xclim-testdata repository. We suggest merging changed testing data first, tagging a new version of xclim-testdata, then re-running tests on your Pull Request at Ouranosinc/xclim with the newest tag.

Running Tests in Offline Mode

xclim testing is designed with the assumption that the machine running the tests has internet access. Many calls to xclim functions will attempt to download data or verify checksums from the Ouranosinc/xclim-testdata repository. This can be problematic for developers working on features where internet access is not reliably available.

If you wish to ensure that your feature or bugfix can be developed without internet access, xclim leverages the pytest-socket plugin so that testing can be run in "offline" mode by invoking pytest with the following options:

pytest --disable-socket --allow-unix-socket -m "not requires_internet"

or, alternatively, using tox :

tox -e offline

These options will disable all network calls and skip tests marked with the requires_internet marker. The --allow-unix-socket option is required to allow the pytest-xdist plugin to function properly.

Tips

To run a subset of tests, we suggest a few approaches. For running only a test file:

pytest tests/test_xclim.py

To skip all slow tests:

pytest -m "not slow"

To run all conventions tests at once:

pre-commit run --all-files

Versioning

In order to update and release the library to PyPI, it's good to use a semantic versioning scheme. The method we use is as follows:

major.minor.patch-release

Major releases denote major changes resulting in a stable API;

Minor is to be used when adding a module, process or set of components;

Patch should be used for bug fixes and optimizations;

Release is a keyword used to specify the degree of production readiness (dev [, and optionally, release]). Only versions built from the main development branch will ever have this marker!

Build is a keyword used to specify the build number. Only versions built from the main development branch will ever have this number!

An increment to the Major or Minor will reset the Release to beta. When a build is promoted above beta (ie: the release/stable version), it's a good idea to push this version towards PyPI.

Packaging and Deployment

This section serves as a reminder for the maintainers on how to prepare the library for a tagged version and how to deploy packages to TestPyPI and PyPI.

When a new version has been minted (features have been successfully integrated test coverage and stability is adequate), maintainers should update the pip-installable package (wheel and source release) on PyPI as well as the binary on conda-forge.

From a new branch (e.g. prepare-v123), open a Pull Request and make sure all your changes to support a new version are committed (update the entry for newest version in CHANGELOG.rst), then run:

bump-my-version bump <option>  # possible options: major / minor / patch / release / build

These commands will increment the version and create a commit with an autogenerated message.

For PyPI releases/stable versions, ensure that the last version bumping command run is $ bump-my-version bump release to remove the -dev. These changes can now be merged to the prepare-v123 branch:

git push origin prepare-v123

With this performed, we can tag a version that will act as the GitHub-provided stable source archive. Be sure to only tag from the `main` branch when all changes from PRs have been merged! The commands needed are:

git tag v1.2.3
git push --tags

Note

Starting from October, 2021, all tags pushed to GitHub will trigger a build and publish a package to TestPyPI by default. TestPyPI is a testing ground that is not indexed or easily available to pip. The test package can be found at xclim on TestPyPI.

The Automated Approach

The simplest way to package xclim is to "publish" a version on GitHub. GitHub CI Actions are presently configured to build the library and publish the packages on PyPI automatically.

When publishing on GitHub, maintainers will need to generate the release notes for the current version, replacing the :issue:, :pull:, and :user: tags. The xclim CLI offers a helper function for performing this action:

# For Markdown format (needed when publishing a new version on GitHub):
xclim release_notes -m
# For ReStructuredText format (offered for convenience):
xclim release_notes -r

Note

The changelog should not extend past those entries relevant for the current version.

Warning

A published version on PyPI can never be overwritten. Be sure to verify that the package published at https://test.pypi.org/project/xclim/ matches expectations before publishing a version on GitHub.

The Manual Approach

The manual approach to library packaging for general support (pip wheels) requires that the flit library is installed.

From the command line on your Linux distribution, simply run the following from the clone's main dev branch:

# To build the packages (sources and wheel)
flit build

# To upload to PyPI
flit publish

The new version based off of the version checked out will now be available via pip ($ pip install xclim).

Releasing on conda-forge

Initial Release

In order to prepare an initial release on conda-forge, we strongly suggest consulting the following links:

Subsequent releases

If the conda-forge feedstock recipe is built from PyPI, then when a new release is published on PyPI, regro-cf-autotick-bot will open Pull Requests automatically on the conda-forge feedstock. It is up to the conda-forge feedstock maintainers to verify that the package is building properly before merging the Pull Request to the main branch.

Before updating the main conda-forge recipe, we strongly suggest performing the following checks:
  • Ensure that dependencies and dependency versions correspond with those of the tagged version, with open or pinned versions for the host requirements.
  • If possible, configure tests within the conda-forge build CI (e.g. imports: xclim, commands: pytest xclim)