Thank you for considering contributing to Kedro-Viz! We welcome contributions in the form of pull requests, issues or code reviews. You can add to code, or simply send us spelling and grammar fixes or extra tests. Contribute anything that you think improves the community for us all!
The following sections describe our vision and the contribution process.
The Kedro team pledges to foster and maintain a welcoming and friendly community in all of our spaces. All members of our community are expected to follow our Code of Conduct and we will do our best to enforce those principles and build a happy environment where everyone is treated with respect and dignity.
We use GitHub Issues to keep track of known bugs. We keep a close eye on them and try to make it clear when we have an internal fix in progress. Before reporting a new issue, please do your best to ensure your problem hasn't already been reported. If so, it's often better to just leave a comment on an existing issue, rather than create a new one. Old issues also can often include helpful tips and solutions to common problems.
If you are looking for help with your code, please consider posting a question on Stack Overflow or our Discord channel.
If you have already checked the existing issues on GitHub and are still convinced that you have found odd or erroneous behaviour then please file a new issue. We have a template that helps you provide the necessary information we'll need in order to address your query.
If you have new ideas for Kedro-Viz functionality then please open a GitHub issue with the label Type: Enhancement
. Please describe in your own words the feature you would like to see, why you need it, and how it should work.
If you're unsure where to begin contributing to Kedro-Viz, please start by looking through the good first issues
and Request: Help Wanted
on GitHub.
Typically, small contributions to Kedro-Viz are more preferable due to an easier review process, but we accept any new features if they prove to be essential for the functioning of the plugin or if we believe that they are used by most projects.
Working on your first pull request? You can learn how from these resources:
- Fork the project
- Develop your contribution in a new branch and open a PR against the
main
branch - Make sure the CI builds are green (have a look at the section Running checks locally below)
- Update the PR according to the reviewer's comments
Note: We only accept contributions under the Apache 2.0 license and you should have permission to share the submitted code.
- Aim for cross-platform compatibility on Windows, macOS and Linux, and support recent versions of major browsers
- We use SemVer for versioning
- Our code is designed to be compatible with Python 3.9 onwards
- We use Anaconda as a preferred virtual environment
- We use PEP 8 conventions for all Python code
- We use Google docstrings for code comments
- We use PEP 484 type hints for all user-facing functions / class methods e.g.
def count_truthy(elements: List[Any]) -> int:
return sum(1 for elem in elements if elem)
- Keep UI elements fully keyboard accessible, and aim to support screen-readers where possible
- Maintain a high level of animation performance, and minimise page-load time
- Comply with the set of colours defined in the Kedro-Viz Style Guide for all colour variables and usage
- We practice Trunk-Based Development.
- We bias towards small, complete pieces of work that can be merged into the trunk. "Complete" is defined as:
- A working user-journey, however small
- A backward-compatible change that paves the way for future features implementations
- A non-breaking refactoring of the code
- Note: to be considered complete, the branch must include tests (end to end or unit tests) for the newly introduced feature or fix
- We embrace freedom to make exception when absolutely necessary.
- We use a naming convention that helps us keep track of branches in a logical, consistent way. All branches should have the hyphen-separated convention of:
<type-of-change>/<short-description-of-change>
e.g.feature/awesome-new-feature
.
Types of changes | Description |
---|---|
docs |
Changes to the documentation of the plugin |
feature |
Non-breaking change which adds functionality |
chore |
Non-breaking change which updates UX, UI, dependencies, etc. |
fix |
Non-breaking change which fixes an issue |
tests |
Changes to project unit (tests/ ) and / or integration (features/ ) tests |
We require that all contributions comply with the Developer Certificate of Origin (DCO). This certifies that the contributor wrote or otherwise has the right to submit their contribution.
All commits must be signed off by including a Signed-off-by
line in the commit message:
This is my commit message
Signed-off-by: Random J Developer <random@developer.example.org>
The sign-off can be added automatically to your commit message using the -s
option:
git commit -s -m "This is my commit message"
To avoid needing to remember the -s
flag on every commit, you might like to set up an alias for git commit -s
. Alternatively, run make sign-off
to setup a commit-msg
Git hook that automatically signs off all commits (including merge commits) you make while working on the Kedro-Viz repository.
- PRs for releasable changes must always be made against
main
. As per Trunk-based Development convention,main
branch must always be releasable. - PRs must update the release note for the upcoming release with details about the change.
- Request review on a PR only as the PR is fully ready for review, i.e. no debugging statement, no stylistic non-compliance.
- Publish a draft PR if you need informal, fast feedback on a work in progress.
- A demo build of frontend changes is automatically created at
http://kedro-viz-fe.s3-website.eu-west-2.amazonaws.com/<your-branch-name>
for every new branch. - Make sure you tick items in the PR checklist as appropriate.
- Focus on correctness, as opposed to nit-picking
- Strive for consistency with existing codebase
- Disagree and commit fast before discussion derails to bike shedding
- Mark optional review suggestions with
Optional
orNit
. These suggestions should still be respected, discussed and resolved, but the PR author can choose to disagree - Assume review suggestions and questions are made from a helpful, knowledge-sharing, collaborative frame by default
- Normalise pointing out improvements unrelated to the current PR in review, then file a GitHub issue to resolve it in another PR
Note: We suggest using the latest release of Node.js v18 in your development environment.
First clone this repo, then download and install dependencies:
npm install
Now you're ready to begin development. Start the development server:
npm start
Note: If you face any issues running this, we recommend installing Node.js v18:
- Delete
package-lock.json
andnode_modules
.- Run
npm install
to reinstall dependencies.
This will serve the app at localhost:4141, and watch files in /src
for changes. It will also update the /lib
directory, which contains a Babel-compiled copy of the source. This directory is exported to npm
, and is used when importing as a React component into another application. It is updated automatically when you save in case you need to test/debug it locally (e.g. with npm link
). You can also update it manually, by running
npm run lib
Lastly, we have in place a pre-commit and pre-push hook. Before committing, the pre-commit hook will lint and prettify your changed files. Before pushing those committed changes, the pre-push hook will run our JavaScript test suite. This ensures the local changes haven't caused any breakages, and if they have, you'll be notified and can remedy then and there (note: you may need to restart your code editor or source control application for these hooks to work properly).
Kedro-Viz uses a unique identifier to determine the data source from one of several available sources. You can configure this by appending a query string to the URL, e.g. http://localhost:4141/?data=random
. Alternatively, you can set it with an environment variable when starting up the dev server:
DATA=random npm start
These are the supported dataset identifiers:
Identifier | Data source |
---|---|
json (default) |
/public/api/main |
random |
Randomly-generated data |
demo |
/src/utils/data/demo.mock.js |
spaceflights |
/src/utils/data/spaceflights.mock.json |
By default in production, the app asynchronously loads JSON from the /api/main
endpoint. You can replicate this in development by placing a JSON dataset in /public/api/main
, using main
as the name of the file, without an extension. Note that operating systems often add hidden file extensions, so you might need to use a CLI to confirm the filename.
Alternatively, you can synchronously load one of the mock datasets in /src/utils/data
. The 'spaceflights' dataset is mainly used as mock data for unit testing.
Finally, you can use pseudo-random data, which is procedurally-generated on page load, and is often useful for local development. Random data can be seeded with a hash string, which will allow you to replicate a generated layout. You can supply a seed with a seed
query string in the URL, e.g. http://localhost:4141/?data=random&seed=oM4xauN4Whyse
. If you do not supply a seed query to the URL, the app will generate a new pseudo-random seed on every browser refresh, and will output it to the browser console in case you wish to reuse it.
Note: Kedro-Viz>=7.0.0 will not work with projects created with Kedro<=0.17.0. Please consider migrating your project to Kedro>=0.18.0 before you develop against the latest version of Kedro-Viz.
Before launching a development server with a real Kedro project, you'd need to have Python(>=3.9) installed. We strongly recommend setting up conda to manage your Python versions and virtual environments. You can visit Kedro's guide to installing conda for more information.
The Kedro-Viz repository comes with an example project in the demo-project
folder. This is used on the public demo. To use it in your development environment, you need to install both the Kedro-Viz dependencies and a minimal set of dependencies for the demo project:
pip3 install -r package/test_requirements.txt -r demo-project/src/docker_requirements.txt
Now build the application with:
make build
As far as the development server for the backend is concerned, you only need to run make build
once when you first setup the project. You can then launch the server with:
make run
This command will launch a Kedro-Viz backend server at localhost:4142 and serve data from demo-project
. If you wish to also launch the frontend app then execute npm start
in a separate terminal window. localhost:4141 will then pull data from the backend Kedro-Viz server that is running on port 4142.
If you wish to point the backend server to a different Kedro project then you can do so by altering the PROJECT_PATH
variable:
make run PROJECT_PATH=<path-to-your-test-project>/new-kedro-project
Note: Once the backend development server is launched at port 4142, the local app will always pull data from that server. To prevent this, you can comment out the proxy setting in
package.json
and restart the dev server at port 4141.
Kedro-Viz provides a SQLiteSessionStore
that users can use in their project to enable experiment tracking functionality. If you want to use this session store with the development server, make sure you don't use a relative path when specifying the store's location in settings.py
. For example, demo-project
specifies the local data
directory within a project as the session store's location as follows:
from kedro_viz.integrations.kedro.sqlite_store import SQLiteStore
SESSION_STORE_ARGS = {"path": str(Path(__file__).parents[2] / "data")}
Owing to this coupling between the project settings and Kedro-Viz, if you wish to execute any Kedro commands on demo-project
(including kedro run
), you will need to install the Kedro-Viz Python package. To install your local development version of the package, run:
pip3 install -e package
Since Kedro 0.18, a session can only contain one run. In Kedro-Viz, once a session has been retrieved from the store we always use the terminology "run" rather than "session", e.g. run_id
rather than session_id
.
- Scope out major journeys from acceptance criteria from the ticket for manual end-to-end testing
- Write any other necessary tests (e.g. unit tests, snapshot tests, etc.) needed to give us enough confidence on the implementation
Kedro-Viz uses Jest for running JavaScript tests, with Enzyme and Testing-Library to mount React components and mock the DOM. You can run tests as follows:
npm test
You can also inspect and debug tests:
npm run test:debug
And check test coverage:
npm run test:coverage
See the Create-React-App docs for further information on JS testing.
Kedro-Viz uses Cypress for running JavaScript end-to-end tests. All the Kedro-Viz dependencies need to be installed before running the tests.
Run end-to-end tests using the below command:
npm run cy:test
You can also run Cypress in head mode for active development. Please note that you need to start your frontend and backend applications separately before running the below command.
npm run cy:dev
You can simulate how the published package will behave when imported into another JavaScript application by running
npm run lib-test
This script uses npm pack
to package Kedro-Viz as a tarball, then copies it to a boilerplate React app in /tools/test-lib/react-app
and installs it. Next, the script runs a simple server at http://localhost:1337
, which will open in the browser so that you can check that everything is working as it should.
You can also run automated tests on this demo app by copying and installing the tarball package, navigating to the test directory and running Jest:
npm run lib-test:setup
cd tools/test-lib/react-app
npm test
Before running Python tests, install test requirements:
pip install -r test_requirements.txt
make pytest
make e2e-tests
make lint
Python dependencies in Kedro-Viz are usually updated automatically through tools like Dependabot or triggered by deprecation warnings. When updating dependencies, especially the lower-bound version, it's essential to also update the lower-requirements.txt
file manually. This ensures that our e2e tests reflect the latest dependencies, maintaining project stability and compatibility.
If the lower-bound e2e test fails, it indicates that some dependencies may not work correctly with Kedro-Viz. To resolve this, update the problematic dependency in both requirements.txt
and lower-requirements.txt
.
Please ensure you have installed Gitpod on your browser extensions - Gitpod browser extension
1. Once you open a pull request, click on the Open
icon on the right
2. It will open the gitpod workspace automatically
3. Please wait until the frontend and backend builds are successful. You can check it by clicking on frontend
and backend
tabs in the terminal. (Estimated build time : 5 minutes)
4. Unblock pop-up ads and refresh your browser. Kedro-Viz should be running on your gitpod workspace
- Practice frequent, staggered releases.
- Create a branch for release. The branch name should follow the pattern
release/* (e.g. release/6.4.0)
- Update RELEASE.md with the latest release note.
Note: The heading for the release note should follow the pattern
# Release <release-number> (e.g. # Release 6.4.0)
- Bump the release version number by running:
make version VERSION=<version-to-release>
Notes: Kedro-Viz uses Semantic Versioning as the versioning scheme.
- Commit, raise a PR and get it merged into
main
. - Once the release commit is in
main
, a GitHub action gets triggered which automatically publishes a release with a version tag and release notes based on RELEASE.md.
Note: The release tag corresponds to
__version__
frompackage/kedro_viz/__init__.py
and the release notes is the content under the heading# Release <release-number> (e.g. # Release 6.4.0)
in RELEASE.md.
- Once the release tag is published on GitHub, a CircleCI job will be triggered to push it to npm and PyPI accordingly.
- Once the new version is on PyPI, you can deploy it to https://demo.kedro.org by merging
main
into thedemo
branch. A CI job will automatically build a container using the demo project with the newly released version and deploy it.
git checkout demo
git merge main
git push