commands This module contains the cli commands.
configs This module contains estimator configuration files.
datasets This module contains different datasets. The dataset classes contain knowledge on how the dataset should be loaded into memory.
estimators This module contain estimatos are used for training and evaluating models on the datasets.
evaluation_metrics This module contains metrics used by the different estimators and are specific in the estimator config file.
io This module contains functionality that relates to writing/downloading/uploading to/from different sources.
stats This module contains code for visualizing and gathering statistics on the dataset

Unit testing

We use pytest to run tests located under tests/. Run the entire test suite with

pytest

or run individual test files, like:

pytest tests/test_visual.py

for individual test suites.

Style Guide

We follow Black code style for this repository. The max line length is set at 80. We enforce this code style using Black to format Python code. In addition to Black, we use isort to sort Python imports.

Before submitting a pull request, run:

pre-commit run --all-files

Fix all issues that were highlighted by flake8. If you want to skip exceptions such as long url lines in docstring, add # noqa: E501 <describe reason> for the specific line violation. See this to learn more about how to ignore flake8 errors.

Some editors support automatically formatting on save. For example, in vscode

Writing documentation

Datasetinsights uses Google style for formatting docstrings. Length of line inside docstrings block must be limited to 80 characters with exceptions such as long urls or tables.

Building documentation

Follow instructions here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CONTRIBUTING.md

CONTRIBUTING.md

Table of contents

Contributing to datasetinsights

Developing datasetinsights

Add new dependencies

Codebase structure

Unit testing

Style Guide

Writing documentation

Building documentation

Files

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Table of contents

Contributing to datasetinsights

Developing datasetinsights

Add new dependencies

Codebase structure

Unit testing

Style Guide

Writing documentation

Building documentation