Skip to content

dianagamedi/TDDSpark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PySpark with TDD

An example of how to implement test driven development with data science.

Motivation

Nowadays, data science projects have been increasing due to the demand in the commerce.

Setup

Environment

This project contains a setup for a virtual environment. To create and install libraries required on the virtual environment run:

make setup

Test

Configure pythonpath when running tests in Makefile with the path where tests are located.

All tests should be under folder test

For running tests:

make test

Useful resources

pytest: ModuleNotFoundError: No module named solution

About

Implementing TDD with pyspark for unit testing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published