Skip to content

Latest commit

 

History

History
13 lines (8 loc) · 555 Bytes

CHANGELOG.md

File metadata and controls

13 lines (8 loc) · 555 Bytes

Changelog

All notable changes to CD4Py tool will be documented in this file. The format is based on Keep a Changelog and this project adheres to Semantic Versioning.

[Unreleased]

[0.1.0] - 2020-09-11

Added

  • A parallel tokenizer for Python source code files.
  • A library module for pre-processing tokenized files, calculating TF-IDF, finding KNNs, and identifying duplicate files.
  • A command-line interface for detection of duplicate files in Python projects.