gokinjo

What is this?

A feature extraction library based on k-nearest neighbor algorithm in Python
- k-NN based feature has experience of being used on 1st place solution of Kaggle competition (see references)
Be able to switch backend of k-NN algorithm
- scikit-learn (default)
- annoy
FYI: "gokinjo" is meant neighborhood in japanese.

Prerequisite

Python 3.6 or later
setuptools >= 30.0.3.0

How to install

From PyPI

$ pip install gokinjo

With annoy backend

$ pip install "gokinjo[annoy]"

From source code

$ pip install git+https://github.com/momijiame/gokinjo.git

Quick start

step 1: generate example data

import numpy as np
x0 = np.random.rand(500) - 0.5
x1 = np.random.rand(500) - 0.5
X = np.array(list(zip(x0, x1)))
y = np.array([1 if i0 * i1 > 0 else 0 for i0, i1 in X])

step 2: plot the above

from matplotlib import pyplot as plt
plt.scatter(X[:, 0], X[:, 1], c=y)
plt.show()

It is not linearly separable obviously.

step 3: extract k-NN feature with K-Fold

from gokinjo import knn_kfold_extract
X_knn = knn_kfold_extract(X, y)

step 4: plot the above

plt.scatter(X_knn[:, 0], X_knn[:, 1], c=y)
plt.show()

It looks like almost linearly separable.

Usage example

Please see examples in GitHub repository.

How to setup a development environment

$ pip install -e ".[develop]"
$ pytest

References

The competition which k-NN feature was used on 1st place solution
- https://www.kaggle.com/c/otto-group-product-classification-challenge/discussion/14335
R implementation
- https://github.com/davpinto/fastknn
Super respectable another Python implementation
- https://github.com/upura/knnFeat

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.circleci		.circleci
examples		examples
gokinjo		gokinjo
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
entry_points.cfg		entry_points.cfg
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gokinjo

What is this?

Prerequisite

How to install

From PyPI

With annoy backend

From source code

Quick start

Usage example

How to setup a development environment

References

About

Releases

Packages

Languages

License

momijiame/gokinjo

Folders and files

Latest commit

History

Repository files navigation

gokinjo

What is this?

Prerequisite

How to install

From PyPI

With annoy backend

From source code

Quick start

Usage example

How to setup a development environment

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages