-
Notifications
You must be signed in to change notification settings - Fork 48
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
added requirements and preparations for version 0.1.0
- Loading branch information
1 parent
9bf904b
commit 903f1c0
Showing
5 changed files
with
64 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
mock>=2.0.0 | ||
nose>=1.3.7 | ||
numpy>=1.7.1 | ||
pycuda>=2016.1.1 | ||
pyopencl>=2015.2.4 | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,38 @@ | ||
# Roadmap for the Kernel Tuner | ||
|
||
This roadmap presents an overview of the features we are currently planning to | ||
implement. Please note that this is a living document that will evolve as | ||
priorities grow and shift. | ||
|
||
### version 0.2.0 | ||
|
||
This is the list of features that we want to have implemented by the next version. | ||
|
||
* Option to store tuning results in a file (e.g. json, csv, ... ) | ||
* Option to set a function that performs output verfication, instead of numpy.allclose() | ||
* Option to change defaults for 'block_size_x', and so on | ||
* Option to set a function that computes search space restriction, instead of a list of strings | ||
* Option to set compiler name, when using C backend | ||
* Option to set compiler options | ||
|
||
### version 1.0.0 | ||
|
||
These functions are to be implemented by version 1.0.0, but may already be | ||
implemented in earlier versions. | ||
|
||
* Tuning kernels in parallel on a single node | ||
* Tuning kernels in parallel on a set of nodes in a GPU clusters | ||
* Tuning kernels using machine learning or search strategies | ||
* Store tuning results in a database and provide an API for analysis | ||
|
||
### Low priority | ||
|
||
These are the things that we would like to implement, but we currently have no | ||
demand for it. If you are interesting in any of these, let us know! | ||
|
||
* Tuning compiler options in combination other parameters kernel | ||
* Example that tunes a kernel using thread block re-indexing | ||
* Example host code that runs a pipeline of kernels | ||
* Example CUDA host code that uses runtime compilation | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters