Fix edit api #940

sahithyaravi · 2020-08-06T10:41:32Z

Reference Issue

What does this PR implement/fix? Explain your changes.

the get_arff function is not reliable and sometimes causes errors for dense vs sparse datasets.
I have fixed this by using get_data instead.
The get_arff did not work for some dense datasets. When we use the data returned by get_arff to construct the new dataset, it resulted in errors during dataset publish.

How should this PR be tested?

Any other comments?

This api is going to change based on server changes in future. We are going to get rid of the cloning except when data itself changes

doc/progress.rst

codecov-commenter · 2020-08-06T12:40:28Z

Codecov Report

Merging #940 into develop will decrease coverage by 0.01%.
The diff coverage is 83.33%.

@@             Coverage Diff             @@
##           develop     #940      +/-   ##
===========================================
- Coverage    87.79%   87.78%   -0.02%     
===========================================
  Files           37       37              
  Lines         4433     4437       +4     
===========================================
+ Hits          3892     3895       +3     
- Misses         541      542       +1

Impacted Files	Coverage Δ
openml/datasets/functions.py	`93.90% <83.33%> (-0.22%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 5d9c69c...991f6ef. Read the comment docs.

@mfeurer

* Create first section: Creating Custom Flow * Add Section: Using the Flow It is incomplete as while trying to explain how to format the predictions, I realized a utility function is required. * Allow run description text to be custom Previously the description text that accompanies the prediction file was auto-generated with the assumption that the corresponding flow had an extension. To support custom flows (with no extension), this behavior had to be changed. The description can now be passed on initialization. The description describing it was auto generated from run_task is now correctly only added if the run was generated through run_flow_on_task. * Draft for Custom Flow tutorial * Add minimal docstring to OpenMLRun I am not for each field what the specifications are. * Process code review feedback In particular: - text changes - fetch true labels from the dataset instead * Use the format utility function in automatic runs To format the predictions. * Process @mfeurer feedback * Rename arguments of list_evaluations (#933) * list evals name change * list evals - update * adding config file to user guide (#931) * adding config file to user guide * finished requested changes * Edit api (#935) * version1 * minor fixes * tests * reformat code * check new version * remove get data * code format * review comments * fix duplicate * type annotate * example * tests for exceptions * fix pep8 * black format * Adding support for scikit-learn > 0.22 (#936) * Preliminary changes * Updating unit tests for sklearn 0.22 and above * Triggering sklearn tests + fixes * Refactoring to inspect.signature in extensions * Add flake8-print in pre-commit (#939) * Add flake8-print in pre-commit config * Replace print statements with logging * Fix edit api (#940) * fix edit api * Update subflow paragraph * Check the ClassificationTask has class label set * Test task is of supported type * Add tests for format_prediction * Adding Python 3.8 support (#916) * Adding Python 3.8 support * Fixing indentation * Execute test cases for 3.8 * Testing * Making install script fail * Process feedback Neeratyoy * Test Exception with Regex Also throw NotImplementedError instead of TypeError for unsupported task types. Added links in the example. * change edit_api to reflect server (#941) * change edit_api to reflect server * change test and example to reflect rest API changes * tutorial comments * Update datasets_tutorial.py * Create first section: Creating Custom Flow * Add Section: Using the Flow It is incomplete as while trying to explain how to format the predictions, I realized a utility function is required. * Allow run description text to be custom Previously the description text that accompanies the prediction file was auto-generated with the assumption that the corresponding flow had an extension. To support custom flows (with no extension), this behavior had to be changed. The description can now be passed on initialization. The description describing it was auto generated from run_task is now correctly only added if the run was generated through run_flow_on_task. * Draft for Custom Flow tutorial * Add minimal docstring to OpenMLRun I am not for each field what the specifications are. * Process code review feedback In particular: - text changes - fetch true labels from the dataset instead * Use the format utility function in automatic runs To format the predictions. * Process @mfeurer feedback * Update subflow paragraph * Check the ClassificationTask has class label set * Test task is of supported type * Add tests for format_prediction * Process feedback Neeratyoy * Test Exception with Regex Also throw NotImplementedError instead of TypeError for unsupported task types. Added links in the example. Co-authored-by: Bilgecelik <38037323+Bilgecelik@users.noreply.github.com> Co-authored-by: marcoslbueno <38478211+marcoslbueno@users.noreply.github.com> Co-authored-by: Sahithya Ravi <44670788+sahithyaravi1493@users.noreply.github.com> Co-authored-by: Neeratyoy Mallik <neeratyoy@gmail.com> Co-authored-by: zikun <33176974+zikun@users.noreply.github.com>

sahithyaravi added 2 commits August 6, 2020 12:25

fix edit api

4848aca

progress

953d975

sahithyaravi requested a review from PGijsbers August 6, 2020 10:41

PGijsbers reviewed Aug 6, 2020

View reviewed changes

doc/progress.rst Outdated Show resolved Hide resolved

Update progress.rst

991f6ef

PGijsbers approved these changes Aug 7, 2020

View reviewed changes

PGijsbers merged commit 7d51a76 into develop Aug 7, 2020

PGijsbers deleted the fix_edit_api branch August 7, 2020 08:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix edit api #940

Fix edit api #940

sahithyaravi commented Aug 6, 2020 •

edited

Loading

codecov-commenter commented Aug 6, 2020 •

edited

Loading

Fix edit api #940

Fix edit api #940

Conversation

sahithyaravi commented Aug 6, 2020 • edited Loading

Reference Issue

What does this PR implement/fix? Explain your changes.

How should this PR be tested?

Any other comments?

codecov-commenter commented Aug 6, 2020 • edited Loading

Codecov Report

sahithyaravi commented Aug 6, 2020 •

edited

Loading

codecov-commenter commented Aug 6, 2020 •

edited

Loading