Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature][Task Plugin] Add DVC task plugin for MLops scenario (#10372) #10407

Merged
merged 2 commits into from
Jun 21, 2022

Conversation

jieguangzhou
Copy link
Member

@jieguangzhou jieguangzhou commented Jun 10, 2022

Purpose of the pull request

DVC (Data Version Control) is an excellent open-source version control system for machine learning projects.

The DVC plugin is used to use the data version management function of DVC on DolphinScheduler, helping users to carry out data version management easily.

The plugin provides the following three functions:

  • Init DVC: Initialize the Git repository as a DVC repository and bind the address where the data is stored to store the actual data.
  • Upload: Add or update specific data to the repository and record the version tag.
  • Download: Download a specific version of data from the repository.

This PR close #10372.

Brief change log

image

image

image

Verify this pull request

Verified by unit test.

Copy link
Member

@zhongjiajie zhongjiajie left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

NIT

Copy link
Contributor

@caishunfeng caishunfeng left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM of the backend part 👍

@codecov-commenter
Copy link

codecov-commenter commented Jun 14, 2022

Codecov Report

Merging #10407 (7f2ddca) into dev (6e4b2e6) will decrease coverage by 0.08%.
The diff coverage is 68.86%.

@@             Coverage Diff              @@
##                dev   #10407      +/-   ##
============================================
- Coverage     40.85%   40.76%   -0.09%     
- Complexity     4846     4877      +31     
============================================
  Files           886      893       +7     
  Lines         36029    36339     +310     
  Branches       3999     4065      +66     
============================================
+ Hits          14718    14812      +94     
- Misses        19852    20066     +214     
- Partials       1459     1461       +2     
Impacted Files Coverage Δ
...dolphinscheduler/plugin/task/dvc/DvcConstants.java 0.00% <0.00%> (ø)
...lphinscheduler/plugin/task/dvc/DvcTaskChannel.java 0.00% <0.00%> (ø)
...heduler/plugin/task/dvc/DvcTaskChannelFactory.java 0.00% <0.00%> (ø)
...ache/dolphinscheduler/plugin/task/dvc/DvcTask.java 72.58% <72.58%> (ø)
...olphinscheduler/plugin/task/dvc/DvcParameters.java 100.00% <100.00%> (ø)
...dolphinscheduler/plugin/task/dvc/TaskTypeEnum.java 100.00% <100.00%> (ø)
...uler/server/master/runner/EventExecuteService.java 0.00% <0.00%> (ø)
...er/server/master/runner/FailoverExecuteThread.java 0.00% <0.00%> (ø)
...r/server/master/runner/MasterSchedulerService.java 0.00% <0.00%> (ø)
...ver/master/consumer/TaskPriorityQueueConsumer.java 0.00% <0.00%> (ø)
... and 5 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6e4b2e6...7f2ddca. Read the comment docs.

@jieguangzhou
Copy link
Member Author

@zhongjiajie @songjianet PTAL

@zhongjiajie
Copy link
Member

zhongjiajie commented Jun 14, 2022

@zhongjiajie @songjianet PTAL

backend and docs LGTM, please task a look the frontend code @songjianet

zhuangchong
zhuangchong previously approved these changes Jun 20, 2022
Copy link
Contributor

@zhuangchong zhuangchong left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

Update docs/docs/en/guide/task/dvc.md

change repository_name xxxx to <YOUR-NAME-OR-ORG>

Co-authored-by: Jiajie Zhong <zhongjiajie955@gmail.com>

fix repository name xxx to <YOUR-NAME-OR-ORG>

fix doc

add fix comment

change dvgTaskType to enumeration

change TaskTypeEnum to reside in its own source file.
@sonarcloud
Copy link

sonarcloud bot commented Jun 21, 2022

SonarCloud Quality Gate failed.    Quality Gate failed

Bug A 0 Bugs
Vulnerability A 0 Vulnerabilities
Security Hotspot A 0 Security Hotspots
Code Smell A 2 Code Smells

22.6% 22.6% Coverage
0.0% 0.0% Duplication

@zhuangchong zhuangchong merged commit 36e20cd into apache:dev Jun 21, 2022
hstdream pushed a commit to hstdream/dolphinscheduler that referenced this pull request Jun 23, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backend feature new feature UI ui and front end related
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

[Feature][Task Plugin] Add DVC task plugin
6 participants