Releases: volcano-sh/volcano
v0.4.1
v0.4.1 (2020-07-14)
- UpdateJob when Job annotations or labels changed (#919, @Thor-wl)
- Fix panic in controller (#901, @Thor-wl)
- Support scale up and down (#796, @hzxuzhonghu)
- Fix crd to support job patch ops (#794, @hzxuzhonghu)
v1.0.0
1.0 What's New
1. GPU Sharing
Volcano now supports gpu sharing between different pods (#852, @tizhou86, @hzxuzhonghu).
2. Preempt and reclaim enhancement
Volcano is now able to support preempt for batch job (#738, @carmark).
3. Dynamic scale up and down
Volcano job now supports dynamically scale up and down (#787, @hzxuzhonghu).
4. Support integrate with flink operator
Users are now able to run flink job with volcano. Follow the instructions here to make use of the feature. @hzxuzhonghu).
5. Support DAG job with argo
Users are now able to run DAG job with volcano. Follow the instructions here to make use of the feature. @alcorf-mizar).
Other Notable Changes
- Update go version to 1.14 (#886, @hzxuzhonghu)
- Bump to k8s 1.18 to keep up with kubernetes (#855, @hzxuzhonghu)
- Add mindspore example (#845, @lyd911)
- Add golangCi-lint check (#799, #821, #824, #825, #827, #829, #830, #833, #835, #841. @masihtehrani @daixiang0 @Thor-wl)
- Allow specifying admission webhook port (#832, @hzxuzhonghu)
- Add support for most requested priority (#831, @daixiang0)
- Set pod DNSPolicy to ClusterFirstWithHostNet when hostnetwork set (#779, @hzxuzhonghu)
- Add bestNodeFn for plugins to select best node of its own (#790, @jiangkaihua)
- Add e2e cases for drf (#905, @Thor-wl)
- Add e2e cases for reclaim (#898 #906, @alcorj-mizar)
- Add e2e cases for preempt (#892, @Thor-wl )
- Add e2e for queue (#872, @Thor-wl)
- Add test code for queue controller (#858, @alcorj-mizar)
Bug Fixes
- Fix panic in controller (#903, @Thor-wl)
- Fix panic in allocate (#843, @k82cn)
- Fix job phase transition time set (#789, @hzxuzhonghu)
- Fix crd to support job patch ops (#786, @hzxuzhonghu)
v0.4.0
v0.4.0 (2020-04-09)
- #756 [controller] Set
BlockOwnerDeletion
to true when create PodGroup (@xiaogaozi) - #754 [controller] Set
Queue
field when pod has queue name annotation (@xiaogaozi) - #746 Fix volcano job phase setting (@hzxuzhonghu)
- #745 Use go mod to manage dependencies (@tizhou86)
- #733 Added resources predicate in allocate action (@k82cn)
- #722 Added a flag
disable-network-policy
to disable Network Policy (@EDGsheryl) - #709 Use openssl to sign certificate instead of using k8s (@hzxuzhonghu)
- #702 Added env var for scheduler name (@k82cn)
- #693 Remove scheduling.v1alpha1 and scheduling.v1alpha2 API (@thandayuthapani)
- #681 Refactor events/action (@k82cn)
v0.4-beta
v0.4 (2020-04-07)
- #756 [controller] Set
BlockOwnerDeletion
to true when create PodGroup (@xiaogaozi) - #754 [controller] Set
Queue
field when pod has queue name annotation (@xiaogaozi) - #746 Fix volcano job phase setting (@hzxuzhonghu)
- #745 Use go mod to manage dependencies (@tizhou86)
- #733 Added resources predicate in allocate action (@k82cn)
- #722 Added a flag
disable-network-policy
to disable Network Policy (@EDGsheryl) - #709 Use openssl to sign certificate instead of using k8s (@hzxuzhonghu)
- #702 Added env var for scheduler name (@k82cn)
- #693 Remove scheduling.v1alpha1 and scheduling.v1alpha2 API (@thandayuthapani)
- #681 Refactor events/action (@k82cn)
v0.3.0
This is to support go get
v0.3
- #670 Added Shareit as one of adopter (@k82cn)
- #666 Add command vjobs, vqueues and unit tests. (@jiangkaihua)
- #667 Revert "Gen install yaml without v1alpha queue and poggroup" (@hzxuzhonghu)
- #664 Add capability in crd declarition (@hzxuzhonghu)
- #663 add defaultQPS and defaultBurst for webhook-manager (@yuzhaojing)
- #661 Change queue update in cli and e2e test to patch (@sivanzcw)
- #660 Build new CLI by default. (@k82cn)
- #656 Add vcommands: vcancel, vsuspend, vresume. (@jiangkaihua)
- #658 Added scheduling v1beta1 API. (@k82cn)
- #659 Add admission for queue (@sivanzcw)
- #634 Do not create jobs until pg inqueue (@hzxuzhonghu)
- #651 Reclaim Enhancement: Add Reclaimable parameter for queue (@sivanzcw)
- #655 Auto generate code, change Copyright 2019 to Copyright 2020 (@sivanzcw)
- #647 Considering best-effort pods when calculating ready task number (@sivanzcw)
- #653 Gen install yaml without v1alpha queue and poggroup (@hzxuzhonghu)
- #654 Remove pdb support (@hzxuzhonghu, @k82cn)
- #652 Use relative path for doc. (@k82cn)
- #633 fix the the Getting started link in contribute.md (@ruiyinchen)
- #644 Refactor webhook org. (@k82cn)
- #642 Added cherry_pick_pull.sh (@k82cn)
- #643 Added Volcano Intro. (@k82cn)
- #638 Remove duplicated check in jobEnqueueableFn of proportion (@zionwu)
- #637 Update version to 0.3 (@k82cn)
- #630 Added Roadmap (@k82cn)
- #636 Modify error check of return error (@sivanzcw)
- #631 Push job back to queue if task is assigned in reclaim action (@zionwu)
- #632 remove redundant type conversion (@YesterdayxD)
- #605 Rename binaries. (@k82cn)
- #627 remove repeated code. (@YesterdayxD)
- #626 Added Xiaohongshu as one of adopters (@k82cn)
- #625 Update job_controller_util.go (@YesterdayxD)
- #541 Pipeline task if task's request resource less than the releasing resource of node during performing allocate action (@sivanzcw)
- #622 vcctl command line enhancement (@jiangkaihua)
- #610 Added hosts into environment. (@k82cn)
- #614 Update factory.go (@YesterdayxD)
- #613 Added VC_TASK_INDEX and added env to initContainers. (@k82cn)
- #609 Fixed build error of release-pkg. (@k82cn)
- #608 Enahcement cli. (@k82cn)
- #607 Fixed localup cluster script. (@k82cn)
- #606 Update webhook path. (@k82cn)
- #575 Admission Refactor. (@k82cn)
- #603 change storage of ssh pem from configmap to secret for ssh plugin (@sivanzcw)
- #601 Added localup script. (@k82cn)
- #600 Remove kar, kube-batch. (@k82cn)
- #599 Change lessequal function in Reclaimable function (@sivanzcw)
- #597 when delete pod, a new shadowgroup will be created (@invalid-email-address)
- #570 added priority based preemption to priority plugin (@mateuszlitwin)
- #588 Cleanup e2e framework to speed up e2e (@hzxuzhonghu)
- #591 disp job in default queue (@jiangkaihua)
- #590 Support queue action by vcctl (@sivanzcw)
- #589 Upgrade helm to v3.0.1 (@hzxuzhonghu)
- #592 Add Vivo as adopter (@k82cn)
- #587 Add arguments for action (@sivanzcw)
- #585 use future idle resources when checking if task can fit node (@mateuszlitwin)
- #512 Add queue controller about state (@sivanzcw)
- #586 dep ensure (@sivanzcw)
- #584 change node not found errors (@invalid-email-address)
- #581 Change Statement unevict method to call UpdateTask (@yodarshafrir1)
- #578 Add explict info about what todo to update generated yaml (@hzxuzhonghu)
- #577 Enable CI verify (@hzxuzhonghu)
- #576 Enable networkpolicy create/get permission (@hzxuzhonghu)
- #572 fix validate victims check for preempt action (@zionwu)
- #567 Update admission to use pflag. (@k82cn)
- #564 Fixed build error. (@k82cn)
- #566 Fix wrong condition for reclaim action (@zionwu)
- #563 Update to klog. (@k82cn)
- #542 modify the 'vcctl job run' function (@jiangkaihua)
- #552 Support networkpolicy (@hzxuzhonghu)
- #560 Move myself to controller owner (@hzxuzhonghu)
- #547 Modify comments on OnPodCreate function of svc plugin (@sivanzcw)
- #544 Simplify job pvc create process (@hzxuzhonghu)
- #515 ssh plugin support specifying private/public keys path (@hzxuzhonghu)
- #537 Add queueAction queueEvent queueRequest type (@sivanzcw)
- #536 Add QTT as adopter (@k82cn)
- #535 Add the --publish-not-ready-addresses param for the svc plugin (@zrss)
- #527 Add svc hosts volumeMount for InitContainers (@zrss)
- #525 Fixed import order. (@k82cn)
- #523 pdb bug 修复 (@chenshaojin)
- #520 Modify scheduling events for pod and podgroup (@sivanzcw)
- #517 Add filter function for command watching of job controller (@sivanzcw)
- #518 Added Gitter (@k82cn)
- #507 fix filter NotReady node (@wangyuqing4)
- #513 fix podgroup phase (@wangyuqing4)
- #511 Umbrealla cleanups (@wangyuqing4)
- #510 Rename imported package alias (@hzxuzhonghu)
- #508 Add state parameter to queueSpec and queueStatus for queue (@sivanzcw)
- #501 Add queue state management design proposal (@sivanzcw)
- #506 Add events for pod with pipelined state (@sivanzcw)
- #504 Dynamic loading comfigmap about action and plugins of scheduler, move loadSchedulerConf processing from run to runOnce (@sivanzcw)
- #502 Fix deprecated dind in favor of kind in develop doc (@akillcool)
- #499 correct podgroup creating bug for single pod without ownerreference (@sivanzcw)
- #498 refresh volumes logic (@lminzhw, @dingtsh1)
- [#500](https://github.com/volcano-sh/volcano/pul...
v0.2
- #117 Implement queue Capability, donot allow podgroup enqueue when queue capability reached (@hzxuzhonghu)
- #172 Show Queue's status in
vkctl queue
sub-command (@SrinivasChilveri) - #173 Add "vkctl job delete xxx" (@SrinivasChilveri)
- #200 Disable preempt & relcaim action by default (@thandayuthapani)
- #205 Check Queue exist in admission controller (@thandayuthapani)
- #184 Use install job to generate secret for admission service (@TommyLike)
- #149 Added Job garbage collector, cleanup Job after a configured ttl (@hzxuzhonghu)
- #176 Retain pod with Succeeded/Failed phase (@lminzhw)
- #170 Support Job Priority (@TommyLike)
- #108 Resolve the golint issues (@nikita15p,@Rajadeepan)
- #137 Pass conformance test (@shivramsrivastava)
- #358 Fair-share scheduling of namespace cross queues (@lminzhw)
- #306 Fix the scheduler panic whenever the GPU is lost on node (@william-wang)
- #288 Migrate volcano-sh/scheduler into volcano-sh/volcano (@kevin-wangzefeng)
- #168 Speed up e2e, do not just add an e2e test (@SrinivasChilveri,@thandayuthapani,@Rajadeepan)
- #286 Update abbreviation of Volcano from vk-* to vc-*m, including binary and docker images (@asifdxtreme)
- #93 Allow multiple sync job workers run in parallel (@SrinivasChilveri)
- #329 Move admission webhook configuration registeration into admission server from yaml (@TommyLike)
- #335 User experience improvement (@TommyLike)
- #325 Contributor experience improvement (@hzxuzhonghu,@SrinivasChilveri)
- #386 Valid policy action
RestartTask
is prevented (@hzxuzhonghu) - #266 Support multiple events in job lifecycle policy (@asifdxtreme)
- #364 Migrate queue/podgroup to v1alpha2 (@hzxuzhonghu)
- #401 Add PodGroupController to create shadow PodGroup (@wangyuqing4)
- #384 The job task resync logic is not right (@hzxuzhonghu)
- #370 Refactor Delay Pod Creation by admission controller (@wangyuqing4)
- #380 Support binpack policy (@lminzhw)
v0.1
Features
-
IndexedJob
-
Multiple Pod template
-
Error handling of Pod/Job
-
Queue/Job command line
-
Delay Pod Creation
-
Job plugins
-
env
: set VK_TASK_INDEX to each container, is a index for giving the identity to container -
svc
: create headlessSerivce
and *.host to enable pods communicate -
ssh
: sign in ssh without password, e.g. use command mpirun or mpiexec
-
Docker Images:
docker pull volcanosh/vk-scheduler:v0.1
docker pull volcanosh/vk-controllers:v0.1
docker pull volcanosh/vk-admission:v0.1