This repository has been archived by the owner on Sep 18, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Update pai yaml merge method #2369
Merged
Merged
Changes from 60 commits
Commits
Show all changes
65 commits
Select commit
Hold shift + click to select a range
704b50e
Merge pull request #200 from microsoft/master
SparkSnail 5b0034e
Merge pull request #204 from microsoft/master
SparkSnail 8fe2588
Merge pull request #205 from microsoft/master
SparkSnail 9fae194
Merge pull request #206 from microsoft/master
SparkSnail c785655
Merge pull request #207 from microsoft/master
SparkSnail 2f5272c
Merge pull request #208 from microsoft/master
SparkSnail 1892bc2
Merge pull request #209 from microsoft/master
SparkSnail 7c1ab11
Merge pull request #210 from microsoft/master
SparkSnail 8c203f3
Merge pull request #211 from microsoft/master
SparkSnail d7a62f6
check pylint for nni_cmd
SparkSnail e259d10
fix id error
SparkSnail 4997295
Merge pull request #212 from microsoft/master
SparkSnail c037a7c
Merge pull request #213 from microsoft/master
SparkSnail 7620e7c
Merge pull request #214 from microsoft/master
SparkSnail d16dbe9
Merge pull request #215 from microsoft/master
SparkSnail 9ce751d
Merge pull request #216 from microsoft/master
SparkSnail a0846f2
Merge pull request #217 from microsoft/master
SparkSnail cd3a912
Merge pull request #218 from microsoft/master
SparkSnail 32efaa3
Merge pull request #219 from microsoft/master
SparkSnail 543239c
Merge pull request #220 from microsoft/master
SparkSnail 36e6e35
Merge pull request #221 from microsoft/master
SparkSnail f9ee589
Merge pull request #222 from microsoft/master
SparkSnail b9a7a95
Merge pull request #223 from microsoft/master
SparkSnail 1a5c017
Merge pull request #224 from microsoft/master
SparkSnail 392460a
Merge pull request #225 from microsoft/master
SparkSnail 9bafa4c
Merge pull request #226 from microsoft/master
SparkSnail c23b807
Merge pull request #227 from microsoft/master
SparkSnail 4132f62
Merge pull request #228 from microsoft/master
SparkSnail 92c2ce7
add merge config
SparkSnail 956b413
fix comments
SparkSnail 0a37820
use deepmerge package
SparkSnail d07fec1
add semicolon
SparkSnail a803684
add annotation
SparkSnail 1970f15
set trial config optional
SparkSnail a1dab9f
add doc
SparkSnail c58f49b
sort package.json
SparkSnail 4f66d0c
Merge pull request #229 from microsoft/master
SparkSnail 129c4a5
Merge pull request #230 from microsoft/master
SparkSnail 4163f26
add yarn.lock
e2ceede
revert change
3fe117f
Merge pull request #231 from microsoft/master
SparkSnail aa31674
Merge pull request #233 from microsoft/master
SparkSnail 1d74ae5
Merge pull request #234 from microsoft/master
SparkSnail 75028bd
Merge pull request #235 from microsoft/master
SparkSnail 4773c91
Merge pull request #236 from microsoft/master
SparkSnail 3ee0961
Merge pull request #237 from microsoft/master
SparkSnail 0fb7862
Merge pull request #238 from microsoft/master
SparkSnail 6c3148c
Merge pull request #239 from microsoft/master
SparkSnail b4773e1
Merge pull request #240 from microsoft/master
SparkSnail 6728799
Merge pull request #241 from microsoft/master
SparkSnail 1b9daa3
Merge pull request #242 from microsoft/master
SparkSnail e0c2c0e
Merge pull request #243 from microsoft/master
SparkSnail bd9fda5
fix conflict
SparkSnail 0e1ad4f
update pai yaml merge
SparkSnail 61c8eaf
remove deepmerge package
SparkSnail 052b3ec
fix comments
SparkSnail ce11bf7
add doc for paiConfigPath
SparkSnail 8aca557
fix comments
SparkSnail 8a3768f
update doc
SparkSnail f4b4775
fix pylint
SparkSnail e29b58a
Merge pull request #244 from microsoft/master
SparkSnail caadc04
fix comments
SparkSnail 0516655
remove unused doc
SparkSnail 9201699
format doc
SparkSnail 45ce90d
update yarn.loc
SparkSnail File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -92,8 +92,17 @@ Compared with [LocalMode](LocalMode.md) and [RemoteMachineMode](RemoteMachineMod | |
* Required key. Set the mount path in your container used in PAI. | ||
* paiStoragePlugin | ||
* Optional key. Set the storage plugin name used in PAI. If it is not set in trial configuration, it should be set in the config file specified in `paiConfigPath` field. | ||
* command | ||
* Optional key. Set the commands used in PAI container. | ||
* paiConfigPath | ||
* Optional key. Set the file path of pai job configuration, the file is in yaml format. | ||
If users set paiConfigPath in NNI's configuration file, the `command`, `paiStoragePlugin`, `virtualCluster`, `image`, `memoryMB`, `cpuNum`, `gpuNum` in `trial` filed will be replaced by configurations from `paiConfigPath`. | ||
``` | ||
Note: | ||
|
||
1. If users set multiple taskRoles in PAI's configuration file, NNI will wrap all of these taksRoles and start multiple tasks in one trial job, users should ensure that only one taskRole report metric to NNI, otherwise there might be some conflict error. | ||
2. The job name in PAI's configuration file will be replaced by a new job name, the new job name is created by NNI, the name format is nni_exp_${this.experimentId}_trial_${trialJobId} | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. better to switch point 1 and point 2. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. fixed. |
||
``` | ||
|
||
|
||
Once complete to fill NNI experiment config file and save (for example, save as exp_pai.yml), then run the following command | ||
|
@@ -104,6 +113,12 @@ to start the experiment in pai mode. NNI will create OpenPAI job for each trial, | |
You can see jobs created by NNI in the OpenPAI cluster's web portal, like: | ||
![](../../img/nni_pai_joblist.jpg) | ||
|
||
|
||
|
||
<center class="half"> | ||
<img src="https://github.com/JSong-Jia/NNI-Student-Program-2020/blob/master/QR%20Code.png?raw=true" /> | ||
</center> | ||
|
||
Notice: In pai mode, NNIManager will start a rest server and listen on a port which is your NNI WebUI's port plus 1. For example, if your WebUI port is `8080`, the rest server will listen on `8081`, to receive metrics from trial job running in Kubernetes. So you should `enable 8081` TCP port in your firewall rule to allow incoming traffic. | ||
|
||
Once a trial job is completed, you can goto NNI WebUI's overview page (like http://localhost:8080/oview) to check trial's information. | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If users set
paiConfigPath
in NNI's configuration file, no need to specify the fieldscommand
,paiStoragePlugin
,virtualCluster
,image
,memoryMB
,cpuNum
,gpuNum
intrial
configuration. These fields will use the values from the config file specified bypaiConfigPath
.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed.