This repository has been archived by the owner on Sep 18, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Add PAI environment setup steps in DOC #2011
Merged
Merged
Changes from 41 commits
Commits
Show all changes
42 commits
Select commit
Hold shift + click to select a range
704b50e
Merge pull request #200 from microsoft/master
SparkSnail 5b0034e
Merge pull request #204 from microsoft/master
SparkSnail 8fe2588
Merge pull request #205 from microsoft/master
SparkSnail 9fae194
Merge pull request #206 from microsoft/master
SparkSnail c785655
Merge pull request #207 from microsoft/master
SparkSnail 2f5272c
Merge pull request #208 from microsoft/master
SparkSnail 1892bc2
Merge pull request #209 from microsoft/master
SparkSnail 7c1ab11
Merge pull request #210 from microsoft/master
SparkSnail 8c203f3
Merge pull request #211 from microsoft/master
SparkSnail d7a62f6
check pylint for nni_cmd
SparkSnail e259d10
fix id error
SparkSnail 4997295
Merge pull request #212 from microsoft/master
SparkSnail c037a7c
Merge pull request #213 from microsoft/master
SparkSnail 7620e7c
Merge pull request #214 from microsoft/master
SparkSnail d16dbe9
Merge pull request #215 from microsoft/master
SparkSnail 9ce751d
Merge pull request #216 from microsoft/master
SparkSnail a0846f2
Merge pull request #217 from microsoft/master
SparkSnail cd3a912
Merge pull request #218 from microsoft/master
SparkSnail 32efaa3
Merge pull request #219 from microsoft/master
SparkSnail 543239c
Merge pull request #220 from microsoft/master
SparkSnail 36e6e35
Merge pull request #221 from microsoft/master
SparkSnail f9ee589
Merge pull request #222 from microsoft/master
SparkSnail b9a7a95
Merge pull request #223 from microsoft/master
SparkSnail 1a5c017
Merge pull request #224 from microsoft/master
SparkSnail 392460a
Merge pull request #225 from microsoft/master
SparkSnail 9bafa4c
Merge pull request #226 from microsoft/master
SparkSnail c23b807
Merge pull request #227 from microsoft/master
SparkSnail 4132f62
Merge pull request #228 from microsoft/master
SparkSnail 4c7ae00
add foreground mode
SparkSnail 9b5acf3
remove pass
SparkSnail a4159da
fix windows pipe
SparkSnail fb0af3b
fix pylint
SparkSnail 536a9cb
fix pylint
SparkSnail 4f66d0c
Merge pull request #229 from microsoft/master
SparkSnail 129c4a5
Merge pull request #230 from microsoft/master
SparkSnail 954a1b1
fix comments
26a000b
fix eslint
8a94c33
Merge branch 'master' of https://github.com/SparkSnail/nni into dev-n…
2f644f0
add doc for PAI
3fe117f
Merge pull request #231 from microsoft/master
SparkSnail 25a6394
Merge branch 'master' of https://github.com/SparkSnail/nni into dev-f…
65e0c27
fix comments
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -3,7 +3,33 @@ | |
NNI supports running an experiment on [OpenPAI](https://github.com/Microsoft/pai) (aka pai), called pai mode. Before starting to use NNI pai mode, you should have an account to access an [OpenPAI](https://github.com/Microsoft/pai) cluster. See [here](https://github.com/Microsoft/pai#how-to-deploy) if you don't have any OpenPAI account and want to deploy an OpenPAI cluster. In pai mode, your trial program will run in pai's container created by Docker. | ||
|
||
## Setup environment | ||
Install NNI, follow the install guide [here](../Tutorial/QuickStart.md). | ||
Step 1. Install NNI, follow the install guide [here](../Tutorial/QuickStart.md). | ||
Step 2. Get PAI token. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Suggest adding an extra blank line between steps. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. fixed. |
||
Click `My profile` button in the top-right side of PAI's webprotal. | ||
![](../../img/pai_token_button.jpg) | ||
Find the token management region, copy one of the token as your account token. | ||
![](../../img/pai_token_profile.jpg) | ||
Step 3. Mount NFS storage to local machine. | ||
Click `Submit job` button in PAI's webportal. | ||
![](../../img/pai_job_submission_page.jpg) | ||
Find the data management region in job submission page. | ||
![](../../img/pai_data_management_page.jpg) | ||
The `DEFAULT_STORAGE`field is the path to be mounted in PAI's container when a job is started. The `Preview container paths` is the NFS host and path that PAI provided, you need to mount the corresponding host and path to your local machine first, then NNI could use the PAI's NFS storage. | ||
For example, use the following command: | ||
``` | ||
sudo mount nfs://gcr-openpai-infra02:/pai/data /local/mnt | ||
``` | ||
Then the `/data` folder in container will be mounted to `/local/mnt` folder in your local machine. | ||
You could use the following configuration in your NNI's config file: | ||
``` | ||
nniManagerNFSMountPath: /local/mnt | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Extra indent? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. fixed. |
||
containerNFSMountPath: /data | ||
``` | ||
Step 4. Get PAI's storage plugin name. | ||
Contact PAI's admin, and get the PAI's storage plugin name for NFS storage. The default storage name is `teamwise_storage`, the configuration in NNI's config file is in following value: | ||
``` | ||
paiStoragePlugin: teamwise_storage | ||
``` | ||
|
||
## Run an experiment | ||
Use `examples/trials/mnist-annotation` as an example. The NNI config YAML file's content is like: | ||
|
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we need to point out that the setup steps is for paiyarn or paiK8s?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No, this doc is only for paik8s, do not need to point out again. paiYarn has another doc
paiYarnMode.md
.