Add native support for v2 config #3466

liuzhe-lz · 2021-03-22T05:52:29Z

No description provided.

J-shang · 2021-03-24T08:44:01Z

nni/experiment/config/common.py

@@ -66,6 +68,7 @@ class ExperimentConfig(ConfigBase):
    assessor: Optional[_AlgorithmConfig] = None
    advisor: Optional[_AlgorithmConfig] = None
    training_service: Union[TrainingServiceConfig, List[TrainingServiceConfig]]
+    _deprecated: Optional[str, Any] = None


what this contains?

Deprecated fields as well as fields not supported by v2 (yet)...
It appears in at least 3 places and I placed the comments somewhere else. Because this file is "more visible" to end users.
This is a workaround for backward compatibility.

J-shang · 2021-03-25T01:36:55Z

nni/tools/nnictl/algo_management.py

+            remove_algo_meta_data(meta['builtinName'])
+            save_algo_meta_data(meta)
+        else:
+            print_error(f'Cannot overwrite builtin algorithm')


need return?

I don't really understand the old logic but seems not.
We have no restriction on meta_list's order. So as you can see the old code calls verify_algo_import on a part of the list. I believe the correct logic is either verify all or verify none.

J-shang · 2021-03-25T05:15:55Z

ts/nni_manager/rest_server/restHandler.ts

@@ -160,7 +160,7 @@ class NNIRestHandler {
    }

    private startExperiment(router: Router): void {
-        router.post('/experiment', expressJoi(ValidationSchemas.STARTEXPERIMENT), (req: Request, res: Response) => {
+        router.post('/experiment', (req: Request, res: Response) => {


we do not need to validate now?

I think validate in launcher is enough. It's not DRY to validate twice.
If a user calls this API from a custom client, it's at their own risks.

J-shang · 2021-03-25T05:17:52Z

ts/nni_manager/training_service/kubernetes/kubeflow/kubeflowConfig.ts

@@ -10,7 +10,7 @@ import { AzureStorage, KeyVaultConfig, KubernetesClusterConfig, KubernetesCluste
 } from '../kubernetesConfig';

 // operator types that kubeflow supported
-export type KubeflowOperator = 'tf-operator' | 'pytorch-operator' ;
+export type KubeflowOperator = string;  // 'tf-operator' | 'pytorch-operator'


why we change this?

The unpack logic before this yields string, and tsc complained a lot type mismatch, so...
Too lazy.

J-shang · 2021-04-07T03:06:25Z

nni/experiment/config/convert.py

-        data['logDir'] = data.pop('experimentWorkingDirectory')
+def to_v2(v1) -> ExperimentConfig:
+    platform = v1.pop('trainingServicePlatform')
+    assert platform in ['local', 'remote', 'openpai']


what if aml kubeflow ...

K8S based will stick with v1.
AML is a mistake and has been fixed.

ts/nni_manager/core/nnimanager.ts

SparkSnail · 2021-04-08T11:12:04Z

ts/nni_manager/training_service/reusable/test/trialDispatcher.test.ts

-        chai.assert.equal(environmentService.testGetEnvironments().size, 2, "as env not reused, so only 2 envs should be here.");
-        const trials = await trialDispatcher.listTrialJobs();
-        chai.assert.equal(trials.length, 2, "there should be 2 trials");
+        //trialDispatcher.setClusterMetadata(


Is the UT case fixed?

forgot...
please let me fix it during bug bash

SparkSnail · 2021-04-08T11:19:47Z

ts/webui/src/static/experimentConfig.ts

+    classArgs?: object;
+}
+
+export interface ExperimentConfig {


miss versionCheck?

Currently version check is bound to debug. I think it's enough for this release.

SparkSnail · 2021-04-08T11:20:09Z

ts/webui/src/static/experimentConfig.ts

+    maxExperimentDuration?: string;
+    maxTrialNumber?: number;
+    nniManagerIp?: string;
+    //useAnnotation: boolean;


does not support annotation anymore?

Annotation is handled inside nnictl. All other modules does not aware whether it's user written code & search space, or it's de-annotated code & search space.
nni.Experiment does not support annotation for now.

liuzhe and others added 11 commits March 12, 2021 18:43

local v2 backend

f4b6fc0

Merge branch 'master' into dev-config2

9f1ff18

local nnictl v1

24133bc

add missing file

beb057c

webui basic

2d9999d

fix lint

7980e91

remote reuse

4c07d28

remote legacy

1ccf483

remote v1

8629966

pai draft

507430f

kubeflow

e0e0d6a

liuzhe-lz mentioned this pull request Mar 22, 2021

Add native support for v2 config #3315

Closed

SparkSnail mentioned this pull request Mar 24, 2021

NNI 2021 Mar~Apr Iteration Planning #3445

Closed

78 tasks

SparkSnail requested review from SparkSnail and J-shang March 24, 2021 02:47

J-shang reviewed Mar 25, 2021

View reviewed changes

SparkSnail marked this pull request as draft March 26, 2021 08:57

liuzhe and others added 4 commits April 2, 2021 12:08

fix some ut

374873a

revert k8s

ca85da8

bugfix

5448a42

try fix ut

4a94396

liuzhe-lz marked this pull request as ready for review April 7, 2021 02:19

liuzhe-lz added 2 commits April 7, 2021 10:20

fix webui lint

7bec4fb

fix custom tuner

8c226a1

J-shang reviewed Apr 7, 2021

View reviewed changes

SparkSnail reviewed Apr 8, 2021

View reviewed changes

ts/nni_manager/core/nnimanager.ts Show resolved Hide resolved

SparkSnail reviewed Apr 8, 2021

View reviewed changes

fix rollback

77d7f36

liuzhe added 7 commits April 9, 2021 12:51

Merge remote-tracking branch 'lz/dev-config2' into dev-config2

5905590

Merge branch 'master' into dev-config2

b8d3746

make reuse mode default

3477bfa

bugfix

940bbd7

fix custom tuner

dadb7ff

fix multi-thread

ef2f46e

disable multi-phase

d273805

SparkSnail approved these changes Apr 9, 2021

View reviewed changes

J-shang approved these changes Apr 9, 2021

View reviewed changes

liuzhe added 3 commits April 9, 2021 16:13

Merge remote-tracking branch 'ms/master' into dev-config2

8933636

disable multi-phase test

f49c3cc

set default value to experiment working directory

746f1f6

SparkSnail merged commit 817ec68 into microsoft:master Apr 9, 2021

liuzhe-lz deleted the dev-config2 branch June 17, 2021 03:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add native support for v2 config #3466

Add native support for v2 config #3466

liuzhe-lz commented Mar 22, 2021

J-shang Mar 24, 2021

liuzhe-lz Apr 7, 2021

J-shang Mar 25, 2021

liuzhe-lz Apr 7, 2021 •

edited

Loading

J-shang Mar 25, 2021

liuzhe-lz Apr 7, 2021 •

edited

Loading

J-shang Mar 25, 2021

liuzhe-lz Apr 7, 2021

J-shang Apr 7, 2021

liuzhe-lz Apr 9, 2021 •

edited

Loading

SparkSnail Apr 8, 2021 •

edited

Loading

liuzhe-lz Apr 9, 2021

SparkSnail Apr 8, 2021

liuzhe-lz Apr 9, 2021

SparkSnail Apr 8, 2021

liuzhe-lz Apr 9, 2021

Add native support for v2 config #3466

Add native support for v2 config #3466

Conversation

liuzhe-lz commented Mar 22, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liuzhe-lz Apr 7, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liuzhe-lz Apr 7, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liuzhe-lz Apr 9, 2021 • edited Loading

Choose a reason for hiding this comment

SparkSnail Apr 8, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liuzhe-lz Apr 7, 2021 •

edited

Loading

liuzhe-lz Apr 7, 2021 •

edited

Loading

liuzhe-lz Apr 9, 2021 •

edited

Loading

SparkSnail Apr 8, 2021 •

edited

Loading