initial draft of operator-testing-tool proposal for openshift-4.3 #1916

jmccormick2001 · 2019-09-13T17:55:57Z

Description of the change:

this is the proposal draft document for the operator-testing-tool, for openshift-4.3

Motivation for the change:

not a breaking change or code fix, just the proposal for a new feature.

jmrodri · 2019-09-13T18:40:12Z

Make sure to run the markdown checker, /hack/ci/marker --root=doc

./hack/ci/marker --root=doc
Found broken reference (  ->  ) at doc/proposals/openshift-4.3/operator-testing-tool.md:27
Found broken reference (  ->  ) at doc/proposals/openshift-4.3/operator-testing-tool.md:28
Found broken reference (  ->  ) at doc/proposals/openshift-4.3/operator-testing-tool.md:29
Found broken reference (  ->  ) at doc/proposals/openshift-4.3/operator-testing-tool.md:30
Found broken reference (  ->  ) at doc/proposals/openshift-4.3/operator-testing-tool.md:31
Found broken reference (openshift/docs -> openshift/docs) at doc/proposals/openshift-4.3/operator-testing-tool.md:31
Found broken reference (maturity levels -> maturity levels) at doc/proposals/openshift-4.3/operator-testing-tool.md:170
Found broken reference (maturity-levels -> maturity-levels) at doc/proposals/openshift-4.3/operator-testing-tool.md:170
Found broken reference (optional -> optional) at doc/proposals/openshift-4.3/operator-testing-tool.md:234
make: *** [test/markdown] Error 1

jmrodri · 2019-09-13T18:41:58Z

Actually we might have to fix the marker checker to handle the proposal format

jmccormick2001 · 2019-09-13T19:17:25Z

/test marker

camilamacedo86 · 2019-09-19T07:16:57Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+  - "/enhancements/our-past-effort.md"
+---
+
+# operator-testing-tool


Suggested change

# operator-testing-tool

## New Operator Testing Tool for Operator SDK

Also, after starting to read it will not help the users/devs impl testing. I understand that it is a tool to check its QA. So, WDYT about :

Suggested change

# operator-testing-tool

New Operator QA Tool for Operator SDK (Scorecard improvements and extraction)

I would suggest that we match the epic name directly.

Suggested change

# operator-testing-tool

# Tooling for Testing Operators

updated in the next version.

doc/proposals/openshift-4.3/operator-testing-tool.md

camilamacedo86 · 2019-09-19T07:19:55Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+superseded-by:
+  - "/enhancements/our-past-effort.md"
+---
+


Suggested change

It is not part of the template, wdyt about remove it?
Also. note that the status are not the same suggested by @joelanford here: #1929 (comment)

@camilamacedo86 We're using this template for enhancement proposal-style proposals.

If it makes sense, we could consider moving to that template for all future proposals.

doc/proposals/openshift-4.3/operator-testing-tool.md

camilamacedo86 · 2019-09-19T07:39:53Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+
+A scheme for applying the labels to the tests would need to be developed as well as label names to categorize tests.
+
+The “source of truth” for validation would need to be established such as which library the scorecard would use to perform CR/CSV/bundle validation.


CR/CRD/RBCA -> By default
OLM files (CSV) are optional - IMHO should be checked if they exist in the project and/or a flag for it be used.

Maybe semething like as: operator-sdk-qa operator --olm=false

camilamacedo86 · 2019-09-19T07:43:57Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+### Risks and Mitigations
+
+The scorecard would implement a version flag in the CLI to allow users to migrate from current functionality to the proposed functionality (e.g. v1alpha2):
+ * operator-sdk scorecard --version v1alpha2


IMHO: Should be a goal check if the versions are following the good practices and in this way be able to check the latest one. I mean how the users should version the source should be a rule as sem version.

This version is the version of the output produced by the scorecard. It has nothing to do with operator or CSV versions.

With respect to versioning operator source code, CRDs and CSVs, I agree that it would be nice to be able to help users understand if they're following semver, but that's definitely out-of-scope for this epic, and possibly out-of-scope for scorecard.

However one possible new test that we could think about in the future would be to round-trip of CRs between different versions to see if they result in the operator taking the same actions against the cluster.

camilamacedo86 · 2019-09-19T07:47:47Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+The exit code of scorecard would be 0 if all tests passed. The exit code would be non-zero if tests failed. With this change scores are essentially replaced with a list of pass/fail(s).
+
+The scorecard would by default run all tests regardless if a single test fails.  Using a CLI flag such as below would cause the test execution to stop on a failure:
+ * operator-sdk scorecard -l ‘testgroup=required’ --fail-fast


Since we will no longer have scores I think we should change the name in order to make it more intuitive. I think it should be something as operator-sdk project-qa.. I am not good with names... however... something that shows that the propose of this is to check the QA of the project.

I think the name "scorecard" still holds since we're still running a bunch of generic black-box tests and reporting to developers (and eventually users, I think) about which tests passed and failed.

camilamacedo86 · 2019-09-19T07:53:09Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+Tests can fall into 3 groups: required, recommended, or optional. Tests can also be categorized as static or runtime. Labels for these groups and categories would allow a test to be more precisely specified by an end user.
+
+Possible examples of specifying what tests to run are as follows: 
+ * operator-sdk scorecard -l ‘testgroup in (required,recommended,optional)’ 


In the practice what will be the diff between recommended and optional? In my POV that all NO mandatory recommendations are Optional. WDYT?
IMHO: It should be mandatory and recommended.

If the optional is related to the tests impl by the devs when they extending its API and creates their plugins I think we could give a name more intuitive as plugins or additional

I agree. @dmesser can you describe what the difference is between recommended and optional?

As a user, why would I want to run optional, but not recommended, tests?

Do we have a breakdown of which of the current tests fall into these two/three catagories?

If not, I think we should propose what we think and circulate with the OLM and CVP teams for feedback.

camilamacedo86 · 2019-09-19T07:54:29Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+Possible examples of specifying what tests to run are as follows: 
+ * operator-sdk scorecard -l ‘testgroup in (required,recommended,optional)’ 
+ * operator-sdk scorecard -l ‘testtype in (runtime,static)’
+ * operator-sdk scorecard -l updatecrtest=true


What does it means?

(runtime,static)

updatecrtest

I think these are just examples of syntax for selecting tests based on labels.

testtype in (runtime,static) would mean any test where the testtype label is either "runtime" OR "static"

updatecrtest=true would mean any test where the updatecrtest label is "true"

The labels are arbitrary, so these label examples will not necessarily be the ones we define on the tests.

camilamacedo86 · 2019-09-19T08:00:09Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+Rough point estimate for this story is 8 points.
+#### Story 3 - Common Validation
+The scorecard would use a common validation codebase to verify bundle contents. This story supports a single “source of truth” and avoids duplication or variance of validation logic. The scorecard user would be able to specify a bundle on the command line, that bundle would then be validated.  For example:
+ * operator-sdk scorecard --bundle ./my-operator


--config I think is better ( bundle is nomenclature added in the community repo which for me at least was not intuitive at all)

IIRC, --config is already used to define the config file used by the scorecard. I think --bundle is good because "bundle" has a well-defined definition in the context of the Operator Framework and it's what we're expecting to be passed to this flag.

One caveat/question: @dmesser Are we expecting to test the entire set of bundles that make up a channel/catalog, or are we scoping the scorecard to a single bundle at a time?

camilamacedo86 · 2019-09-19T08:02:15Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+Tasks for this story include:
+ * determine the correct validation library to use
+ * add support for a bundle directory flag to the CLI
+ * change the scorecard code to use that validation library when validating a CR, CSV, or given a bundle directory


IMHO: It should be:

CR/CRD/RBCA -> By default

OLM files (CSV) are optional - IMHO should be checked if they exist in the project and/or a flag for it be used.

Note: The bundle in POV is not an intuitive nomenclature.

I think one issue with the scorecard today is that it has too much configuration complexity. The following flags can all be used to define how scorecard runs. Some are optional, some are required, and some are conditionally required. There are also interdependencies between them that can cause confusion.

--basic-tests Enable basic operator checks (default true) --cr-manifest strings Path to manifest for Custom Resource (required) (specify flag multiple times for multiple CRs) --crds-dir string Directory containing CRDs (all CRD manifest filenames must have the suffix 'crd.yaml') (default "deploy/crds") --csv-path string Path to CSV being tested --global-manifest string Path to manifest for Global resources (e.g. CRD manifests) --namespace string Namespace of custom resource created in cluster --namespaced-manifest string Path to manifest for namespaced resources (e.g. RBAC and Operator manifest) --olm-deployed The OLM has deployed the operator. Use only the CSV for test data --olm-tests Enable OLM integration checks (default true)

If we go all-in on OLM integration (which I believe is a goal), then I think we should require a bundle and leverage other OLM integrations (e.g. installing OLM and running an operator with OLM) so that we can reduce configuration complexity and make running and troubleshooting scorecard much more straightforward.

@dmesser @shawn-hurley WDYT?

camilamacedo86 · 2019-09-19T08:06:59Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+
+#### Story 4 - Update scorecard Documentation
+Tasks:
+ * Document changes to CLI


IMHO We should not have the cli commands documented in the docs instead of it we should have a --help command and some automatically doc as it was done here for example using Makefile.

If we will use cobra then it is possible to get done as well.

I think it should be both, document the changes but we need to make sure the help text is accurate.

I believe the CLI library we use supports generating docs from the CLI configuration, and I think it's in our backlog to investigate using that to keep our SDK CLI reference up-to-date. I could see this being something we contribute to upstream and then inherit as part of our Kubebuilder integration plan.

camilamacedo86 · 2019-09-19T08:08:01Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+ * Document changes to CLI
+ * Document new output formats
+ * Document changes to configuration
+ * Document breaking changes and removals


Suggested change

* Document breaking changes and removals

* update the CHANGELOG and Migration files with breaking changes and removals

camilamacedo86 · 2019-09-19T08:09:54Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+Tasks:
+ * Document changes to CLI
+ * Document new output formats
+ * Document changes to configuration


* Update doc X of the scorecard with the changes ( also IMHO it should be done for each change performed, I mean in the same PR instead of having a task to do all at the end. Maybe we can have a task to review/improve the doc, after all, be implemented. ) ``

camilamacedo86 · 2019-09-19T08:12:40Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+
+### User Stories 
+
+#### Story 1 - Show pass/fail in Scorecard Output


For all user story, IMHO we need to update them in order to follow its common definitions as

As a of the product, ## title
I can
So that .

Following an example for actions:

A user can post her resume to the web site.

A user can search for jobs.

A company can post new job openings.

A user can limit who can see her résumé.

S few refs [1](https://www.berteig.com/how-to-apply-agile/summary-of-user-stories-the-three-cs-and-invest/ and) and 2

Also,

Shows that the task for each user stories is broken already Tasks for this story include:. I would replace it for Acceptance Criteria Instead of.

@dmesser has been pushing this point as well. I don't think we should require this method of writing stories in the proposals this time around because it wasn't discussed as a team, and there are still some kinks to work out with how these things are translated into JIRA issues.

Let's talk about this as a team for the 4.4+ epics and planning.

camilamacedo86 · 2019-09-19T08:22:01Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+
+## Design Details
+
+### Test Plan


Have we not a default process to impl all features? Should it really be added here in the Proposal doc? IMHO it shows out of the scope of this doc. Also, it shows not part of this topic. I'd add things like that in the section ### Observations and open questions of its template.

This is part of the enhancement proposal template, hence it's inclusion. I think test planning is a relevant and important aspect of incorporating new features.

doc/proposals/openshift-4.3/operator-testing-tool.md

camilamacedo86 · 2019-09-19T08:30:37Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+These are generalized examples to consider, in addition to the aforementioned
+maturity levels.
+
+##### Dev Preview -> Tech Preview


IMHO this doc recommends we change the features of the scorecard, so:

I'd adopted the incremental development process where each deliverable change with its docs would be merge in the master. (In this way we can add value to the users and if the scope change in the middle will not require to start from the scratch / agile process )

In the end, if we still think that it should be another tool and etc .. we could extract the impl for a new repo/project

camilamacedo86 · 2019-09-19T08:33:25Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+subproject, repos requested, github details, and/or testing infrastructure.
+
+Listing these here allows the community to get the process for these resources
+started right away.


Suggested change

started right away.

I think it is not required to be added here too.

This is boilerplate from the template. Once we have consensus on the contents of the proposal, I think it's safe to remove the sections that are not applicable to this proposal.

camilamacedo86 · 2019-09-19T08:38:21Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+**For non-optional features moving to GA, the graduation criteria must include
+end to end tests.**
+
+##### Removing a deprecated feature


I think it is not required.

operator-sdk still in init dev. See that its tag is < 1.0.0

In POV the biggest part of users has not been using it anyway

The CI in the community-operators repos will be affected as well but after a new release with the changes, it should be still working fine by using the previous version until we are able to update there. Maybe is required to signalize who is responsible for this repo about our intention for they are able to follow up and plan the changes.

While we're not yet to 1.0.0, the CVP has automation around scorecard that would break suddenly if we didn't have a deprecation and removal plan that was communicated in advance. In this case, I think it's important to document what we're doing:

Adding a new --version flag

Adding a new result output spec, v1alpha2, based on the changes described in this proposal.

Setting the default --version to v1alpha1 to avoid breaking existing CLI users.

Deprecating v1alpha1, and informing users to move to v1alpha2 by setting --version=v1alpha2 on the command line.

In a subsequent release, removing support for v1alpha1.

I'll update this section in a new version of the proposal.

shawn-hurley · 2019-09-30T18:19:55Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+
+
+
+### Implementation Details/Notes/Constraints


This feels like the proposal is not implementable.

How are we going to create a labeling system? Does a user provide their own list of labels for a test? do we provide that list? do we provide an initial list?

Are we planing on writing a DSL of some sort to parse this? testtype in (runtime,static) are we worried that we are signing up for more than we bargained for?

The idea is to:

Allow tests to declare a set of labels that can be introspected by the scorecard at runtime

Allow users to filter the set of tests to run based on a label selector string

Reuse apimachinery for parsing and matching.

Can we add that info to the Implementation Details of the proposal?

I'll update the proposal and include that list in Implementation Details.

joelanford · 2019-10-02T15:56:18Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+#### Story 1 - Show pass/fail in Scorecard Output
+Today, the scorecard output shows a percentage of tests that were successful to the end user. This story is to change the scorecard output to show a pass or fail for each test that is run in the output instead of a success percentage. 
+
+The exit code of scorecard would be 0 if all tests passed. The exit code would be non-zero if tests failed. With this change scores are essentially replaced with a list of pass/fail(s).


Can you expand on this to document the discussion we had about how the exit code should only apply to required tests?

https://jira.coreos.com/browse/OSDK-519?focusedCommentId=129787&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-129787

will update in a new version of the proposal.

initial draft

f253069

openshift-ci-robot requested review from camilamacedo86 and hasbro17 September 13, 2019 17:55

openshift-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Sep 13, 2019

jmrodri closed this Sep 13, 2019

jmrodri reopened this Sep 13, 2019

updated checklist boxes to be escaped.

33ee52e

removed extraneous references found my the marker

884e9ab

camilamacedo86 reviewed Sep 19, 2019

View reviewed changes

doc/proposals/openshift-4.3/operator-testing-tool.md Show resolved Hide resolved