Collect system-level tests and start removing Ruby tests #4853

abailly · 2024-11-21T10:02:19Z

This PR is a first step in an effort to rationalise our testing approach as outlined in #4852.

list existing integration tests and e2e tests side-by-side
remove misc Ruby tests which are not testing cardano-wallet

paweljakubas

👍

If I understand well we will need to remember whenever we add/remove any integration test we need to update tests.md

abailly · 2024-11-21T10:18:59Z

Yes, that's more or less the idea.

abailly · 2024-11-21T13:11:09Z

@paweljakubas Given that you seem to be the one most aware about those integration tests, could you have a look at the ruby tests and mark which ones, in your opinion we should preserve?

paweljakubas · 2024-11-22T08:14:24Z

@abailly not a question I can give 100% confidence answer. why? Those tests were added long time ago by our QA engineer (that happened to know Ruby) and were supposed to replicate some integration tests and run them against preprod (previously testnet) and check scenarios engaging wallets with HISTORY. Ocassionally, they were able to surface edge case bugs. As we are not adding logic and integration tests so much and Ruby tests were run many times and found what they can detect, AND as we are not so eager to work with this codebase, I would tentatively propose to abandon them. If we are to start adding integration tests + logic we might return to the subject and think how to test against preprod with wallet with history but with codebase we are into. @HeinrichApfelmus @paolino @Anviking make sense?

paweljakubas · 2024-11-22T08:15:49Z

to sum up: I would remove them all as we speak as they are burden now and not adding value imho.

abailly · 2024-11-22T10:46:32Z

Thanks @paweljakubas, I am happy to remove those, but let's wait for feedback from the rest of the team

tests.md

HeinrichApfelmus · 2024-11-25T11:30:19Z

tests.md

+         it 'Delegating address with both pub key credentials' do
+         it 'Delegating address - payment from script, stake from key' do
+         it 'Delegating address - payment from key, stake from script' d
+       it 'Malformed payload when tx is not binary' do


It appears that this test exercises the POST /proxy/transactions endpoint of cardano-wallet.

tests.md

Anviking · 2024-11-25T11:52:14Z

tests.md

+        it 'ping-pong' do
+        it 'game' do
+        it 'mint-burn' do
+        it 'withdrawal' do
+        it 'currency' do


These are currently no longer tested with integration tests: https://github.com/cardano-foundation/cardano-wallet/blob/master/lib/integration/scenarios/Test/Integration/Scenario/API/Shelley/TransactionsNew.hs#L2987-L3045 (withdrawal not there, the others exist but are marked pending)

HeinrichApfelmus · 2024-11-25T11:53:56Z

I would like to note that — most unfortunately — I find the Ruby E2E tests easier to read and understand than the Haskell integration tests.

One of the main reasons for the increased complexity of the Haskell integration tests is that the they are mostly based on the Cardano.Wallet.Api.Link module, which allows sending malformed data to endpoints.

cardano-wallet/lib/api/src/Cardano/Wallet/Api/Link.hs

Lines 16 to 23 in a9f941c

    
           -- Type-safe endpoint accessors for the wallet API. Under normal circumstances, 
        
           -- one would prefer to use 'WalletClient' from 'Cardano.Wallet.Api.Client' and 
        
           -- not to bother with endpoints at all. 
        
           -- 
        
           -- Yet, in some cases (like in black-box testing), one could want to purposely 
        
           -- send malformed requests to specific endpoints. Thus, this module facilitates 
        
           -- the construction of valid endpoints that'd be accepted by the server, and for 
        
           -- which, users are free to send all sort of data as payload, valid or invalid.

I'm not sure that it's a good idea to base a large test suite on this — ideally, the concern of malformed data is resolved by unit testing decoder / encoders.

In the absence of specialized types for requests and response, good syntactic support for general JSON types would be helpful to make the tests readable. Ruby has good syntactic support for inspecting JSON types, e.g. res['network_info']['protocol_magic'], at the usual price of partial functions. However, we never invested into similar syntactic amenities for Haskell.

To summarize: Unfortunately, our Haskell integration tests contain technical debt to the point where I think that they do not compete well against the Ruby E2E tests.

abailly · 2024-11-25T12:00:11Z

@HeinrichApfelmus From what I have seen, I am tempted to agree and I am against building such a large integration test suite as it's a great source of tech debt, complexity, flaky tests, etc.
But we need to do something as we probably don't want to maintain 2 set of mostly overlapping slow-to-run-hard-to-debug test suite.

HeinrichApfelmus · 2024-11-25T12:27:07Z

But we need to do something as we probably don't want to maintain 2 set of mostly overlapping slow-to-run-hard-to-debug test suite.

Yes. The question is of course: "Which something?"

The way I see is that we have different options with different amounts of effort:

Maintain 2 sets of mostly overlapping slow-to-run-hard-to-debug test suites.
Rewrite Ruby E2E tests to disappear.
Rewrite Haskell integration tests to become more readable and competitive with the Ruby E2E tests.
Rewrite Haskell integration tests to become unit tests and remove matching Ruby E2E tests.
…

It is not clear to me that 1. is a priori more effort than 2, 3, or 4. In addition, we have to consider the opportunity cost of not investing into option

extend the "fresh start" of the "Deposit Wallet" to include features from the legacy wallet, mostly related to delegation.

The modular our codebase is, the cheaper 6. becomes. Option 4. would move us to a more modular codebase, so my gut feeling is that a combination of 4 and 6 plus down-payments on 1 is probably the route that requires less engineering effort than the others.

abailly · 2024-11-25T13:07:34Z

My comments (but I really think we need to spend more time discussing f2f in front of whiteboard than through GH clumsy PR):

Maintain 2 sets of mostly overlapping slow-to-run-hard-to-debug test suites.

I don't mind having Ruby code, I just don't want to maintain 2 different things which serve the same purpose and each take time to run, delay feedback, are hard to run locally, and ultimately hamper delivery speed.
So we definitely want to remove the overlap.

Rewrite Haskell integration tests to become more readable and competitive with the Ruby E2E tests.

Significant effort, and I already said I would rather reduce the amount of integration tests we have as I am 100% confident we are testing things which would be much better tested at a finer grained level.

Rewrite Haskell integration tests to become unit tests and remove matching Ruby E2E tests.

That has my favour.

extend the "fresh start" of the "Deposit Wallet" to include features from the legacy wallet, mostly related to delegation.

I don't understand what this has to do with the discussion.

This PR's goal is to spark the discussion and start the ball rolling, but what I am sure is that statu quo is undesirable. It's unclear to me what value do the E2E tests bring us given the apparently huge amount of overlap they have with Haskell integration tests and the latter seem much more actively maintained (and perhaps that's a problem too).

Quick stats:

% git log --oneline  --since 2024-01-01 lib/integration | wc -l
     194
% git log --oneline --since 2024-01-01 test/e2e | wc -l
      44

I would like to also understand what's the payoff for all these system-level tests, and I understand this is hard to tell: How many times did the E2E test suite prevented a change from introducing a bug? Same question goes for Haskell integration tests?

HeinrichApfelmus · 2024-11-25T14:41:56Z

My comments (but I really think we need to spend more time discussing f2f in front of whiteboard than through GH clumsy PR):

Sure. Perhaps put it on the agenda for our Dev Meeting?

So we definitely want to remove the overlap.

I don't disagree, but I think that the "definitely" needs to be weighted by effort compared to effort of alternatives.

extend the "fresh start" of the "Deposit Wallet" to include features from the legacy wallet, mostly related to delegation.

I don't understand what this has to do with the discussion.

What I'm trying to say is that the effort of maintaining a test suite only pays off it the software under test is still subject to meaningful changes. At this point, it's not clear to me that this is still the case. Hence I'm bringing up the alternative of freezing the old in cardano-wallet and its test suite in place (I'm not necessarily in favor of this, but do think that it's realistic enough to bring up.)

I would like to also understand what's the payoff for all these system-level tests

In the past, these have been the only tests for features such as delegating voting power to a DRep.

How many times did the E2E test suite prevented a change from introducing a bug?

Pawel and Johannes know better, but I am told that historically, Piotr's E2E tests have caught at least three bugs.

abailly · 2024-12-09T17:15:01Z

Here is a quick summary of current situation:

I don't have the depth of knowledge of c-w to be confident in my judgment but I would be tempted to kill those tests
@HeinrichApfelmus is unsure whether or not this tests are properly replaced by other tests and is in favour of statu quo and not invest time on this tests given our current focus on deposit wallet
@paweljakubas is in favour of removing Ruby ETE tests altogether
@Anviking is not sure

What should we do?

abailly · 2024-12-10T08:25:54Z

It seems attempting a smooth and incremental migration is not really worth it, so we are left with 2 options:

Keep them as is
Completely remove them

These tests are actually testing other software, namely cardano-address and SMASH which are outside our control. And we know for a fact the former is thoroughly tested.

HeinrichApfelmus · 2024-12-14T12:52:43Z

I just realized that the Ruby E2E tests are our only tests that exercise the Windows platform. If we remove them now, we don't have any integration test on Windows before a release. 🤔

abailly · 2024-12-16T07:07:03Z

That's a fair point @HeinrichApfelmus

paweljakubas approved these changes Nov 21, 2024

View reviewed changes

abailly requested review from paolino, Anviking, HeinrichApfelmus and paweljakubas November 25, 2024 10:09

HeinrichApfelmus reviewed Nov 25, 2024

View reviewed changes

Anviking reviewed Nov 25, 2024

View reviewed changes

abailly added 5 commits December 13, 2024 13:41

Raw collection of existing tests in a document

bc05ed6

Remove 'misc' E2E tests

ee82271

These tests are actually testing other software, namely cardano-address and SMASH which are outside our control. And we know for a fact the former is thoroughly tested.

Remove wrong statement about what ruby e2e tests are testing

d34dc19

Remove ruby e2e tests and corresponding CI actions

fabbdef

Remove reference to ruby bundle

107d7bc

abailly force-pushed the abailly/collect-e2e-tests branch from 8cfde86 to 107d7bc Compare December 13, 2024 12:45

Remove E2E GHA workflows

bf1b0c1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collect system-level tests and start removing Ruby tests #4853

Collect system-level tests and start removing Ruby tests #4853

abailly commented Nov 21, 2024

paweljakubas left a comment

abailly commented Nov 21, 2024

abailly commented Nov 21, 2024

paweljakubas commented Nov 22, 2024

paweljakubas commented Nov 22, 2024

abailly commented Nov 22, 2024

HeinrichApfelmus Nov 25, 2024

Anviking Nov 25, 2024

HeinrichApfelmus commented Nov 25, 2024 •

edited

Loading

abailly commented Nov 25, 2024

HeinrichApfelmus commented Nov 25, 2024 •

edited

Loading

abailly commented Nov 25, 2024

HeinrichApfelmus commented Nov 25, 2024

abailly commented Dec 9, 2024

abailly commented Dec 10, 2024

HeinrichApfelmus commented Dec 14, 2024

abailly commented Dec 16, 2024

Collect system-level tests and start removing Ruby tests #4853

Are you sure you want to change the base?

Collect system-level tests and start removing Ruby tests #4853

Conversation

abailly commented Nov 21, 2024

paweljakubas left a comment

Choose a reason for hiding this comment

abailly commented Nov 21, 2024

abailly commented Nov 21, 2024

paweljakubas commented Nov 22, 2024

paweljakubas commented Nov 22, 2024

abailly commented Nov 22, 2024

HeinrichApfelmus Nov 25, 2024

Choose a reason for hiding this comment

Anviking Nov 25, 2024

Choose a reason for hiding this comment

HeinrichApfelmus commented Nov 25, 2024 • edited Loading

abailly commented Nov 25, 2024

HeinrichApfelmus commented Nov 25, 2024 • edited Loading

abailly commented Nov 25, 2024

HeinrichApfelmus commented Nov 25, 2024

abailly commented Dec 9, 2024

abailly commented Dec 10, 2024

HeinrichApfelmus commented Dec 14, 2024

abailly commented Dec 16, 2024

HeinrichApfelmus commented Nov 25, 2024 •

edited

Loading

HeinrichApfelmus commented Nov 25, 2024 •

edited

Loading