autopilot fallback address #1039

sampocs · 2023-12-28T22:00:58Z

Closes: #XXX

Context and purpose of the change

We now use a hashed address as the sender of the outbound transfer during autopilot liquid stake and forward. However, we need to gracefully hand ack failures and timeouts.

In the event of an ack failure, we should send the tokens to a fallback address (which will be the original receiver address of the autopilot memo). In the event of a timeout, we just retry infinitely.

Brief Changelog

Added keeper functions to store the fallback address
Added OnAckPacket for autopilot that will handle the bank send to the fallback address during an ack failure
Added OnTimeoutPacket for autopilot that will handle the infinite retries
Added relevant unit tests

Testing

Timeout

Hard coded the timeout timestamp in the original transfer submission code to be 1 second pass the block time (to force a timeout)
Started dockernet and sent an autopilot LS and forward tx
Confirmed via logging that the original packet timed out and the retry was submitted
Confirmed via balance checks that the retry landed

Ack Error

Started dockernet and sent an autopilot tx with an invalid IBC reciever address (to force an ack error)
Confirmed via balance queries that
- (1) uatom left the gaia address
- (2)stuatom appeared in escrow address (after the forwarding transfer was submitted)
- (3) the stuatom moved from the escrow address to the fallback address (during the ack error handling)

…de/pull/771/files

x/autopilot/types/keys.go

x/autopilot/keeper/ibc.go

asalzmann

I have two main comments

why not use icacallbacks to bank send to the fallback address?
on second thought, the infinite retry approach seems bad because it could lead to full stride blocks (with a queue of messages that is impossible to process and relayers running out of funds)

x/autopilot/keeper/ibc.go

x/autopilot/keeper/fallback.go

x/autopilot/module_ibc.go

x/autopilot/keeper/ibc.go

riley-stride

Reviewed with particular attention to ibc.go and fallback.go.

Looking very good. Thanks for the thorough comments and replies on the PR threads, they helped with the review.

The main thing I'd want to see to get more confidence is some integration tests that test the various fallback address code paths. Two cases that come to mind: (1) LS&forward + ack failure => check tokens landed in fallback addr (2) LS&forward + timeout => check tokens are retrying infinitely (not sure how to check this, maybe see the balances oscillating at the freq of the retry period). Of these, (1) seems by far most important to integration test.

x/autopilot/keeper/fallback.go

x/autopilot/keeper/ibc.go

sampocs · 2024-01-06T01:10:10Z

@asalzmann @riley-stride

I took a very rough first pass at using callbacks in #1047; however, it's far from functional right now.

You can see the PR description for details, but the TLDR is it seems like it would be a pretty big effort to add callbacks to autopilot since it's in the transfer stack.

The PR should help illustrate the trade off of what the solution would look like with callbacks vs how it's implemented in this PR. _I'm going to pause on this for now until you get back to me on which approach you'd prefer_.

My two cents atm is that the callbacks approach does seem to maybe clarify things a tad, but it's mostly just a swapping of keeper boilerplate for callbacks boilerplate, and I'm not sure it simplifies things enough to justify the effort required to overcome the challenge described above.

Also, the PR description is probably quite confusing, but if any of you are planning to work on this this weekend, I can put a loom together for ya to help clarify.

riley-stride · 2024-01-07T20:54:33Z

@asalzmann @riley-stride

I took a very rough first pass at using callbacks in #1047; however, it's far from functional right now.

You can see the PR description for details, but the TLDR is it seems like it would be a pretty big effort to add callbacks to autopilot since it's in the transfer stack.

The PR should help illustrate the trade off of what the solution would look like with callbacks vs how it's implemented in this PR. __I'm going to pause on this for now until you get back to me on which approach you'd prefer__.

My two cents atm is that the callbacks approach does seem to maybe clarify things a tad, but it's mostly just a swapping of keeper boilerplate for callbacks boilerplate, and I'm not sure it simplifies things enough to justify the effort required to overcome the challenge described above.

Also, the PR description is probably quite confusing, but if any of you are planning to work on this this weekend, I can put a loom together for ya to help clarify.

Thanks for the loom, that was very helpful.

I lean towards keeping this approach for now, and potentially refactoring in a future upgrade.

I'm a bit further from the middleware stack so can't speak on this confidently, but seems like sifting through the assumptions baked into the middleware stack (outlined in your video) to re-wire middleware stack three could be dangerous to do on a fast timeline.

It _does_ however feel like the more technically correct solution, so long term we should probably move toward it.

Don't feel strongly though, @sampocs and @asalzmann's opinions should carry much more weight here as they both understand the middleware stack more deeply.

sampocs · 2024-01-09T00:43:51Z

Discussed offline and decided on the same as above. Callbacks approach seems too complex atm, but we can revist our mdidleware stack later

riley-stride

Nice! Removing CheckAcknowledgementStatus and having OnTimeoutPacket send to the Fallback address simplifies the PR meaningfully imo.

…k-address

asalzmann

lgtm! main changes I reviewed

use the icacallbacks ack parsing function
use a timeout and don't retry forwards

asalzmann and others added 20 commits November 28, 2023 15:44

pull in LS + forward changes from https://github.com/Stride-Labs/stri…

d3e0501

…de/pull/771/files

integration test working

1c6ed0b

fix unittest

596250b

restrict to either autopilot or pfm

220c8f1

added hash receiver helper

528994a

added bank keeper to autopilot

264cc28

first pass at hashed recipient implementation

45b1465

cleaned up variable names

99db724

cleanup and docs

f9d5d09

moved types to autopilot

3bdfcd7

walked back almost all the changes from above :(

eb408d2

cleanup again

8ee8ef2

added bank keeper again

b1d4503

added bank send to hashed sender

208eac7

renamed hashed address function

2c22d17

added keepers to store fallback address

c0ca4f5

implemented onAck and onTimeout

a119b8d

added unit test for helpers

f102574

added unit tests for on ack packet

619c480

added unit tests for full callbacks

baaeb50

sampocs marked this pull request as ready for review December 29, 2023 01:12

sampocs requested review from shellvish, asalzmann and riley-stride December 29, 2023 01:12

info -> error log

c8288dc

riley-stride reviewed Jan 3, 2024

View reviewed changes

x/autopilot/types/keys.go Show resolved Hide resolved

x/autopilot/keeper/ibc.go Outdated Show resolved Hide resolved

autopilot liquid stake and forward unit tests (#1041)

299f585

asalzmann requested changes Jan 4, 2024

View reviewed changes

x/autopilot/keeper/ibc.go Show resolved Hide resolved

x/autopilot/keeper/fallback.go Show resolved Hide resolved

x/autopilot/module_ibc.go Show resolved Hide resolved

x/autopilot/keeper/ibc.go Outdated Show resolved Hide resolved

riley-stride reviewed Jan 5, 2024

View reviewed changes

x/autopilot/keeper/fallback.go Show resolved Hide resolved

x/autopilot/keeper/fallback.go Outdated Show resolved Hide resolved

x/autopilot/keeper/fallback.go Show resolved Hide resolved

x/autopilot/keeper/ibc.go Show resolved Hide resolved

sampocs added 6 commits January 8, 2024 18:46

renamed key function

1bfa7ca

replaced CheckAckStatus with helper from icacallbacks

cd7ce0b

removed retry and replaced with send to fallback address

8958d80

fixed unit tests

55635d0

updated timeout to 3 hours

2bf291f

Merge branch 'main' into sam/autopilot-hash-sender-2

bd61e0a

riley-stride approved these changes Jan 9, 2024

View reviewed changes

sampocs added 3 commits January 9, 2024 10:02

updated timeout comments

a814f92

updated checkmes

5042669

Merge branch 'sam/autopilot-hash-sender-2' into sam/autopilot-fallbac…

158c77d

…k-address

asalzmann approved these changes Jan 10, 2024

View reviewed changes

sampocs changed the base branch from sam/autopilot-hash-sender-2 to main January 10, 2024 23:32

Merge branch 'main' into sam/autopilot-fallback-address

158ccec

github-actions bot added C:app-wiring C:stakeibc labels Jan 10, 2024

sampocs added the A:automerge Automatically merge PR once checks pass label Jan 10, 2024

sampocs and others added 2 commits January 10, 2024 22:10

fixed unit test after merge

ba9330c

Merge branch 'main' into sam/autopilot-fallback-address

2bec06a

mergify bot merged commit b9be32b into main Jan 11, 2024
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

autopilot fallback address #1039

autopilot fallback address #1039

sampocs commented Dec 28, 2023 •

edited

Loading

asalzmann left a comment

riley-stride left a comment •

edited

Loading

sampocs commented Jan 6, 2024 •

edited

Loading

riley-stride commented Jan 7, 2024 •

edited

Loading

sampocs commented Jan 9, 2024

riley-stride left a comment

asalzmann left a comment

autopilot fallback address #1039

autopilot fallback address #1039

Conversation

sampocs commented Dec 28, 2023 • edited Loading

Context and purpose of the change

Brief Changelog

Testing

Timeout

Ack Error

asalzmann left a comment

Choose a reason for hiding this comment

riley-stride left a comment • edited Loading

Choose a reason for hiding this comment

sampocs commented Jan 6, 2024 • edited Loading

riley-stride commented Jan 7, 2024 • edited Loading

sampocs commented Jan 9, 2024

riley-stride left a comment

Choose a reason for hiding this comment

asalzmann left a comment

Choose a reason for hiding this comment

sampocs commented Dec 28, 2023 •

edited

Loading

riley-stride left a comment •

edited

Loading

sampocs commented Jan 6, 2024 •

edited

Loading

riley-stride commented Jan 7, 2024 •

edited

Loading