[🛠️] Change Command Scheduler to fix iterator invalidation bugs and get rid of "hacks" in C++ and Java #6593

kytpbs · 2024-05-07T00:11:34Z

scheduledCommands.erase() was being called in for(Command* command : scheduledCommands)

this now gets put into a temporary vector (at the size of the set, I do not think all the commands will end in the same for loop but nevertheless it's the size of the set) and erases them after the for loop is done.

Found together with @PeterJohnson.

Starlight220

Wasn't the iteration being done via an iterator (and then the removal also being done through the iterator)?

There's already endingCommands, and this adds another toRemove; I feel we're hacking this implementation patch on patch and it's not great.

This diverges from Java.

This changes behavior, as with this isScheduled will return true until the following iteration and not immediately when isFinished returns false. As an example of practical change, proxies will now end the iteration after the proxied command (which tbh might be better than the current situation where it depends on the map iteration order).

Add unit tests based on the code used to find this.

PeterJohnson · 2024-05-07T04:56:42Z

Wasn't the iteration being done via an iterator (and then the removal also being done through the iterator)?

Sort of. It used a range for, which has an internal iterator. That iterator is invalidated by SmallSet when the current object is removed, which means the for loop next iteration is incrementing from an invalid iterator (UB). Even manually managing the iterators doesn't work with SmallSet, because it can use a vector implementation, which invalidates all iterators, not just the current one. So the only other possible fix would be to use std::set AND manually manage the iterators instead of using range-for.

Java doesn't have this issue--iterator.remove() updates itself in such a way that hasNext() on it is safe for the next loop iteration.

Starlight220 · 2024-05-07T05:24:15Z

SmallSet doesn't have an Iterator.remove like Java?

PeterJohnson · 2024-05-07T05:24:52Z

No. Neither does std::set.

kytpbs · 2024-05-07T12:15:20Z

/format

KangarooKoala

Good catch!

However, considering that this certainly does have a change in behavior, Java should be changed to match. (We've had enough issues with differences between Java and C++, so it's best to keep them in sync as much as possible.) I'll note that robotpy's command scheduler avoids this issue by iterating over a (shallow) copy of the map/dictionary, but it still has the scheduling and canceling queues (presumably for compatibility?).

Additionally, I'm nervous about partially delaying ending the command- It feels like at least one person will run into an edge case where a "scheduled" command not being linked to any requirements causes strange and difficult-to-debug behavior.

wpilibNewCommands/src/main/native/cpp/frc2/command/CommandScheduler.cpp

kytpbs · 2024-05-16T02:08:11Z

The way robot.py did it seems like a better way to mitigate this issue, and SmallSet<Command*>(scheduledCommands) should create a copy of the smallSet, and I don't think the smallSet will be big enough that this will have a performance impact.

What do you guys think?

KangarooKoala · 2024-05-16T16:29:42Z

I agree that copying scheduledCommands would be the cleaner solution. Some things to remember for whoever implements that:

Make the same change to Java (I think iterating over m_scheduledCommands.toArray(new Command[0]) would be the most performant way to copy the set)
Add a check to the start of the run loop to skip commands that are no longer scheduled
Remove the inRunLoop, toSchedule, toCancelCommands, and toCancelInterruptors member variables

TheTripleV · 2024-05-16T17:49:31Z

Whatever behavior is being chosen, if it's going to being a part of the "spec", there should be tests that fail currently and pass afterwards added.

kytpbs · 2024-05-21T22:41:12Z

/format

KangarooKoala

Looks good to me! (For what it's worth)
I'll just note down the behavioral differences for posterity, but I think these are acceptable (even positive) changes:

command.isScheduled() is false when command.end(boolean) and the finish or interrupt callbacks are called.
When scheduling a command from the run loop, command.initialize() and the schedule callbacks are called immediately, not after the run loop. It remains the case that command.execute() will only be called from the next call to scheduler.run().
When cancelling a command from the run loop , command.end(true) and the finish or interrupt callbacks are called immediately, not after the run loop (and the queued scheduled commands are processed). Notably, if a command (call it A) cancels another command (call it B) that was scheduled after A (and therefore will be processed after A), B is not be processed (execute() called and isFinished() checked) in the same run loop.

wpilibNewCommands/src/main/java/edu/wpi/first/wpilibj2/command/CommandScheduler.java

kytpbs · 2024-05-22T00:02:56Z

/format

exact same changes from wpilibsuite/allwpilib#6593

PeterJohnson · 2024-05-23T14:03:54Z

Set.copyOf is memory allocation heavy; this is what the implementation is:

return (Set<E>)Set.of(new HashSet<>(coll).toArray());

So it first creates a HashSet, then creates an array of its elements, then creates a Set from that array.

For how we're using it here, toArray() would be more efficient. As a further optimization we could even avoid allocations most of the time by smartly reusing the array.

KangarooKoala · 2024-05-23T19:50:47Z

Note that toArray(T[]) will automatically try to reuse the passed array, so a simple way to reduce allocations would be to update the array with m_composedCommandsCopy = m_composedCommands.toArray(m_composedCommandsCopy). (There's some options we could explore such as the initial array size and growing the array by more than is initially required, but I don't know if it would necessarily be worth it)

Starlight220

This needs so many tests.
SchedulingRecursionTest passing is a good sign (or a very bad one, if the tests there aren't working).

All behavior changes, edge cases, and so on need to be tested. Rigorously.

PeterJohnson · 2024-05-24T17:57:19Z

Note that toArray(T[]) will automatically try to reuse the passed array, so a simple way to reduce allocations would be to update the array with m_composedCommandsCopy = m_composedCommands.toArray(m_composedCommandsCopy). (There's some options we could explore such as the initial array size and growing the array by more than is initially required, but I don't know if it would necessarily be worth it)

Note also toArray() will null-fill the end elements, not shrink the array, so you will need to look for the first null and exit the loop early if doing this.

this is also to retrigger workflows

Co-authored-by: Joseph Eng <91924258+KangarooKoala@users.noreply.github.com>

kytpbs · 2024-10-10T23:16:12Z

synced with main with rebase.

Will write new tests and better optimize currently working on it

wpilibNewCommands/src/test/native/cpp/frc2/command/CommandScheduleTest.cpp

Co-authored-by: Joseph Eng <91924258+KangarooKoala@users.noreply.github.com>

kytpbs · 2024-10-11T14:59:20Z

fix formatting with rebase + force-push: (I forget to run wpiformat all the time...)

kytpbs · 2024-10-11T17:45:54Z

Update the checking for null and isScheduled as discussed in discord with a amend + force-push:

kytpbs · 2024-10-11T18:44:22Z

oops did command.isScheduled() instead of isScheduled(command) what a rookie mistake, fixing with force push now

kytpbs · 2024-10-12T02:56:38Z

I have added 3 different tests testing weird cases of using the scheduler inside the commands. all 3 fail on the main branch and pass on this branch

What other tests should I add? @Starlight220?

KangarooKoala

Open for discussion- Should we move some of these tests to SchedulingRecursionTest?

wpilibNewCommands/src/test/java/edu/wpi/first/wpilibj2/command/CommandScheduleTest.java

wpilibNewCommands/src/test/native/cpp/frc2/command/CommandScheduleTest.cpp

Co-Authored-By: Joseph Eng <91924258+KangarooKoala@users.noreply.github.com>

kytpbs · 2024-10-12T21:26:23Z

@KangarooKoala: Should we move some of these tests to SchedulingRecursionTest?

I say we move test cancelNextCommandTest for sure and also scheduleCommandInCommand since they are both similar to #4259 but keep commandKnowsWhenEndedTest since it feels more like a CommandScheduler test more than a scheduling recursion.
Any objections?

KangarooKoala

I say we move test cancelNextCommandTest for sure and also scheduleCommandInCommand since they are both similar to #4259 but keep commandKnowsWhenEndedTest since it feels more like a CommandScheduler test more than a scheduling recursion.
Any objections?

Sounds good!

wpilibNewCommands/src/test/java/edu/wpi/first/wpilibj2/command/CommandScheduleTest.java

kytpbs requested a review from a team as a code owner May 7, 2024 00:11

Starlight220 reviewed May 7, 2024

View reviewed changes

KangarooKoala reviewed May 7, 2024

View reviewed changes

wpilibNewCommands/src/main/native/cpp/frc2/command/CommandScheduler.cpp Outdated Show resolved Hide resolved

kytpbs requested a review from PeterJohnson May 21, 2024 22:42

kytpbs changed the title ~~[🛠️] fix "modify set in its loop" bug in C++ Command Scheduler~~ [🛠️] change Command Scheduler to fix iterator invalidation bugs and get rid of "hacks" in C++ and Java May 21, 2024

kytpbs changed the title ~~[🛠️] change Command Scheduler to fix iterator invalidation bugs and get rid of "hacks" in C++ and Java~~ [🛠️] Change Command Scheduler to fix iterator invalidation bugs and get rid of "hacks" in C++ and Java May 21, 2024

kytpbs requested a review from KangarooKoala May 21, 2024 23:27

KangarooKoala approved these changes May 21, 2024

View reviewed changes

wpilibNewCommands/src/main/java/edu/wpi/first/wpilibj2/command/CommandScheduler.java Outdated Show resolved Hide resolved

kytpbs added a commit to kytpbs/robotpy-commands that referenced this pull request May 22, 2024

remove concurrent avoiding modification member variables

f028bf5

exact same changes from wpilibsuite/allwpilib#6593

kytpbs mentioned this pull request May 22, 2024

remove concurrent avoiding modification member variables (Python) robotpy/robotpy-commands-v2#66

Open

Starlight220 requested changes May 24, 2024

View reviewed changes

KangarooKoala mentioned this pull request May 28, 2024

Update Commands.java Add looseSequence() method #6639

Closed

This was referenced Sep 18, 2024

Fix iterator invalidation bugs exposed by LLVM 19 #7092

Open

[upstream_utils] Upgrade to LLVM 19.1.3 #7101

Open

kytpbs force-pushed the Fix-cpp-commandScheduler branch from 3dc304c to c85c91a Compare October 10, 2024 13:09

kytpbs and others added 3 commits October 11, 2024 02:11

fix scheduledCommands.erase() being called in the same sets iterator

184566c

added comment explaning why we defer from java

9c94322

this is also to retrigger workflows

Formatting fixes

cd753b1

kytpbs and others added 5 commits October 11, 2024 02:11

remove concurrent avoiding modification member variables (C++)

35786d8

remove concurrent avoiding modification member variables (Java)

a905f7b

Formatting fixes

ede024e

remove iterator as we don't need it

a7702d0

Co-authored-by: Joseph Eng <91924258+KangarooKoala@users.noreply.github.com>

Formatting fixes

2f5c5c4

kytpbs force-pushed the Fix-cpp-commandScheduler branch from c85c91a to 2f5c5c4 Compare October 10, 2024 23:11

kytpbs force-pushed the Fix-cpp-commandScheduler branch 2 times, most recently from 4db4256 to 2d54082 Compare October 11, 2024 03:24

KangarooKoala reviewed Oct 11, 2024

View reviewed changes

wpilibNewCommands/src/test/native/cpp/frc2/command/CommandScheduleTest.cpp Outdated Show resolved Hide resolved

add a test to cancel the next command from the first

7149e43

Co-authored-by: Joseph Eng <91924258+KangarooKoala@users.noreply.github.com>

kytpbs force-pushed the Fix-cpp-commandScheduler branch from 2d54082 to 7149e43 Compare October 11, 2024 13:48

add better error messages to C++ tests

7c7ca2e

kytpbs force-pushed the Fix-cpp-commandScheduler branch from 96b0bb9 to 68d669d Compare October 11, 2024 14:59

kytpbs force-pushed the Fix-cpp-commandScheduler branch from 68d669d to 896d784 Compare October 11, 2024 17:46

Optimize copying m_scheduledCommands

9154940

kytpbs force-pushed the Fix-cpp-commandScheduler branch from 896d784 to 9154940 Compare October 11, 2024 18:44

kytpbs added 2 commits October 12, 2024 01:37

add a test checking if command is scheduled on end() call

35b02b9

Add test for scheduling a command within a command

0f5c8e6

KangarooKoala reviewed Oct 12, 2024

View reviewed changes

Refactor tests according to reviews

93ad7b8

Co-Authored-By: Joseph Eng <91924258+KangarooKoala@users.noreply.github.com>

KangarooKoala reviewed Oct 13, 2024

View reviewed changes

wpilibNewCommands/src/test/java/edu/wpi/first/wpilibj2/command/CommandScheduleTest.java Outdated Show resolved Hide resolved

This comment was marked as duplicate.

Sign in to view

move some tests to schedulingRecursionTest & improve comments

ada142a

kytpbs requested a review from Starlight220 October 31, 2024 14:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[🛠️] Change Command Scheduler to fix iterator invalidation bugs and get rid of "hacks" in C++ and Java #6593

[🛠️] Change Command Scheduler to fix iterator invalidation bugs and get rid of "hacks" in C++ and Java #6593

kytpbs commented May 7, 2024

Starlight220 left a comment

PeterJohnson commented May 7, 2024 •

edited

Loading

Starlight220 commented May 7, 2024 •

edited

Loading

PeterJohnson commented May 7, 2024

kytpbs commented May 7, 2024

KangarooKoala left a comment

kytpbs commented May 16, 2024

KangarooKoala commented May 16, 2024

TheTripleV commented May 16, 2024

kytpbs commented May 21, 2024

KangarooKoala left a comment

kytpbs commented May 22, 2024

PeterJohnson commented May 23, 2024 •

edited

Loading

KangarooKoala commented May 23, 2024

Starlight220 left a comment

PeterJohnson commented May 24, 2024

kytpbs commented Oct 10, 2024

kytpbs commented Oct 11, 2024 •

edited

Loading

kytpbs commented Oct 11, 2024

kytpbs commented Oct 11, 2024

kytpbs commented Oct 12, 2024 •

edited

Loading

KangarooKoala left a comment

kytpbs commented Oct 12, 2024

KangarooKoala left a comment

This comment was marked as duplicate.

[🛠️] Change Command Scheduler to fix iterator invalidation bugs and get rid of "hacks" in C++ and Java #6593

Are you sure you want to change the base?

[🛠️] Change Command Scheduler to fix iterator invalidation bugs and get rid of "hacks" in C++ and Java #6593

Conversation

kytpbs commented May 7, 2024

Starlight220 left a comment

Choose a reason for hiding this comment

PeterJohnson commented May 7, 2024 • edited Loading

Starlight220 commented May 7, 2024 • edited Loading

PeterJohnson commented May 7, 2024

kytpbs commented May 7, 2024

KangarooKoala left a comment

Choose a reason for hiding this comment

kytpbs commented May 16, 2024

KangarooKoala commented May 16, 2024

TheTripleV commented May 16, 2024

kytpbs commented May 21, 2024

KangarooKoala left a comment

Choose a reason for hiding this comment

kytpbs commented May 22, 2024

PeterJohnson commented May 23, 2024 • edited Loading

KangarooKoala commented May 23, 2024

Starlight220 left a comment

Choose a reason for hiding this comment

PeterJohnson commented May 24, 2024

kytpbs commented Oct 10, 2024

kytpbs commented Oct 11, 2024 • edited Loading

kytpbs commented Oct 11, 2024

kytpbs commented Oct 11, 2024

kytpbs commented Oct 12, 2024 • edited Loading

KangarooKoala left a comment

Choose a reason for hiding this comment

kytpbs commented Oct 12, 2024

KangarooKoala left a comment

Choose a reason for hiding this comment

This comment was marked as duplicate.

PeterJohnson commented May 7, 2024 •

edited

Loading

Starlight220 commented May 7, 2024 •

edited

Loading

PeterJohnson commented May 23, 2024 •

edited

Loading

kytpbs commented Oct 11, 2024 •

edited

Loading

kytpbs commented Oct 12, 2024 •

edited

Loading