Reduce sleep() in CAP library code #2189

rajan-chari · 2024-03-28T14:02:53Z

Why are these changes needed?

Related issue number

Resolves #2088 Roadmap item: (Remove Sleeps from Actor creation, ActorSender, ActorConnector in the CAP framework)

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

kinnym

Typically we dont want to wait infinitely.. generally not a good idea. To fix it we can modify the existing code to this. Your timeout is your worst case scenario.
// Assuming _start_event is already defined somewhere
// Create an instance of threading.Event() if it's not already created
self._start_event = threading.Event()

//Wait for the event with a timeout of 5 seconds
if not self._start_event.wait(timeout=5):
// If the event didn't occur within 5 seconds
// Do something else or raise an exception
print("Event didn't occur within 5 seconds. Proceeding with other actions.")
else:
// If the event occurred within 5 seconds
// Proceed with the rest of the code
print("Event occurred. Proceeding with other actions.")

kinnym

Otherwise the changes look good.

codecov-commenter · 2024-03-29T00:38:44Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 37.94%. Comparing base (32fbfa2) to head (9f0bbcc).
Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2189   +/-   ##
=======================================
  Coverage   37.94%   37.94%           
=======================================
  Files          77       77           
  Lines        7780     7780           
  Branches     1666     1666           
=======================================
  Hits         2952     2952           
  Misses       4579     4579           
  Partials      249      249

Flag	Coverage Δ
unittests	`37.93% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

rajan-chari · 2024-03-31T17:19:15Z

Typically we dont want to wait infinitely.. generally not a good idea. To fix it we can modify the existing code to this. Your timeout is your worst case scenario. // Assuming _start_event is already defined somewhere // Create an instance of threading.Event() if it's not already created self._start_event = threading.Event()

//Wait for the event with a timeout of 5 seconds if not self._start_event.wait(timeout=5): // If the event didn't occur within 5 seconds // Do something else or raise an exception print("Event didn't occur within 5 seconds. Proceeding with other actions.") else: // If the event occurred within 5 seconds // Proceed with the rest of the code print("Event occurred. Proceeding with other actions.")

Thanks for the suggestion, Kiran. I see where you are coming from in case there is an error in signaling logic, this would prevent Actor startup from freezing.

I tried the suggested approach. Here are my observations:

In debugger, even when there are no issues, the waiting threads exit. This makes the Actor classes brittle and difficult to debug. In this case, attempting to avoid possible incorrect states, puts the code in incorrect states. Since the 5 seconds is just a heuristic, it's hard to guarantee correct behavior.
My preference here is to fix issues instead of providing a path for possibly incorrect behavior in the core framework. I try to ensure that all path's in the signaling thread are correct. Please let me know if you see any logic issues, for example, exceptions might cause things to not be signaled.

This change would make it harder to use the framework (at least under Debug). I concluded that the best approach here is to rely on correct event signaling behavior and correct logic in threading code, so I rolled back the changes.

rajan-chari · 2024-03-31T17:49:58Z

Merge seems to be prevented by checking a file that should be excluded. I created a PR to disable the file check. This has been merged to main. Not sure how to proceed here. The exclusion has also been committed to this branch.

sonichi · 2024-03-31T22:55:15Z

Could you fix the code formatting error?

rajan-chari · 2024-04-01T13:23:36Z

Could you fix the code formatting error?

Running into this issue: #2190

…P can wait a certain amount and give up. In order to reconcile the two, AutoGenConnector is set to wait indefinitely.

…ps://github.com/rajan-chari/autogen into rajan/reduce-sleeps

rajan-chari · 2024-04-01T16:31:45Z

Could you fix the code formatting error?

All set.

* 1) Removed most framework sleeps 2) refactored connection code * pre-commit fixes * pre-commit * ignore protobuf files in pre-commit checks * Fix duplicate actor registration * refactor change * Nicer printing of Actors * 1) Report recv_multipart errors 4) Always send 4 parts * AutoGen generate_reply expects to wait indefinitely for an answer. CAP can wait a certain amount and give up. In order to reconcile the two, AutoGenConnector is set to wait indefinitely. * pre-commit formatting fixes * pre-commit format changes * don't check autogenerated proto py files

rajan-chari added 3 commits March 28, 2024 09:37

1) Removed most framework sleeps 2) refactored connection code

f14c382

pre-commit fixes

ffac84b

pre-commit

d093cda

rajan-chari changed the title ~~Rajan/reduce sleeps~~ Reduce sleep() in CAP library code Mar 28, 2024

Merge branch 'main' into rajan/reduce-sleeps

596aa8d

rajan-chari requested review from ekzhu and kinnym March 28, 2024 17:39

kinnym reviewed Mar 28, 2024

View reviewed changes

ignore protobuf files in pre-commit checks

916a1ec

Merge branch 'main' into rajan/reduce-sleeps

d1c438e

ekzhu approved these changes Mar 31, 2024

View reviewed changes

rajan-chari closed this Mar 31, 2024

rajan-chari reopened this Mar 31, 2024

rajan-chari force-pushed the rajan/reduce-sleeps branch from d602400 to d1c438e Compare March 31, 2024 17:35

Merge branch 'main' into rajan/reduce-sleeps

fea2cfb

rajan-chari mentioned this pull request Mar 31, 2024

[Bug]: pre-commit checks are failing for protobuf generated files #2190

Closed

rajan-chari added 3 commits March 31, 2024 17:05

Fix duplicate actor registration

c8e0fc6

refactor change

2daed6a

Nicer printing of Actors

e9eea78

1) Report recv_multipart errors 4) Always send 4 parts

08dc6b6

rajan-chari added 4 commits April 1, 2024 09:23

Merge branch 'main' into rajan/reduce-sleeps

40124f5

AutoGen generate_reply expects to wait indefinitely for an answer. CA…

dbdbb4d

…P can wait a certain amount and give up. In order to reconcile the two, AutoGenConnector is set to wait indefinitely.

Merge branches 'rajan/reduce-sleeps' and 'rajan/reduce-sleeps' of htt…

51a8cfa

…ps://github.com/rajan-chari/autogen into rajan/reduce-sleeps

pre-commit formatting fixes

eb90cd4

rajan-chari added 3 commits April 1, 2024 12:07

pre-commit format changes

77d9488

don't check autogenerated proto py files

431865c

Merge branch 'main' into rajan/reduce-sleeps

9f0bbcc

Merge branch 'main' into rajan/reduce-sleeps

2112cf7

ekzhu added this pull request to the merge queue Apr 2, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Apr 2, 2024

ekzhu added this pull request to the merge queue Apr 2, 2024

Merged via the queue into microsoft:main with commit db30ec8 Apr 2, 2024
23 checks passed

rajan-chari deleted the rajan/reduce-sleeps branch April 2, 2024 11:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce sleep() in CAP library code #2189

Reduce sleep() in CAP library code #2189

rajan-chari commented Mar 28, 2024 •

edited

Loading

kinnym left a comment •

edited

Loading

kinnym left a comment

codecov-commenter commented Mar 29, 2024 •

edited

Loading

rajan-chari commented Mar 31, 2024 •

edited

Loading

rajan-chari commented Mar 31, 2024 •

edited

Loading

sonichi commented Mar 31, 2024

rajan-chari commented Apr 1, 2024

rajan-chari commented Apr 1, 2024

Reduce sleep() in CAP library code #2189

Reduce sleep() in CAP library code #2189

Conversation

rajan-chari commented Mar 28, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

kinnym left a comment • edited Loading

Choose a reason for hiding this comment

kinnym left a comment

Choose a reason for hiding this comment

codecov-commenter commented Mar 29, 2024 • edited Loading

Codecov Report

rajan-chari commented Mar 31, 2024 • edited Loading

rajan-chari commented Mar 31, 2024 • edited Loading

sonichi commented Mar 31, 2024

rajan-chari commented Apr 1, 2024

rajan-chari commented Apr 1, 2024

rajan-chari commented Mar 28, 2024 •

edited

Loading

kinnym left a comment •

edited

Loading

codecov-commenter commented Mar 29, 2024 •

edited

Loading

rajan-chari commented Mar 31, 2024 •

edited

Loading

rajan-chari commented Mar 31, 2024 •

edited

Loading