replay a GitHub event locally at a bot instance #178

jonas-lq · 2023-04-26T08:35:05Z

Fixes #94

PR for being able to replay events stored in the file system.

Triggered by commenting "replay " on the pr (Your GitHub app must subscribe to the event "Issue comment")

There were already some support for replaying events (see line 200 in eessi_bot_event_handler.py), but this is outdated and would not work even if the specified file contained both the header and body (PyGHee logs events as separate files for body and header). Should I change the code here to utilise my newly implemented function?

trz42

Looks already quite nice. The PR does more than what #94 asked for. #94 just suggested to have a means to replay an event locally (where a bot instance is running). A couple of suggestions:

remove the ability to send the replay command from GitHub (it will be impractical to use or worse lead to unintended consequences)
think of improving the efficiency, as of now os.walk() will traverse the whole directory of the bot instance and access 100k or even millions of entries
- one could/should make use of the directory structure starting at events_log
- one could also think of improving PyGHee such that it creates symlinks, eg, event_id -> directory that contains the header & body json
  - thus there need to be no lookup/traversal of the file system
there could be a new small script, say replay_event.py that imports the replay_event method
- this probably requires to move the core of the method replay_event out of the EESSIBotEventHandler class to another module, say tools/replay_events.py
- thus we could have a tool that replays an event and if we later decide to use that from the bot (event handler) we can just reuse it
a follow-up PR could improve the replay_event tool to limit the effects of replaying an event, for example, only replay it for certain architectures or job ids
- this might use code developed for support for sending commands to bot instances via PR comments #172

trz42 · 2023-04-27T07:04:23Z

eessi_bot_event_handler.py

+        if comment_txt.lower()[:6] == "replay":
+            self.replay_event(comment_txt[7:])


I suggest to remove the ability to trigger the replay from GitHub. #94 only intended to replay an event locally, that is, where a bot instance is running. Someone who only has access to GitHub may also have difficulties to know/obtain the ID of an event.

trz42 · 2023-04-27T07:26:38Z

eessi_bot_event_handler.py

+        """
+        event = namedtuple('Request', ['headers', 'json', 'data'])
+
+        for (dir, _subdirs, files) in os.walk("."):


This traverses the whole directory tree under .. Benefit would be that even if the directory structure changes, it could still find the event. Disadvantage could be that it does not find the event because they are not stored under . (it's just a relative reference to the file system) and it checks many files/directories.

Could this be made more efficient and explicitly use the directory that stores event information?

. could be replaced with what app.cfg defines for the directory that contains jobs OR the directory PyGHee uses to log events. For example, for the current NESSI bot instances on Fram, eX3 and AWS, there are 15159, 15161 and 15129 entries under events_log, while under . there are 49037, 53289, 284738 entries, respectively.

trz42 · 2023-04-27T08:45:32Z

eessi_bot_event_handler.py

+            if any([event_id in file for file in files]):
+                with open(f"{dir}/{event_id}_headers.json", 'r') as jf:
+                    headers = json.load(jf)
+                    event.headers = CaseInsensitiveDict(headers)
+                with open(f"{dir}/{event_id}_body.json", 'r') as jf:
+                    body = json.load(jf)
+                    event.json = body
+
+        event_info = get_event_info(event)


This looks very good. Clean and concise. No immediate requests for changes. Maybe need to be updated if the os.walk() is replaced with something more efficient.

trz42 · 2023-11-26T14:08:15Z

Closing this one. We may revisit it at a later time.

jonas-lq added 3 commits April 25, 2023 13:14

Events can now be replayed with pr comment

168acdd

Event does not need to be labeled pr now

f4559e7

Fixed hound issues

600cb72

trz42 changed the title ~~Issue#94~~ replay a GitHub event locally at a bot instance Apr 27, 2023

trz42 requested changes Apr 27, 2023

View reviewed changes

jonas-lq added 7 commits April 27, 2023 16:13

Replay func in separate file, starts search from events_log

1252670

Fixed over-indentation

1339c4c

Added dir with symlinks to events logged by PyGHee

7afc2f7

Added new author on files changed across my PR's

d908185

Added event_links_path in app.cfg

122d4d3

Removed unused imports

b788bac

Merge branch 'main' into issue#94

e2c2b0f

trz42 closed this Nov 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replay a GitHub event locally at a bot instance #178

replay a GitHub event locally at a bot instance #178

jonas-lq commented Apr 26, 2023 •

edited by trz42

Loading

trz42 left a comment

trz42 Apr 27, 2023

trz42 Apr 27, 2023

trz42 Apr 27, 2023

trz42 commented Nov 26, 2023

		if comment_txt.lower()[:6] == "replay":
		self.replay_event(comment_txt[7:])

replay a GitHub event locally at a bot instance #178

replay a GitHub event locally at a bot instance #178

Conversation

jonas-lq commented Apr 26, 2023 • edited by trz42 Loading

trz42 left a comment

Choose a reason for hiding this comment

trz42 Apr 27, 2023

Choose a reason for hiding this comment

trz42 Apr 27, 2023

Choose a reason for hiding this comment

trz42 Apr 27, 2023

Choose a reason for hiding this comment

trz42 commented Nov 26, 2023

jonas-lq commented Apr 26, 2023 •

edited by trz42

Loading