transports/tcp: Remove sleep_on_error #2849

mxinden · 2022-08-26T08:11:56Z

Description

The sleep_on_error mechanism in libp2p-tcp would delay the next poll on the
listener stream when an error happens. This mechanism was introduced in
#402 based on the
tk_listen crate also referenced
in the tokio documentation.

When running out of file descriptors, the listening socket would return an
EMFILE. Instead of polling the socket again, thus likely receiving another
EMFILE, potentially resulting in a busy loop, one would instead wait for 100ms.

Modern operating systems should run with high file descriptor limits and thus
this error should not happen in the wild. In addition, delaying the next poll
only covers up the issue, but does not solve it.

Lastly rust-libp2p prioritizes incoming connections the lowest. Thus, while
this could result in a busy loop, it only does in case there is no other work left.

With the above in mind, this pull request removes the daly.

(The mention of tk_listen has since been removed from tokio.)

Links to any relevant issues

Recent discussion on sleep_on_error transports/tcp: simplify IfWatcher integration #2813 (comment)

Open Questions

What do folks think? Is it safe to assume that rust-libp2p runs in environments with high file descriptor limits?
How do other projects handle these cases? I have not found other examples yet.

Change checklist

I have performed a self-review of my own code
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
A changelog entry has been made in the appropriate crates

The `sleep_on_error` mechanism in `libp2p-tcp` would delay the next poll on the listener stream when an error happens. This mechanism was introduced in libp2p#402 based on the [`tk_listen`](https://docs.rs/tk-listen/latest/tk_listen/) crate also referenced in the tokio documentation. When running out of file descriptors, the listening socket would return an `EMFILE`. Instead of polling the socket again, thus likely receiving another `EMFILE`, one would instead wait for 100ms. Modern operating systems should run with high file descriptor limits and thus this error should not happen in the wild. In addition, delaying the next poll only covers up the issue, but does not solve it. With the above in mind, this pull request removes the daly. (The mention of `tk_listen` has since been removed from tokio.)

mxinden · 2022-08-26T08:18:28Z

As far as I can tell, go-libp2p does not pause when witnessing an error on accept:

https://github.com/libp2p/go-libp2p/blob/423eab209791f1b1864096371c1b3d76a2bc88c3/p2p/transport/tcp/tcp.go#L79-L91

Maybe @marten-seemann @MarcoPolo or @julian88110 can confirm?

marten-seemann · 2022-08-26T08:29:39Z

No we don't wait.

Menduist · 2022-08-26T09:29:37Z

this error should not happen in the wild.

We've seen it a few times, for instance on macos or whatever

Thus, while this could result in a busy loop, it only does in case there is no other work left.

You assume this is the only program running on the server, imo it's never ok to burn a thread for no reason

In addition, delaying the next poll only covers up the issue, but does not solve it.

What would solving it look like? You either raise the fd limit, or lower the maximum number of peers

How do other projects handle these cases?

In nimbus, we show a warning at startup if the fd limit is too close / below the maximum peer amount (if we can, it's not straightforward to get the actual limit). Otherwise, we sleep after a failed accept to avoid the busy loop

mxinden · 2022-08-30T06:34:44Z

Thanks @marten-seemann and @Menduist.

Thus, while this could result in a busy loop, it only does in case there is no other work left.

You assume this is the only program running on the server, imo it's never ok to burn a thread for no reason

Good point.

In addition, delaying the next poll only covers up the issue, but does not solve it.

What would solving it look like? You either raise the fd limit, or lower the maximum number of peers

Those are the only solutions I can think of as well. My thought was that a busy loop would surface the issue (e.g. low file descriptor count), while the delay would cover the issue up when running in production. Unless one pays attention to the additional logged errors, which I doubt is happening in most setups.

How do other projects handle these cases?

In nimbus, we show a warning at startup if the fd limit is too close / below the maximum peer amount (if we can, it's not straightforward to get the actual limit). Otherwise, we sleep after a failed accept to avoid the busy loop

For the record, corresponding logic in Substrate: https://github.com/paritytech/substrate/blob/00cc5f104176fac6f5a624bced22a2192c7c0470/client/cli/src/config.rs#L652-L660

Will give this more thought, but leaning towards keeping and better documenting the delay with this additional input.

Menduist · 2022-08-30T11:50:19Z

Unless one pays attention to the additional logged errors, which I doubt is happening in most setups.

That's a good point
I guess in the end it depends on the end application, libp2p can't really decide what makes more sense. Maybe we should just explore how to convey the issue to the end application

MarcoPolo · 2022-09-12T23:56:51Z

It might be a good idea to record this as a metric, and make this delay configurable. An operator may want to alert when this happens since it may mean they should increase their FD limits or something is exhausting their FDs (leak?).

thomaseizinger · 2022-09-14T03:28:36Z

It might be a good idea to record this as a metric, and make this delay configurable. An operator may want to alert when this happens since it may mean they should increase their FD limits or something is exhausting their FDs (leak?).

We can return an error via Transport::poll. The docs for TransportEvent::ListenerError say "event is for informational purposes only" so we can use this to report it back up and later integrate it with libp2p-metrics.

mxinden · 2022-09-29T14:43:43Z

I am in favor of @thomaseizinger suggestion above.

That said I will not get to this any time soon. In case someone wants to take this over, let us know.

thomaseizinger · 2022-09-30T01:37:20Z

The error is already reported as of today. Unless another sub-task is woken, this shouldn't result in a busy loop?

thomaseizinger · 2022-09-30T02:42:21Z

I had a play around with this and pushed a little test harness with a docker container to test the behaviour for when we run out of file descriptors.

In the current implementation, the error is already returned but it is a busy loop. Can we perhaps change the implementation that, once we return this particular error once, we return Poll::Pending instead?

mxinden · 2022-10-04T09:59:26Z

I had a play around with this and pushed a little test harness with a docker container to test the behaviour for when we run out of file descriptors.

🚀 neat! Thank you.

In the current implementation, the error is already returned but it is a busy loop. Can we perhaps change the implementation that, once we return this particular error once, we return Poll::Pending instead?

Given the Delay it is currently not a busy loop. Am I missing something? Returning Poll::Pending without registration of a Waker could result in a stall as nothing will wake us up once a new connection comes in, right?

thomaseizinger · 2022-10-04T11:28:34Z

In the current implementation, the error is already returned but it is a busy loop. Can we perhaps change the implementation that, once we return this particular error once, we return Poll::Pending instead?

Given the Delay it is currently not a busy loop. Am I missing something? Returning Poll::Pending without registration of a Waker could result in a stall as nothing will wake us up once a new connection comes in, right?

I was referring to the implementation in this PR!

I am not sure about the stall. Would have to experiment. If you are running out of FD you can't accept new connections anyway?

mxinden · 2022-10-09T19:10:22Z

If you are running out of FD you can't accept new connections anyway?

Even though you might not be able to accept new connections in such case, I think the issue here is that the sys call will still attempt to do so. Thus there is a busy loop.

mergify · 2023-03-22T15:23:55Z

This pull request has merge conflicts. Could you please resolve them @mxinden? 🙏

mxinden · 2023-03-22T19:32:22Z

Closing here since stale and not needed.

mxinden mentioned this pull request Aug 26, 2022

transports/tcp: simplify IfWatcher integration #2813

Merged

3 tasks

mxinden added difficulty:moderate help wanted getting-started Issues that can be tackled if you don't know the internals of libp2p very well labels Sep 29, 2022

thomaseizinger added 3 commits September 30, 2022 12:32

Add infrastructure to test low file system limit

decdffc

Merge branch 'master' into tcp-no-sleep

b7a6320

Adjust to new implementation

6960ab4

thomaseizinger marked this pull request as draft November 2, 2022 03:53

mxinden closed this Mar 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transports/tcp: Remove sleep_on_error #2849

transports/tcp: Remove sleep_on_error #2849

mxinden commented Aug 26, 2022 •

edited

Loading

mxinden commented Aug 26, 2022

marten-seemann commented Aug 26, 2022

Menduist commented Aug 26, 2022 •

edited

Loading

mxinden commented Aug 30, 2022

Menduist commented Aug 30, 2022

MarcoPolo commented Sep 12, 2022

thomaseizinger commented Sep 14, 2022

mxinden commented Sep 29, 2022

thomaseizinger commented Sep 30, 2022

thomaseizinger commented Sep 30, 2022

mxinden commented Oct 4, 2022

thomaseizinger commented Oct 4, 2022 •

edited

Loading

mxinden commented Oct 9, 2022

mergify bot commented Mar 22, 2023

mxinden commented Mar 22, 2023

transports/tcp: Remove sleep_on_error #2849

transports/tcp: Remove sleep_on_error #2849

Conversation

mxinden commented Aug 26, 2022 • edited Loading

Description

Links to any relevant issues

Open Questions

Change checklist

mxinden commented Aug 26, 2022

marten-seemann commented Aug 26, 2022

Menduist commented Aug 26, 2022 • edited Loading

mxinden commented Aug 30, 2022

Menduist commented Aug 30, 2022

MarcoPolo commented Sep 12, 2022

thomaseizinger commented Sep 14, 2022

mxinden commented Sep 29, 2022

thomaseizinger commented Sep 30, 2022

thomaseizinger commented Sep 30, 2022

mxinden commented Oct 4, 2022

thomaseizinger commented Oct 4, 2022 • edited Loading

mxinden commented Oct 9, 2022

mergify bot commented Mar 22, 2023

mxinden commented Mar 22, 2023

mxinden commented Aug 26, 2022 •

edited

Loading

Menduist commented Aug 26, 2022 •

edited

Loading

thomaseizinger commented Oct 4, 2022 •

edited

Loading