net: add mechanism to wait for readability on a TCPConn #15735

bradfitz · 2016-05-18T21:51:04Z

EDIT: this proposal has shifted. See #15735 (comment) below.

Old:

The net/http package needs a way to wait for readability on a TCPConn without actually reading from it. (See #15224)

http://golang.org/cl/22031 added such a mechanism, making Read(0 bytes) do a wait for readability, followed by returning (0, nil). But maybe that is strange. Windows already works like that, though. (See new tests in that CL)

Reconsider this for Go 1.8.

Maybe we could add a new method to TCPConn instead, like WaitRead.

bradfitz · 2016-05-18T21:51:11Z

/cc @ianlancetaylor @rsc

gopherbot · 2016-05-18T22:00:26Z

CL https://golang.org/cl/23227 mentions this issue.

Updates #15735 Change-Id: I42ab2345443bbaeaf935d683460fc2c941b7679c Reviewed-on: https://go-review.googlesource.com/23227 Reviewed-by: Ian Lance Taylor <iant@golang.org>

Updates #15735. Fixes #15741. Change-Id: Ic4ad7e948e8c3ab5feffef89d7a37417f82722a1 Reviewed-on: https://go-review.googlesource.com/23199 Run-TryBot: Mikio Hara <mikioh.mikioh@gmail.com> TryBot-Result: Gobot Gobot <gobot@golang.org> Reviewed-by: Brad Fitzpatrick <bradfitz@golang.org>

RalphCorderoy · 2016-05-20T09:56:55Z

read(2) with a count of zero may be used to detect errors. Linux man page confirms, as does POSIX's read(3p) here. Mentioning it in case it influences this subverting of a Read(0 bytes) not calling syscall.Read.

bradfitz · 2016-10-21T08:56:29Z

I found a way to do without this in net/http, so punting to Go 1.9.

bradfitz · 2016-12-12T22:30:47Z

Actually, the more I think about this, I don't even want my idle HTTP/RPC goroutines to stick around blocked in a read call. In addition to the array memory backed by the slice given to Read, the goroutine itself is ~4KB of wasted memory.

What I'd really like is a way to register a func() to run when my *net.TCPConn is readable (when a Read call wouldn't block). By analogy, I want the time.AfterFunc efficiency of running a func in a goroutine later, rather than running a goroutine just to block in a time.Sleep.

My new proposal is more like:

package net

// OnReadable runs f in a new goroutine when c is readable;
// that is, when a call to c.Read will not block.
func (c *TCPConn) OnReadable(f func()) {
   // ...
}

Yes, maybe this is getting dangerously into event-based programming land.

Or maybe just the name ("OnWhatever") is offensive. Maybe there's something better.

I would use this in http, http2, and grpc.

/cc @ianlancetaylor @rsc

ianlancetaylor · 2016-12-12T23:07:49Z

Sounds like you are getting close to #15021.

I'm worried that the existence of such a method will encourage people to start writing their code as callbacks rather than as straightforward goroutines.

bradfitz · 2016-12-12T23:17:07Z

Yeah. I'm conflicted. I see the benefits and the opportunity for overuse.

dvyukov · 2017-01-06T09:10:52Z

If we do OnReadable(f func()), won't we need to fork half of standard library for async style? Compress, io, tls, etc readers all assume blocking style and require a blocked goroutine.
I don't see any way to push something asynchronously into e.g. gzip.Reader. Does this mean that I have to choose between no blocked goroutine + my own gzip impl and blocked goroutine + std lib?

dvyukov · 2017-01-06T09:14:54Z

Re 0-sized reads.
It should work with level-triggered notifications, but netpoll uses epoll in edge-triggered mode (and kqueue iirc). I am concerned if cl/22031 works in more complex cases: waiting for already ready IO, double wait, wait without completely draining read buffer first, etc?

bradfitz · 2017-01-06T18:17:57Z

@dvyukov, no, we would only use OnReadable in very high-level places, like the http1 and http2 servers where we know the conn is expected to be idle for long periods of time. The rest of the code underneath would remain in the blocking style.

dvyukov · 2017-01-06T18:56:46Z

This looks like a half-measure. An http connection can halt in the middle of request...

bradfitz · 2017-01-06T18:57:32Z

@dvyukov, but not commonly. This would be an optimization for the common case.

dvyukov · 2017-01-07T11:38:03Z

An alternative interface can be to register a channel that will receive readiness notifications. The other camp wants this for packet-processing servers, and there starting a goroutine for every packet will be too expensive. However, if at the end you want a goroutine, then the channel will introduce unnecessary overhead.
Channel has a problem with overflow handling (netpoll can't block on send, on the other hand it is not OK to lose notifications).
For completeness, this API should also handle writes.

DemiMarie · 2017-01-10T19:56:22Z

We need to make sure that this works with Windows IOCP as well.

rsc · 2017-01-10T20:25:05Z

Not obvious to me why the API has to handle writes. The thing about reads is that until the data is ready for reading, you can use the memory for other work. If you're waiting to write data, that memory is not reusable (otherwise you'd lose the data you are waiting to write).

dvyukov · 2017-01-11T10:50:01Z

@rsc If we do just 0-sized reads, then write support is not necessary. However, if we do Brad's "My new proposal is more like": func (c *TCPConn) OnReadable(f func()), then this equally applies to writes as well -- to avoid 2 blocked goroutines per connection.

noblehng · 2017-02-21T11:50:58Z

If memory usage is the concern, it is possible to make long parked G use less memory instead of changing programming style? One main selling point of Go to me is high efficiency network servers without resorting to callbacks.

Something like shrink the stack or move the stack to heap by the GC using some heuristics, that will be littile different from spinning up a new goroutine on callback memory usage wise, and scheduling wise a callback is not much different than goready(). Also I assume the liveness change in Go1.8 could help here too.

For the backing array, if it is preallocated buffer, than a callback doesn't make much different than Read(), maybe it will make some different if it is allocated per-callback and use a pool.

Edit:
Actually we could have some GC deadline or gopark time in runtime.pollDesc, so we could get a list of long parked G from the poller, then GC can kick in, but more dance is still needed to avoid race and make it fast.

noblehng · 2017-02-22T11:12:29Z

How about a epoll like interface for net.Listener:

type PollableListener interface {
   net.Listener
   // Poll will block till at least one connection been ready for read or write
   // reads and writes are special net.Conn that will not block on EAGAIN
   Poll() (reads []net.Conn, writes []net.Conn)
}

Then the caller of Poll() can has a small number of goroutines to poll for readiness and handle the reads and writes. This should also works well for packet-processing servers.

Note that this only needs to be implemented in the runtime for those Listeners that multiplexed in the kernel, like the net.TCPListener. For other protocol that multiplex in the userspace and doesn't attached to the runtime poller directly, like udp listener or multiplexing streams in a tcp connection, can be implemented outside the runtime. For example, for multiplexing in a tcp connection, we can implemented the epoll like behavior by read from/write to some buffers then poll from them or register callbacks on buffer size changed.

Edit:
To implement this, we can let users of the runtime poller, like socket and os.File, provide a callback function pointer when open the poller for a fd, to notify them the readiness of I/O. The callback should
looks like:

type IOReadyNotify func(mode int32)

And we store this in the runtime.pollDesc, then the runtime.netpollready() function should also call this callback if not nil besides give out the pending goroutine(s).

aajtodd · 2017-02-27T23:46:35Z

I'm fairly new to Go but seeing the callback interface is a little grating given the blocking API exposed everywhere else. Why not expose a public API to the netpoll interfaces?

Go provides no standard public facing event loop (correct me if I'm wrong please). I have need to wait for readability on external FFI socket(s) (given through cgo). It would be nice to re-use the existing netpoll abstraction to also spawn FFI sockets onto rather than having to wrap epoll/IOCP/select. Also I'm guessing wrapping (e.g) epoll from the sys package does not integrate with the scheduler which would also be a bummer.

mjgarton · 2017-03-15T15:21:54Z

For a number of my use cases, something like this :

package net

// Readable returns a channel which can be read from whenever a call to c.Read
// would not block.
func (c *TCPConn) Readable() <-chan struct{} {
        // ...
}

.. would be nice because I can select on it. I have no idea whether it's practical to implement this though.

Another alternative (for some of my cases at least) might be somehow enabling reads to be canceled by using a context.

lesismal · 2022-09-14T03:33:20Z

I mean, gnet equals in performance the fastest code out there in c/cpp/rust/... so the performance is there, in Go

@ivanjaros
That's not right.
For HTTP, gnet uses a simple parser that does not implement full features of the HTTP protocol, so its testing code cost much less of cpu than a full-featured HTTP server. Here are the testing code: parser, response encoder
It seems not only gnet makes the test like this, but many frameworks join the github.com/TechEmpower/FrameworkBenchmarks like this. That's not the real performance reports, and it misleads many people about different frameworks' performance.

As we know, golang can't get the same performance as c/cpp/rust in most scenarios, but can get near to c/cpp/rust in some scenarios such as IO, and the most important is: goroutine and chan make us write code easier.

d2gr · 2022-09-14T04:17:30Z

Also I bet proxy like Traefik would get a nice performance gain from this as well. I mean, gnet equals in performance the fastest code out there in c/cpp/rust/... so the performance is there, in Go. It just needs to be unlocked.

@ivanjaros
Yeah, but the problem with frameworks like gnet or evio is that they are not usable in production. There might be cases where you can use it, even with codecs. They are too complicated to be used in serious environments. A pro-actor pattern library would be more suitable (like boost::asio).

Standard Go is quite performant, just take a look at fasthttp. It is built using standard Go. The only problem with Go is the amount of sync mechanisms that it requires. That's why I said that a pro-actor pattern might be faster for TLS & HTTP/2. Or any protocol that uses streams instead of one single stateless connection.

d2gr · 2022-09-14T04:32:46Z

As we know, golang can't get the same performance as c/cpp/rust in most scenarios, but can get near to c/cpp/rust in some scenarios such as IO, and the most important is: goroutine and chan make us write code easier.

@lesismal
I mean... Go is quite fast. I benchmarked fasthttp vs boost::beast quite a lot (in AWS 2xc5n.4xlarge, client and server in a placement group) and Go is able to handle 200K QPS below 5ms (100th percentile) and boost::beast was able to do 200K sometimes below 4.8ms other times below 5.5ms. So there's a lot of variance in boost::beast, whilst in Go I got the same result in all the benchmarks.
Now, the only problem with Go is that the more the connections the less performant the I/O becomes. And that's a limitation you should be aware of while building a system.

So it depends on how your Go program is structured. If it has locks, it doesn't. If uses channels, etc... In my benchmarks I had no locks, just plain HTTP (no TLS) with some caching system in both fasthttp & boost::beast.

I wouldn't say that Go is less performant than C++ and Rust. It also depends on the library that you use. boost::beast is ok, it does scale well, but it is not as latency sensitive as you might think (truth is that you can easily plug in solarflare's onload in boost::asio and get a performance improvement), and Rust is not as different from Go. They also use coroutines and they also need to lock in some scenarios but they have the advantage of not having a GC ("advantage").

Bechmarks are mostly a lie. People prepare their programs for the benchmark in question (like gnet's case). Production ready environments don't need to lie in their benchmarks (fasthttp)

lesismal · 2022-09-14T05:30:09Z

@d2gr
I think proactor is not the key point, but are the num of goroutines, and blocking or non-blocking.
I've tried a lot about a non-blocking based HTTP server, I think we can't gain both high performance and high online together using golang by now:

For std-based frameworks that use net.Conn(blocking IO interface), including fasthttp, the cost of hardware grows fast as the num of connections increases, because they all use at least one goroutine for each connection, including fasthttp. It's hard to reduce the cost for gc, memory and schedule.
For non-blocking frameworks, no matter whether it's reactor or proactor, we need to handle IO and logic in different goroutine pools. That needs more heap escape, more complex async parser, and we can not optimize the buffers as fasthttp does.

d2gr · 2022-09-14T05:43:12Z

@lesismal
I mention pro-actor because it's the easiest way to handle async (unlike async/await, or reactor). I agree that we need less heap and more stack-based structures. I don't know what do you mean with async parser.

ivanjaros · 2022-09-14T06:06:06Z

That's not right. For HTTP, gnet uses a simple parser that does not implement full features of the HTTP protocol, so its testing code cost much less of cpu than a full-featured HTTP server.

you are mixing apples and oranges here. nobody is talking about http servers here. gnet is merely networking framework(like ALL the projects mentioned before). what you build on top of it is up to you.

lesismal · 2022-09-14T06:35:01Z

@d2gr
We need to cache half-packet bytes because we can't use ReadFull. The parser and buffer usage logic are much more complex than net.Conn based frameworks.

@ivanjaros Please refer to the reasons I've mentioned here and in previous comments.

Here are some benchmark reports, you can run the test in your own env:
lesismal/go-net-benchmark#1
I get the same level of performance between nbio and gnet, but I support a lot more in nbio than gnet. I tried a lot to optimize the performance and reduce the cost, but I can achieve only balance between performance and cost reduce, can't gain both of them together.
That's not only about HTTP!
For simple IO logic, we do gain a good performance, but for product env, for the complex logic, the reasons I mentioned make the performance down.
You can try it, I will be glad to see if you can find some way to make more promotions.

bcmills · 2022-09-14T17:51:25Z

@lesismal, @ivanjaros, @d2gr: it isn't clear to me how the above discussion relates to the feature proposed in this issue. For off-topic performance discussions, please start a thread on the golang-dev mailing list or a similar venue outside of the issue tracker.

d2gr · 2022-09-14T17:53:54Z

@bcmills Sorry for spamming a bit, but my comments where related to the issue. At least this one #15735 (comment)

lesismal · 2022-09-15T02:51:23Z

@bcmills
Sorry about that.
But I think the CanRead does related to non-blocking interfaces and performance. If the new feature doesn't consider these points, the new feature will not be useful and should not be added.
If the new feature provides only a CanRead but still blocking Read/Write, there will be problems like @ivanjaros 's for-loop or gobwas/ws.

My previous related comments:

My opinion is that if provide Readable, non-blocking Read/Write interface should be provided together, else Readable will not be useful.

If conn.CanRead() is blocking interface, one connection's blocking makes all the other connections wait for long in the loop, even if they have been readable already. If conn.CanRead() is non-blocking interface, that for loop will cause cpu 100%.

lesismal · 2022-09-15T03:42:45Z

@bcmills
Actually, CanRead interface is just the same thing that 1m-go-websockets and gobwas/ws did. That's the smallest import for event-driven on std's TCPConn, but as we've discussed, it leads to the problem:
gobwas/ws#143
That's the same paroblem as the for-loop-block we discussed in previous comments.
To solve this problem in gobwas/ws, we still need to serve each connection with at least 1 goroutine, which come back to the solution of the current std but is more complex. Then, there's no benefit, seems and performs even worse.
So, if Read/Write interfaces are still blocking mod, I would prefer you maintainers just keep it not changed rather than add a new CanRead interface.

Also that's why I hope if you add CanRead, please add non-blocking Read/Write interfaces together. And then, one more thing, the non-blocking interfaces and separate goroutine pool make the performance worse than the current std if there are not a lot of connections, and that needs a lot of changes to the current TCPConn, I think that should be also considered before this proposal is accepted.
But all right, I will stop discussing performance, I just focus on the CanRead, blocking or non-blocking.

AnimusPEXUS · 2023-07-26T21:17:41Z

just add those functions to net.Conn:

ReadAvailable() bool
WriteAvailable() bool
ReadNonBlocking(b []byte) (n int, err error)
WriteNonBlocking(b []byte) (n int, err error)

ianlancetaylor · 2023-07-26T21:43:17Z

@AnimusPEXUS Thanks, but that doesn't address this issue. This issue is about putting a goroutine to sleep until there is something to read.

AnimusPEXUS · 2023-07-26T22:28:59Z

@ianlancetaylor determining "if ther's something to read" is system-to-system specific question. for NIX* it's 'select' function. maximum you can do here for nix is make callback or push signal after select returned and if we want select non-blocking, then we have to put socket into non-blocking state. for non-socket/non-unix there only two options: endless loop, constantly checking readiness, or, again, callback if peer can somehow separately indicate 'I have something to read'

DemiMarie · 2023-07-27T22:33:10Z

I don’t think Go as it stands today is the best choice for this kind of code.

Optimized Rust and C++ servers use a combination of compiler-generated state machines (async/await and co_async/co_await, respectively) and manual memory management to achieve very high performance. Go does not support either of these features, so achieving similar performance in Go would require essentially writing C with Go syntax.

ianlancetaylor · 2023-07-27T22:36:40Z

@AnimusPEXUS There is a reason that this issue is still open.

bradfitz added this to the Go1.8 milestone May 18, 2016

bradfitz self-assigned this May 18, 2016

gopherbot pushed a commit that referenced this issue May 19, 2016

net: don't return io.EOF from zero byte reads

5bcdd63

Updates #15735 Change-Id: I42ab2345443bbaeaf935d683460fc2c941b7679c Reviewed-on: https://go-review.googlesource.com/23227 Reviewed-by: Ian Lance Taylor <iant@golang.org>

quentinmit added the NeedsDecision Feedback is required from experts, contributors, and/or the community before a change can be made. label Oct 7, 2016

bradfitz modified the milestones: Go1.9, Go1.8 Oct 21, 2016

garyburd mentioned this issue Dec 15, 2016

write buffer pooling gorilla/websocket#192

Closed

bradfitz mentioned this issue Jan 6, 2017

os: use non-blocking I/O for pollable files automatically #18507

Closed

ALTree mentioned this issue Feb 2, 2017

net: skip excessive read calls #8891

Closed

lesismal mentioned this issue Oct 16, 2022

GoPool's performance in comparison to other goroutine pool implementations bytedance/gopkg#144

Open

atollena mentioned this issue Oct 31, 2022

High memory footprint per connection grpc/grpc-go#5751

Closed

jackc mentioned this issue Dec 6, 2022

Custom dial func causes pgx to hang indefinitely jackc/pgx#1413

Closed

tebruno99 mentioned this issue Dec 13, 2022

Examples to save memory using write buffer pool and freeing net.http default buffers ElasticPerch/epws#8

Open

jackc mentioned this issue Feb 27, 2023

net: TCP connection erroneously duplicates message on Windows #58764

Open

This was referenced Jun 21, 2023

"closing bad idle connection: unexpected read from socket" errors on MySQL 8.0.24 go-sql-driver/mysql#1392

Open

RFC: Disable CheckConnLiveness by default. go-sql-driver/mysql#1451

Closed

lesismal mentioned this issue Jul 28, 2023

proposal: syscall: extend the interface definition of RawConn to support non-blocking operations #61628

Open

rentziass mentioned this issue Sep 30, 2024

Pool TLS connections aren't long-lived anymore redis/go-redis#3137

Closed

ignoramous mentioned this issue Oct 19, 2024

Pool DoT connections celzero/firestack#105

Closed

net: add mechanism to wait for readability on a TCPConn #15735

net: add mechanism to wait for readability on a TCPConn #15735

Comments

bradfitz commented May 18, 2016 • edited Loading

bradfitz commented May 18, 2016

gopherbot commented May 18, 2016

RalphCorderoy commented May 20, 2016

bradfitz commented Oct 21, 2016

bradfitz commented Dec 12, 2016

ianlancetaylor commented Dec 12, 2016

bradfitz commented Dec 12, 2016

dvyukov commented Jan 6, 2017

dvyukov commented Jan 6, 2017

bradfitz commented Jan 6, 2017

dvyukov commented Jan 6, 2017

bradfitz commented Jan 6, 2017

dvyukov commented Jan 7, 2017

DemiMarie commented Jan 10, 2017

rsc commented Jan 10, 2017

dvyukov commented Jan 11, 2017

noblehng commented Feb 21, 2017 • edited Loading

noblehng commented Feb 22, 2017 • edited Loading

aajtodd commented Feb 27, 2017

mjgarton commented Mar 15, 2017

lesismal commented Sep 14, 2022 • edited Loading

d2gr commented Sep 14, 2022 • edited Loading

d2gr commented Sep 14, 2022 • edited Loading

lesismal commented Sep 14, 2022

d2gr commented Sep 14, 2022

ivanjaros commented Sep 14, 2022 • edited Loading

lesismal commented Sep 14, 2022

bcmills commented Sep 14, 2022

d2gr commented Sep 14, 2022

lesismal commented Sep 15, 2022

lesismal commented Sep 15, 2022 • edited Loading

AnimusPEXUS commented Jul 26, 2023

ianlancetaylor commented Jul 26, 2023

AnimusPEXUS commented Jul 26, 2023

DemiMarie commented Jul 27, 2023

ianlancetaylor commented Jul 27, 2023

bradfitz commented May 18, 2016 •

edited

Loading

noblehng commented Feb 21, 2017 •

edited

Loading

noblehng commented Feb 22, 2017 •

edited

Loading

lesismal commented Sep 14, 2022 •

edited

Loading

d2gr commented Sep 14, 2022 •

edited

Loading

d2gr commented Sep 14, 2022 •

edited

Loading

ivanjaros commented Sep 14, 2022 •

edited

Loading

lesismal commented Sep 15, 2022 •

edited

Loading