Opt in PanicHandler #646

edoshor · 2019-09-03T12:34:54Z

After recover() band-aids were removed (commit). Users were left with no possibility to act on a panic in a goroutine started inside fasthttp code. They can recover inside their own code. However, that might turn to be repetitive and cumbersome as they have to it on every handler, callback, custom interface implementation (ReadCloser) etc...

The discussion on this issue suggests that panic handling is the responsibility of the user. This PR allow the user to provide an opt-in callback which is called on a non nil return value from recover(). Users are encouraged to re-panic in their implementation to allow the process to crash completely and not leave fasthttp internals in some unexpected state.

Main use cases are error reporting and proper cleanup of resources upon an unexpected process termination.

Note: Currently, not all places creating goroutines are handled. I can add them later on if the general idea is accepted by the maintainers.

erikdubbelboer · 2019-09-07T07:59:32Z

I'm still not in favor of merging this. Have you ever had a panic caused by fasthttp crash your process? All panics I have seen came from the user code in the handler. It shouldn't be up to fasthttp to always catch this for you. There is nothing wrong with adding a defer recover at the top of your handler if you want to catch these panics.

What do you think @kirillDanshin?

edoshor · 2019-09-08T07:38:18Z

@erikdubbelboer I truly understand the lack of motivation for merging this. Having the user take responsibility is the correct way.

Our handlers are just like you recommended, i.e. defer recover at the top. However, we had panics inside a ReadCloser Close method (body stream). This method is invoked outside the scope of the handler so we can't recover there. We can recover inside the Close method but it feels cumbersome, repetitive and error prone. Moreover, these ReadClosers may be outside our control *(think some 3rd party package).

This opt-in callback would ease our job controlling the teardown of the process. Implementing one global function once, instead of constantly checking all flows and states beforehand.

erikdubbelboer · 2019-09-10T14:49:35Z

I'm wondering if Response.SetBodyStream and Response.SetBodyStreamWriter are the only functions that have this issue or if there are other things that I'm missing. (I sometimes which we didn't have such a HUGE api surface to maintain).

We already have Server.ErrorHandler, so I'm wondering if we should just use this instead of adding yet another property to Server. Of course we would then instead have to introduce a new error type that wraps the value returned by panic(). So in the end it would expose the same amount of new API to maintain.

kirillDanshin · 2019-09-17T23:58:19Z

server.go

+	if s.PanicHandler != nil {
+		defer func() {
+			s.cleanAfterHijackConn(r, c, hjc)
+			if r := recover(); r != nil {


this r shadows io.Reader defined above. I think we should rename this variable

kirillDanshin · 2019-09-18T00:55:44Z

workerpool.go

+	wp.workersCount--
+	wp.lock.Unlock()
+}
+
 func (wp *workerPool) workerFunc(ch *workerChan) {


I don't really like that this change will silently affect users with PanicHandler == nil as well.

if wp.PanicHandler != nil for == nil would at least execute redundant JZ, and the if wp.PanicHandler == nil below implies something like

JNZ FUNCRET; JMP WORKER_DONE; FUNCRET:

and that's all while the user doesn't even use this feature.

while I understand what (*wp).workerDone() is easier to read in the context of this feature, I'm not sure that this feature is really required, or at least that this implementation is optimal for both users who want the PanicHandler and who don't.

Again, I use fasthttp for several years now and I still didn't find a case when fasthttp would panic.

I'd like to hear some more thoughts about this PR.

I also think it's better if we just handle panics from Response.SetBodyStream and Response.SetBodyStreamWriter and use s.logger().Printf("... and return without writing anything more in the response as we don't know what's written already.

For people who want more control it's not that hard to wrap the reader and handle these panics themselves.

This way we don't expose any extra API again.

I agree with the above.

Moreover, #687 is definitely a step in the right direction for us. So closing this one.

edoshor · 2020-03-17T15:25:50Z

Closing as not sure it's the right way to go.

edoshor added 3 commits August 29, 2019 18:17

RecoverHandler - workerpool.go

caabce4

rename to PanicHandler

0690ed3

recover in hijackConnHandler

89af8d6

erikdubbelboer added pending/investigation pending/submitter-response labels Sep 7, 2019

erikdubbelboer removed the pending/submitter-response label Sep 10, 2019

kirillDanshin requested changes Sep 18, 2019

View reviewed changes

erikdubbelboer added pending/submitter-response and removed pending/investigation labels Sep 18, 2019

Bobochka mentioned this pull request Nov 3, 2019

Recover from panic in body write #687

Merged

edoshor closed this Mar 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Opt in PanicHandler #646

Opt in PanicHandler #646

edoshor commented Sep 3, 2019

erikdubbelboer commented Sep 7, 2019

edoshor commented Sep 8, 2019

erikdubbelboer commented Sep 10, 2019

kirillDanshin Sep 17, 2019

kirillDanshin Sep 18, 2019

erikdubbelboer Sep 18, 2019

edoshor Mar 17, 2020

edoshor commented Mar 17, 2020

Opt in PanicHandler #646

Opt in PanicHandler #646

Conversation

edoshor commented Sep 3, 2019

erikdubbelboer commented Sep 7, 2019

edoshor commented Sep 8, 2019

erikdubbelboer commented Sep 10, 2019

kirillDanshin Sep 17, 2019

Choose a reason for hiding this comment

kirillDanshin Sep 18, 2019

Choose a reason for hiding this comment

erikdubbelboer Sep 18, 2019

Choose a reason for hiding this comment

edoshor Mar 17, 2020

Choose a reason for hiding this comment

edoshor commented Mar 17, 2020