nsqd: support deferred PUB #293

mreiferson · 2015-05-01T18:07:51Z

This came up on the mailing list: https://groups.google.com/forum/#!topic/nsq-users/HkoOMygaHK0 - just opening the issue here for others to weigh in and gauge interest.

This would allow you to PUB a message that goes directly to the deferred priority queue with the specified delay.

Also, It's complicated a bit by #34.

mreiferson · 2014-01-25T17:48:11Z

@jayridge always wanted this

jayridge · 2014-01-25T19:44:33Z

this feature might persuade me to use nsq for my current project.

mreiferson · 2014-01-26T00:48:02Z

😈

There are many questions. Would be curious to hear about the use case @jayridge.

Given the fact that you PUB to a topic and topics do not currently have any concept of a deferred queue, what would be the expected behavior?

Option 1

Add a deferred queue to topics.
If a message is published with a defer it goes directly to said queue.
Every channel for that topic would receive the message delayed.

Is this desirable?

Option 2

Specify a defer delay at the channel level (ala #ephemeral)
All messages on that channel are deferred
No way for a publisher to specify a defer
Some channels on a topic could receive the message immediately

The problem with this approach is that, when publishing, there is no concept of channel and thus no way for a publisher to be able to say, on a per-channel level, to defer. This leaves channel creation as the only obvious implementation.

jayridge · 2014-01-26T17:45:35Z

The publisher should be able to specify defer timeout per message.

Primary use case is checking on a tasks progress where task is created by other party. I have also used this feature ( visibility? ) in sms where I needed a one shot after.

If there are only two options, I like option un.

deedubs · 2014-02-21T05:24:24Z

@mreiferson A use case:

We have tasks we assign to people, I'd like to be able to push a message onto the queue to check if the user has completed it in the allotted time or if it should be unlocked and made available again.

mreiferson · 2014-02-21T05:34:25Z

thanks @deedubs

For the record, you can already implement what you describe "outside the core" by embedding your defer timeout in your message body and having your consumer read that field and, if the deadline hasn't expired, REQ the message with the appropriate defer.

The downside of this approach is the overhead of having to receive the message the first time in order to defer it (and because it is consumer controlled rather than producer controlled).

In my mind, this issue is about justifying making this functionality "first class"... i.e. are the downsides I just listed really that bad vs. the complexity of the proposed changes inside nsqd?

tj · 2014-03-17T18:12:07Z

+1 we could use this as well, I was planning on going with the REQ technique. That should work fine I guess the only upside to adding official support is some visibility in nsqadmin, not sure if it's worth it or not but thought I would mention we'll be using it for deferred work as well :D

AlekSi · 2014-05-19T09:26:17Z

+1

chaosue · 2014-07-31T06:53:57Z

+1 we currently works with beanstalkd thats has this feature which is very useful in our production.

victorquinn · 2014-09-03T17:17:03Z

FYI a workaround we used was to create a delay topic, push messages into that delay topic with the name of the topic we actually want it to end up in along with the data it needs and the time at which it should be delayed until. For example, we would send a message like this to our delay topic:

{
    "timestamp": 1409763037301,
    "topic": "<actual_topic_for_message>",
    "data": "<data_for_this_message>"
}

The timestamp is an absolute timestamp, in milliseconds calculated at the time we put the item into the queue which represents the time at which it is supposed to be delayed until. This timestamp is in GMT so it is agnostic of timezone and is necessary up front because the data in the queued item is immutable so we must calculate it at time of publish (in other words there is no good way to have it be a relative timestamp, e.g. do this in 15000ms).

When a channel listening on the delay topic gets one of these messages, the first thing it does is inspect the body to see if the timestamp for this message is later than the current time. If so, it triggers a requeue for the difference. If not, it publishes the new message to the end topic.

Perhaps best illustrated by example. So if it is currently 2:45pm and I want to send an email in 15 minutes (by sending a message to the email topic), I actually send the following message to the delay topic:

{
    "timestamp": 1409763037301,
    "topic": "email",
    "data": {
        "to": "victor@socialradar.com",
        "subject": "Test",
        "message": "Hey, this is a test"
    }
}

Where the timestamp is 3:00pm in GMT, the time until which this message is supposed to be delayed.

The consumer will get it, inspect it, notice that it is currently 2:45pm and the message is meant to go out at 3:00pm so the consumer will requeue the message for 15 minutes in the future. Then at 3:00pm when that requeued message is picked up again by this consumer, it will publish it to the appropriate topic, in this case email.

So a simplified version of our code that listens to the delay topic looks a bit like this (this happens to be Node.js, using the nsq.js module):

reader.on("message", function(message) {
    var body = message.json(), now = new Date().getTime();

    if (body.timestamp > now) {
        message.requeue(body.timestamp - now);
    } else {
        writer.publish(body.topic, body.data);
    }
});

This is how we implemented an "initial delay" and it may be useful for anyone else trying to do the same unless or until it arrives in core NSQ. This is basically a more concrete example of what @mreiferson described above.

zygis · 2014-11-25T19:58:20Z

+1

jnankin · 2015-01-27T03:42:27Z

+1

mreiferson · 2015-05-01T18:10:20Z

Here goes nothing... thoughts on this approach?

mreiferson · 2015-05-01T18:18:15Z

(the tests are failing because of a dependency on go-nsq changes, which is unfortunate)

jehiah · 2015-05-01T19:02:48Z

actually comes out pretty cleanly. 👍

mreiferson · 2015-05-02T23:34:16Z

@jehiah added HTTP support. Note, we're explicitly not exposing deferred pub for MPUB varieties. These tests should pass.

jehiah · 2015-05-03T18:38:46Z

👍 thoughts on if/how this should be exposed in nsq_to_nsq?

nsqd: support deferred PUB

mreiferson · 2015-05-03T18:54:20Z

thoughts on if/how this should be exposed in nsq_to_nsq?

Do we need to support it there? Arguably, it's already got too many knobs. Supporting this at all was already a question, having to write your own consumer to use it when copying streams around is too much to ask? 😁

jehiah · 2015-05-03T19:03:35Z

Yeah i hear ya. I mean the obvious answer is it's the one spot where you would want to have a --defer-msg=60s type option.

Since this isn't a "reader option" it would need to be exposed directly, and we don't (yet?) support config files there so ... I'm in favor of adding there.

jayridge · 2015-05-03T19:15:58Z

its like christmas boys

mreiferson · 2015-05-03T19:36:11Z

we've been had

zygis · 2015-05-06T15:13:45Z

How about MDPUB or DMPUB (Multi Deferred Publish)?

mreiferson · 2015-05-06T17:10:49Z

@zygis I alluded to this in my comment above, but I intentionally didn't want to expose a means to defer en masse a large set of messages. In the current implementation, nsqd keeps deferred messages entirely in memory (i.e. they do not overflow to disk).

zygis · 2015-05-07T09:54:16Z

Well...
1000xDPUB - same result with ~1000 network calls
1xMDPUB - same result with ~1 network call

When deferred duration is few seconds maybe this limitation makes sence, but if duration is minutes or even hours I think this limitation solves nothing.

mreiferson · 2015-05-07T14:16:37Z

@zygis I realize that. I'm concerned with the behaviors that we encourage here. I can be convinced, it just didn't seem like the obvious use case.

jehiah · 2015-05-07T15:29:56Z

@zygis if the client connection buffers the socket there is little network difference between those two different protocol messages. You can pipeline a thousand DPUB just like a MDPUB would effectively provide. (there is minimal difference in command overhead between the two approaches)

zygis · 2015-05-07T16:22:52Z

@mreiferson I just do lot of writes to DB and via nsq notifying rest part of system do some computations. The main problem is that reads are done from slaves in other continent, and some times written data still not available, because of cross-continent replication lag. So DPUB is acceptable solution. But few weeks ago switched from PUB(client nsq.js) to MPUB(client go-nsq) because of networking problems between servers... MDPUB could be perfect solution.

@jehiah not sure how it can be done with go-nsq client

jehiah · 2015-05-07T16:28:58Z

@zygis What go-nsq needs to expose for you is the equivlant of Producer.DeferredPublishAsync. The *Async commands allow you to pipeline multiple requests for throughput.

zygis · 2015-05-07T16:32:35Z

@jehiah Thanks, I will try this.

jonathannorris · 2015-09-17T21:19:55Z

I've been interested in using nsq for a while, we have a potential use case coming up that requires queuing of millions of deferred messages for up to a day or so. I was wondering if there are any potential performance pitfalls with this, other then nsq storing millions of messages in memory (and not backing them up on disk). It's probably not the right solution for the problem, but I'm interested to hear from @jehiah @mreiferson on this.

ploxiln · 2015-09-17T21:41:26Z

just a general purpose PSA: For scheduling functionality, you could use a database, with an "expires" field/column, with an index on that field/column. Have a process query the database once a minute or so for "expires < $now", process those results, then delete those rows/documents. You can limit the query to return, say, 100 results, and you can re-run the query immediately after processing&deleting if there were any results.

That's roughly what's done for a few parts of @jayridge's current project, and he grudgingly uses nsq as well anyway ;)

mreiferson · 2015-09-20T15:39:55Z

@jonathannorris there aren't any other performance implications, there's just the durability question. Whether it's the right solution to the problem depends on your requirements... i.e. can you afford to lose messages that are in memory?

mreiferson · 2015-09-20T15:40:52Z

That's roughly what's done for a few parts of @jayridge's current project, and he grudgingly uses nsq as well anyway ;)

I think that's a valid statement for any technology @jayridge uses.

jayridge · 2015-09-20T15:50:26Z

true that

captainblue2013 · 2016-08-05T07:19:26Z

So ? Can anyone tell me how to use DPUB ?
There is no concept of DPUB in http://nsq.io/components/nsqd.html#post-pub

mreiferson · 2016-08-05T14:45:27Z

@captainblue2013 we should obviously document this - just pass the ?defer=x param to /pub, see https://github.com/nsqio/nsq/blob/master/nsqd/http.go#L226-L237

captainblue2013 · 2016-08-08T07:18:06Z

@mreiferson I did it after read the source code

kgdev · 2017-01-12T09:23:37Z

I found a potential bug, I switched our code from using PUB to DPUB recently, and the message got handled by the client within the specified period successfully, which was good.
However, when I check the status in localhost:4171, the "Requeued" count for that channel keeps increasing (which is not the case for PUB).

Can someone take a look into it?

sakop · 2017-01-12T09:43:31Z

@kgdev It is very easy to reproduce this bug, just exec this

curl -d "aDa" 'http://127.0.0.1:4151/pub?defer=1000&topic=aa'
and you will see Requeued keeps increasing while Messages remains 0

ploxiln · 2017-01-12T16:02:56Z

This makes sense because nsqd treats it just like a message on which the consumer called "requeue(delay=...)". It could be improved - but for that you should definitely open a new separate issue.

(Also, nsqd was not designed as a scheduling service, and thus has a few other problems with this use case.)

kgdev · 2017-01-12T16:17:06Z

Agree. I prefer not using "deferred" enqueue.

…

On Thu, Jan 12, 2017 at 6:03 PM, Pierce Lopez ***@***.***> wrote: This makes sense because nsqd treats it just like a message on which the consumer called "requeue(delay=...)". It could be improved - but for that you should definitely open a new separate issue. (Also, nsqd was not designed as a scheduling service, and thus has a few other problems with this use case.) — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#293 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGDeqEL55dxFE7GtPJ4pIyK9VReYtGj6ks5rRk67gaJpZM4BciNL> .

mreiferson · 2017-01-12T16:18:27Z

@kgdev I believe this accounting issue was resolved in #805

mreiferson mentioned this pull request Feb 6, 2014

nsqd: message TTL #302

Open

mreiferson mentioned this pull request May 20, 2014

Add ability to delay new message #351

Closed

AlekSi mentioned this pull request May 27, 2014

nsqd: arbitrary 1 hour limit for REQ timeout #348

Closed

jehiah mentioned this pull request Jun 6, 2014

Ability to delay new messages? dudleycarr/nsqjs#25

Closed

saidimu mentioned this pull request Feb 18, 2015

Add deferred message support to NSQ library saidimu/newscuria#25

Open

mreiferson mentioned this pull request May 2, 2015

DPUB nsqio/go-nsq#139

Merged

mreiferson force-pushed the dpub_293 branch from 1ca005e to 4342d36 Compare May 2, 2015 23:33

mreiferson force-pushed the dpub_293 branch 3 times, most recently from ba691d7 to 287d57b Compare May 3, 2015 14:23

mreiferson added 2 commits May 3, 2015 07:25

nsqd: add DPUB

c28d188

build: bump to go-nsq v1.0.5-alpha for DPUB

2fe831c

mreiferson force-pushed the dpub_293 branch from 287d57b to 2fe831c Compare May 3, 2015 14:25

jehiah added a commit that referenced this pull request May 3, 2015

Merge pull request #293 from mreiferson/dpub_293

a63ec33

nsqd: support deferred PUB

jehiah merged commit a63ec33 into nsqio:master May 3, 2015

mreiferson deleted the dpub_293 branch May 3, 2015 18:51

wtolson mentioned this pull request May 3, 2015

Add support for DPUB wtolson/gnsq#5

Closed

deoxen0n2 mentioned this pull request Oct 9, 2016

docs: DPUB #795

Closed

nsqd: support deferred PUB #293

nsqd: support deferred PUB #293

Conversation

mreiferson commented May 1, 2015

mreiferson commented Jan 25, 2014

jayridge commented Jan 25, 2014

mreiferson commented Jan 26, 2014

Option 1

Option 2

jayridge commented Jan 26, 2014

deedubs commented Feb 21, 2014

mreiferson commented Feb 21, 2014

tj commented Mar 17, 2014

AlekSi commented May 19, 2014

chaosue commented Jul 31, 2014

victorquinn commented Sep 3, 2014

zygis commented Nov 25, 2014

jnankin commented Jan 27, 2015

mreiferson commented May 1, 2015

mreiferson commented May 1, 2015

jehiah commented May 1, 2015

mreiferson commented May 2, 2015

jehiah commented May 3, 2015

mreiferson commented May 3, 2015

jehiah commented May 3, 2015

jayridge commented May 3, 2015

mreiferson commented May 3, 2015

zygis commented May 6, 2015

mreiferson commented May 6, 2015

zygis commented May 7, 2015

mreiferson commented May 7, 2015

jehiah commented May 7, 2015

zygis commented May 7, 2015

jehiah commented May 7, 2015

zygis commented May 7, 2015

jonathannorris commented Sep 17, 2015

ploxiln commented Sep 17, 2015

mreiferson commented Sep 20, 2015

mreiferson commented Sep 20, 2015

jayridge commented Sep 20, 2015

captainblue2013 commented Aug 5, 2016

mreiferson commented Aug 5, 2016

captainblue2013 commented Aug 8, 2016

kgdev commented Jan 12, 2017 • edited Loading

sakop commented Jan 12, 2017 • edited Loading

ploxiln commented Jan 12, 2017

kgdev commented Jan 12, 2017 via email

mreiferson commented Jan 12, 2017

kgdev commented Jan 12, 2017 •

edited

Loading

sakop commented Jan 12, 2017 •

edited

Loading