Replace Celery with RQ (except for execute_query tasks) #4093

rauchy · 2019-08-21T21:14:54Z

What type of PR is this? (check all applicable)

Refactor

Description

As a first probe towards migrating to RQ, I'm interested in trying to swap out Celery beat and all non-execute-query tasks and replace them with RQ.

Related Tickets & Documents

Towards #4092

rauchy · 2019-08-21T21:17:16Z

docker-compose.yml

+    build: .
+    command: dev_rq_worker
+    deploy:
+      replicas: 2


Left this here for discussion - if one would want to allow multiple workers running simultaneously using docker-compose, one would have to run docker-compose --compatibility up

Isn't deploy only for Docker Swarm deployments? Have you considered using scale option?

If I understand correctly, the scale option is deprecated. One could use the --scale CLI option, and that would have the same consequence as using --compatibility. I just saw the deploy.replicas option as a better source of documentation for anyone who would want to scale through docker-compose.

You're right. They deprecated it in v3 😒 I fail to see the logic behind this move, specially when they recommend not using --compatibility in production.

Considering we have no intention of using Swarm -- maybe we should switch to v2?

We have at least one thing we need from v3.2 - bind volumes (for proper code reloading), so v2 is not an option. I guess it'll be best to recommend the --scale option outside this file. Where do you think would be the best place to do so?

Do you remember what's the difference between how we define the volume for server and worker? I know you added it when working on auto restart, but from looking at the docs I fail to understand the difference.

If it is required, we can consider downgrading the production compose file to v2 as there the scale configuration is more important.

server is using the default volume volume type, because Flask's auto reloading works fine with it. For worker (or anything that we use watchmedo for) we need a bind volume type for FSEvents to play nicely with macOS.

We could definitely use v2 for production compose files though.

Ok, as this is all interim -- let's move on and revisit before migrating everything to RQ.

OK, for now I've removed the deploy.replicas key.

jezdez · 2019-08-21T21:59:34Z

redash/cli/rq.py

+def worker(queues='default'):
+    with Connection(redis_connection):
+        w = Worker(queues)
+        w.work()


I should note that there is flask-rq2 which provides a Flask friendly API for this.

(I'm its author)

redash/__init__.py

…heduler entrypoint

requirements.txt

redash/cli/rq.py

…ia get_task_logger for now)

…le using docker-compose would be the --scale CLI option, which will be described in the knowledge based

…bs to be scheduled after app has been loaded

…expiring and having periodic jobs not reschedule

arikfr · 2019-10-10T14:23:07Z

@NicolasLM We're planning on merging this soon into master. Anything we can do to make your life easier post the merge?

NicolasLM · 2019-10-11T08:09:54Z

@NicolasLM We're planning on merging this soon into master. Anything we can do to make your life easier post the merge?

Thank you for the heads up. I don't expect much trouble with running this code on Python 3, the bulk of the work will probably be resolving conflicts.

rauchy · 2019-10-15T18:28:18Z

If we can't just directly use the function, why not return that definitions dict in this function and call schedule in redash.schedule?

@arikfr yeah that makes sense. I've used the opportunity to schedule standard jobs in the same way to make both invocations appear the same in 4fb49c0.

arikfr · 2019-10-15T18:57:37Z

When we merge this we should post an update on Development mentioning that master is in a hybrid state at the moment and not recommended for general consumption.

arikfr · 2019-10-16T08:50:20Z

redash/schedule.py

+                          cleanup_query_results,
+                          version_check, send_aggregated_errors)
+
+rq_scheduler = Scheduler(connection=redis_connection,


In Python 3 we use decode_responses=True, while rq-scheduler expects it to be False and uses decode on values it receives:

https://github.com/rq/rq-scheduler/blob/061488e79b82af215f7f98fa1554a0aa84dfd62f/rq_scheduler/scheduler.py#L321

When porting this to Python 3, we should probably create a Redis connection for it without the decode_responses change 🤔 We can create a "factory" function for creating the Redis connection and move the logic of mangling the Redis URL there (and make it optional).

Another option to consider is to revert to not decoding responses and do it where needed. It just felt redundant, so I was hoping to avoid it.

@rauchy @NicolasLM

Omer Lachish added 3 commits August 22, 2019 00:11

add rq and an rq_worker service

0773c27

add rq_scheduler and an rq_scheduler service

974a5c5

move beat schedule to periodic_jobs queue

054810c

rauchy commented Aug 21, 2019

View reviewed changes

jezdez reviewed Aug 21, 2019

View reviewed changes

weekly-digest bot mentioned this pull request Aug 26, 2019

Weekly Digest (19 August, 2019 - 26 August, 2019) #4098

Closed

Omer Lachish added 8 commits August 27, 2019 23:20

move version checks to RQ

63e32d0

move query result cleanup to RQ

41820ab

use timedelta and DRY up a bit

5bbe2c7

move custom tasks to RQ

f3c780e

do actual schema refreshes in rq

debb05c

rename 'period_jobs' to 'periodic', as it obviously holds jobs

15c5752

move send_email to rq

0ad0ce2

DRY up enqueues

dc123d1

jezdez reviewed Aug 29, 2019

View reviewed changes

redash/__init__.py Outdated Show resolved Hide resolved

Omer Lachish added 8 commits August 29, 2019 22:38

ditch and use a partially applied decorator

b3614d6

move subscribe to rq

adb79ba

move check_alerts_for_query to rq

ea6181d

move record_event to rq

5bb90a9

make tests play nicely with rq

8b2e9a3

👋 beat

a597fc3

rename rq_scheduler to plain scheduler, now that there's no Celery sc…

ce9363c

…heduler entrypoint

Merge branch 'master' into rq

1e3711a

arikfr reviewed Sep 1, 2019

View reviewed changes

requirements.txt Show resolved Hide resolved

redash/cli/rq.py Outdated Show resolved Hide resolved

redash/cli/rq.py Outdated Show resolved Hide resolved

Omer Lachish added 6 commits September 1, 2019 11:13

add some color to rq-worker's output

55d2931

add logging context to rq jobs (while keeping execute_query context v…

dbef8fa

…ia get_task_logger for now)

move schedule to its own module

ed95b7c

cancel previously scheduled periodic jobs. not sure this is a good idea.

1b1485a

rename redash.scheduler to redash.schedule

36b67dd

allow custom dynamic jobs to be added decleratively

5e71aa7

Omer Lachish added 2 commits September 22, 2019 13:24

pleasing the CodeClimate overlords

98b8748

remove replication settings from docker-compose - a proper way to sca…

59ae7a2

…le using docker-compose would be the --scale CLI option, which will be described in the knowledge based

weekly-digest bot mentioned this pull request Sep 23, 2019

Weekly Digest (16 September, 2019 - 23 September, 2019) #4172

Closed

Merge branch 'master' into rq

d1449c2

weekly-digest bot mentioned this pull request Sep 30, 2019

Weekly Digest (23 September, 2019 - 30 September, 2019) #4196

Closed

Omer Lachish added 3 commits October 3, 2019 09:28

Merge branch 'master' into rq

ad91bf4

revert to calling a function in dynamic settings to allow periodic jo…

7a59113

…bs to be scheduled after app has been loaded

Merge branch 'master' into rq

76a74b3

weekly-digest bot mentioned this pull request Oct 7, 2019

Weekly Digest (30 September, 2019 - 7 October, 2019) #4222

Closed

Omer Lachish added 6 commits October 7, 2019 16:43

Merge branch 'master' into rq

f2a4bde

don't need to depend on context when templating failure reports

2092324

Merge branch 'master' into rq

8f9f8d0

Merge branch 'dont-rely-on-context-for-aggregated-errors' into rq

1c3f1df

set the timeout_ttl to double the interval to avoid job results from …

b96226a

…expiring and having periodic jobs not reschedule

whoops, bad merge

7a81103

weekly-digest bot mentioned this pull request Oct 14, 2019

Weekly Digest (7 October, 2019 - 14 October, 2019) #4241

Closed

Omer Lachish added 2 commits October 15, 2019 21:07

Merge branch 'master' into rq

b15e4d8

describe custom jobs and don't actually schedule them

4fb49c0

arikfr approved these changes Oct 15, 2019

View reviewed changes

Omer Lachish added 3 commits October 15, 2019 22:17

Merge branch 'master' into rq

335d8c1

Merge branch 'master' into rq

1fae19c

fix merge

fc72455

rauchy merged commit 5a5fdec into master Oct 15, 2019

arikfr reviewed Oct 16, 2019

View reviewed changes

weekly-digest bot mentioned this pull request Oct 21, 2019

Weekly Digest (14 October, 2019 - 21 October, 2019) #4271

Closed

guidopetri deleted the rq branch July 22, 2023 03:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace Celery with RQ (except for execute_query tasks) #4093

Replace Celery with RQ (except for execute_query tasks) #4093

rauchy commented Aug 21, 2019 •

edited

Loading

rauchy Aug 21, 2019

arikfr Sep 22, 2019

rauchy Sep 22, 2019

arikfr Sep 22, 2019

rauchy Sep 22, 2019

arikfr Sep 22, 2019

rauchy Sep 22, 2019

arikfr Sep 22, 2019

rauchy Sep 22, 2019

jezdez Aug 21, 2019

jezdez Aug 21, 2019

arikfr commented Oct 10, 2019

NicolasLM commented Oct 11, 2019

rauchy commented Oct 15, 2019

arikfr commented Oct 15, 2019

arikfr Oct 16, 2019

Replace Celery with RQ (except for execute_query tasks) #4093

Replace Celery with RQ (except for execute_query tasks) #4093

Conversation

rauchy commented Aug 21, 2019 • edited Loading

What type of PR is this? (check all applicable)

Description

Related Tickets & Documents

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arikfr commented Oct 10, 2019

NicolasLM commented Oct 11, 2019

rauchy commented Oct 15, 2019

arikfr commented Oct 15, 2019

Choose a reason for hiding this comment

rauchy commented Aug 21, 2019 •

edited

Loading