[#2977] Basic implementation of background jobs via python-rq #3165

torfsen · 2016-07-14T14:11:26Z

This PR is WIP for #2977, a new implementation of background jobs using python-rq.

(Note that I'm not interested in the bounty)

It is loosely based on #2706 by @rossjones.

The implementation has a minimal feature set on purpose: I'm fine with adding more features but I wanted to avoid introducing things that nobody needs. So if you feel that this PR is missing an important feature please speak up.

Likewise, documentation and tests will not be written until there is an agreement regarding the general architecture.

There is now the ckan.lib.jobs module which most importantly provides the init_queue and enqueue functions. The former is called during CKAN startup and establishes the connection to Redis. I wanted to make sure that we fail early when Redis is not available (and not later at some random time when a job is enqueued). However I'm not sure whether ckan/websetup.py is the right place for that -- feedback appreciated!

Once you have enqueued a job using ckan.lib.jobs.enqueue you can use the job_* API functions to manage the queue:

job_list: List all enqueued jobs
job_show: Show details for a certain job
job_cancel: Cancel a certain enqueued job
job_clear: Clear the whole queue

Similar functionality is provided by paster jobs, see its help text. In particular, paster jobs worker starts a worker that fetches jobs from the queue and executes them. If you want to play around you can use paster jobs test to enqueue test jobs (which you then can list, cancel, execute, etc.).

amercader · 2016-07-14T15:19:12Z

You rock @torfsen 🍻

Don't put anything on websetup.py, I don't even know what that file is used for. What about putting the queue init code in environment.load_environment(). That's what's first called when creating the web app and the CLI (Right now the queue is not initialised when starting up CKAN as websetup is not called).

Are you envisaging Redis being a hard requirement?

wardi · 2016-07-14T16:15:56Z

I think we all agreed to accept Redis as a hard requirement when this work is merged

TkTech · 2016-07-15T03:05:20Z

Yup. It will also allow us to use Flask-Cache/plain redis as a general cache in core CKAN instead of stuffing everything into Solr.

torfsen · 2016-07-15T08:56:43Z

@amercader I've moved the connection test to load_environment so that a warning is shown upon server start when Redis is not available. In addition I've made the queue initialization lazy, see jobs.get_queue.

torfsen · 2016-07-18T08:27:01Z

torfsen · 2016-07-19T12:09:07Z

@TkTech You mentioned that you want to use Redis for other parts of CKAN, too. How should we let the user configure the Redis connection?

Since we're going to use 3rd-party code that uses Redis we probably want to put each distinct usage (background jobs, Flask cache) into a separate Redis database to make sure that they don't interfere with each other. Hence I suggest that we only add a single configuration option, redis_url (similar to solr_url) which contains Redis' URL without specifying a database (e.g. redis://localhost:6379). Our different use cases can then pick a separate database each. Redis offers 16 databases by default which should be plenty.

TkTech · 2016-07-19T12:12:47Z

We need to support it all in a single database. People using hosted redis, or odd hosting setups may not be able to use SELECT. Allowing them to specify the connection URI is fine, since the database can be selected in that URI, allowing them to choose to use one or many.

There's no problem allowing all our uses to live in one database so long as everything is prefixed.

torfsen · 2016-07-19T13:45:03Z

@TkTech Good point regarding hosted Redis.

RQ prefixes its keys, so interference should be avoidable. It also seems to empty queues intelligently. However, it seems that the prefix is not configurable, so we cannot add the CKAN site ID. This means that two CKAN instances can share a Redis instance but must use separate databases.

We still have to be cautious, though: For example, ckanext-harvest simply flushes the whole Redis database to empty its queues. This is a trivial example, since we control that code, but we must be careful not to miss something like that in 3rd-party code.

TkTech · 2016-07-19T13:55:29Z

https://github.com/nvie/rq/blob/766bb600060c85e8a39579541798930bbb508ec8/rq/queue.py#L31

Looks like we can "configure" the prefix this way.

We should definitely update the plugins we control (no good plugin should be wiping the entire keyspace! Always use prefixes in redis), but for 3rd party plugins we shouldn't consider that our problem. Document it and assume good behavior.

torfsen · 2016-07-19T14:02:15Z

@TkTech Unfortunately there doesn't seem to be such a setting for the prefix of job keys, that seems to be hard-coded.

Update: There's a PR for that feature: rq/rq#685.

torfsen · 2016-07-20T08:45:12Z

It just occurred to me that we can also simply use the CKAN site ID as the queue name for RQ.

torfsen · 2016-07-20T12:19:35Z

Regarding compatibility with ckanext-harvest: See ckan/ckanext-harvest#257 and the corresponding PR ckan/ckanext-harvest#258.

torfsen · 2016-07-21T05:51:25Z

Do we need support for multiple queues? #2706 had, and ckanext-harvest also uses two separate queues (gather and fetch). This PR currently only supports a single queue but it would be easy to support an arbitrary number of named queues instead. Workers could then listen on all or a certain subset of the queues.

@rossjones and @amercader, what was the rationale behind the multiple queue-functionality in #2706 and ckanext-harvest, respectively?

rossjones · 2016-07-21T06:16:15Z

I had multiple queues in that PR because on data.gov.uk we use different queues for different priority tasks. That way we can have a busy low-priority queue, but still get out the small tasks pretty quickly. FWIW I'd support multiple queues if it wasn't much work, but I wouldn't let it hold you up.

torfsen · 2016-07-22T10:34:13Z

Multiple queues are now supported. By default there is just a single queue (default) which is used whenever no other queue is specified. Queues can be created by simply adding tasks to them.

Queue names are automatically prefixed with the site ID so multiple CKAN instances can now share a Redis database.

Useful for displaying details about a single job. Corresponds to the job_show API function.

This output doesn't go into the CKAN logs but into a separate worker log (`/var/log/ckan-worker.log` if the default Supervisor configuration is used).

In most cases one can simply use `tempfile.NamedTemporaryFile` instead.

Mention that it's usually best to simply use the `ckan.tests.recorded_logs` context manager.

The documentation previously talked about sharing databases which could be misunderstood as the possibility of sharing PostgreSQL databases between CKAN instances (instead of sharing Redis databases, which was the intended meaning).

A previous commit for ckan#2977 removed `ckan/config/celery-supervisor.conf` as part of deprecating the old Celery background task system. However, the old documentation told people to copy *or link* that file, so removing it could break existing installations. Hence this commit restores the file, it should be kept around until support for the Celery system is removed completely.

torfsen · 2016-09-12T12:41:57Z

Thanks for the review and good idea regarding ckan.plugins.toolkit.enqueue_job, @wardi, that's done.

I've also restored ckan/config/celery-supervisor.conf: I had previously removed it while deprecating the old Celery system. However, the old documentation told people to copy or link the file, so removing it could break existing installations. I've now restored the file, we should keep it around until we remove Celery support completely.

wardi · 2016-09-12T12:46:12Z

doc/extensions/best-practices.rst

+the name of your extension. For example:
+
+* The names of *configuration settings* introduced by your extension should
+  have the form ``ckan.my_extension.my_config_setting``.


The two patterns I've seen for configuration options are

ckanext.my_extension.my_config_setting = ...

and

my_extension.my_config_setting = ...

I choose the latter for my extensions because I find the configuration options are really long with the former.
Anything starting with ckan. should belong to ckan core.

I've changed it to my_extension.my_config_setting.

The docs now suggest to use `my_extension.my_setting` instead of the previously suggested `ckan.my_extension.my_setting`.

wardi · 2016-09-13T12:13:07Z

ckan/lib/jobs.py

+prefixed names. Use the functions ``add_queue_name_prefix`` and
+``remove_queue_name_prefix`` to manage queue name prefixes.
+
+.. versionadded:: 2.6


please update these to 2.7

amercader · 2016-09-13T13:53:35Z

ckan/lib/jobs.py

+
+import logging
+
+from pylons import config


Since #3163 was merged we should never import config directly from Pylons anymore. Use from ckan.common import config.

My fault for not documenting / announcing it

No problem, fixed.

amercader · 2016-09-13T14:25:27Z

@torfsen This looks really, really good. Thanks for being so thorough with the tests and docs. Apart from the minor comments added it looks good to me.

Can't wait to try it with ckanext-harvest!

wardi · 2016-09-15T20:28:19Z

any objections to merging this now?

amercader · 2016-09-16T15:50:41Z

No, let's get it in!

torfsen · 2016-09-16T16:50:19Z

Yay, great work everybody!

wardi · 2016-09-16T23:56:54Z

@torfsen I wrote a tiny stub of an extension that uses this code https://github.com/wardi/ckanext-geocodejob but it seems rq's forking is interfering with ckan's sqlalchemy state. My first job works but jobs after that fail with DatabaseError: (DatabaseError) SSL error: decryption failed or bad record mac

Can we wrap the job running with code that cleanly closes our database connections?

torfsen · 2016-09-19T14:24:21Z

@wardi Good catch! I've created a separate issue (#3243) and I'm working on it.

This was referenced Jul 14, 2016

Provide support for background tasks #2977

Closed

Celery for background tasks? ckan/ideas#66

Closed

torfsen force-pushed the 2977-rq-background-tasks branch from f68d434 to c6fcb06 Compare July 15, 2016 08:58

torfsen mentioned this pull request Jul 20, 2016

Allow sharing of Redis database ckan/ckanext-harvest#257

Closed

torfsen force-pushed the 2977-rq-background-tasks branch 3 times, most recently from 917fe7b to 3f5b46f Compare July 22, 2016 10:24

torfsen force-pushed the 2977-rq-background-tasks branch 8 times, most recently from 4472541 to 2aadcb6 Compare July 27, 2016 09:18

torfsen added 10 commits September 12, 2016 13:25

[ckan#2977] Add paster jobs show command.

c8a8553

Useful for displaying details about a single job. Corresponds to the job_show API function.

[ckan#2977] Document new background job system.

a6788db

[ckan#2977] Increase verbosity of background job worker output.

d0b56a6

This output doesn't go into the CKAN logs but into a separate worker log (`/var/log/ckan-worker.log` if the default Supervisor configuration is used).

[ckan#2977] Document in which CKAN version new features were added.

958db3a

[ckan#2977] Rename configuration option redis_url to ckan.redis.url.

6379585

[ckan#2977] Remove ckan.tests.helpers.temp_file context manager.

4fcc8c1

In most cases one can simply use `tempfile.NamedTemporaryFile` instead.

[ckan#2977] Improve docs of ckan.tests.helpers.RecordingLogHandler

4bd5521

Mention that it's usually best to simply use the `ckan.tests.recorded_logs` context manager.

[ckan#2977] Add ckan.plugins.toolkit.enqueue_job.

74aabf9

torfsen force-pushed the 2977-rq-background-tasks branch from 5a21108 to 18e140b Compare September 12, 2016 12:37

wardi reviewed Sep 12, 2016
View reviewed changes

[ckan#2977] Remove CKAN prefix from suggested configuration option name.

e855b55

The docs now suggest to use `my_extension.my_setting` instead of the previously suggested `ckan.my_extension.my_setting`.

wardi reviewed Sep 13, 2016
View reviewed changes

[ckan#2977] Change CKAN release from 2.6 to 2.7.

c3cf34e

amercader reviewed Sep 13, 2016
View reviewed changes

[ckan#2977] Import config from ckan.common instead of pylons.

a967a9b

wardi approved these changes Sep 16, 2016

View reviewed changes

wardi merged commit a483968 into ckan:master Sep 16, 2016

torfsen mentioned this pull request Sep 19, 2016

Multiple background jobs that access the database fail #3243

Closed

earthlyreason mentioned this pull request Jul 12, 2018

Support Redis password ckan/ckanext-harvest#331

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[#2977] Basic implementation of background jobs via python-rq #3165

[#2977] Basic implementation of background jobs via python-rq #3165

torfsen commented Jul 14, 2016

amercader commented Jul 14, 2016

wardi commented Jul 14, 2016

TkTech commented Jul 15, 2016

torfsen commented Jul 15, 2016

torfsen commented Jul 18, 2016 •

edited

Loading

torfsen commented Jul 19, 2016

TkTech commented Jul 19, 2016 •

edited

Loading

torfsen commented Jul 19, 2016

TkTech commented Jul 19, 2016

torfsen commented Jul 19, 2016 •

edited

Loading

torfsen commented Jul 20, 2016

torfsen commented Jul 20, 2016

torfsen commented Jul 21, 2016

rossjones commented Jul 21, 2016

torfsen commented Jul 22, 2016

torfsen commented Sep 12, 2016

wardi Sep 12, 2016

torfsen Sep 13, 2016

wardi Sep 13, 2016

torfsen Sep 14, 2016

amercader Sep 13, 2016

torfsen Sep 14, 2016

amercader commented Sep 13, 2016

wardi commented Sep 15, 2016

amercader commented Sep 16, 2016

torfsen commented Sep 16, 2016

wardi commented Sep 16, 2016

torfsen commented Sep 19, 2016

[#2977] Basic implementation of background jobs via python-rq #3165

[#2977] Basic implementation of background jobs via python-rq #3165

Conversation

torfsen commented Jul 14, 2016

amercader commented Jul 14, 2016

wardi commented Jul 14, 2016

TkTech commented Jul 15, 2016

torfsen commented Jul 15, 2016

torfsen commented Jul 18, 2016 • edited Loading

torfsen commented Jul 19, 2016

TkTech commented Jul 19, 2016 • edited Loading

torfsen commented Jul 19, 2016

TkTech commented Jul 19, 2016

torfsen commented Jul 19, 2016 • edited Loading

torfsen commented Jul 20, 2016

torfsen commented Jul 20, 2016

torfsen commented Jul 21, 2016

rossjones commented Jul 21, 2016

torfsen commented Jul 22, 2016

torfsen commented Sep 12, 2016

wardi Sep 12, 2016

Choose a reason for hiding this comment

torfsen Sep 13, 2016

Choose a reason for hiding this comment

wardi Sep 13, 2016

Choose a reason for hiding this comment

torfsen Sep 14, 2016

Choose a reason for hiding this comment

amercader Sep 13, 2016

Choose a reason for hiding this comment

torfsen Sep 14, 2016

Choose a reason for hiding this comment

amercader commented Sep 13, 2016

wardi commented Sep 15, 2016

amercader commented Sep 16, 2016

torfsen commented Sep 16, 2016

wardi commented Sep 16, 2016

torfsen commented Sep 19, 2016

torfsen commented Jul 18, 2016 •

edited

Loading

TkTech commented Jul 19, 2016 •

edited

Loading

torfsen commented Jul 19, 2016 •

edited

Loading