python.d.plugin: use separate process for initial module checking #5552

ilyam8 · 2019-03-05T08:38:52Z

Summary

This PR adds (major) changes only to python.d.plugin file.

pyhton.d.plugin imports a lot of additional packages during initial module initialization/job creating/checking and there is no way to unimport them, even if they arn't needed. It consumes relatively a lot of ram.

Memory utilization comparing before/after the PR (one job example module, py3.7.2):

21.1 => 8.8 MiB

Component Name

collectors/python.d.plugin

Additional Information

This PR adds separate process for initial module checking.

Logic:

main process spawns checker process
checker process loads every module, loads module config, creates jobs and runs job.check() for every job, if check success it adds the job to the list.
checker process returns list of modules and jobs.
main process loads only active modules, etc.

ilyam8 · 2019-03-05T16:11:58Z

ok, i tested it with on Manjaro (py 3.7.2) and on Centos6(py 2.6.6), and it appears to be working correctly

ilyam8 · 2019-03-05T17:16:40Z

Hey @Ferroin if you could find a time and test the PR a bit it would be very nice

ilyam8 · 2019-03-06T11:02:29Z

ok, i don't think we need to run a job if job.check() returns False in main. It can be error prone, instead we can treat all jobs in main as autodetection jobs, so if a job's check(), for some reason, succeed in child process and failed in main we will retry check again and again

ilyam8 · 2019-03-06T11:40:14Z

@ktsaou if you are ok with the changes this is ready.

ilyam8 · 2019-03-06T14:21:16Z

@cakrit please approve

ilyam8 · 2019-03-07T10:51:12Z

ok, here we go, i am on a high alert ⏰

…tdata#5552) ##### Summary This PR adds (major) changes only to `python.d.plugin` file. Fixes: netdata#5525 `pyhton.d.plugin` imports a lot of additional packages during initial module initialization/job creating/checking and there is no way to unimport them, even if they arn't needed. It consumes relatively a lot of ram. ___ Memory utilization comparing before/after the PR (one job `example` module, py3.7.2): > 21.1 => 8.8 MiB ![screenshot_20190305_111837](https://user-images.githubusercontent.com/22274335/53791147-c27a6e00-3f39-11e9-8eaf-8ac3809a3b6e.png) ##### Component Name [`collectors/python.d.plugin`](https://github.com/netdata/netdata/blob/master/collectors/python.d.plugin/python.d.plugin.in) ##### Additional Information This PR adds separate process for initial module checking. Logic: - main process spawns checker process - checker process loads every module, loads module config, creates jobs and runs job.check() for every job, if check success it adds the job to the list. - checker process returns list of modules and jobs. - main process loads only active modules, etc.

ilyam8 added 11 commits March 5, 2019 01:28

powerdns minor fix

fcb6416

add new func to loaders

8b01c33

change pythond log fmt

fd18220

remove threading from SimpleService

df1da35

python.d.plugin refactor wip

e08552d

minor SimpleService

a6b9d8b

minor

c292aba

heartbeat and minor

f5d51be

heartbeat fix

253d816

minor

9993826

minor

43b1650

ilyam8 added the area/external/python label Mar 5, 2019

netdatabot added the area/collectors Everything related to data collection label Mar 5, 2019

ilyam8 added 3 commits March 5, 2019 18:17

fixes

a1d0e6d

fixes

560ce6a

fixes

d45d3cd

netdata deleted a comment Mar 5, 2019

ilyam8 changed the title ~~[wip] python.d.plugin: use separate process for initial module checking~~ python.d.plugin: use separate process for initial module checking Mar 5, 2019

already served jobs fix

c340fe0

netdata deleted a comment Mar 6, 2019

handle all jobs in main as autodetection jobs

4c67771

minor

b640274

netdata deleted a comment Mar 6, 2019

ilyam8 mentioned this pull request Mar 6, 2019

python SocketService: lack of connect timeout, python.d.plugin hangs #5541

Closed

cakrit self-requested a review March 7, 2019 09:11

cakrit approved these changes Mar 7, 2019

View reviewed changes

ilyam8 merged commit 2175673 into netdata:master Mar 7, 2019

ilyam8 deleted the pythond_plugin_refactor branch March 12, 2019 13:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

python.d.plugin: use separate process for initial module checking #5552

python.d.plugin: use separate process for initial module checking #5552

ilyam8 commented Mar 5, 2019 •

edited

Loading

ilyam8 commented Mar 5, 2019

ilyam8 commented Mar 5, 2019

ilyam8 commented Mar 6, 2019 •

edited

Loading

ilyam8 commented Mar 6, 2019

ilyam8 commented Mar 6, 2019

ilyam8 commented Mar 7, 2019

python.d.plugin: use separate process for initial module checking #5552

python.d.plugin: use separate process for initial module checking #5552

Conversation

ilyam8 commented Mar 5, 2019 • edited Loading

Summary

Component Name

Additional Information

ilyam8 commented Mar 5, 2019

ilyam8 commented Mar 5, 2019

ilyam8 commented Mar 6, 2019 • edited Loading

ilyam8 commented Mar 6, 2019

ilyam8 commented Mar 6, 2019

ilyam8 commented Mar 7, 2019

ilyam8 commented Mar 5, 2019 •

edited

Loading

ilyam8 commented Mar 6, 2019 •

edited

Loading