import plugin: append logs for bulk import rather than overwriting #223

sbesson · 2020-06-23T15:49:18Z

Fixes #214

Rather than unconditionally open log files in w mode, this allows to pass the opening mode via a key/value parameter.
In the case of a bulk import, the first iteration will write logs in w mode but all subsequent iterations should append to the log files.

joshmoore · 2020-06-24T07:21:55Z

src/omero/plugins/import.py

@@ -583,7 +583,11 @@ def bulk_import(self, command_args, xargs):
                            # FIXME: this assumes 'omero'
                            print(sys.argv[0], "import", rv, file=o)
                else:
-                    self.do_import(command_args, xargs)
+                    if incr == 0:
+                        mode = "w"


Overwriting on the first of multiple imports and subsequently appending is certainly an improvement.
I wonder a bit if one ever wants to overwrite, though that may be part of a larger refactoring. Two other options would be: (1) failing if an overwrite would occur or (2) auto-incrementing the file name.

I like (2) as an alternate behavior. It is inline with the classical behavior of distributed tooling like GNU parallel. One implementation would be to support regexp arguments like bin/omero import --bulk bulk --file log_%d.log --err log_%d.err, try to substitute the increment number and fallback to writing to single files in appending mode. Thoughts?

How about merge this as it's definitely an improvement, then think about whether it's worth the additional complexity? We've had bulk import for ages but the issue was only discovered last month, so either no-one uses it which suggests it's not worth adding more complexity, or no-one's noticed it in which case this fix is fine.

MSTM. (I think I often opted to used parallel rather than bulk which perhaps is what left bulk in an incomplete state.)

Interesting. I never used that, just redirect out and err to a "log" file, assuming everything important is printed out on the command line.

src/omero/plugins/import.py

Co-authored-by: Simon Li <orpheus+devel@gmail.com>

manics

Tested locally, LGTM

import plugin: append logs for bulk import rather than overwriting

f06a9a0

sbesson mentioned this pull request Jun 23, 2020

Add integration test covering bulk import with log flags ome/openmicroscopy#6240

Merged

joshmoore reviewed Jun 24, 2020

View reviewed changes

manics reviewed Jun 29, 2020

View reviewed changes

src/omero/plugins/import.py Outdated Show resolved Hide resolved

Fix initial increment

f32fe91

Co-authored-by: Simon Li <orpheus+devel@gmail.com>

manics approved these changes Jun 29, 2020

View reviewed changes

sbesson merged commit 59cf539 into ome:master Jun 29, 2020

sbesson deleted the bulk_file_errs branch June 29, 2020 20:41

sbesson mentioned this pull request Nov 18, 2020

append logs for omero import #270

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

import plugin: append logs for bulk import rather than overwriting #223

import plugin: append logs for bulk import rather than overwriting #223

sbesson commented Jun 23, 2020 •

edited

Loading

joshmoore Jun 24, 2020

sbesson Jun 24, 2020

manics Jun 24, 2020

joshmoore Jun 24, 2020 •

edited

Loading

dominikl Jun 29, 2020

manics left a comment

import plugin: append logs for bulk import rather than overwriting #223

import plugin: append logs for bulk import rather than overwriting #223

Conversation

sbesson commented Jun 23, 2020 • edited Loading

joshmoore Jun 24, 2020

Choose a reason for hiding this comment

sbesson Jun 24, 2020

Choose a reason for hiding this comment

manics Jun 24, 2020

Choose a reason for hiding this comment

joshmoore Jun 24, 2020 • edited Loading

Choose a reason for hiding this comment

dominikl Jun 29, 2020

Choose a reason for hiding this comment

manics left a comment

Choose a reason for hiding this comment

sbesson commented Jun 23, 2020 •

edited

Loading

joshmoore Jun 24, 2020 •

edited

Loading