Allow bulk conversion to safe PDFs #77

micahflee · 2020-04-23T16:33:13Z

A few different people have requested this feature: the ability to convert documents to safe PDFs in bulk.

This shouldn't be too difficult to implement, but I think the biggest issue is how the user interface should work. One idea is to allow the user to browse for a folder, and then try to automatically convert all documents it can in that folder to save versions (appending -safe.pdf to the end). It probably makes sense to convert them one at a time, and also display a report of all unsupported files in the folder that couldn't get converted.

What happens if there's a file called test.pdf and another called test-safe.pdf in the same folder though? What would the safe version of test.pdf get named? It could maybe detect this edge case, and tell you that you must not have any files that end in -safe.pdf in the folder to convert.

The text was updated successfully, but these errors were encountered:

haplo · 2020-07-22T09:04:32Z

It probably makes sense to convert them one at a time, and also display a report of all unsupported files in the folder that couldn't get converted.

Running multiple conversions in parallel would improve running times and make use of multiple CPU cores. Just run a process pool bound to the number of CPUs (ideally would be a configurable setting), queue all PDF to be converted (using multiprocessing.Queue and have the worker processes pull from the queue and do the conversion. I can provide an implementation for this.

What happens if there's a file called test.pdf and another called test-safe.pdf in the same folder though? What would the safe version of test.pdf get named? It could maybe detect this edge case, and tell you that you must not have any files that end in -safe.pdf in the folder to convert.

I think a good UX would be to identify all PDF files that already have a -safe version and skip them, telling the user that they seem to be already converted. User can then manually rename them if they want Dangerzone to work on them.

pettitjr · 2020-10-22T19:00:34Z

I would also like to see this feature!

JesseKrembsNYT · 2020-10-22T19:04:54Z

What James said..

tzmnyt · 2020-10-22T19:45:55Z

Bulk conversion of PDFs is very much needed.

RLburrito · 2020-10-26T12:52:42Z

Bulk conversion is desperately needed here as well.

DirtyNoob · 2020-11-21T22:03:13Z

I agree to the consensus here, this is much needed!

anarcat · 2021-06-10T20:00:22Z

as part of #110, i implemented a webdav processor to do batch processing. the idea is that you dump your files in a webdav folder (e.g. on nextcloud), share that folder with the dangerzone bot, which pulls the files, processes them in docker, and pushes the sanitized files back.

see https://gitlab.torproject.org/tpo/tpa/dangerzone-webdav-processor/ for details.

ninavizz · 2021-06-11T02:01:09Z

Hey hey! I've got some hours to spare to get a nice UX together for this, and to potentially improve the single-file experience. Can do & share next week.

deeplow · 2022-08-02T09:54:43Z

An extra thing to consider is that a user may want to add extra files while some are already processing.

eloquence · 2022-09-15T17:18:25Z

Tentatively adding to 0.4.0 milestone; initially, this may only include backend changes to support bulk conversion + CLI support.

Fixes #77

micahflee added the enhancement New feature or request label Apr 23, 2020

micahflee added this to the 0.2 milestone Jun 3, 2020

anarcat mentioned this issue Apr 12, 2021

Run dangerzone from the command line #100

Closed

ninavizz mentioned this issue Jun 11, 2021

Improve Overall UX #117

Open

micahflee modified the milestones: 0.2, 0.3 Jun 15, 2021

micahflee removed this from the 0.3 milestone Dec 8, 2021

eloquence added this to the 0.4.0 milestone Sep 15, 2022

deeplow mentioned this issue Sep 19, 2022

Refactor application to acommodate bulk doc conversion #209

Closed

deeplow added a commit that referenced this issue Nov 3, 2022

Changelog: add multi-document support

96e08f2

Fixes #77

deeplow added a commit that referenced this issue Nov 9, 2022

Changelog: add multi-document support

cb313f3

Fixes #77

deeplow mentioned this issue Nov 10, 2022

GUI multi-document support #247

Merged

deeplow added a commit that referenced this issue Nov 10, 2022

Changelog: add multi-document support

564495b

Fixes #77

deeplow added a commit that referenced this issue Nov 10, 2022

Changelog: add multi-document support

37557ae

Fixes #77

deeplow added a commit that referenced this issue Nov 14, 2022

Changelog: add multi-document support

9d2fbe7

Fixes #77

deeplow added a commit that referenced this issue Nov 18, 2022

Changelog: add multi-document support

c8efb6f

Fixes #77

deeplow closed this as completed in 2aa329d Nov 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow bulk conversion to safe PDFs #77

Allow bulk conversion to safe PDFs #77

micahflee commented Apr 23, 2020

haplo commented Jul 22, 2020

pettitjr commented Oct 22, 2020

JesseKrembsNYT commented Oct 22, 2020

tzmnyt commented Oct 22, 2020

RLburrito commented Oct 26, 2020

DirtyNoob commented Nov 21, 2020

anarcat commented Jun 10, 2021

ninavizz commented Jun 11, 2021

deeplow commented Aug 2, 2022

eloquence commented Sep 15, 2022

Allow bulk conversion to safe PDFs #77

Allow bulk conversion to safe PDFs #77

Comments

micahflee commented Apr 23, 2020

haplo commented Jul 22, 2020

pettitjr commented Oct 22, 2020

JesseKrembsNYT commented Oct 22, 2020

tzmnyt commented Oct 22, 2020

RLburrito commented Oct 26, 2020

DirtyNoob commented Nov 21, 2020

anarcat commented Jun 10, 2021

ninavizz commented Jun 11, 2021

deeplow commented Aug 2, 2022

eloquence commented Sep 15, 2022