Parallel execution fontforge in docker #1508

nobk · 2024-02-04T17:00:02Z

Description

Improve the docker execution method, optimize from single process to parallel execution, and increase the speed of font patching.
see issue 1507

Requirements / Checklist

Read the Contributing Guidelines
Verified the license of any newly added font, glyph, or glyph set

What does this Pull Request (PR) do?

Speed up docker patcher

How should this be manually tested?

Any background context you can provide?

What are the relevant tickets (if any)?

Screenshots (if appropriate or helpful)

Finii

Thanks for the nice PR!

I did never put much effort in the docker patcher ;-) because I do not use it at all.
But than means that suggestions like yours are so valuable!

I think this is a very good improvement. The gotta-patch-em also can parallel-patch (but with a hand-rolled parallelism and a hard wired number of jobs).

I would allow people to specify the -j option, as not all want parallel patching.
Maybe also helpful would be some output if find turned up with nothing.

Tell me if you want to expand this or if you prefer me merging as-is.

Finii · 2024-02-04T17:24:07Z

bin/scripts/docker-entrypoint.sh

@@ -23,6 +23,6 @@ done
 printf "Running with options:\n%s\n" "$args"

 # shellcheck disable=SC2086
-for f in /in/*.otf /in/*.ttf /in/*.woff /in/*.eot /in/*.ttc; do [ -f "$f" ] && fontforge -script /nerd/font-patcher -out /out $args "$f"; done
+find /in -type f \( -name "*.otf" -o -name "*.ttf" -o -name "*.woff" -o -name "*.eot" -o -name "*.ttc" \) | parallel fontforge -script /nerd/font-patcher -out /out $args {}


I like find much more than the loop. We could use -iname here, as some fonts out there have their name and extension in caps. (Which would be an additional commit of course ;)

Probably I would use -iregex but that is because I'm a regex person and hate the -o style in find ;-)

Suggested change

find /in -type f $ -name "*.otf" -o -name "*.ttf" -o -name "*.woff" -o -name "*.eot" -o -name "*.ttc" $ | parallel fontforge -script /nerd/font-patcher -out /out $args {}

find /in -type f -iregex ".*\.$otf|ttf|woff|eot|ttc$" | parallel fontforge -script /nerd/font-patcher -out /out $args {}

untested

Dockerfile's find is a BusyBox multi-call, which is not support -iregex, I will change it with -iname.

Finii · 2024-02-04T17:39:17Z

@allcontributors please add @nobk for code

allcontributors · 2024-02-04T17:39:27Z

@Finii

I've put up a pull request to add @nobk! 🎉

Finii · 2024-02-04T17:46:15Z

As a side note, I did never check if the debug log will be even avaiable when docker is used.
Note to self: Check that and maybe add moving the logfile to /out.

nobk · 2024-02-04T19:55:03Z

I would allow people to specify the -j option, as not all want parallel patching. Maybe also helpful would be some output if find turned up with nothing.

Tell me if you want to expand this or if you prefer me merging as-is.

I will add an option for -j PN, with docker option -e "PN=1", to disable parallel execute.

docker run --rm -v /path/to/fonts:/in:Z -v /path/for/output:/out:Z -e "PN=1" nerdfonts/patcher [OPTIONS]

nobk · 2024-02-04T22:09:38Z

I think docker patcher is for users, not for developers, so I am not test logfile output.
When I set -e "PN=20" , all hyper threads of i7-12700K CPU is used, up to 4.9GB RAM used, speed up max.

Finii · 2024-02-05T13:30:40Z

Thank you! Appreciate your work 💚

Finii · 2024-02-05T13:47:52Z

Fixing

Finii · 2024-02-05T14:05:27Z

$ shellcheck docker-entrypoint.sh

In docker-entrypoint.sh line 28:
	  -exec fontforge -script /nerd/font-patcher -out /out $args {} \;
                                                               ^---^ SC2086 (info): Double quote to prevent globbing and word splitting.

Did you mean: 
	  -exec fontforge -script /nerd/font-patcher -out /out "$args" {} \;


In docker-entrypoint.sh line 34:
	  | parallel $njob fontforge -script /nerd/font-patcher -out /out $args {}
                     ^---^ SC2086 (info): Double quote to prevent globbing and word splitting.
                                                                          ^---^ SC2086 (info): Double quote to prevent globbing and word splitting.

Did you mean: 
	  | parallel "$njob" fontforge -script /nerd/font-patcher -out /out "$args" {}

nobk · 2024-02-05T14:10:14Z

no, that warnning caused by BusyBox multi-call sh. If the environment variable PN is not defined, this warning will appear.

docker run -it --rm alpine:latest sh
/ # [ "$PN" -eq 1 ]
sh: out of range
/ # which sh
/bin/sh
/ # /bin/sh --version
/ # ls -l /bin/sh
lrwxrwxrwx    1 root     root            12 Jan 26 17:53 /bin/sh -> /bin/busybox
/ #

nobk · 2024-02-05T14:18:05Z

Adding one more line of code eliminates those two warnings.
[[ -z "$PN" ]] && PN=0
need me submit a PR for this?

Finii · 2024-02-05T14:31:36Z

no, thanks :-)

nobk · 2024-02-05T14:35:25Z

My fix is like this, and not submit that then, it just warning.

git diff
diff --git a/bin/scripts/docker-entrypoint.sh b/bin/scripts/docker-entrypoint.sh
index acd931a..bb852d3 100644
--- a/bin/scripts/docker-entrypoint.sh
+++ b/bin/scripts/docker-entrypoint.sh
@@ -23,6 +23,7 @@ done
 printf "Running with options:\n%s\n" "$args"

 # shellcheck disable=SC2086
+[[ -z "$PN" ]] && PN=0
 if [ "$PN" -eq 1 ]; then
        find /in -type f \

nobk · 2024-02-05T14:56:53Z

You can execute sudo docker system prune and answer y to reclaim the disk space occupied by the outdated docker patcher <none> images.

Finii · 2024-02-05T16:25:09Z

I added two more commits, hope you find them ok.

1fccd8a docker: Allow blancs in filenames
7ebbc4e docker: Include logfile in output

That also fixes the shellcheck warning and the runtime warning.

This is a very good addition! Thanks again for the idea and implementation/PR!

Finii · 2024-02-05T16:31:22Z

I think docker patcher is for users, not for developers, so I am not test logfile output.

Well, that output helps (me) when those users have problems with some particular font.
They can report with the debug messages and I can possible see what goes wrong.
This is important for fonts that are not free, i.e. I can not download and test myself.

For example

"After patching the gap is too wide - I can not share the font file"
"Please run with --debug 1"
In the debug log: Gap detected ...

nobk · 2024-02-05T16:53:22Z

As you changed the default value of PN to 1, you need modify readme.md docker usage section, default single process, -e "PN=0" let parallel decide the number of tasks.
But I fell default value of PN as 0 will be better because all docker user can benefit by speed up from this upgrade.
Unless you need debugging or there is insufficient memory, you need to set PN to 1.

Finii · 2024-02-05T17:03:03Z

I didnt intend to change your default, though 😬
Rewrote that several times, maybe got confused ;-)
Fixing.

Thanks for reporting. I believe -j0 is the better default.

Finii · 2024-02-05T17:20:18Z

96497b4 docker: Run parallel by default

nobk · 2024-02-05T17:41:48Z

Thanks for reporting. I believe -j0 is the better default.

I am not using -j0 in my PR, I just omitted the -j parameter in shell script by default or PN=0, parallel generate 8 tasks for me, I can see them in htop.
Your latest commit with parallel -j0 I just tested, parallel generate 32 tasks for me, because my custom fonts have totally 32 .ttf files, it use 7.8GB memory.
If users have more font files, I think -j0 will cause out of memory or too much tasks let system lag.

nobk · 2024-02-05T17:50:51Z

omitted the -j parameter means -j 100%
man parallel

       --jobs num
       -j num
       --max-procs num
       -P num
           Number of jobslots on each machine.

           Run up to num jobs in parallel. Default is 100%.

           num    Run up to num jobs in parallel.

           0      Run as many as possible (this can take a while to determine).

                  Due to a bug -j 0 will also evaluate replacement strings twice up  to  the  number  of
                  joblots:

                    # This will not count from 1 but from number-of-jobslots
                    seq 10000 | parallel -j0   echo '{= $_ = $foo++; =}' | head
                    # This will count from 1
                    seq 10000 | parallel -j100 echo '{= $_ = $foo++; =}' | head

           num%   Multiply  the  number  of  CPU threads by num percent. E.g. 100% means one job per CPU
                  thread on each machine.

           +num   Add num to the number of CPU threads.

           -num   Subtract num from the number of CPU threads.

           expr   Evaluate expr. E.g. '12/2' to get 6, '+25%' gives  the  same  as  '125%',  or  complex
                  expressions  like  '+3*log(55)%'  which means: multiply 3 by log(55), multiply that by
                  the number of CPU threads and divide by 100, add this to the number of CPU threads.

                  An expression that evalutates to less that 1 is replaced with 1.

           procfile
                  Read parameter from file.

                  Use the content of procfile as parameter for  -j.  E.g.  procfile  could  contain  the
                  string 100% or +2 or 10.

                  If procfile is changed when a job completes, procfile is read again and the new number
                  of  jobs is computed. If the number is lower than before, running jobs will be allowed
                  to finish but new jobs will not be started until the wanted number of  jobs  has  been
                  reached.   This  makes  it  possible to change the number of simultaneous running jobs
                  while GNU parallel is running.

           If the evaluated number is less than 1 then 1 will be used.

           If --semaphore is set, the default is 1 thus making a mutex.

           See also: --use-cores-instead-of-threads --use-sockets-instead-of-threads

Finii · 2024-02-05T18:25:53Z

omitted the -j parameter means -j 100% man parallel

But that would require an (additional) if and/or code duplication.

In principle we could

-parallel --verbose --null "--jobs=${PN}" fontforge -script ../../font-patcher -out ../../out_fonts $args {}
+[-n "$PN"] && jobs=-j${PN}
+parallel --verbose --null "${jobs}" fontforge -script ../../font-patcher -out ../../out_fonts $args {}

The problem is that we can not use double qoutes around jobs then, because that passes an empty option (instead of no option, which would be passed without quotes).

Your original PR had the if and code duplication (find twice and font-patcher twice), which is not good in the long run.

Not having the params double quoted raisesshellcheck warnings and should be avoided as much as possible (imho).

That's the reason I used an explicit 100% (i.e. -j0) to get it - branchless programming ;-)

nobk · 2024-02-05T20:00:13Z

KISS = keep it simple stupid
I do not think use if else and write find twice not good.
Make it work is important than make it looks well.
If you need debug, find -exec is better than find | parallel -j1 .

Finii · 2024-02-05T20:10:05Z

Debugging is not about debugging the code but debugging the font.

Having code that works (supposedly) but is hard to read does not help anyone in the long run. So looking well IS important.

Sorry that you do not like the changes.

nobk · 2024-02-05T20:14:55Z

The -j0 make max task as many as total font files can not as default, need avoid.

nobk · 2024-02-05T20:30:48Z

If you want keep the double qoutes, you can write like this

jobs="-j 100%"
[[ -n "$PN" ]] && [ "$PN" -gt 0 ] && jobs="-j${PN}"

nobk · 2024-02-05T20:56:02Z

Sorry that you do not like the changes.

I like correct code refactoring, not the introduction of bugs.

Finii · 2024-02-05T21:19:08Z

That still bugs with an obscure error if someone does -e "PN=test". That is not an improvement but making the error feedback worse.

I like ... not the introduction of bugs

But still you suggest the above code. That will end up as

parallel --verbose "-j 0" ...

and that will raise an error because the allowed syntax is only "-j0" or "-j=0" (for single arguments, and keeping the quotes exactly means that: 1 argument) (and the two argument version "-j" "0".)

Parallel execution fontforge in docker

86b38e4

Finii approved these changes Feb 4, 2024

View reviewed changes

allcontributors bot mentioned this pull request Feb 4, 2024

docs: add nobk as a contributor for code #1509

Merged

Finii added the ✨ enhancement label Feb 4, 2024

nobk added 2 commits February 5, 2024 04:15

Case insensitive fonts find

45e1ad9

Docker add option -e PN=1 disable parallel fontforge

abc7f34

nobk force-pushed the pdckr branch from f639f0e to ed14632 Compare February 4, 2024 21:56

Update Docker usage in readme.md

51b7fc6

nobk force-pushed the pdckr branch from ed14632 to 51b7fc6 Compare February 4, 2024 21:57

Finii merged commit e633e5f into ryanoasis:master Feb 5, 2024
1 check passed

Repository owner locked as too heated and limited conversation to collaborators Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel execution fontforge in docker #1508

Parallel execution fontforge in docker #1508

nobk commented Feb 4, 2024

Finii left a comment

Finii Feb 4, 2024

Finii Feb 4, 2024

Finii Feb 4, 2024

nobk Feb 4, 2024

Finii commented Feb 4, 2024

allcontributors bot commented Feb 4, 2024

Finii commented Feb 4, 2024

nobk commented Feb 4, 2024

nobk commented Feb 4, 2024

Finii commented Feb 5, 2024

Finii commented Feb 5, 2024

Finii commented Feb 5, 2024

nobk commented Feb 5, 2024

nobk commented Feb 5, 2024

Finii commented Feb 5, 2024

nobk commented Feb 5, 2024

nobk commented Feb 5, 2024

Finii commented Feb 5, 2024

Finii commented Feb 5, 2024

nobk commented Feb 5, 2024 •

edited

Loading

Finii commented Feb 5, 2024 •

edited

Loading

Finii commented Feb 5, 2024

nobk commented Feb 5, 2024 •

edited

Loading

nobk commented Feb 5, 2024

Finii commented Feb 5, 2024 •

edited

Loading

nobk commented Feb 5, 2024

Finii commented Feb 5, 2024

nobk commented Feb 5, 2024

nobk commented Feb 5, 2024 •

edited

Loading

nobk commented Feb 5, 2024

Finii commented Feb 5, 2024

	find /in -type f \( -name ".otf" -o -name ".ttf" -o -name ".woff" -o -name ".eot" -o -name "*.ttc" \) \| parallel fontforge -script /nerd/font-patcher -out /out $args {}
	find /in -type f -iregex ".*\.\(otf\|ttf\|woff\|eot\|ttc\)" \| parallel fontforge -script /nerd/font-patcher -out /out $args {}

Parallel execution fontforge in docker #1508

Parallel execution fontforge in docker #1508

Conversation

nobk commented Feb 4, 2024

Description

Requirements / Checklist

What does this Pull Request (PR) do?

How should this be manually tested?

Any background context you can provide?

What are the relevant tickets (if any)?

Screenshots (if appropriate or helpful)

Finii left a comment

Choose a reason for hiding this comment

Finii Feb 4, 2024

Choose a reason for hiding this comment

Finii Feb 4, 2024

Choose a reason for hiding this comment

Finii Feb 4, 2024

Choose a reason for hiding this comment

nobk Feb 4, 2024

Choose a reason for hiding this comment

Finii commented Feb 4, 2024

allcontributors bot commented Feb 4, 2024

Finii commented Feb 4, 2024

nobk commented Feb 4, 2024

nobk commented Feb 4, 2024

Finii commented Feb 5, 2024

Finii commented Feb 5, 2024

Finii commented Feb 5, 2024

nobk commented Feb 5, 2024

nobk commented Feb 5, 2024

Finii commented Feb 5, 2024

nobk commented Feb 5, 2024

nobk commented Feb 5, 2024

Finii commented Feb 5, 2024

Finii commented Feb 5, 2024

nobk commented Feb 5, 2024 • edited Loading

Finii commented Feb 5, 2024 • edited Loading

Finii commented Feb 5, 2024

nobk commented Feb 5, 2024 • edited Loading

nobk commented Feb 5, 2024

Finii commented Feb 5, 2024 • edited Loading

nobk commented Feb 5, 2024

Finii commented Feb 5, 2024

nobk commented Feb 5, 2024

nobk commented Feb 5, 2024 • edited Loading

nobk commented Feb 5, 2024

Finii commented Feb 5, 2024

nobk commented Feb 5, 2024 •

edited

Loading

Finii commented Feb 5, 2024 •

edited

Loading

nobk commented Feb 5, 2024 •

edited

Loading

Finii commented Feb 5, 2024 •

edited

Loading

nobk commented Feb 5, 2024 •

edited

Loading