Consider precomputing a DAFSA or perfect hash for the list #35

sleevi · 2015-12-02T07:33:47Z

A common strategy used by many PSL implementations is to optimize the PSL for both size and speed, given that it contains a complete set of the IANA Root Zone Database, in addition to public and private registerable domains.

Firefox currently uses a simple hash, Chromium previously used a gperf-generated hash tree, and presently uses a DAFSA, while Guava uses a trie

These solutions each make different trade-offs with respect to size (in memory) and speed, but all offer better performance than the current implementation, and are using relatively friendly licensing.

rockdaboot · 2015-12-02T12:01:19Z

Thanks for your investigation.

The built-in data uses a sorted array, the lookup is a binary search, lookup performance is O(log n).
The good thing is startup time - no startup code has to be executed, no pointers have to be resolved.
For short-running tools like curl or wget, library startup time matters - the common use case either doesn't use cookies / libpsl at all or need just a handful of lookups. So introducing a library startup (building a hashmap) would contradict my assumptions when I designed libpsl (builtin data).

But just have a look at current speed: on a Intel i3 using 1 core, I can do 5-10 million lookups per second (taken the PSL data as input). The libpsl code is reentrant, so you could use any number of cores in parallel - if that matters to you. But of course, a hashmap could boost the lookups by a factor of ~10 (roughly).
This in mind, what kind of bottleneck do you have resp. what is your use case where speed matters that much ?

For dynamic loaded PSL data I keep this in mind - initialization is roughly the same for a hashmap and for a vector. Code complexity is bit higher, but nothing to worry.

sleevi · 2015-12-02T12:12:06Z

The good thing is startup time - no startup code has to be executed, no pointers have to be resolved.

Well, both the gperf and DAFSA code of Chromium share this property, so I don't think it is unique to your solution.

The naive solution (which is similar to what you're using now) was roughly an order of magnitude larger than the gperf solution (which traded a one-time cost at build for faster load times), which itself was about 8X larger than what we currently use for the DAFSA.

In Chrome, we found that library size (as in, the size of the .so/dll) to understandably contribute to load times and page faults, which is why we reoptimized things. Opera contributed the DAFSA code to make our PSL implementation smaller for mobile - 200K vs 35K is a BIG difference for small apps.

My hope is that libpsl will optimize to use the smallest amount of memory possible (that is compatible with your complexity and performance goals), to make it more suitable for use in smaller devices. I filed the issue since it seemed like an easy win, low-hanging fruit, and friendly licensing of existing solutions :)

rockdaboot · 2015-12-02T14:29:43Z

I read about DAFSA - very nice concept, indeed (thanks for pointing out). I'll implement that definitely, i am just pretty busy right now. Hope, I can do that within the next weeks.

rockdaboot · 2015-12-09T09:38:53Z

DAFSA is in branch 'develop' now, thanks again for your request.
The dictionary size (and thus the library size) has been dramatically reduced by DAFSA.

Some measurements:
Average DAFSA lookup (see tests/test-is-public-all.log after a 'make check') is ~3 times slower than binary search in static array (as before DAFSA). I don't care much, this still means ~1.5 million lookups per second on a 3.1GHz i3 (on a single core).

Size of the library (stripped) here (amd64) is now 46816 vs 435352 before, a factor of 9.3 !!!

rockdaboot · 2016-01-02T13:46:02Z

Released in libpsl-0.12.0

Thanks for pointing out DAFSA !

rockdaboot closed this as completed Jan 2, 2016

rockdaboot mentioned this issue Jan 30, 2016

New test suite & linter publicsuffix/list#123

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider precomputing a DAFSA or perfect hash for the list #35

Consider precomputing a DAFSA or perfect hash for the list #35

sleevi commented Dec 2, 2015

rockdaboot commented Dec 2, 2015

sleevi commented Dec 2, 2015

rockdaboot commented Dec 2, 2015

rockdaboot commented Dec 9, 2015

rockdaboot commented Jan 2, 2016

Consider precomputing a DAFSA or perfect hash for the list #35

Consider precomputing a DAFSA or perfect hash for the list #35

Comments

sleevi commented Dec 2, 2015

rockdaboot commented Dec 2, 2015

sleevi commented Dec 2, 2015

rockdaboot commented Dec 2, 2015

rockdaboot commented Dec 9, 2015

rockdaboot commented Jan 2, 2016