portmatching

Fast pattern matching on port graphs.

This crate is designed to find embeddings of port graphs fast, in the case of many patterns. Patterns are preprocessed to make matching faster and compiled into a graph trie data structure similar to the finite automata used for string matching and regular expressions.

The crate exports several Matcher instances that can be used for pattern matching. The principal object of interest in this crate is the ManyMatcher object. The other Matchers serve as baselines for benchmarking purposes. SinglePatternMatcher matches a single pattern in time O(nm) (size of the input times size of the pattern). The NaiveManyMatcher uses k instances of the SinglePatternMatcher to find matches of any of k patterns in time O(kmn).

Benchmarks

For all benchmarks, inputs and patterns are random graphs with quantum circuit-like structure. Inputs have up to 2000 nodes (gates) and 400 qubits, patterns have up to 30 nodes and between 2 and 5 qubits. For pattern matching on weighted graphs, the weights for each node n are chosen at random in the set {(d, 0), (d, 1), (d, 2)}, where d is the degree (arity) of the node n.

Note: weighted pattern matching is much easier than unweighted, especially as weights fix the arity of the nodes. In a sense, unweighted pattern matching corresponds to the worst case. Optimisations to the automaton could be made to significantly speed up unweighted matching, but this has not been the focus.

Comparison to baseline [unweighted]

Pattern matching times for 0 ... 1000 patterns, NaiveManyMatcher vs automaton-based ManyMatcher. The plot measures matching time (ms) as a function of the number of patterns.

Pattern matching scaling (on-line) [weighted]

Matching time (ms) as a function of the number of patterns for ManyMatcher only, up to 10k patterns. The patterns are categorised by the number of qubits.

Automaton construction time (off-line) [unweighted]

On top of the running time plotted above, there is also a one-time cost to construct the automaton from the set of patterns. This is plotted here, again as a function of the number of patterns.

Example

TODO

Features

serde: Enable serialization and deserialization via serde.
datagen: Necessary for the data_generation binary, for benchmarking. Currently not useful to the end user of the crate.

License

Distributed under the MIT License. See LICENSE for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 422 Commits
.github/workflows		.github/workflows
benches		benches
examples		examples
proptest-regressions		proptest-regressions
src		src
tests		tests
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

portmatching

Benchmarks

Comparison to baseline [unweighted]

Pattern matching scaling (on-line) [weighted]

Automaton construction time (off-line) [unweighted]

Example

Features

License

About

Releases 5

Packages

Contributors 3

Languages

License

lmondada/portmatching

Folders and files

Latest commit

History

Repository files navigation

portmatching

Benchmarks

Comparison to baseline [unweighted]

Pattern matching scaling (on-line) [weighted]

Automaton construction time (off-line) [unweighted]

Example

Features

License

About

Resources

License

Stars

Watchers

Forks

Releases 5

Packages 0

Contributors 3

Languages

Packages