Practical String Searching

This is a mirror of my library hosted at https://create.stephan-brumme.com/practical-string-searching/

Practical String Searching

There is a huge number of scientific papers on fast, faster and super-fast string searching algorithms. They usually prove theoretical performance in O-Notation and most papers cover memory consumption as well.

However, theoretical performance isn't always the same as practical performance. That's why I always want to measure real-world throughput: this article presents hopefully understandable C implementations of the most common generic string search algorithms.

In addition I also wrote a simple tool called mygrep that prints all lines of a file where a search phrase is found. It doesn't come with all the bells and whistles of the Unix tool grep but achieves similar or sometimes even better speed.

Algorithms

simple loop / brute force
memchr/memcmp
memmem
strstr
Knuth-Morris-Pratt
Boyer-Moore-Horspool
Bitap aka Baeza-Yates-Gonnet
Rabin-Karp

Interface

All C functions share the same interface: const char* search(const char* haystack, const char* needle); for strings const char* search(const char* haystack, size_t haystackLength, const char* needle, size_t needleLength); for binary data

More ...

See my website https://create.stephan-brumme.com/practical-string-searching/ for a live demo, code examples and benchmarks.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
mygrep.c		mygrep.c
readme.md		readme.md
search.c		search.c
search.h		search.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Practical String Searching

Algorithms

Interface

More ...

About

Releases

Packages

Languages

License

stbrumme/practical-string-searching

Folders and files

Latest commit

History

Repository files navigation

Practical String Searching

Algorithms

Interface

More ...

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages