finite-state-rice-decoder

Java implementation of byte-level finite state decoder for a stream of Rice codes with constant space complexity with respect to alphabet size.

Finite-state machine implementation is done using Enum types (see State.java in 'original' branch). The conventional bit-level decoding procedure is also implemented (see BitLevelDecoder.java in 'original' branch). Finite-state decoding, which decodes variable length Rice codes bytewise, is faster and comparatively becomes more effective as the mean value of encoded integers increases. Speed gains up to a factor of 2 have empirically been observed in an inverted index compression task where document gaps are geometrically distributed (approximately) under the assumption that documents are randomly ordered.

The unary part is encoded as 0's followed by a 1 in this implementation.

The paper that describes and evaluates the finite-state decoder can be found at https://www.researchgate.net/publication/344635127_Efficient_Finite-State_Decoding_of_Rice_Codes_for_Large_Alphabets

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

finite-state-rice-decoder

About

Releases

Packages

Contributors 2

Languages

License

canozbey/finite-state-rice-decoder

Folders and files

Latest commit

History

Repository files navigation

finite-state-rice-decoder

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages