#                                              Petite
#                                              ------
#                         Storing key-value strings in zip files since 2017

import petite
dbpath='path/my_dbm_file'

# Open an existing dbm file, or create a new one with that name
pdict= petite.zdbm(dbpath)

# Use it like a regular dictionary
pdict['1st_entry']='my_value1'
pdict['2nd_entry']='my_value2'

# As expected in a dictionary
pdict['1st_entry']='new_value'         # my_value1 is overrwritten

# pdict.keys() to get a list of keys
for key in pdict:
  print (key)

# pdict.pop(), pdict.update(another_dictionary), pdict.clear() are quite available
del pdict['2nd_entry']

# Deleting or modifying entries makes them outdated, but they remain within the database.
# Call compact() or purge() when desired to remove them permanently.
pdict.compact()

# Don't forget to close. context manager notation (with .. :) is also available.
pdict.close()

# On our next visit..
pdict= petite.zdbm(dbpath)
pdict['1st_entry']                     #outputs 'new_value'

# The codebase isn't large and I daresay easy on the eyes, peek around for fun and profit

Petite

A simple key-value storage
Single interface across platforms
Strives to reduce hard-disk footprint*.

Python offers persistent key-value storage (saving dictionaries to disk) via the dbm module. However, I was unsatisfied with dbm -

The API to all backends but the deprecated Berkeley DB (dbhash) is not complete - no iteration, no clear() method.
While Berkeley DB offered a complete API, and was available on both Windows and Linux, it was deprecated on Python 3.
Thus, as of Python 3, the supported backends (gdbm/ndbm) exist only on some Unix systems, and need to be handled differently than dumbdbm.
The only cross-platform dbm option is Python's own dumdbm, which is self proclaimedly lame.

Solutions I've tried:

LMDB - cross platform, has a Python package, used by SQLite, data scientists and many more. This is a well acclaimed k-v solution that uses transactions, thus complicating the approach somewhat. Most importantly (to my needs), [link]it does not support compression.
semidbm - a much better python dbm implementation. while fast and pure python (pip install always works, no need for the 3Gb Visual 2014 monster thingy on Windows), it too offers no built-in compression.

What does Petite offer?

A persistent dictionary interface
no need for sync() / commit()
Small and portable - A dime over 300 lines total.

Drawbacks:

- Zip does not offer solid-block compression, so not much space can be saved.
While adding entries is quick, compacting entails rebuilding the database, akin to semidbm.

To-do:

Find if it's somehow possible to regenerate the database/zipfile without de/recompression, perhaps using ZipFile's compress_size, header_offset
The small footprint goal is not currently achieved. Therefore, add support for solid-block compression, currently looking at LZMA or 7z.
benchmark against other Python solutions.

Public Domain.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.rst

README.rst

Petite

Solutions I've tried:

What does Petite offer?

Drawbacks:

To-do:

Files

README.rst

Latest commit

History

README.rst

File metadata and controls

Petite

Solutions I've tried:

What does Petite offer?

Drawbacks:

To-do: