Skip to content

Latest commit

 

History

History
94 lines (71 loc) · 2.23 KB

README.md

File metadata and controls

94 lines (71 loc) · 2.23 KB

Business Registry Download

This is a tool to download business registry data from Estonian, Finnish, Latvian, Lithuanian and Czech business registers. These files are usually updated daily by the registers and contain information about companies, their officers, and other relevant information.

Happy to take PRs for other countries!

It's easy to load these files with dlt, duckdb or transform them with dbt and integrate this into your data pipelines.

The downloads are all done in parrallel and async so it's pretty fast.

To use from command line

Make sure you have uv installed

curl -LsSf https://astral.sh/uv/install.sh | sh

Run with default settings:

uvx registry_downloader

Or override options for download directory, countries, and override URL:

uvx registry_downloader --download-dir "./downloads" --countries ee --override-url ee=https://avaandmed.ariregister.rik.ee/et/avaandmete-allalaadimine

To use as a library

You can install the registry_downloader package using either pip or uv. Here are examples for both:

Using pip:

pip install registry_downloader

Using uv:

uv add registry_downloader

Run with default settings:

import asyncio
from registry_downloader import run_downloader

async def main() -> None:
    await run_downloader()

if __name__ == "__main__":
    asyncio.run(main()) 

Or override options for download directory, countries, and override URL:

import asyncio
from registry_downloader import run_downloader

async def main() -> None:
    await run_downloader(
        download_dir="./downloads",
        countries=["ee"],
        override_url=["ee=https://avaandmed.ariregister.rik.ee/et/avaandmete-allalaadimine"]
    )

if __name__ == "__main__":
    asyncio.run(main()) 

To develop

  1. Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh
  1. Create a virtual environment and activate it
uv venv && source .venv/bin/activate
  1. Install dependencies and ensure the virtual environment is in sync
uv sync
  1. Build the project or run it locally with defaults
uv build

or

uv run src/registry_downloader