It parses files from https://developer.imdb.com/non-commercial-datasets/ in C++ and saves it to a postgres Database
Inspired by https://github.com/Totto16/imdb-dataset / https://github.com/Totto16/imdb-dataset-parser/ / https://github.com/andreivinaga/imdb-dataset
This preseeds a postgresql database with the whole data to a docker image, that can be used easily.
Use the docker images on GitHub as base database image, e.g. latest or also specific dates 20241126. Use it like a normal postgres:16-alpine
docker image, and you have the table imdb
, which already has the data.
The code under this repo is under MIT License, but the data from IMDb is under a Non-Commercial License, see here