A Dockerfile setting up OpenRefine 2.6 with some useful extensions, among which:
- RDF extension to bring Linked Data capabilities to OpenRefine
- NER extension to allow Named Entity Recognition on texts
- Geo extension that adds geo utilities both on conversion and visualization
- extra ctu extension to extract email addresses, urls and phone numbers from texts
This docker is hosted on the official docker.io hub. Running it is as simple as:
docker run -p 80:3333 spaziodati/openrefine
If you want refine projects to be persistent, you must mount /mnt/refine
as follows:
docker run -p 80:3333 -v /path-to-host:/mnt/refine spaziodati/openrefine
You can also increase the max size of the heap, by specifying the REFINE_MEMORY environment variable:
docker run -p 80:3333 -e REFINE_MEMORY=24G spaziodati/openrefine