Skip to content

Using the database snapshots

Luna McNulty edited this page Feb 23, 2024 · 1 revision

Exports of the full database content (minus user information) are available at https://incidentdatabase.ai/research/snapshots/. These are automatically generated from a backup script.

The snapshots contain CSV summaries for some parts of the data – everything else is contained in BSON files under /mongodump_full_snapshot/aiidprod/<collection-name>.bson. You can read those with bsondump. You can also load the data into your own Atlas instance.

Snapshot contents as of 2024-02-23:

mongodump_full_snapshot
├── aiidprod
│   ├── candidates.bson
│   ├── candidates.metadata.json
│   ├── checklists.bson
│   ├── checklists.metadata.json
│   ├── classifications.bson
│   ├── classifications.metadata.json
│   ├── duplicates.bson
│   ├── duplicates.metadata.json
│   ├── entities.bson
│   ├── entities.metadata.json
│   ├── incidents.bson
│   ├── incidents.metadata.json
│   ├── migrations.bson
│   ├── migrations.metadata.json
│   ├── quickadd.bson
│   ├── quickadd.metadata.json
│   ├── reports.bson
│   ├── reports.metadata.json
│   ├── submissions.bson
│   ├── submissions.metadata.json
│   ├── taxa.bson
│   └── taxa.metadata.json
├── classifications_cset.csv
├── duplicates.csv
├── incidents.csv
├── license.txt
├── quickadd.csv
├── reports.csv
├── submissions.csv
└── translations
    ├── reports_en.bson
    ├── reports_en.metadata.json
    ├── reports_es.bson
    ├── reports_es.metadata.json
    ├── reports_fr.bson
    └── reports_fr.metadata.json

See also this comment on issue #2663.