Skip to content

pdssp/pds_crawler

Repository files navigation

pds-crawler

image

image

image

ETL to index PDS data to pdssp

graph TD
  A[PDS ODE Web Service - collection] --> |JSON| D(Extraction)
  B[PDS ODE Web Service - records] --> |JSON| E(Extraction)
  C[PDS ODE Web Site] --> |REFERENCE_CATALOG, MISSION_CATALOG,<br>PERSONNEL_CATALOG, INSTRUMENT_CATALOG,<br>INSTRUMENT_HOST_CATALOG,DATA_SET_CATALOG,<br>VOL_DESC, DATA_SET_MAP_PROJECTION_CATALOG| F(Extraction)
  E(Extraction) --> |Files| H[Storage File System]
  F(Extraction) --> |Files| M[Storage File System]
  D(Extraction) --> |JSON PdsRegistryModel| I[HDF5]
  I[HDF5] --> |PdsRegistryModel| N[Transform]
  M[Storage File System] --> |PdsRecordsModel, DataSetMapProjectionModel,<br>MissionModel, ReferencesModel,<br>PersonnelsModel, VolumeModel,<br>InstrumentModel, InstrumentHostModel,<br>DataSetModel| L[Transform]
  H[Storage File System] --> |PdsRecordModel| N[Transform]
  I[HDF5] --> |PdsRegistryModel| L[Transform]
  N[Transform] --> |STAC Item, STAC Collection, STAC Catalog| O[STAC repository]
  L[Transform] --> |STAC Collection, STAC Catalog| O[STAC repository]
Loading

Stable release

To install pds-crawler, run this command in your terminal:

$ pip install git+https://github.com/pdssp/pds_crawler.git

From sources

The sources for pds-crawler can be downloaded from the Github repo.

You can either clone the public repository:

$ git clone git://github.com/pdssp/pds_crawler

Or download the tarball:

$ curl -OJL https://github.com/pdssp/pds_crawler/tarball/master

Once you have a copy of the source, you can install it with:

$ make  # install in the system root
$ make user # or Install for non-root usage

Development

$ git clone https://github.com/pdssp/pds_crawler
$ cd pds_crawler
$ make prepare-dev
$ source .pds_crawler
$ make install-dev

To get more information about the preconfigured tasks:

$ make help

Usage

To use pds-crawler in a project:

import pds_crawler

Run tests

$make tests

Author

👤 Jean-Christophe Malapert

🤝 Contributing

Contributions, issues and feature requests are welcome!<br />Feel free to check [issues page](https://github.com/pdssp/pds_crawler/issues). You can also take a look at the [contributing guide](https://github.com/pdssp/pds_crawler/blob/master/CONTRIBUTING.rst)

📝 License

This project is [GNU Lesser General Public License v3](https://github.com/pdssp/pds_crawler/blob/master/LICENSE) licensed.