Intelcomp's Evaluation Workbench (EWB) API Dockers

Intelcomp's Evaluation Workbench (EWB) API Dockers

Overview

The Evaluation Workbench (EWB) API Dockers comprise a multi-container application that includes essential components like the Solr cluster and REST APIs for Topic Modeling, Inference, and Classification services. This multi-container application is orchestrated using a docker-compose script, connecting all services through the ewb-net network.

Main components

Topic Modeling Service

This service comprises a RESTful API that utilizes the Solr search engine for data storage and retrieval. It enables the indexing of logical corpora and associated topic models, formatted according to the specifications provided by the topicmodeler. Additionally, it facilitates information retrieval through a set of queries.

This system relies on the following services:

ewb-tm: This service hosts the Topic Modeling's RESTful API server. It is constructed using the Dockerfile located in the ewb-tm directory. It has dependencies on the Solr service and requires access to the following mounted volumes: ./data/source, ./data/inference, and ./ewb_config. These volumes are crucial for accessing necessary data from the ITMT (the project folder containing the topic models) and for delivering results obtained through the EWB or generated via the Inference service. The ewb_config volume also houses some important configuration variables.
ewb-solr: This service operates the Solr search engine. It employs the official Solr image from Docker Hub and relies on the zoo service. The service mounts several volumes, including:
- The Solr data directory (./db/data/solr:/var/solr) for data persistence.
- Two custom Solr plugins:
  - solr-ewb-jensen-shanon-distance-plugin for utilizing the Jensen–Shannon divergence as a vector scoring method.
  - solr-ewb-jensen-sims for retrieving documents with similarities within a specified range.
- The Solr configuration directory (./solr_config:/opt/solr/server/solr) to access the specific Solr schemas for EWB.
ewb-solr-initializer: This service is temporary and serves the sole purpose of initializing the mounted volume /db/data with the necessary permissions required by Solr.
ewb-zoo: This service runs Zookeeper, which is essential for Solr to coordinate cluster nodes. It employs the official zookeeper image and mounts two volumes for data and logs.
ewb-solr-config: This service handles Solr configuration. It is constructed using the Dockerfile located in the solr_config directory. This service has dependencies on the Solr and zoo services and mounts the Docker socket and the bash_scripts directory, which contains a script for initializing the Solr configuration for EWB.

Inference Service

This service serves as a Topic Model Inferencer, constructed using the Dockerfile found in the ewb-inferencer directory. It relies on access to mounted volumes at ./data/source, ./data/inference, and ./ewb_config.

Its primary purpose is to be used internally by the Topic Modeling Service, although it can also function as a standalone component.

Classification Service

This service serves as an inference system for hierarchical classification, built on top of the clf-inference-intelcomp library, that allows to classify texts based on a given hierarchy of language models. It relies on access to mounted volumes at ./data/classifier and ./ewb_config.

Requirements

Python requirements files (ewb-tm, ewb-inferencer and ewb-classifier).

Note that the requirements are directly installed in their respective services at the building-up time.

Sample data to start using the EWB API Dockers

A sample corpus and model can be downloaded from here.

Name		Name	Last commit message	Last commit date
Latest commit History 150 Commits
aux_files/similarities_calculation		aux_files/similarities_calculation
ewb-classifier		ewb-classifier
ewb-inferencer		ewb-inferencer
ewb-tm		ewb-tm
ewb_config		ewb_config
solr_config		solr_config
solr_plugins		solr_plugins
static		static
uc3m_deployment		uc3m_deployment
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intelcomp's Evaluation Workbench (EWB) API Dockers

Overview

Main components

Topic Modeling Service

Inference Service

Classification Service

Requirements

Sample data to start using the EWB API Dockers

About

Releases

Packages

Contributors 3

Languages

License

IntelCompH2020/EWB

Folders and files

Latest commit

History

Repository files navigation

Intelcomp's Evaluation Workbench (EWB) API Dockers

Overview

Main components

Topic Modeling Service

Inference Service

Classification Service

Requirements

Sample data to start using the EWB API Dockers

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages