Skip to content

CTUAvastLab/datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

Datasets

This repository contains various datasets used in CTUAvastLab. Currently it contains only mutagenesis dataset.

Mutagenesis dataset:

Summary

The dataset comprises of 230 molecules trialed for mutagenicity on Salmonella typhimurium. A subset of 188 molecules is learnable using linear regression. This subset was later termed the ”regression friendly” dataset. The remaining subset of 42 molecules is named the ”regression unfriendly” dataset. (taken from relational.fit.cvut.cz/).

Currently, this repository contains only Mutagenesis_188.

Website

relational.fit.cvut.cz/ where the original data is hosted as SQL database. Original source

see separate file.

Data structure

mutagenesis/data.json contains data from dataset Mutagenesis_188, as list of 188 strucures, each representing one molecule, as a json.

mutagenesis/meta.json contains metadata about the dataset, as a json.

About

Datasets, currently containing: Mutagenesis

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published