Skip to content

Accelerate Molecular Biology Research with Machine Learning

License

AGPL-3.0, Unknown licenses found

Licenses found

AGPL-3.0
license.md
Unknown
license-faq.md
Notifications You must be signed in to change notification settings

DLS5-Omics/multimolecule

Repository files navigation

Tip

Accelerate Molecular Biology Research with Machine Learning

Codacy - 代码质量 Codacy - Coverage CodeCov - Coverage

PyPI - Version PyPI - Python Version Downloads Statistics

License: AGPL v3

Introduction

Welcome to MultiMolecule (浦原), a foundational library designed to accelerate scientific research in molecular biology through machine learning. MultiMolecule provides a comprehensive yet flexible set of tools for researchers aiming to leverage AI with ease, focusing on biomolecular data (RNA, DNA, and protein).

Overview

MultiMolecule is built with flexibility and ease of use in mind. Its modular design allows you to utilize only the components you need, integrating seamlessly into your existing workflows without adding unnecessary complexity.

  • data: Smart [Dataset][multimolecule.data.Dataset] that automatically infer tasks—including their level (sequence, token, contact) and type (classification, regression). Provides multi-task datasets and samplers to facilitate multitask learning without additional configuration.
  • datasets: A collection of widely-used biomolecular datasets.
  • module: Modular neural network building blocks, including embeddings, heads, and criterions for constructing custom models.
  • models: Implementation of state-of-the-art pre-trained models in molecular biology.
  • tokenisers: Tokenizers to convert DNA, RNA, protein and other sequences to one-hot encodings.

Installation

Install the most recent stable version on PyPI:

pip install multimolecule

Install the latest version from the source:

pip install git+https://github.com/DLS5-Omics/MultiMolecule

Citation

If you use MultiMolecule in your research, please cite us as follows:

@software{chen_2024_12638419,
  author    = {Chen, Zhiyuan and Zhu, Sophia Y.},
  title     = {MultiMolecule},
  doi       = {10.5281/zenodo.12638419},
  publisher = {Zenodo},
  url       = {https://doi.org/10.5281/zenodo.12638419},
  year      = 2024,
  month     = may,
  day       = 4
}

License

We believe openness is the Foundation of Research.

MultiMolecule is licensed under the GNU Affero General Public License.

For additional terms and clarifications, please refer to our License FAQ.

Please join us in building an open research community.

SPDX-License-Identifier: AGPL-3.0-or-later

About

Accelerate Molecular Biology Research with Machine Learning

Topics

Resources

License

AGPL-3.0, Unknown licenses found

Licenses found

AGPL-3.0
license.md
Unknown
license-faq.md

Stars

Watchers

Forks

Packages

No packages published

Languages