Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
dtreai authored Jan 3, 2025
1 parent a76eb2a commit 8a40bf0
Showing 1 changed file with 1 addition and 34 deletions.
35 changes: 1 addition & 34 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,17 +1,11 @@
<p align="center">
<img src="./assets/original-31669f20cced100c1e16a68a6974b8b4.jpg" width="500"/>
</p>
<h1 align="center">SynthToT</h1>
<h3 align="center">Generate synthetic dataset for your training dataset through deliberate problem-solving</h3>
<h6 align="center"><a href="https://discord.gg/5TesGau62q">Join Mathematics & AI Institute Discord!</a></h6>

<hr>

## Table of Content

- [Introduction](#introduction)
- [Features](#features)
- [Enterprise Grid](#enterprise-grid)
- [Usage](#usage)
- [Initial setup](#initial-setup)
- [Preparing Input Data List](#preparing-input-data-list)
Expand All @@ -24,14 +18,7 @@

## Introduction

SynthToT is an simple AI agent system powered by [Langchain](https://www.langchain.com/). SynthToT agent developed by [Mathematics and AI Institute.](https://www.matyz.org/en/) It is specifically designed to facilitate the automated generation of synthetic datasets, which are crucial for the training of large language models. SynthToT Agent utilize the renowned [Tree of Thoughts: Deliberate Problem Solving with Large Language Models](https://arxiv.org/abs/2305.10601) et al. Shunyu Yao, Dian Yu. Tree-of-Thoughts prompting strategy, *"which generalizes over the popular Chain of Thought approach to prompting language models, and enables exploration over coherent units of text (thoughts) that serve as intermediate steps toward problem solving. ToT allows LMs to perform deliberate decision making by considering multiple different reasoning paths and self-evaluating choices to decide the next course of action, as well as looking ahead or backtracking when necessary to make global choices."*

<hr>

> ### About Mathematics & AI Institute
> At the Math and AI Institute, our mission is to bring state-of-the-art research at the intersection of AI and mathematics. We specialize in generative AI technologies, leveraging their potential to empower industries with innovative projects. Generative AI, a cutting-edge field, enables the creation of diverse content, including text, images, and videos, while adapting seamlessly to new data. While we acknowledge the potential risks associated with the rapid evolution of Generative AI, our team is dedicated to ensuring the safety and interpretability of these tools. Our core areas of focus include AI Safety, Industry Integration, Fundamental Research, and Tailored AI Training for various sectors.
> For more information about us, please check out website at [matyz.org/en](https://matyz.org/en)
SynthToT is an simple AI agent system powered by [Langchain](https://www.langchain.com/). It is specifically designed to facilitate the automated generation of synthetic datasets, which are crucial for the training of large language models. SynthToT Agent utilize the renowned [Tree of Thoughts: Deliberate Problem Solving with Large Language Models](https://arxiv.org/abs/2305.10601) et al. Shunyu Yao, Dian Yu. Tree-of-Thoughts prompting strategy, *"which generalizes over the popular Chain of Thought approach to prompting language models, and enables exploration over coherent units of text (thoughts) that serve as intermediate steps toward problem solving. ToT allows LMs to perform deliberate decision making by considering multiple different reasoning paths and self-evaluating choices to decide the next course of action, as well as looking ahead or backtracking when necessary to make global choices."*

<hr>

Expand Down Expand Up @@ -87,26 +74,6 @@ You can view example input_data and output_data from under the `examples` folder

- **Chain Assembly:** Generates a list of LLM chained instances based on the predefined templates and output keys.

## Enterprise Grid

At Mathematics and AI Institute, we're proud to offer the enterprise edition of SynthToT, tailored specifically for enterprise users. SynthToT Enterprise Edition provides advanced features designed to deliver a streamlined, scalable, and production-grade system synthetic data generation agent for your organization's needs. With SynthToT Enterprise, you can meet the demands of large-scale data products and enable a complex synthetic dataset generation system for safety-critical applications.

- Autotransformers: Integrate autotransformers for automated data transformation, enabling efficient and seamless preprocessing of input data for synthetic dataset generation.

- User Interface (UI): Access SynthToT Enterprise Edition through an intuitive user interface, providing a user-friendly experience for configuring settings, monitoring processes, and accessing generated datasets.

- Scalability: Scale SynthToT Enterprise Edition effortlessly to accommodate growing datasets and increased computational demands, ensuring seamless performance under heavy workloads.

- Integration Capabilities: Integrate SynthToT Enterprise Edition seamlessly with existing event-driven data processing systems, and MLOps pipelines for enhanced interoperability and continous data flow.

- Automated Quality Assurance: Utilize automated quality assurance mechanisms to ensure the accuracy, consistency, and reliability of generated synthetic datasets, reducing manual intervention and error rates.

- Plugable Thought Templates: Customize the synthetic data generation process by plugging in thought templates, allowing users to define and utilize their own templates tailored to specific use cases and domains.

### Get Started with SynthToT Enterprise Edition

To get started with SynthToT Enterprise Edition, [contact our team](mailto:info@matyz.org) for a demo or trial.

## Usage

### Initial setup
Expand Down

0 comments on commit 8a40bf0

Please sign in to comment.