In this page, you can train a GMM on the encoded HT-SELEX data.
Navigate to the GMM Trainer by clicking the GMM Trainer
link in the top menu or the navigation bar.
Begin by clicking the + Add a New Training Job
button.
Set the preprocessing parameters for your training job, then click the Next
button.
Parameters
- ① Model Type:
- Select the VAE model registered in the database.
- ② GMM name:
- Assign a name to your GMM model.
- ③ Minumum number of GMM components:
- Set the lower limit for GMM components.
- ④ Maximum number of GMM components:
- Set the upper limit for GMM components.
- ⑤ Step size of the search:
- Define the increment for searching GMM components.
- ⑥ Number of trials on each number of components:
- Specify the number of iterations for each number of GMM commponents.
The GMM training is runned multiple times to select the best BIC model among the models trained with the different number of GMM components. The number of the training is determined by the following formula:
More trials increase accuracy, but it takes longer to train.
After submission, the training begins, and progress is displayed in the Running
job list.
Once training completes, the Add to Viewer Dataset
button becomes active. Click it to add the model to the dataset.
By default, the best number of components is selected based on the BIC score. You can change this using the dropdown menu.
When you click the button, the following dialog will be shown.
Fill in the name of the model and click the Add to Viewer Dataset
button.
Proceed to the Bayesian Optimization page to optimize aptamer sequence on latent space.