Skip to content
#

datageneration

Here are 16 public repositories matching this topic...

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

  • Updated Dec 19, 2024
  • Python

The aim of this study is to determine the machine failure by construction of classifier model on predictive maintenance dataset. The class imbalance data compromise the performance of the constructed model and this is addressed by assessing the oversampling methods with Multi-Task Learning (MTL)architecture. Also, to gauge the performance of aux…

  • Updated Sep 22, 2022
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the datageneration topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the datageneration topic, visit your repo's landing page and select "manage topics."

Learn more