Synthetic data generators for tabular and time-series data
-
Updated
Dec 10, 2024 - Jupyter Notebook
Synthetic data generators for tabular and time-series data
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Targeted Data Generation with Large Language Models
PulseBat dataset for retired battery reuse and recycling decision making. We open-source the collected PulseBat dataset for pulse voltage response generation of the retired batteries across random retirement conditions, i.e., state of charge (SOC) conditions, facilitating downstream SOH estimation tasks.
NextBrain's data Anonymizer tool ensures top-tier privacy by irreversibly obscuring personal identifiers without storing any data. Ideal for businesses prioritizing data security and compliance, it offers a reliable solution for safeguarding sensitive information.
Documentation for Data Caterer
this sample, show how we can use materialized view in spring-framework
An OCR system trained with manually generated handwriting datasets with the purpose to classify handwritten text to digital text.
The aim of this study is to determine the machine failure by construction of classifier model on predictive maintenance dataset. The class imbalance data compromise the performance of the constructed model and this is addressed by assessing the oversampling methods with Multi-Task Learning (MTL)architecture. Also, to gauge the performance of aux…
Visual data generation for the basic English words by Ogden.
NIP Research project working NIP-RW, DIP, and Subcellular location from Uniprot.org. Work in progress.
Codebase for extracting banks and ATMs from OSM
Mini Project about synthetic data generation by implementing CTGAN algorithm on tabular data
Detecting fraudulent transactions on the synthetic dataset with machine learning
RandomUserGenerator is a web application that utilizes a random user data API to generate and display detailed user profiles
Repository for publishing website about data management practices of the Momentum project
Add a description, image, and links to the datageneration topic page so that developers can more easily learn about it.
To associate your repository with the datageneration topic, visit your repo's landing page and select "manage topics."