Data-Imputation

Handling the missing data problems.

Impute the data by calculating the maximum likelihood iteratively.
Imputation accuracy is verified with different Machine Learning classifiers and Neural Networks.
Scikit-learn and Pytorch packages need to be installed before running the Python code.

We use the algorithm to handle the missing data problems in electronic health records (EHRs). Due to the privacy, we cannot release the original EHR dataset.

inputDataPreprocessing.py: preprocess the EHRs data and save them as *.npy files.

generate_training_X_Y.py: impute the data from the preprocessed *.npy files.

missingData.py: imputation module.

classifier.py: train and test the imputed data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Data-Imputation

Files

README.md

Latest commit

History

README.md

File metadata and controls

Data-Imputation