Normalization stage is checking for aggregate_code_metadata/codes.parqet columns and metadata/codes.parquet columns in data/codes.parquet #147
Labels
bug
Something isn't working
MEDS-Transform
Issues for the data pre-processing transformations in MEDS_transforms
priority:critical
A critical priority issue that should be solved and pushed to a new minor version release ASAP.
Testing
The normalization stage is failing for me because there is no
data/codes.parquet
file.When I try to copy over the metadata/codes/parquet file:
cp "${MEDS_DIR}/data/metadata/codes.parquet" "${MEDS_DIR}/data/codes.parquet"
I get an error that there is no
values/sum
columnAnd when I try to copy over the aggregate_code_metadata/codes.parquet:
cp "${MEDS_DIR}/aggregate_code_metadata/codes.parquet" "${MEDS_DIR}/data/codes.parquet"
I get an error that there is no "code/vocab_index" column.
What worked for me as a temporary solution was to spin up a simple hydra script to generate a code/vocab_index column:
This issue exists on the
dev
branch and on release 0.0.4The text was updated successfully, but these errors were encountered: