-
Notifications
You must be signed in to change notification settings - Fork 304
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sub-100% Data Validity #1899
Comments
Hi there @prupireddy it looks like the reason the score isn't 100% is because the PARSynthesizer model isn't adhering to the min and max values in your original dataset column. I have 2 questions:
|
Thank you |
Hi @prupireddy and @srinify there is currently a known issue that I would start by confirming whether this column (
|
Hi @prupireddy I noticed you closed the issue. Does that mean you were able to come up with a resolution? For our knowledge (and perhaps to help others running into the same problem), you could clarify what the issue was? |
Since the true data type was supposed to be numerical, I followed your first suggestion and just hardcoded it to numerical (as opposed to having it get detected which gave categorical). This resolved the issue. |
Great. Appreciate the confirmation! |
Environment Details
Please indicate the following details about the environment in which you found the bug:
Error Description
I have a PAR model running on a health dataset. I noticed that my Data Validity drops from 100% when I include a Days_Supplied feature. The website said to contact you if that occurs.
DataValidityIssue.xlsx
Steps to reproduce
I've attached an excel file that has the true values on the left and the synthetic on the right. For privacy reasons, I cannot send the full data and code.
The text was updated successfully, but these errors were encountered: