Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incorporate result calculation and output control #71

Open
matthiasgomolka opened this issue May 10, 2021 · 2 comments
Open

Incorporate result calculation and output control #71

matthiasgomolka opened this issue May 10, 2021 · 2 comments
Labels
enhancement New feature or request question Further information is requested

Comments

@matthiasgomolka
Copy link
Owner

Would it be useful to enhance the functionality of sdc_descriptives() such that the function calculates and checks a result in a single step?

Right now, users have to calculate results and then need to show that these results are fine.

This might be comparatively difficult to program as it needs to be really flexible.

@matthiasgomolka matthiasgomolka added enhancement New feature or request question Further information is requested labels May 10, 2021
@tbecker2511
Copy link
Collaborator

@matthiasgomolka I remember we thought about it in the first place, but I don't know exactly why we decided not to do it. It could work in a similar way as sdc_extreme(). If the data resp. the descriptive statistics comply with the rules, then (grouped) descriptive statistics could be output automatically. It could work like the function summary() and output e.g. mean, median, sd, quartiles.

I think we had seen the problem in that descriptive statistics might be calculated too customized. Therefore the function could not capture all variants that are desired

@matthiasgomolka
Copy link
Owner Author

@tbecker2511 Yes, I think your last paragraph captures our reasoning quite well. We opted against it because it's hard to provide a function which is flexible enough but still covers our needs in terms of safety.

Maybe this will become a little easier with the next release of data.table: Rdatatable/data.table#4247

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants