Added validation function for `serodata` and `fit_seromodel`. Fixes #148 #154

jpavlich · 2024-01-26T21:13:58Z

Please check that the validations I do for each column in serodata are correct.

R/modelling.R

codecov-commenter · 2024-02-02T21:58:20Z

Codecov Report

Attention: 12 lines in your changes are missing coverage. Please review.

Comparison is base (df2322a) 68.26% compared to head (e0d7d1b) 68.63%.

❗ Current head e0d7d1b differs from pull request most recent head 2a5820a. Consider uploading reports for the commit 2a5820a to get more accurate results

Files	Patch %	Lines
R/modelling.R	84.61%	12 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #154      +/-   ##
==========================================
+ Coverage   68.26%   68.63%   +0.37%     
==========================================
  Files          10       10              
  Lines        1736     1779      +43     
==========================================
+ Hits         1185     1221      +36     
- Misses        551      558       +7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

jpavlich · 2024-02-03T00:24:01Z

@ntorresd I just added type checking for serodata. Do not merge just yet. As soon as you confirm me the missing checks and the type checks are correct, I will write tests and method documentations, then we can merge this PR.

ntorresd · 2024-02-06T13:29:12Z

@ntorresd I just added type checking for serodata. Do not merge just yet. As soon as you confirm me the missing checks and the type checks are correct, I will write tests and method documentations, then we can merge this PR.

It seems like all the necessary checks are in place, thanks @jpavlich!

…anged `validate_serodata` to `validate_prepared_serodata` within fit_seromodel

… `prepare_serodata`

`validate_prepared_serodata` also calls `validate_serodata`

jpavlich · 2024-02-07T22:10:03Z

@ntorresd Please let me know if validate_prepared_serodata is ok. After merging into main I will open another branch to add documentation and tests

…im_data`

…alidate_prepared_serodata`

…data`

ntorresd · 2024-02-09T13:27:52Z

R/modelling.R

+    missing <- optional_cols[which(!(optional_cols %in% colnames(serodata)))]
+    warning(
+      "The following optional columns in `serodata` are missing.",
+      "Consider including them to get more information from this analysis",


This is currently printing the following when country, test and antibody are missing:
The following optional columns in serodata are missing.Consider including them to get more information from this analysiscountry, test, antibody
Please add a line break at the end of line 56.

ntorresd · 2024-02-09T13:42:46Z

R/modelling.R

+  ) {
+    missing <- must_have_cols[which(!(must_have_cols %in% colnames(serodata)))]
+    stop(
+      "The following mandatory columns in `serodata` are missing.",


Please add a line break at the end of this message too.

ntorresd · 2024-02-09T13:58:08Z

R/modelling.R

+
+  # Check that the serodata has the necessary columns to fully
+  # identify the age groups
+  stopifnot(


Thinking about the implementation of the age-varying models and the additional time-varying models I realized that this kind of validation may not be necessary given that we will need age_min and age_max anyway to correctly specify the chunks on which the FoI values are estimated, meaning that both of these should be validated in validate_serodata as mandatory columns, whereas age_mean_f should be validated on validate_prepared_serodata. This way the data validation will be simpler.

ntorresd · 2024-02-09T13:58:37Z

R/modelling.R

+
+
+validate_serodata <- function(serodata) {
+  col_types <- list(


Please add both age_min and age_max to this list

ntorresd · 2024-02-09T14:08:41Z

R/modelling.R

+}
+
+validate_prepared_serodata <- function(serodata) {
+  col_types <- list(


total, counts and tsur are already validated by validate_serodata(serodata) in line 113. Here we should just make sure that both age_mean_f and birth_year had been added to the data.

For the time being we can also add prev_obs, prev_obs_lower and prev_obs_upper (which should be numeric) for consistency with the current version of prepare_serodata. Although they're not needed for modelling, they're currently used for plotting purposes, so to simplify data validation for those functions I think it's worth adding them here. In the future we may refactor prepare_serodata for it just to prepare the data for modelling and compute the prevalence with its binomial confidence interval internally in the plotting functions (if we decide to keep the visualization module in the package), but we can decide this later.

jpavlich · 2024-03-20T19:03:45Z

Fixes #152 since group_sim_data calls prepare_serodata. The latter performs the required validations that are implemented in this PR

jpavlich requested a review from ntorresd January 26, 2024 21:13

jpavlich linked an issue Jan 26, 2024 that may be closed by this pull request

Input validation for fit_seromodel #148

Closed

ntorresd requested changes Feb 1, 2024

View reviewed changes

R/modelling.R Outdated Show resolved Hide resolved

jpavlich requested a review from ntorresd February 2, 2024 21:37

jpavlich added 3 commits February 7, 2024 16:11

Added validation function for serodata and fit_seromodel. Fixes #148

2323e1a

rewrote validate_serodata, to be used inside prepare_serodata. Ch…

f10c0ba

…anged `validate_serodata` to `validate_prepared_serodata` within fit_seromodel

added type checking functions for serodata before and after calling…

ef879d7

… `prepare_serodata`

jpavlich force-pushed the 148-input-validation-for-fit_seromodel branch from 4977060 to ef879d7 Compare February 7, 2024 21:27

Updated parameter names that are validated in fit_seromodel.

b7752df

`validate_prepared_serodata` also calls `validate_serodata`

jpavlich and others added 5 commits February 12, 2024 16:01

Added a simple "simulation" of age_min and age_max to `generate_s…

a066e39

…im_data`

Removed age_min and age_max mutations to sim_data

b8bbe17

fixed newlines and added missing checks to validate_serodata and `v…

2c42e9e

…alidate_prepared_serodata`

fixed newlines and added missing checks to validate_serodata and `v…

1d9612b

…alidate_prepared_serodata`

Minor correction to age_min, age_max generation in `generate_sim_…

2a5820a

…data`

ntorresd approved these changes Feb 13, 2024

View reviewed changes

ntorresd merged commit a790a1f into main Feb 13, 2024
7 checks passed

ntorresd deleted the 148-input-validation-for-fit_seromodel branch February 13, 2024 00:06

ntorresd mentioned this pull request Mar 8, 2024

Serodata requirements #66

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added validation function for `serodata` and `fit_seromodel`. Fixes #148 #154

Added validation function for `serodata` and `fit_seromodel`. Fixes #148 #154

jpavlich commented Jan 26, 2024

codecov-commenter commented Feb 2, 2024 •

edited

Loading

jpavlich commented Feb 3, 2024

ntorresd commented Feb 6, 2024

jpavlich commented Feb 7, 2024

ntorresd Feb 9, 2024

ntorresd Feb 9, 2024

ntorresd Feb 9, 2024

ntorresd Feb 9, 2024

ntorresd Feb 9, 2024

jpavlich commented Mar 20, 2024

Added validation function for serodata and fit_seromodel. Fixes #148 #154

Added validation function for serodata and fit_seromodel. Fixes #148 #154

Conversation

jpavlich commented Jan 26, 2024

codecov-commenter commented Feb 2, 2024 • edited Loading

Codecov Report

jpavlich commented Feb 3, 2024

ntorresd commented Feb 6, 2024

jpavlich commented Feb 7, 2024

ntorresd Feb 9, 2024

Choose a reason for hiding this comment

ntorresd Feb 9, 2024

Choose a reason for hiding this comment

ntorresd Feb 9, 2024

Choose a reason for hiding this comment

ntorresd Feb 9, 2024

Choose a reason for hiding this comment

ntorresd Feb 9, 2024

Choose a reason for hiding this comment

jpavlich commented Mar 20, 2024

Added validation function for `serodata` and `fit_seromodel`. Fixes #148 #154

Added validation function for `serodata` and `fit_seromodel`. Fixes #148 #154

codecov-commenter commented Feb 2, 2024 •

edited

Loading