README

2024-10-07

MoBa reproducibility project

The MoBa reproducibility project assesses current open science practices in the Norwegian Mother, Father, Child cohort (MoBa) and provides guidelines on being as transparent science practices given the limitations of epidemiological research.

This MoBaRepro repository is a supplement to the working example provided in the paper.

What are the independent effects of age and breastfeeding on height?

In this hypothetical research project, researchers are interested in testing the independent effects of age and breastfeeding duration on height during childhood in MoBa. Additionally, they also intend to test for variation in these effects according to child sex.

Hypotheses

H1: Higher age predicts taller height across childhood. H2: Longer breastfeeding duration will be associated with taller height across childhood. H3: The effect of age on height across childhood differs between males and females. H4: The effect of breastfeeding duration on height across childhood differs between males and females.

Data

The data included in this repository is completely simulated, and bears no resemblance to actual MoBa data other than in structure.

Scripts

All scripts can be found under the scripts subdirectory.

data_sim.R

data_sim.R generates the simulated “original” MoBa dataset with variables for height at 6 months, 18 months, 3 years, 5 years, 7 years, and 8 years, as well as for breastfeeding duration, sex, and birth length. This dataset is subsequently saved in the data subdirectory as simdata.RData.

01_dataprep.R

The 01_dataprep.R script loads the simulated dataset, checks the descriptive statistics and distributions, cleans and recodes the data, and converts the data to long form before the analysis. The processed dataset is saved as data/longdata.RData.

02_mixedmodels.R

The 02_mixedmodels.R script loads the longdata.RData dataset generated in script 01, which is used to create a single multilevel model to test all hypotheses. It also performs two sensitivity analyses: one using a different method of categorising the breastfeed_dur variable, and one using the height variable where values were deemed outliers if they were more than 2 SD away from the mean, and were thus removed. Finally, this script collates the results from all models.

03_synthesis.R

Because MoBa data contains potentially identifiable information, it is not possible to share the actual data alongside analysis scripts. One way of handling this issue is to create a synthetic dataset based on the original dataset. The 03_synthesis.R script provides an example of how to generate such a synthetic dataset and how to assess its preservation of the relationships found in the original dataset. It loads the longdata.RData dataset and creates a synthetic dataset, data/syndata.RData.

04_validate.R

The 04_validate.R script performs the same analyses as in script 02, but on the synthetic dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
MoBaRepro.Rproj		MoBaRepro.Rproj
README.Rmd		README.Rmd
README.md		README.md
simdata.RData		simdata.RData

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

MoBa reproducibility project

What are the independent effects of age and breastfeeding on height?

Hypotheses

Data

Scripts

data_sim.R

01_dataprep.R

02_mixedmodels.R

03_synthesis.R

04_validate.R

About

Releases

Packages

Languages

License

berntgl/MoBaRepro

Folders and files

Latest commit

History

Repository files navigation

README

MoBa reproducibility project

What are the independent effects of age and breastfeeding on height?

Hypotheses

Data

Scripts

data_sim.R

01_dataprep.R

02_mixedmodels.R

03_synthesis.R

04_validate.R

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages