PySCeS ThinLayer: Fit multiple replicates, plot fit result #46

jmrohwer · 2022-06-14T14:57:55Z

This pull request implements 4 features/bug fixes:

The PySCeS TL can now fit datasets with multiple replicates
A function has been added to plot the experimental data together with the fitted model
Various cases where parameters with a zero value were not loaded/saved, have been fixed (if self.value: -> if self.value is not None).
The PySCeS loading of the model has been optimized, the conversion to PSC is only done if the EnzymeMLDoc is newer (this process takes time), and a bug with compartment volumes not equal to 1 has been fixed.

The new functionality is demonstrated with an example notebook that I am sharing with @JR-1991 via Google Colab. Any other reviewers, let me know and I'll be happy to share it. It takes the Model 3 from Scenario 4 from the Lauterbach 2022 paper and fits this with PySCeS. The fitted parameters are the same (within small errors) as obtained with direct fitting in the Jupyter notebook using lmfit and odeint.

This change is

jmrohwer

Reviewable status: 0 of 3 files reviewed, 2 unresolved discussions (waiting on @JR-1991)

pyenzyme/thinlayers/TL_Pysces.py line 288 at r1 (raw file):

        return pd.DataFrame(np.hstack(output).T, columns=self.model.species)

    def _get_replicate_info(self, time):

@JR-1991 I'd be interested to know if there is a better way of doing this. The point is that in the experimental data all the replicates are vertically stacked above each other, yet each replicate requires only a single simulation (we don't have to repeat the simulation), and this is what I'm trying to achieve here.

pyenzyme/thinlayers/TL_Pysces.py line 301 at r1 (raw file):

        return (time[:i].values, num_reps)

    def plot_fit(self):

This is quite useful and if there is more than one reagent for which there is data, they are plotted with different colours. It is of course not as elegant as EnzymeMLDocument.visualize()but that one can use seaborn.FacetGrid because it is only a single dataframe. Here we are plotting data from two different dataframes, and moreover we want to plot the modelling data with more time-points than the experimental data to give smooth curves. So I think the only way to do this is directly with matplotlib, which gives less flexibility for interactive visualization. I have set this to plot 3 columns and the number of rows to be adjusted automatically depending number of measurements. However, I am open for suggestions for improving this. I think a method to quickly visualize a fit with data and model is very important in the library.

JR-1991

Reviewed 3 of 3 files at r1, 1 of 1 files at r2, all commit messages.
Reviewable status: all files reviewed, 2 unresolved discussions (waiting on @jmrohwer)

pyenzyme/thinlayers/TL_Pysces.py line 288 at r1 (raw file):

Previously, jmrohwer (Johann Rohwer) wrote…

@JR-1991 I'd be interested to know if there is a better way of doing this. The point is that in the experimental data all the replicates are vertically stacked above each other, yet each replicate requires only a single simulation (we don't have to repeat the simulation), and this is what I'm trying to achieve here.

Thats a great solution! I was wondering if it might be an option to modify the exportData function to not stack replicates vertically, but to provide a Dataframe for each species. This way everything is stacked horizontally. I have a feeling that this might also be more handy for other use cases. This would then be another optional argument to the function to change behaviour. What do you think?

JR-1991

As we previously discussed, the extension is great. Will merge the changes into main.

Reviewable status: all files reviewed, 2 unresolved discussions (waiting on @jmrohwer)

JR-1991

Reviewable status: all files reviewed, 2 unresolved discussions (waiting on @jmrohwer)

jmrohwer added 4 commits June 8, 2022 13:51

fix fitting of datasets with multiple replicates

2b6b20a

add function to plot data with fitted model

b4a2869

improve loading of PySCeS model, work around compartment volume bug

b55972e

fix SBML export of model with parameter values equal to zero

6fb0709

jmrohwer marked this pull request as ready for review June 14, 2022 15:11

jmrohwer requested a review from JR-1991 June 14, 2022 15:11

jmrohwer commented Jun 14, 2022

View reviewed changes

JR-1991 approved these changes Oct 22, 2022

View reviewed changes

Added missing dependency

c8518d1

JR-1991 approved these changes Oct 22, 2022

View reviewed changes

JR-1991 and others added 2 commits October 22, 2022 10:48

Fixed merge conflict

e64b599

Merge branch 'main' into fit_replicates

8471ea6

JR-1991 merged commit 2efeb21 into main Oct 22, 2022

JR-1991 deleted the fit_replicates branch June 25, 2024 08:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PySCeS ThinLayer: Fit multiple replicates, plot fit result #46

PySCeS ThinLayer: Fit multiple replicates, plot fit result #46

jmrohwer commented Jun 14, 2022 •

edited

Loading

jmrohwer left a comment

JR-1991 left a comment

JR-1991 left a comment

JR-1991 left a comment

PySCeS ThinLayer: Fit multiple replicates, plot fit result #46

PySCeS ThinLayer: Fit multiple replicates, plot fit result #46

Conversation

jmrohwer commented Jun 14, 2022 • edited Loading

jmrohwer left a comment

Choose a reason for hiding this comment

JR-1991 left a comment

Choose a reason for hiding this comment

JR-1991 left a comment

Choose a reason for hiding this comment

JR-1991 left a comment

Choose a reason for hiding this comment

jmrohwer commented Jun 14, 2022 •

edited

Loading