Do not copy datamodels when opening an already open datamodel. #232

schlafly · 2023-07-05T21:04:09Z

This PR starts to try to reduce memory usage in romancal by changing the default behavior of rdm.open(datamodel) to just return the existing datamodel rather than to make a copy. This likely has important downstream consequences that I haven't fully sussed out, though!

codecov · 2023-07-05T21:07:37Z

Codecov Report

Patch coverage: 100.00% and project coverage change: +0.13 🎉

Comparison is base (4e6db1e) 96.74% compared to head (2ec418c) 96.88%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #232      +/-   ##
==========================================
+ Coverage   96.74%   96.88%   +0.13%     
==========================================
  Files          28       29       +1     
  Lines        2399     2406       +7     
==========================================
+ Hits         2321     2331      +10     
+ Misses         78       75       -3

Impacted Files	Coverage Δ
src/roman_datamodels/datamodels/_core.py	`91.57% <100.00%> (+1.57%)`	⬆️
src/roman_datamodels/datamodels/_utils.py	`93.75% <100.00%> (ø)`

... and 1 file with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

WilliamJamieson · 2023-07-05T22:12:42Z

We should be a bit careful here, the changes currently fail this check:

roman_datamodels/tests/test_open.py

Lines 98 to 100 in ce423ee

    
           # It's essential that we get a new instance so that the original 
        
           # model can be closed without impacting the new model. 
        
           assert reopened_model is not original_model

The note is absolutely correct, in that if we don't do the copy being removed, then calling close on some upstream reference to the model object here will cause downstream problems because then inadvertently one can end up trying to access information from a closed model. Indeed, this behavior stretches all the way back to PR #1.

Currently, we are a bit caviler about sometimes calling close and sometimes not calling close on models. Especially since rdm_open is used extensively as a context manager. This cavalierness may cause a couple of issues for us:

We accidentally close a model which was "opened" and then modified by a step when the step exits. This is east to cause when using rdm_open as a context manager. More generally this can be caused via directly calling close somewhere on the model (which will be annoying to debug).
We leave the backing files to a model open and loose reference to those files due to loosing reference to the model, which can cause all sorts of subtle problems. In theory, the __del__ method on the model should clean up the open files for us as it closes the model when the reference count drops to zero, but as the python documentation for that method indicates __del__ is not guaranteed to always be called; moreover, the data model documentation
states:

Objects are never explicitly destroyed; however, when they become unreachable they may be garbage-collected. An implementation is allowed to postpone garbage collection or omit it altogether — it is a matter of implementation quality how garbage collection is implemented, as long as no objects are collected that are still reachable.

In effect this means we cannot guarantee that our models will be properly cleaned up by __del__. Indeed, that is what was causing #150 (see #231 fix).

These two issues are both solved by the copy being removed here because it disconnects models "opened" during a given step from models passed in. So we both never accidentally close a model we need open, and can properly close the files for models when needed.

I personally agree with the changes here as it is silly to copy things for no real reason. However, we need to approach this very carefully or otherwise we might cause ourselves very difficult to solve bugs.

schlafly · 2023-07-06T14:02:13Z

Yes. The way I think of this, if we open a file, we're responsible for cleaning it up. If we accept an object, the person who made the object is responsible for cleaning it up (or we wait for it to be garbage collected).

The current pipeline practice of using
with rdm.open(in) as data
as a generic "open a file or ~do nothing" seems problematic to me. In the former case we often want to clean up files. In the latter case we don't really want to do anything, but we don't want to clean up the object, since it was sent to us by someone else. We currently have a uniform interface by making a copy and then cleaning that copy up (unless opening with the target argument, which I doubt we handle well?), so that the caller's copy isn't cleaned up. But that's an unacceptable overhead here.

Maybe a better solution would be for rdm.open(in) to return a clone of in. That looks like it's by default a shallow copy that will ~turn off context handling and might be exactly what we want?

…object.

…amodels into reduce-copies

…object.

Clean up the shallow copy

schlafly · 2023-07-12T17:41:35Z

I think this is ready for review. This changes the behavior of rdm.open(...) to return a shallow rather than full copy. rdm.open(...) is used throughout romancal and stcal to open already opened data models, and the previous behavior of making copies for this purpose wasted a lot of memory. This has some implications in terms of what we're allowed to return from the pipeline, which leads to me making lazy_loading the default in the romancal pipeline. That's discussed in more detail in the related romancal PR here: spacetelescope/romancal#774

PaulHuwe · 2023-07-14T23:40:58Z

Can you add a plwishmaster link (presumably with the RCAL-774 ticket) to show this passing regression tests?

schlafly · 2023-07-15T00:54:10Z

romancal regression tests with this PR here: https://plwishmaster.stsci.edu:8081/job/RT/job/Roman-Developers-Pull-Requests/281/
I don't think I ran separate datamodels regression tests and would have to look at this again next week.

schlafly · 2023-07-15T00:56:28Z

(and note that I don't remember but that I don't think regression tests on romancal would pass with only this PR. One needs the other work on romancal too.)

ddavis-stsci · 2023-07-17T13:05:15Z

I'm confused, what other romancal work does the memory work depend on? Are you saying that the steps don't run with these changes?

…

On 7/14/23 8:56 PM, Eddie Schlafly wrote: (and note that I don't remember but that I don't think regression tests on romancal would pass with /only/ this PR. One needs the other work on romancal too.) — Reply to this email directly, view it on GitHub <https://urldefense.com/v3/__https://github.com/spacetelescope/roman_datamodels/pull/232*issuecomment-1636597248__;Iw!!CrWY41Z8OgsX0i-WU-0LuAcUu2o!zMLH15idQb4luRN53nt2X1bYVWQSva2rLSzygvpNuFEgrWHUQW5Ovgu175XySVqCd8skqi05FesMuV9IkjPuvNRr$>, or unsubscribe <https://urldefense.com/v3/__https://github.com/notifications/unsubscribe-auth/ALXCXWLCWZETG3HP22FFBPTXQHTEPANCNFSM6AAAAAAZ7PD5RQ__;!!CrWY41Z8OgsX0i-WU-0LuAcUu2o!zMLH15idQb4luRN53nt2X1bYVWQSva2rLSzygvpNuFEgrWHUQW5Ovgu175XySVqCd8skqi05FesMuV9IkoLRt1O7$>. You are receiving this because your review was requested.Message ID: ***@***.***>

schlafly · 2023-07-17T13:27:25Z

I have two PRs. This one on roman_datamodels and spacetelescope/romancal#774 on romancal. Both are required. I was worried by Paul's questions that I had missed some roman_datamodels specific regression tests, but it looks like there are only the romancal regression tests, for which I have linked a run here:
https://plwishmaster.stsci.edu:8081/job/RT/job/Roman-Developers-Pull-Requests/281/

ddavis-stsci · 2023-07-17T13:47:26Z

You can run the cal regression tests using your branch of roman_datamodels. Look at https://plwishmaster.stsci.edu:8081/job/RT/job/Roman-Developers-Pull-Requests/ Well at least you will be able to once they fix the missing "Configure" and "Build with Parameters" options. Soon I hope...

…

On 7/17/23 9:27 AM, Eddie Schlafly wrote: I have two PRs. This one on roman_datamodels and spacetelescope/romancal#774 <https://urldefense.com/v3/__https://github.com/spacetelescope/romancal/pull/774__;!!CrWY41Z8OgsX0i-WU-0LuAcUu2o!21eUOOz_B67K3_i8WeshngIZ4fa4aB2OCC1XvLw2L2lQKYXZnkQx-8A_sotN401sOFJB6rfViZQRYE0YwlU2jSzB$> on romancal. Both are required. I was worried by Paul's questions that I had missed some roman_datamodels specific regression tests, but it looks like there are only the romancal regression tests, for which I have linked a run here: https://plwishmaster.stsci.edu:8081/job/RT/job/Roman-Developers-Pull-Requests/281/ — Reply to this email directly, view it on GitHub <https://urldefense.com/v3/__https://github.com/spacetelescope/roman_datamodels/pull/232*issuecomment-1638143646__;Iw!!CrWY41Z8OgsX0i-WU-0LuAcUu2o!21eUOOz_B67K3_i8WeshngIZ4fa4aB2OCC1XvLw2L2lQKYXZnkQx-8A_sotN401sOFJB6rfViZQRYE0YwnOo0nJq$>, or unsubscribe <https://urldefense.com/v3/__https://github.com/notifications/unsubscribe-auth/ALXCXWOT7DH4HNZ4KV42UZDXQU4UPANCNFSM6AAAAAAZ7PD5RQ__;!!CrWY41Z8OgsX0i-WU-0LuAcUu2o!21eUOOz_B67K3_i8WeshngIZ4fa4aB2OCC1XvLw2L2lQKYXZnkQx-8A_sotN401sOFJB6rfViZQRYE0YwogWdy1N$>. You are receiving this because your review was requested.Message ID: ***@***.***>

schlafly · 2023-07-17T14:22:34Z

I did this in
https://plwishmaster.stsci.edu:8081/job/RT/job/Roman-Developers-Pull-Requests/281/
by making romancal depend on this branch of roman_datamodels.
https://github.com/spacetelescope/romancal/pull/774/files#diff-50c86b7ed8ac2cf95bd48334961bf0530cdc77b5a56f852c5c61b89d735fd711R28
That should be adequate? I agree we'd want to take that out after merging the datamodels PR.

ddavis-stsci · 2023-07-17T14:27:31Z

That is fine. I just misread your reply to Paul. Sorry for the confusion.

…

On 7/17/23 10:22 AM, Eddie Schlafly wrote: I did this in https://plwishmaster.stsci.edu:8081/job/RT/job/Roman-Developers-Pull-Requests/281/ by making romancal depend on this branch of roman_datamodels. https://github.com/spacetelescope/romancal/pull/774/files#diff-50c86b7ed8ac2cf95bd48334961bf0530cdc77b5a56f852c5c61b89d735fd711R28 <https://urldefense.com/v3/__https://github.com/spacetelescope/romancal/pull/774/files*diff-50c86b7ed8ac2cf95bd48334961bf0530cdc77b5a56f852c5c61b89d735fd711R28__;Iw!!CrWY41Z8OgsX0i-WU-0LuAcUu2o!xd2tr8yyZZoR5Jl_okLBZVldFdq1tLttKPZIZ6lURmRfCafq_c4V6EZozr-x6pAeIZwF4VNBG0DbIoBIShsLkfw_$> That should be adequate? I agree we'd want to take that out after merging the datamodels PR. — Reply to this email directly, view it on GitHub <https://urldefense.com/v3/__https://github.com/spacetelescope/roman_datamodels/pull/232*issuecomment-1638255408__;Iw!!CrWY41Z8OgsX0i-WU-0LuAcUu2o!xd2tr8yyZZoR5Jl_okLBZVldFdq1tLttKPZIZ6lURmRfCafq_c4V6EZozr-x6pAeIZwF4VNBG0DbIoBISihT1k8o$>, or unsubscribe <https://urldefense.com/v3/__https://github.com/notifications/unsubscribe-auth/ALXCXWOYB6DAFC7YCHWYXFLXQVDDNANCNFSM6AAAAAAZ7PD5RQ__;!!CrWY41Z8OgsX0i-WU-0LuAcUu2o!xd2tr8yyZZoR5Jl_okLBZVldFdq1tLttKPZIZ6lURmRfCafq_c4V6EZozr-x6pAeIZwF4VNBG0DbIoBISval9Qqj$>. You are receiving this because your review was requested.Message ID: ***@***.***>

PaulHuwe

LGTM

…amodels into reduce-copies

schlafly · 2023-07-21T20:34:19Z

Merged in updates from main, reran regression tests; merging.
https://plwishmaster.stsci.edu:8081/job/RT/job/Roman-Developers-Pull-Requests/285/

schlafly and others added 2 commits July 5, 2023 17:00

Do not copy datamodels when opening an already open datamodel.

45548d5

Merge branch 'main' into reduce-copies

fccdf9f

schlafly mentioned this pull request Jul 5, 2023

Try to avoid copying data models. spacetelescope/romancal#774

Merged

5 tasks

schlafly added 6 commits July 6, 2023 10:20

Make rdm.open(object) return a shallow copy rather than the original …

2f59495

…object.

Merge branch 'reduce-copies' of https://github.com/schlafly/roman_dat…

02ea82a

…amodels into reduce-copies

Pass **kwargs on to asdf in asdf_open.

8cc322e

Do not copy datamodels when opening an already open datamodel.

6e24f64

Make rdm.open(object) return a shallow copy rather than the original …

26d1335

…object.

Pass **kwargs on to asdf in asdf_open.

476848b

WilliamJamieson force-pushed the reduce-copies branch from 8cc322e to 476848b Compare July 8, 2023 00:25

Clean up shallow copy ability

25e6529

WilliamJamieson mentioned this pull request Jul 8, 2023

Clean up the shallow copy schlafly/roman_datamodels#1

Merged

schlafly and others added 4 commits July 11, 2023 13:52

Merge.

8972c19

Merge pull request #1 from WilliamJamieson/clean_up_copy

97ccacc

Clean up the shallow copy

Add changelog entry.

d7c3ac4

Merge branch 'main' into reduce-copies

8db2ef3

schlafly marked this pull request as ready for review July 12, 2023 17:39

schlafly requested review from a team and WilliamJamieson as code owners July 12, 2023 17:39

stscijgbot-rstdms mentioned this pull request Jul 13, 2023

romancal memory profiling spacetelescope/romancal#752

Closed

PaulHuwe approved these changes Jul 20, 2023

View reviewed changes

schlafly added 2 commits July 21, 2023 12:01

Merge changes from upstream.

e2d98a6

Merge branch 'reduce-copies' of https://github.com/schlafly/roman_dat…

2ec418c

…amodels into reduce-copies

schlafly merged commit 357a364 into spacetelescope:main Jul 21, 2023

schlafly deleted the reduce-copies branch July 21, 2023 20:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not copy datamodels when opening an already open datamodel. #232

Do not copy datamodels when opening an already open datamodel. #232

schlafly commented Jul 5, 2023

codecov bot commented Jul 5, 2023 •

edited

Loading

WilliamJamieson commented Jul 5, 2023 •

edited

Loading

schlafly commented Jul 6, 2023

schlafly commented Jul 12, 2023

PaulHuwe commented Jul 14, 2023

schlafly commented Jul 15, 2023

schlafly commented Jul 15, 2023

ddavis-stsci commented Jul 17, 2023 via email

schlafly commented Jul 17, 2023

ddavis-stsci commented Jul 17, 2023 via email

schlafly commented Jul 17, 2023

ddavis-stsci commented Jul 17, 2023 via email

PaulHuwe left a comment

schlafly commented Jul 21, 2023

Do not copy datamodels when opening an already open datamodel. #232

Do not copy datamodels when opening an already open datamodel. #232

Conversation

schlafly commented Jul 5, 2023

codecov bot commented Jul 5, 2023 • edited Loading

Codecov Report

WilliamJamieson commented Jul 5, 2023 • edited Loading

schlafly commented Jul 6, 2023

schlafly commented Jul 12, 2023

PaulHuwe commented Jul 14, 2023

schlafly commented Jul 15, 2023

schlafly commented Jul 15, 2023

ddavis-stsci commented Jul 17, 2023 via email

schlafly commented Jul 17, 2023

ddavis-stsci commented Jul 17, 2023 via email

schlafly commented Jul 17, 2023

ddavis-stsci commented Jul 17, 2023 via email

PaulHuwe left a comment

Choose a reason for hiding this comment

schlafly commented Jul 21, 2023

codecov bot commented Jul 5, 2023 •

edited

Loading

WilliamJamieson commented Jul 5, 2023 •

edited

Loading