Add documentation for long-lived Postgres pools #273

seanlinsley · 2023-09-28T16:26:27Z

At pganalyze we run a long-lived Postgres pool, and with the help of bytehound have discovered that a certain query with large bind params appears to leak memory. While it may be possible to fix this leak, there may be leaks elsewhere or future changes may introduce them, so it's probably best for deadpool to recommend that users with long-lived pools call retain to prune old database connections.

I've both added an example to the README that's similar to our configuration, and updated the retain function docs to highlight this issue.

bikeshedder · 2023-10-04T10:24:39Z

I didn't know about tokio-postgres leaking that much memory. Is this maybe related with the connection staying active even after dropping the client if there are still queries running? In the latest version of deadpool-postgres does call abort when the client wrapper is dropped:

deadpool/postgres/src/lib.rs

Lines 444 to 448 in 884629f

    
           impl Drop for ClientWrapper { 
        
               fn drop(&mut self) { 
        
                   self.conn_task.abort() 
        
               } 
        
           }

seanlinsley · 2023-10-04T16:13:12Z

Our workload involves a high number of queries that complete quickly and we have a short statement_timeout set, so I don't think that #237 will help. That said, we haven't updated deadpool yet.

Even after making this change we are still seeing a leak, though it's slower now. I'll keep looking into this.

Do you have idea why CI is failing? From the other examples in the README, I would've assumed rust,no_run would prevent CI from running the code.

seanlinsley · 2023-11-13T14:07:33Z

The remaining apparent memory leak was solved with aws/amazon-ecs-agent#3594 (comment)

After confirming the memory usage was stable, I then disabled the prune_db_connections code path (while still using malloc_trim(0)) and confirmed there is definitely a leak in the Postgres connections:

inzanez · 2024-06-04T14:03:48Z

Is there any intention to merge this at some point?

bikeshedder · 2024-06-04T14:25:42Z

I'm hesitant to merge this as it is pointing fingers at tokio-postgres and nobody else was able to reproduce this, yet.

Do you use the statement cache a lot? I mean prepare_cached which is added by deadpool-postgres. That cache doesn't currently have a logic to remove statements so if you use it with dynamically generated queries it's going to bloat quite a lot.

A simple manager.statement_caches.clear() does clear the caches for all connections.

The statement cache currently has no way of removing statements automatically so you should never use it for dynamically generated queries. See:

Policies for removing prepared statements from the cache #282

inzanez · 2024-06-04T17:16:17Z

I see. And I generally agree. I am still working on a way to reproduce the issue. But I can say that I never make any use of prepare_cached.

I am currently working with my Rocket web back end, as I can only reproduce it on a real workload,...I am still not at a point where I could reproduce it in a minimal example.

What I can say so far: if my Rocket web server uses one connection pool to Postgres, I have memory issues hitting certain endpoints.
If I use a separate connection pool for the endpoint receiving huge data loads and storing them in the database, the memory consumption goes down.

So I think there might some interaction between different statements (SELECT and INSERT),...

inzanez · 2024-06-05T11:31:39Z

Ok, I will stop for now. I have found out that as long as no other DB operation takes place on any of the pool's connections, the large 'INSERT' statements I run do not lead to any memory issue / overhead.
However, as soon as 'SELECT' statements run before or during the heavy INSERT operations,...I get a huge memory overhead on the container.
Unfortunately I cannot reproduce the issue in a minimal example...and I am not sure if this is linked to rust-postgres (sfackler/rust-postgres#1081) or deadpool ...

Add documentation for long-lived Postgres pools

071e7fd

seanlinsley force-pushed the long-lived-pool-docs branch from 8858328 to 071e7fd Compare September 28, 2023 16:29

bikeshedder added documentation Improvements or additions to documentation A-postgres Area: PostgreSQL support / deadpool-postgres labels Oct 4, 2023

seanlinsley mentioned this pull request Nov 13, 2023

Memory leak in long-lived connection pool sfackler/rust-postgres#1081

Open

bikeshedder force-pushed the master branch 2 times, most recently from 2f4d3ba to b1cf396 Compare March 31, 2024 22:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add documentation for long-lived Postgres pools #273

Add documentation for long-lived Postgres pools #273

seanlinsley commented Sep 28, 2023

bikeshedder commented Oct 4, 2023

seanlinsley commented Oct 4, 2023 •

edited

Loading

seanlinsley commented Nov 13, 2023 •

edited

Loading

inzanez commented Jun 4, 2024

bikeshedder commented Jun 4, 2024

inzanez commented Jun 4, 2024 •

edited

Loading

inzanez commented Jun 5, 2024

Add documentation for long-lived Postgres pools #273

Are you sure you want to change the base?

Add documentation for long-lived Postgres pools #273

Conversation

seanlinsley commented Sep 28, 2023

bikeshedder commented Oct 4, 2023

seanlinsley commented Oct 4, 2023 • edited Loading

seanlinsley commented Nov 13, 2023 • edited Loading

inzanez commented Jun 4, 2024

bikeshedder commented Jun 4, 2024

inzanez commented Jun 4, 2024 • edited Loading

inzanez commented Jun 5, 2024

seanlinsley commented Oct 4, 2023 •

edited

Loading

seanlinsley commented Nov 13, 2023 •

edited

Loading

inzanez commented Jun 4, 2024 •

edited

Loading