Extra unnecessary(?) requests happening whenever `Recurly_Pager::rewind()` called. #212

nerrad · 2016-03-09T13:26:32Z

Introduced in this commit: e5bb1ff#diff-91098c9a075e43eeab9310ea7998d870

This appears to have changed the behaviour of rewind() so an http request is done ALWAYS when calling rewind() instead of just when the isset($this->_links['start']). This means even if one has iterated through all pages, any rewind() will (possibly unnecessarily) do additional http requests to the server instead of resetting the pointer for cached objects.

In our case this was a breaking change because we built our consumer based on the previous behaviour where rewind() ONLY loaded from the api if there had been no initial request yet. So we now have lots of places in our code where we started hitting Recurly API limits because rewind() now fully resets the internal caching of the objects and rebuilds on other pages as well.

I don't know if this change was intended or not. But fairly significant regardless.

The text was updated successfully, but these errors were encountered:

nerrad · 2016-03-09T13:39:56Z

This is related to what is reported in #207

drewish · 2016-03-14T16:04:38Z

@nerrad hey so i'm curious what's causing the rewinds? are you reusing an iterator? or is it within PHP's foreach?

nerrad · 2016-03-14T17:05:52Z

yah I had the rewind() in our code originally in a foreach which I refactored because it was silly. It exposed the change though. Also, there's still potential where the iterator could be referenced multiple times in the same request, in which case there doesn't seem to be need to hit the http when calling rewind(). I guess there's an inherent assumption that once the iterator has iterated over the objects they are cached internally (an assumption that was broken, maybe intentionally in the recent update)

drewish · 2016-03-14T17:34:10Z

My intention with the change was that for a multi-page iteration rewinding should re-request since the first page's results would have already been discarded.

The unintended change I didn't foresee is on an iterator where there's only single page worth of results which would still be in memory, they'd be discarded and re-requested. I could sort of make the argument that if you're rewinding you'd want fresh data but it's definitely a change. I'm open to restoring the old behavior if there's a strong need for it.

nerrad · 2016-03-14T17:40:43Z

Well personally I think its a pretty big leap to assume that ANY resource using Recurly_Pager is only going to interate once through pages and want the results discarded afterwards. At least in our case, there's cases where we might be rewinding() more than once in a request.

IF you decide to keep the change as is, that will affect the assumptions we've made about the pager, and we'll likely build our own separate class for caching the objects that are retrieved while paging.

Regarding this:

rewinding should re-request since the first page's results would have already been discarded.

Is that the case? After each page retrieval, are the current objects that have been retrieved in the old page(s) discarded?

drewish · 2016-03-14T18:37:16Z

@nerrad both the old and new code only maintained a single page worth of results. for multi-page result sets rewinding would re-request the first page. the change in the new code is that rewinding on the first page will now re-request. my assumption is that the client ~~does this~~ doesn't cache all records to avoid running out of memory if you've got hundreds of thousands of records.

if you're making multiple passes through a result set, it'd probably be best to just use something like $items = iterator_to_array($pager, false); to fetch all the results and store them in an array. note that unfortunately Recurly_Pager reuses keys, so you'll either want to extract an appropriate value from the item or just append it to the array.

nerrad · 2016-03-14T19:06:45Z

hmm.. yeah looks like I misread the code. I see _loadFrom sets $this->_objects to an empty array before loading the next page. So obviously my pull request is dumb.

I'll rework what we're doing. Thanks for stepping through things with me.

drewish · 2016-03-14T19:42:10Z

@nerrad totally, yeah sorry the pagination stuff is kind of a mess. i was trying to fix another bug and clean up as much as i could without breaking anything. there weren't any tests so it was hard to nail down all the prior behaviors. i'm glad it wasn't a huge blocker for you.

This was referenced Mar 9, 2016

Reworked pager is a breaking change in the API #207

Closed

only redo initial request if necessary. #213

Closed

nerrad closed this as completed Mar 14, 2016

bhelx added the V2 V2 Client label Mar 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extra unnecessary(?) requests happening whenever `Recurly_Pager::rewind()` called. #212

Extra unnecessary(?) requests happening whenever `Recurly_Pager::rewind()` called. #212

nerrad commented Mar 9, 2016

nerrad commented Mar 9, 2016

drewish commented Mar 14, 2016

nerrad commented Mar 14, 2016

drewish commented Mar 14, 2016

nerrad commented Mar 14, 2016

drewish commented Mar 14, 2016

nerrad commented Mar 14, 2016

drewish commented Mar 14, 2016

Extra unnecessary(?) requests happening whenever Recurly_Pager::rewind() called. #212

Extra unnecessary(?) requests happening whenever Recurly_Pager::rewind() called. #212

Comments

nerrad commented Mar 9, 2016

nerrad commented Mar 9, 2016

drewish commented Mar 14, 2016

nerrad commented Mar 14, 2016

drewish commented Mar 14, 2016

nerrad commented Mar 14, 2016

drewish commented Mar 14, 2016

nerrad commented Mar 14, 2016

drewish commented Mar 14, 2016

Extra unnecessary(?) requests happening whenever `Recurly_Pager::rewind()` called. #212

Extra unnecessary(?) requests happening whenever `Recurly_Pager::rewind()` called. #212