Simplify PackageFinder best candidate API some more #6787

cjerdonek · 2019-07-24T18:56:33Z

This PR makes a few minor changes to PackageFinder's "best candidate" API to simplify things and improve the readability / make it easier to understand. Specifically, the PR does the following--

Rename PackageFinder.find_candidates() to PackageFinder.find_best_candidate() because that is what the method is used for. There is also already a PackageFinder.find_all_candidates() method, so the rename makes things less confusing since there are now no longer two methods find_all_candidates() and find_candidates() with similar-sounding names.
Change the type of the return value of PackageFinder.find_best_candidate() from FoundCandidates to BestCandidateResult (by renaming FoundCandidates to BestCandidateResult), since the purpose of this class is to expose the best candidate.
Compute the best_candidate and pass it to the BestCandidateResult constructor instead of computing the best candidate lazily. The best candidate is now exposed as a best_candidate attribute of the BestCandidateResult object instead of via FoundCandidates.get_best(). This is better for a couple reasons: (1) the applicable candidates and best candidate are now both handled the same way (namely by passing them in when instantiating instead of computing lazily), and (2) we no longer need to pass the whole CandidateEvaluator object into the FoundCandidates object, eliminating a bit of circularity that was of concern before. This completes the process of converting FoundCandidates to a simple data container class, so that it doesn't actually compute anything on its own.
Rename the current CandidateEvaluator.get_best_candidate() to CandidateEvaluator.sort_best_candidate() because this is the method that gets the best candidate by sorting, and add a new CandidateEvaluator.compute_best_candidate() method responsible for filtering applicable candidates, calling sort_best_candidate(), and constructing the BestCandidateResult object.
Adds to the BestCandidateResult constructor the assertion that @xavfernandez requested here: Simplify FoundCandidates #6684 (comment)

src/pip/_internal/index.py

xavfernandez · 2019-07-26T09:52:03Z

I must admit I'm getting kind of confused in the multiple refactors and the new CandidateEvaluator, CandidatePreferences, LinkEvaluator, FoundCandidates, BestCandidateResult.

It might help to write down how it currently works and where we want to go ?

pfmoore · 2019-07-26T10:12:52Z

Agreed - with all the various refactorings going on at the moment, I think this is a great opportunity to create additional internal documentation explaining how the various utility classes we have work.

The old classes weren't well documented, but people had over time gained a rough feeling for how things hung together. Significant refactoring without adding docs loses that implicit knowledge, so better if we can capture some details in writing while they are still fresh.

cjerdonek · 2019-07-26T12:56:37Z

For now, where would you like me to document those things — in the comments to this PR or to make a separate documentation PR? These docs would only be for pip maintainers (us) and contributors, as they would document the code and not the public API. Before this refactoring started, the various responsibilities were all handled by one giant PackageFinder class. Now distinct stages of the process are handled by different classes, which makes certain things easier like testing, reasoning about and adding new features to individual stages (e.g. yanked files and preferring hash matches), and knowing which options affect which stages.

pfmoore · 2019-07-26T13:17:35Z

I'd go for comment blocks in the code (or if you get really verbose, .txt files alongside the source files :-)) But definitely in the actual sources, not in github comments.

I'd assume it's easier to include in this PR while the details are fresh in your mind (these are internal usage notes, so polishing the wording isn't necessary), but it's up to you how you prefer to organise the work - if you find it easier to do as a separate PR, I'm fine with that.

pradyunsg · 2019-07-26T13:23:39Z

Internal Documentation. \o/

Here's how I'd planned to structure it while working on the overall architecture docs, at PyCon 2019:

add development/architecture/ to the html documentation
index is a table of contents + a 1 or 2 paragraph overview + a note that pip's internals are not for external consumers but maintainers and contributors.
each component within the codebase can get its own dedicated page/section within that as necessary.
- think download, finder, vcs, build, install, configuration, uninstall, resolution etc.
- Example url: development/architecture/download.html
A contributor friendly guide that connects the dots for all these parts.

pradyunsg · 2019-07-26T13:28:23Z

I'll be happy to initiate the skeleton in a PR for the above -- I think I already have some this stuff written.

If no one thinks that has major issues (it's something we can add to incrementally and then require updating on major refactors), I'll file the PR.

pfmoore · 2019-07-26T13:30:07Z

@pradyunsg That sounds really nice - but more than I'm talking about here, which is really just "capture the insights that resulted in the current refactoring, so that people who had a feel for the old layout can find their way round the new layout (and ideally, people who didn't know the old layout have a head start understanding the current one)".

pradyunsg · 2019-07-26T13:34:23Z

@pfmoore yep yep -- that's the kind of content I imagined the dedicated pages would contain. 🙈

cjerdonek · 2019-07-26T13:41:08Z

My preference would be if it's something I can work on / write up now rather than having to wait for an architecture document for all of pip to be written, reviewed, and merged. (I already see the scope increasing quite a bit in the few hours since the initial request.)

pradyunsg · 2019-07-26T13:46:12Z

@cjerdonek would the following work for you: add a docs/html/development/architecture/ folder and a file named whatever you want, with whatever content you deem necessary (and :orphan: at the top of the file). :)

I'll be happy to do the rest in a follow-up PR after that.

pfmoore · 2019-07-26T13:47:28Z

@cjerdonek +1 from me. Someone can move the information later if they want to make it more formal. And I apologise if it sounds like scope creep. I was only ever intending to support @xavfernandez request - not to ask for anything more than that.

cjerdonek · 2019-07-26T13:49:17Z

@cjerdonek would the following work for you:

That would be okay, thanks.

I was only ever intending to support @xavfernandez request - not to ask for anything more than that.

And yes, understood, @pfmoore. I appreciate it.

cjerdonek · 2019-08-20T09:18:08Z

Okay, I was finally able to finish drafting the architecture document / section for PackageFinder / index.py that a number of you requested. It includes sections for a number of the key classes in index.py (and in particular the ones mentioned in this PR). I'm sure that this can be fine-tuned, but I think it provides a good start.

I also addressed @chrahunt's review comment on the code portion (which was to add an additional assertion to parallel the one that @xavfernandez had requested earlier).

docs/html/development/architecture/index.rst

* Rename FoundCandidates to BestCandidateResult. * Rename CandidateEvaluator's make_found_candidates() to compute_best_candidate(). * Rename CandidateEvaluator's get_best_candidate() to sort_best_candidate(). * Rename PackageFinder's find_candidates() to find_best_candidate().

…aluator.

cjerdonek · 2019-08-21T05:53:45Z

PR updated. I moved the architecture section to a separate file as requested.

atugushev · 2019-08-21T08:00:26Z

Thanks for the documentation! It seems we got a lot of work to do in pip-tools :)

cjerdonek · 2019-08-21T08:11:05Z

Hi, @atugushev! Most of this documentation is of what was there before. The code changes in the PR are mostly just some renames that may not even affect you.

pradyunsg · 2019-08-21T10:29:34Z

Happy to merge this.

atugushev · 2019-08-21T21:39:58Z

@cjerdonek you were right, that was easy 👍

cjerdonek · 2019-08-21T21:42:22Z

Great!

The `get_best_candidate` has been renamed to `compute_best_candidate`, which now returns an instance of `BestCandidateResult`. See pypa/pip#6787

cjerdonek added C: finder PackageFinder and index related code skip news Does not need a NEWS file entry (eg: trivial changes) type: refactor Refactoring code labels Jul 24, 2019

cjerdonek force-pushed the best-candidate-result branch from 5c4e738 to 5952a38 Compare July 24, 2019 18:59

chrahunt suggested changes Jul 25, 2019

View reviewed changes

src/pip/_internal/index.py Show resolved Hide resolved

pradyunsg mentioned this pull request Aug 4, 2019

Documentation of pip's internals #6831

Closed

cjerdonek force-pushed the best-candidate-result branch from f2ddd7a to 23fd624 Compare August 20, 2019 09:12

cjerdonek force-pushed the best-candidate-result branch from 23fd624 to 9de34e3 Compare August 20, 2019 09:19

pradyunsg reviewed Aug 21, 2019

View reviewed changes

docs/html/development/architecture/index.rst Outdated Show resolved Hide resolved

pradyunsg reviewed Aug 21, 2019

View reviewed changes

docs/html/development/architecture/index.rst Outdated Show resolved Hide resolved

pradyunsg mentioned this pull request Aug 21, 2019

Note that pip's internals can change at any time #6901

Merged

cjerdonek added 6 commits August 20, 2019 22:49

Pass the best candidate to BestCandidateResult instead of CandidateEv…

6554273

…aluator.

Move compute_best_candidate() to the end of CandidateEvaluator.

1a8dc9c

Move BestCandidateResult before CandidateEvaluator.

a644fb0

Add some assertions to BestCandidateResult.__init__().

3eb803a

Add a test for compute_best_candidate() returning a None best candidate.

06d786d

Add initial architecture section for index.py and PackageFinder.

8db3944

cjerdonek force-pushed the best-candidate-result branch from 9de34e3 to 8db3944 Compare August 21, 2019 05:50

pradyunsg approved these changes Aug 21, 2019

View reviewed changes

cjerdonek merged commit 9ef8116 into pypa:master Aug 21, 2019

cjerdonek deleted the best-candidate-result branch August 21, 2019 20:13

atugushev mentioned this pull request Aug 21, 2019

Add compatibility with the pip master (upcoming pip==19.3) jazzband/pip-tools#864

Merged

4 tasks

cjerdonek mentioned this pull request Aug 23, 2019

Split out a LinkCollector class from PackageFinder #6910

Merged

lock bot added the auto-locked Outdated issues that have been locked by automation label Sep 20, 2019

lock bot locked as resolved and limited conversation to collaborators Sep 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify PackageFinder best candidate API some more #6787

Simplify PackageFinder best candidate API some more #6787

cjerdonek commented Jul 24, 2019 •

edited

Loading

xavfernandez commented Jul 26, 2019

pfmoore commented Jul 26, 2019

cjerdonek commented Jul 26, 2019

pfmoore commented Jul 26, 2019

pradyunsg commented Jul 26, 2019 •

edited

Loading

pradyunsg commented Jul 26, 2019 •

edited

Loading

pfmoore commented Jul 26, 2019

pradyunsg commented Jul 26, 2019

cjerdonek commented Jul 26, 2019

pradyunsg commented Jul 26, 2019

pfmoore commented Jul 26, 2019

cjerdonek commented Jul 26, 2019

cjerdonek commented Aug 20, 2019

cjerdonek commented Aug 21, 2019

atugushev commented Aug 21, 2019

cjerdonek commented Aug 21, 2019

pradyunsg commented Aug 21, 2019

atugushev commented Aug 21, 2019

cjerdonek commented Aug 21, 2019

Simplify PackageFinder best candidate API some more #6787

Simplify PackageFinder best candidate API some more #6787

Conversation

cjerdonek commented Jul 24, 2019 • edited Loading

xavfernandez commented Jul 26, 2019

pfmoore commented Jul 26, 2019

cjerdonek commented Jul 26, 2019

pfmoore commented Jul 26, 2019

pradyunsg commented Jul 26, 2019 • edited Loading

pradyunsg commented Jul 26, 2019 • edited Loading

pfmoore commented Jul 26, 2019

pradyunsg commented Jul 26, 2019

cjerdonek commented Jul 26, 2019

pradyunsg commented Jul 26, 2019

pfmoore commented Jul 26, 2019

cjerdonek commented Jul 26, 2019

cjerdonek commented Aug 20, 2019

cjerdonek commented Aug 21, 2019

atugushev commented Aug 21, 2019

cjerdonek commented Aug 21, 2019

pradyunsg commented Aug 21, 2019

atugushev commented Aug 21, 2019

cjerdonek commented Aug 21, 2019

cjerdonek commented Jul 24, 2019 •

edited

Loading

pradyunsg commented Jul 26, 2019 •

edited

Loading

pradyunsg commented Jul 26, 2019 •

edited

Loading