Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for converting pyproject.toml-based Python3 packages. #1982

Open
wants to merge 14 commits into
base: main
Choose a base branch
from

Conversation

amdei
Copy link

@amdei amdei commented Jan 3, 2023

TL;DR:
It works! (With some limitations at the moment though, as this is work in progress currently)

python3 -m pip install -U importlib wheel setuptools packaging pkginfo

git clone ....
cd fpm
./bin/fpm --verbose -s python -t deb --python-install-lib=/usr/local/lib/python3.9/dist-packages/ --python-package-name-prefix python3 --python-bin=python3  typing_extensions

Limitations:

  1. Does not support python2 in any way (obviously)

Temporary limitations (TODOs):

1. Do not support automatic dependencies extraction to the moment - supports
2. Do not support license extraction to the moment - supports
3. Checked only on Debian 11/12 amd64/arm64 only so far
4. Not sure about applicability to 'native' packages - supports native package detection
5. A lot of @todo's in code Only a few of them left...
6. Code quality is poor made it better
7. Code-style is poor tried to made it better

I had no previous experience with ruby, so any suggestions/comments/guidelines are welcome.

@ObjatieGroba - could you please check Python part of code, and give your suggestions on how to improve it?

Fixes #1780
Fixes #1873
Fixes #1860
Fixes magma/magma#13847

@amdei
Copy link
Author

amdei commented Jan 6, 2023

After some reflections I've had a second thought.

  1. Standard importlib.metadata is better that backported importlib_metadata
  2. importlib.metadata is still not good enough to obtain all required metadata of not-yet-installed python package
  3. I don't want to build wheel out of downloaded python package
  4. But seems there is no solution without building wheel out of python package prior to converting it to some other format
  5. It is possible and it is easy to get ALL required metadata via wheel API from pkginfo package

Going to revise my solution accordingly.
Stay tuned.

@amdei
Copy link
Author

amdei commented Feb 12, 2023

It seems to be working a little bit better now.
At least in terms of metadata retrieval.

Things to do:

  • 1. Apply proper arch attribute to generated deb-package. At the moment all toml-based packages has Architecture: all;
  • 2. Instruct wheel to not download dependencies for build package wheel;
  • 3. For some packages fpm now silently dies with zero exit code. Need to get it work again;
  • 4. Deal with packages dependencies. Not easy at the first glance.
  • 5. Speed-up execution on processing toml packages. Now it 3 times slower that setup.py-based;
  • 6. Perform code de-duplication. As now it contains significant amount of redundant fragments.
  • 7. Cleanup code.
  • 8. File some issues to pip, wheel, importlib, etc.
  • 9. Obey --[no-]python-obey-requirements-txt fpm's option

Any suggestions/comments/guidelines are still welcome.
As well as package samples, where things go wrong.

@jordansissel
Copy link
Owner

Thanks for your work on this! I’ll take a look as soon as I can :)

@amdei
Copy link
Author

amdei commented Feb 14, 2023

Bottom line:

  1. Please update python wheel to latest version before trying anything: python3 -m pip install -U wheel - there was a bug, affected fpm, fixed only very recently.
  2. It is slow. Very slow. Significantly slower that setup.py-based packaging;
  3. Wasn't able to made it with architecture - it looks like there is simply no way to get arch neither from *.whl file nor from sdist via any API. But it contains in *.whl. But I'm not brave enough to parse it in that way (yet).
  4. There is no alternatives found for setup.py's --install-lib, --install-data, --install-scripts and build_scripts --executable. So all fpm's --python-install-* options will not work (and currently silently ignored).
  5. Staging with pip is broken. For example see here: Unexpected --target behavior with/without --upgrade pypa/pip#8799 Or it is intended for something else.
  6. Documentation for pip install is rather short: https://docs.python.org/3/installing/index.html#installing-index Significantly shorter than one would expect.

@amdei
Copy link
Author

amdei commented Apr 8, 2023

After a while, I hack the Wheel metadata to get information if package is pure or not.
https://peps.python.org/pep-0491/
As it seems to be impossible (at least yet) via pkginfo.Wheel API: https://bugs.launchpad.net/pkginfo/+bug/2015657

@amdei amdei marked this pull request as ready for review April 10, 2023 00:29
@amdei
Copy link
Author

amdei commented Apr 10, 2023

It reached "works for me" stage.
Let's collect feedback before starting to polish things..

lib/fpm/package/pyfpm/get_metadata.py Outdated Show resolved Hide resolved
lib/fpm/package/python.rb Outdated Show resolved Hide resolved
lib/fpm/package/python.rb Outdated Show resolved Hide resolved
"#{Shellwords.escape(toml_metadata_code)}"

# @todo FIXME!
# if attributes[:python_obey_requirements_txt?]

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Doesn't this break backward compatibility?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It can.
Do you know any cmd option for PEP517 builds to do the same thing as --load-requirements-txt does for setup.py?

::Dir.chdir(project_dir) do
flags = [ "--root", staging_path ]

# if !attributes[:python_install_lib].nil?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lot's of commented code. Is it necessary?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems there is just no alternative for such option (--prefix) in setup.py for PEP517 builds...

@JordanStopford
Copy link
Contributor

Amy update on this? A lot of packages no longer ship setup.py files so if we need to use fpm then we're stuck on old versions

@cwegener
Copy link
Contributor

cwegener commented Apr 9, 2024

I don't understand what this does? It still requires a legacy setup.py file to be generated by the python project that is supposed to be converted. However, pretty much all PEP-517 build backends disable the support for generating setup.py legacy files by default, so they will not be able to be used by fpm. (some backends don't even offer an option at all to generate legacy setup.py files)

And if you have a source python project that has the option to generate a legeacy setup.py enabled, then such a project can already be used as a source in fpm without this change.

I am a bit confused what this change is supposed to do.

What is the intention of this change?

@amdei
Copy link
Author

amdei commented Apr 9, 2024

It still requires a legacy setup.py file

No, it doesn't.

There is an option to use setup.py file, if it presented.
But if there is pyproject.toml file - it will be used instead.

@cwegener
Copy link
Contributor

cwegener commented Apr 9, 2024

It still requires a legacy setup.py file

No, it doesn't.

There is an option to use setup.py file, if it presented. But if there is pyproject.toml file - it will be used instead.

Thanks for the clarification. I'll need to read through the patch again then. 😁

@amdei
Copy link
Author

amdei commented Apr 17, 2024

Although in this MR I made attempt to solve the issue via pip wheel, not via build module from PyPA, as suggested in #2040.

@cwegener
Copy link
Contributor

Although in this MR I made attempt to solve the issue via pip wheel, not via build module from PyPA, as suggested in #2040.

That's excellent. pip wheel will actually just invoke the Python build module which takes care of performing the right build process (legacy setup.py vs new PEP-517 build backends)

I will need to test out your PR and report the results.

Specifically, I will need to test with this PEP-517 project ... https://github.com/openai/openai-python

@Stealthii Stealthii mentioned this pull request Apr 24, 2024
@gmabey
Copy link

gmabey commented May 8, 2024

Just an enthusiastic vote for more eyeballs on this merge request ... I would love ❤️ to see the project catch up to this shift in the python community.

@cwegener
Copy link
Contributor

I will need to test out your PR and report the results.

Quick feedback from my initial testing of this patch:

Testing with

  • openai package from PyPI (which uses hatchling as the PEP 517 build backend)
  • Testing on Ubuntu 22.04 with built-in Python3.10

My main issue that I'm looking into at the moment is the boot-strapping of the PEP 517 build environments for pip wheel (and to a lesser degree pip download ... which is a separate story).

I think the main concern so far is the new dependency on the pkginfo Python library. E.g. on Ubuntu 22.04, the version of python3-pkginfo is a bit older and does not yet have support for version 2.3 Metadata file formats.

But a lot of wheels will now write 2.3 Metadata file formats.

Therefore, injecting or vendoring an up-to-date version of pkginfo is important.

@amdei
Copy link
Author

amdei commented Jul 14, 2024

My main issue that I'm looking into at the moment is the boot-strapping of the PEP 517 build environments for pip wheel

@cwegener could you please elaborate, what is your issue with openai package building?
Is it difficulty with determining build dependencies?
Or something else?

@cwegener
Copy link
Contributor

My main issue that I'm looking into at the moment is the boot-strapping of the PEP 517 build environments for pip wheel

@cwegener could you please elaborate, what is your issue with openai package building? Is it difficulty with determining build dependencies? Or something else?

I think it's actually a different issue that is unrelated to the PEP-517 build support.

The issue is that all dependencies are being built from source due to a change introduced here: 77e0cf7

The --no-build :all: option forces all dependencies to be built from source, even though this is unnecessary.

The dependencies will be thrown away at the end. They don't end up in the released package that fpm creates. So there is really no benefit in building them from source.

Nevertheless, I still have not managed to build an openai package, but I think that might just be an issue on my side.

@amdei
Copy link
Author

amdei commented Jul 14, 2024

Nevertheless, I still have not managed to build an openai package, but I think that might just be an issue on my side.

Just checked - built seamlessly.

Could you please try to build it with more verbose output and share the result?
Something like this:
./bin/fpm --verbose --log debug --debug --debug-workspace -s python -t deb --python-bin=python3 --python-package-name-prefix python3 --python-install-lib=/usr/local/lib/python3.9/dist-packages/ openai

(you may want to adjust --python-install-lib option to reflect your python version)

@cwegener
Copy link
Contributor

I had another look. And I think I figured out where the issue with this approach is.
pip download will perform unwanted build steps in order to generate metadata. Those unwanted build steps will fail quite often.

Example package: jiter

python3 -m pip download --no-clean --no-deps --no-binary :all: -d . jiter

This will try to immediately build a wheel from the jiter sdist, which will fail due to the build backend (maturin) not being present on my system.

This is a pip problem as per the following references.

References:
pypa/pip#7995
pypa/pip#1884

@amdei
Copy link
Author

amdei commented Aug 28, 2024

And I think I figured out where the issue with this approach is.

Could you please elaborate, do you able to offer different approach?
Yes, maturin, for example, requires Rust in order to compile its dependencies.

May be you know the way to download and use pre-build binary packages, for all possible supported OS/architectures?

@gmabey
Copy link

gmabey commented Sep 18, 2024

@amdei This branch worked for me on ubuntu 24.04!
I did have to install some packages that probably already should have been installed: python3-pkginfo, python3-packaging, python3-setuptools and python3-wheel. It looked to me like something was trying to automatically install some of them but that effort (whatever was doing it) produced errors. However, when I just installed the distro-provided versions, everything worked!

On ubuntu 22.04, ubuntu 20.04, rockylinux 8 and rockylinux 9, I had to both upgrade the distro-provided pip as well as upgrade the above modules before the creation of a .deb from python/pip would work.

So, I conclude that the core of your patch works great, but there may be some dependency auto-foo that is lacking?

Thank you for submitting this MR!

@amdei amdei force-pushed the python2deb-great-again branch from d722b81 to c5af917 Compare November 24, 2024 02:18
@amdei
Copy link
Author

amdei commented Nov 24, 2024

Rebased on top of main.
Squashed intermediate commits.

@amdei amdei requested a review from ObjatieGroba November 24, 2024 02:23
@amdei
Copy link
Author

amdei commented Nov 24, 2024

@cwegener Guess I finally get what you've meant here:

And I think I figured out where the issue with this approach is.

I asked:

May be you know the way to download and use pre-build binary packages, for all possible supported OS/architectures?

But the answer appears quite obvious: in wast majority of cases all these packages ARE on PyPi already.
And, instead of wasting CPU poser to compile source packages all over again, we may just download pre-built wheel, and do whatever we would like to do with it.
The only thing required to be taken into account - how to fallback to building wheel from sources when binary distributions is not available.

Will try to figure it out.

@@ -0,0 +1,106 @@
import os
import sys
import pkg_resources
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The usage of pkg_resources is DEPRECATED:
https://setuptools.pypa.io/en/latest/pkg_resources.html

Users should refrain from new usage of pkg_resources and should work to port to importlib-based solutions.

@amdei
Copy link
Author

amdei commented Dec 1, 2024

The issue is that all dependencies are being built from source due to a change introduced here: 77e0cf7

@cwegener
Tried to overcome the issue with rebuilding the whole world all over again.

  1. Do pip download with --prefer-binary flag;
  2. If wheel was downloaded - do not attempt to rebuild package from sources;
  3. If wheel was NOT downloaded - fallback to rebuilding package from sources (either via setup.py or pyproject.toml).

Can not properly check p. 3 though at the moment...

Could you please try it now?
Build time expected to be significantly lower.
Something like this:
./bin/fpm -s python -t deb --python-bin=python3 --python-package-name-prefix python3 numpy

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
6 participants