Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[tests] test_poplib fails with "env changed" on Arch Linux with OpenSSL 3.4: [SYS] unknown error (_ssl.c:2634) #127257

Closed
vstinner opened this issue Nov 25, 2024 · 7 comments · Fixed by #127360
Labels
tests Tests in the Lib/test dir topic-SSL

Comments

@vstinner
Copy link
Member

vstinner commented Nov 25, 2024

First failure: https://buildbot.python.org/#/builders/484/builds/6224

Example of failure:

test_stls_context (test.test_poplib.TestPOP3Class.test_stls_context) ...

Warning -- Uncaught thread exception: SSLError
Exception in thread Thread-16:
Traceback (most recent call last):
  (...)
  File "/buildbot/buildarea/3.x.pablogsal-arch-x86_64/build/Lib/test/test_poplib.py", line 203, in handle_read
    asynchat.async_chat.handle_read(self)
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^
  File "/buildbot/buildarea/3.x.pablogsal-arch-x86_64/build/Lib/test/support/asynchat.py", line 128, in handle_read
    self.handle_error()
    ~~~~~~~~~~~~~~~~~^^
  File "/buildbot/buildarea/3.x.pablogsal-arch-x86_64/build/Lib/test/support/asynchat.py", line 124, in handle_read
    data = self.recv(self.ac_in_buffer_size)
  File "/buildbot/buildarea/3.x.pablogsal-arch-x86_64/build/Lib/test/support/asyncore.py", line 377, in recv
    data = self.socket.recv(buffer_size)
  File "/buildbot/buildarea/3.x.pablogsal-arch-x86_64/build/Lib/ssl.py", line 1285, in recv
    return self.read(buflen)
           ~~~~~~~~~^^^^^^^^
  File "/buildbot/buildarea/3.x.pablogsal-arch-x86_64/build/Lib/ssl.py", line 1140, in read
    return self._sslobj.read(len)
           ~~~~~~~~~~~~~~~~~^^^^^
ssl.SSLError: [SYS] unknown error (_ssl.c:2634)

ok

According to @encukou, the failure started to occur after a system update:

  • OpenSSL 3.3.0→3.4.0
  • Linux 5.15.0→6.8.0
  • glibc 2.39→2.40
  • GCC 13.2.1→14.2.1
  • etc.

Linked PRs

@vstinner vstinner added tests Tests in the Lib/test dir topic-SSL labels Nov 25, 2024
@vstinner
Copy link
Member Author

cc @pablogsal

@vstinner
Copy link
Member Author

Failing tests:

  • test_stls (test.test_poplib.TestPOP3Class.test_stls)
  • test_stls_context (test.test_poplib.TestPOP3Class.test_stls_context)

stls stands for "Start a TLS session":

    def stls(self, context=None):
        """Start a TLS session on the active connection as specified in RFC 2595.

                context - a ssl.SSLContext
        """

@vstinner vstinner changed the title [tests] test_poplib fails with "env changed" on AMD64 Arch Linux TraceRefs 3.x: [SYS] unknown error (_ssl.c:2634) [tests] test_poplib fails with "env changed" on Arch Linux with OpenSSL 3.4: [SYS] unknown error (_ssl.c:2634) Nov 25, 2024
@vstinner
Copy link
Member Author

First failure: https://buildbot.python.org/#/builders/484/builds/6224

That's AMD64 Arch Linux TraceRefs 3.x. test_poplib is also failing on AMD64 Arch Linux Asan 3.x, first failure: https://buildbot.python.org/#/builders/582/builds/6182

@ned-deily
Copy link
Member

Are there changes/additions in the OpenSSL 3.4.x branch to its error codes that require updating our header files (see https://github.com/python/cpython/blob/main/Tools/ssl/make_ssl_data.py)?

@encukou
Copy link
Member

encukou commented Nov 27, 2024

Thanks for the pointer, @ned-deily! Here's where it led me: #127331

@encukou
Copy link
Member

encukou commented Nov 27, 2024

Oh! The [SYS] library means that the error code is a system errno and the message should be retrieved using perror rather than ERR_reason_error_string.

I wonder if this should raise an OSError using PyErr_SetFromErrno, rather than a SSLError.

The failure is an EPIPE, raised when the server is shutting down:

Fatal Python error: Aborted

Current thread 0x00007ff46ce786c0 (most recent call first):
  File "/workspace/Lib/ssl.py", line 1140 in read
  File "/workspace/Lib/ssl.py", line 1285 in recv
  File "/workspace/Lib/test/support/asyncore.py", line 377 in recv
  File "/workspace/Lib/test/test_ftplib.py", line 416 in recv
  File "/workspace/Lib/test/support/asynchat.py", line 124 in handle_read
  File "/workspace/Lib/test/test_poplib.py", line 203 in handle_read
  File "/workspace/Lib/test/support/asyncore.py", line 426 in handle_read_event
  File "/workspace/Lib/test/test_ftplib.py", line 394 in handle_read_event
  File "/workspace/Lib/test/support/asyncore.py", line 89 in read
  File "/workspace/Lib/test/support/asyncore.py", line 156 in poll
  File "/workspace/Lib/test/support/asyncore.py", line 213 in loop
  File "/workspace/Lib/test/test_poplib.py", line 235 in run
  File "/workspace/Lib/threading.py", line 1041 in _bootstrap_inner
  File "/workspace/Lib/threading.py", line 1012 in _bootstrap

Thread 0x00007ff46e1c1740 (most recent call first):
  File "/workspace/Lib/threading.py", line 1092 in join
  File "/workspace/Lib/test/test_poplib.py", line 242 in stop
  File "/workspace/Lib/test/test_poplib.py", line 270 in tearDown
  File "/workspace/Lib/unittest/case.py", line 620 in _callTearDown
  File "/workspace/Lib/unittest/case.py", line 663 in run
  File "/workspace/Lib/unittest/case.py", line 716 in __call__
  File "/workspace/Lib/unittest/suite.py", line 122 in run
  File "/workspace/Lib/unittest/suite.py", line 84 in __call__
  File "/workspace/Lib/unittest/suite.py", line 122 in run
  File "/workspace/Lib/unittest/suite.py", line 84 in __call__
  File "/workspace/Lib/unittest/runner.py", line 240 in run
  File "/workspace/Lib/test/libregrtest/single.py", line 57 in _run_suite
  File "/workspace/Lib/test/libregrtest/single.py", line 37 in run_unittest
  File "/workspace/Lib/test/libregrtest/single.py", line 135 in test_func
  File "/workspace/Lib/test/libregrtest/single.py", line 91 in regrtest_runner
  File "/workspace/Lib/test/libregrtest/single.py", line 138 in _load_run_test
  File "/workspace/Lib/test/libregrtest/single.py", line 181 in _runtest_env_changed_exc
  File "/workspace/Lib/test/libregrtest/single.py", line 281 in _runtest
  File "/workspace/Lib/test/libregrtest/single.py", line 307 in run_single_test
  File "/workspace/Lib/test/libregrtest/main.py", line 365 in run_test
  File "/workspace/Lib/test/libregrtest/main.py", line 399 in run_tests_sequentially
  File "/workspace/Lib/test/libregrtest/main.py", line 545 in _run_tests
  File "/workspace/Lib/test/libregrtest/main.py", line 580 in run_tests
  File "/workspace/Lib/test/libregrtest/main.py", line 752 in main
  File "/workspace/Lib/test/libregrtest/main.py", line 760 in main
  File "/workspace/Lib/test/__main__.py", line 2 in <module>
  File "/workspace/Lib/runpy.py", line 88 in _run_code
  File "/workspace/Lib/runpy.py", line 198 in _run_module_as_main
Aborted (core dumped)

encukou added a commit that referenced this issue Dec 10, 2024
From the ERR_raise manpage:

    ERR_LIB_SYS

        This "library code" indicates that a system error is
        being reported.  In this case, the reason code given
        to `ERR_raise()` and `ERR_raise_data()` *must* be
        `errno(3)`.


This PR only handles ERR_LIB_SYS for the high-lever error types
SSL_ERROR_SYSCALL and SSL_ERROR_SSL, i.e., not the ones where
OpenSSL indicates it has some more information about the issue.
miss-islington pushed a commit to miss-islington/cpython that referenced this issue Dec 11, 2024
From the ERR_raise manpage:

    ERR_LIB_SYS

        This "library code" indicates that a system error is
        being reported.  In this case, the reason code given
        to `ERR_raise()` and `ERR_raise_data()` *must* be
        `errno(3)`.

This PR only handles ERR_LIB_SYS for the high-lever error types
SSL_ERROR_SYSCALL and SSL_ERROR_SSL, i.e., not the ones where
OpenSSL indicates it has some more information about the issue.
(cherry picked from commit f4b31ed)

Co-authored-by: Petr Viktorin <encukou@gmail.com>
encukou added a commit that referenced this issue Dec 12, 2024
…127812)

gh-127257: ssl: Raise OSError for ERR_LIB_SYS (GH-127361)

From the ERR_raise manpage:

    ERR_LIB_SYS

        This "library code" indicates that a system error is
        being reported.  In this case, the reason code given
        to `ERR_raise()` and `ERR_raise_data()` *must* be
        `errno(3)`.

This PR only handles ERR_LIB_SYS for the high-lever error types
SSL_ERROR_SYSCALL and SSL_ERROR_SSL, i.e., not the ones where
OpenSSL indicates it has some more information about the issue.
(cherry picked from commit f4b31ed)

Co-authored-by: Petr Viktorin <encukou@gmail.com>
miss-islington pushed a commit to miss-islington/cpython that referenced this issue Dec 13, 2024
From the ERR_raise manpage:

    ERR_LIB_SYS

        This "library code" indicates that a system error is
        being reported.  In this case, the reason code given
        to `ERR_raise()` and `ERR_raise_data()` *must* be
        `errno(3)`.

This PR only handles ERR_LIB_SYS for the high-lever error types
SSL_ERROR_SYSCALL and SSL_ERROR_SSL, i.e., not the ones where
OpenSSL indicates it has some more information about the issue.
(cherry picked from commit f4b31ed)

Co-authored-by: Petr Viktorin <encukou@gmail.com>
encukou added a commit that referenced this issue Dec 16, 2024
…127905)

gh-127257: ssl: Raise OSError for ERR_LIB_SYS (GH-127361)

From the ERR_raise manpage:

    ERR_LIB_SYS

        This "library code" indicates that a system error is
        being reported.  In this case, the reason code given
        to `ERR_raise()` and `ERR_raise_data()` *must* be
        `errno(3)`.

This PR only handles ERR_LIB_SYS for the high-lever error types
SSL_ERROR_SYSCALL and SSL_ERROR_SSL, i.e., not the ones where
OpenSSL indicates it has some more information about the issue.
(cherry picked from commit f4b31ed)

Co-authored-by: Petr Viktorin <encukou@gmail.com>
srinivasreddy pushed a commit to srinivasreddy/cpython that referenced this issue Jan 8, 2025
From the ERR_raise manpage:

    ERR_LIB_SYS

        This "library code" indicates that a system error is
        being reported.  In this case, the reason code given
        to `ERR_raise()` and `ERR_raise_data()` *must* be
        `errno(3)`.


This PR only handles ERR_LIB_SYS for the high-lever error types
SSL_ERROR_SYSCALL and SSL_ERROR_SSL, i.e., not the ones where
OpenSSL indicates it has some more information about the issue.
encukou added a commit that referenced this issue Jan 10, 2025
* ssl: Add hex error code to "unknown error" messages

To make it easier to vary the individual parts of the message,
replace the if-ladder with constant format strings by building
the string piece-wise with PyUnicodeWriter.

Use "unknown error (0x%x)" rather than just "unknown error" if we
can't get a better error message. (Hex makes sense as the error
includes two packed parts.)
@vstinner
Copy link
Member Author

Congrats for this fix @encukou !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
tests Tests in the Lib/test dir topic-SSL
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants