gh-59705: Add _thread.set_name() function #127338

vstinner · 2024-11-27T16:09:37Z

On Linux, threading.Thread now sets the thread name to the operating system.

configure now checks if pthread_setname_np() function is available.

Issue: Python should support exporting thread names to the OS #59705

On Linux, threading.Thread now sets the thread name to the operating system. configure now checks if pthread_setname_np() function is available.

vstinner · 2024-11-27T16:19:23Z

This implementation is very basic on purpose. I plan to add support for more platform in follow-up PRs.

On Linux, set_name() does nothing if the name is longer than 15 bytes. Should the function truncate silently to 15 bytes instead? I don't think that raising an exception is very convenient here.
Setting Thread.name after Thread.start() doesn't call again set_name(). set_name() is called only once per thread, at startup.
I didn't add automated tests since I don't want to add a get_name() function (use Thread.name to get a thread name).

Demo 1 (main thread):

$ ./python
>>> import os
>>> pid=os.getpid()
>>> with open(f"/proc/{pid}/task/{pid}/comm") as fp: print(f"comm = {fp.read()!r}")
... 
comm = 'python\n'

>>> import _thread; _thread.set_name("demo")
>>> with open(f"/proc/{pid}/task/{pid}/comm") as fp: print(f"comm = {fp.read()!r}")
... 
comm = 'demo\n'

Demo 2 (thread):

$ ./python
>>> import threading, os, time
>>> os.getpid()
81921
>>> t=threading.Thread(target=time.sleep, args=(60,), name="sleeper")
>>> t.start()
^Z

$ cat /proc/81921/task/81927/comm 
sleeper

See also a previous attempt to implement the feature: #14578

vstinner · 2024-11-27T16:38:13Z

I didn't add automated tests since I don't want to add a get_name() function (use Thread.name to get a thread name).

I changed my mind and added a private _thread._get_name() function for tests.

vstinner · 2024-11-28T09:56:23Z

@pitrou @encukou @serhiy-storchaka: Would you mind to review this change? It's to set the thread name in threading.Thread to the operating system.

vstinner · 2024-11-28T10:17:47Z

On Linux, set_name() does nothing if the name is longer than 15 bytes. Should the function truncate silently to 15 bytes instead? I don't think that raising an exception is very convenient here.

I modified _thread.set_name(name) to truncate name to 15 bytes on Linux.

Truncating the string in threading.Thread would be more complicated since it requires to encode the string the filesystem encoding, detect the operating system (Linux), and hardcode the 15 bytes limit there. IMO it's more convenient to truncate in _thread.set_name().

encukou

This looks great, thank you!

The truncation is not pretty with non-ASCII names. I guess codepoint-preserving truncation is not worth the effort, and Linux tools need to deal with thread names being arbitrary bytes.

But, we can test the edge cases, to ensure this quality-of-life enhancement doesn't start raising exceptions in working code.

Lib/test/test_threading.py

Modules/_threadmodule.c

Lib/threading.py

vstinner · 2024-11-28T15:00:50Z

@encukou: I addressed your reviews. Please review the updated PR.

I added tests on long names and non-ASCII names.

Lib/test/test_threading.py

Refactor also tests.

vstinner · 2024-11-28T16:13:03Z

@encukou: Maybe the "replace" error handler can be used, instead of not setting the name if the name cannot be encoded to the filesystem encoding. What do you think?

serhiy-storchaka · 2024-11-28T17:05:56Z

You can use FS_NONASCII. You can also use TESTFN_UNDECODABLE to test that it works with arbitrary bytes and TESTFN_UNENCODABLE to test for encoding error.

Is it a hard limit for the size? Is it the same on other platforms? I would prefer to use a named constant instead of magic numbers 15, 16, 17.

vstinner · 2024-11-28T17:11:58Z

@serhiy-storchaka:

Is it a hard limit for the size?

Yes. Using a longer name fails with ERANGE.

Is it the same on other platforms?

It's 16 bytes on Linux and 64 bytes on macOS, so no, it's not the same.

I would prefer to use a named constant instead of magic numbers 15, 16, 17.

I failed to find a public constant for these limits. For example, Darwin MAXTHREADNAMESIZE constant is private (I'm not 100% sure, but I don't have macOS so I cannot check manually, I only read the code).

vstinner · 2024-11-28T17:18:09Z

You can use FS_NONASCII. You can also use TESTFN_UNDECODABLE to test that it works with arbitrary bytes and TESTFN_UNENCODABLE to test for encoding error.

Ok, I added tests using FS_NONASCII and TESTFN_UNENCODABLE.

Modules/_threadmodule.c

configure.ac

Modules/_threadmodule.c

serhiy-storchaka · 2024-12-05T13:24:21Z

AFAIK, to support cross-compilation, configure is limited to compiling & linking. It shouldn't run the built code.

Well, then hardcoding limits for known platforms is okay. We could also determine it at runtime, but this would be too complicated.

On Linux, the thread name can be retrieved by reading /proc: see my examples #127338 (comment).

It is the content of the file, not its name. There is a flaw in this example: what encoding do you use to decode it? How do you read the name of the thread in other process with different locale?

vstinner · 2024-12-05T15:36:31Z

@serhiy-storchaka: I addressed your review, please review the updated PR.

@serhiy-storchaka:

It is the content of the file, not its name. There is a flaw in this example: what encoding do you use to decode it?

open() uses the current LC_CTYPE locale encoding by default. _thread.set_name() uses the filesystem encoding.

How do you read the name of the thread in other process with different locale?

You have the same problem with file content. It's not a new problem.

IMO the Python filesystem encoding is a better choice than UTF-8 for the thread name.

Modules/_threadmodule.c

Lib/test/test_threading.py

Modules/_threadmodule.c

serhiy-storchaka · 2024-12-05T17:27:31Z

You have the same problem with file content. It's not a new problem.

It is only a large problem if you use locale encoding for file content. If you use a fixed encoding (e.g. UTF-8) or write encoding as a metadata before writing thee encoded content, it is not a problem or a lesser problem.

Lib/test/test_threading.py

serhiy-storchaka

I disagree with some decisions, but do not want to block the PR. We can fix this later.

Co-authored-by: Serhiy Storchaka <[email protected]>

encukou · 2024-12-06T15:07:01Z

Modules/_threadmodule.c

+        errno = rc;
+        return PyErr_SetFromErrno(PyExc_OSError);
+    }
+


pthread_getname_np should add a trailing NUL byte, but like everything here, that's platform-specific. I suggest being defensive here.

Suggested change

name[Py_ARRAY_LENGTH(name)-1] = 0;

On what platform it does not add the null byte?

The null byte is added on all supported platforms. Before I made sure that the buffer always ended with a null byte, but @serhiy-storchaka asked me to remove it. Let's be optimistic. We can adjust the code later if needed.

encukou

I agree, ship it :)
Or make a few more changes if you want it to be more perfect.

Modules/_threadmodule.c

serhiy-storchaka · 2024-12-06T16:28:10Z

Modules/_threadmodule.c

+    size_t len = PyBytes_GET_SIZE(name_encoded);
    if (len > PYTHREAD_NAME_MAXLEN) {


You can also inline len. It is only used here.

On Linux, threading.Thread now sets the thread name to the operating system. * configure now checks if pthread_getname_np() and pthread_setname_np() functions are available. * Add PYTHREAD_NAME_MAXLEN macro. * Add _thread._NAME_MAXLEN constant for test_threading. Co-authored-by: Serhiy Storchaka <[email protected]>

kulikjak · 2025-04-02T13:24:00Z

I am sorry I didn't get to this earlier; on Solaris, the tests are green and all works as expected.

There is one small issue with Solaris detection in test_set_name and I opened #132012 to resolve that.

See python/cpython#127338

pythongh-59705: Add _thread.set_name() function

c6d324d

On Linux, threading.Thread now sets the thread name to the operating system. configure now checks if pthread_setname_np() function is available.

vstinner requested review from erlend-aasland and corona10 as code owners November 27, 2024 16:09

bedevere-app bot mentioned this pull request Nov 27, 2024

Python should support exporting thread names to the OS #59705

Closed

bedevere-app bot added the awaiting core review label Nov 27, 2024

vstinner mentioned this pull request Nov 27, 2024

gh-59705: Export threading.Thread() names to the OS #14578

Closed

vstinner added 2 commits November 27, 2024 17:21

Port to macOS

63b5d52

Add tests

9f6a8ab

Try to fix macOS _get_name()

d79e7af

Truncate to 15 bytes; add error handling

ebd9752

encukou reviewed Nov 28, 2024

View reviewed changes

Lib/test/test_threading.py Outdated Show resolved Hide resolved

Modules/_threadmodule.c Outdated Show resolved Hide resolved

Lib/threading.py Outdated Show resolved Hide resolved

vstinner added 3 commits November 28, 2024 15:36

Address review

a7f5651

Add test on non-ASCII name truncation

97ea645

Add test on non-ASCII name

78a9ab9

vstinner force-pushed the thread_set_name branch from b9f2359 to 78a9ab9 Compare November 28, 2024 14:57

Test long name on non-Linux platforms

dcf13f4

encukou reviewed Nov 28, 2024

View reviewed changes

Lib/test/test_threading.py Outdated Show resolved Hide resolved

vstinner added 2 commits November 28, 2024 16:57

macOS is limited to 63 bytes

6ea7e5a

Catch UnicodeEncodeError when seting the name

46721bb

Refactor also tests.

Add tests

6962116

Use "replace" error handler

5d27da0

serhiy-storchaka reviewed Dec 5, 2024

View reviewed changes

Address Serhiy's review

681624e

serhiy-storchaka reviewed Dec 5, 2024

View reviewed changes

Modules/_threadmodule.c Show resolved Hide resolved

Lib/test/test_threading.py Outdated Show resolved Hide resolved

Lib/test/test_threading.py Outdated Show resolved Hide resolved

Lib/test/test_threading.py Outdated Show resolved Hide resolved

Modules/_threadmodule.c Show resolved Hide resolved

picnixz reviewed Dec 5, 2024

View reviewed changes

Lib/test/test_threading.py Outdated Show resolved Hide resolved

vstinner added 3 commits December 6, 2024 10:31

Solaris always use UTF-8

f57339f

Simplify tests

3941123

Use @unittest.skipUnless

2e27043

serhiy-storchaka reviewed Dec 6, 2024

View reviewed changes

Lib/test/test_threading.py Show resolved Hide resolved

Lib/test/test_threading.py Outdated Show resolved Hide resolved

Solaris uses UTF-8

6ab2944

serhiy-storchaka reviewed Dec 6, 2024

View reviewed changes

Lib/test/test_threading.py Outdated Show resolved Hide resolved

serhiy-storchaka approved these changes Dec 6, 2024

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting core review labels Dec 6, 2024

Update Lib/test/test_threading.py

b370c49

Co-authored-by: Serhiy Storchaka <[email protected]>

encukou reviewed Dec 6, 2024

View reviewed changes

encukou approved these changes Dec 6, 2024

View reviewed changes

Modules/_threadmodule.c Outdated Show resolved Hide resolved

vstinner added 2 commits December 6, 2024 16:53

add test on embedded null character

beeae59

Optimize code truncating the name

ae956a0

vstinner enabled auto-merge (squash) December 6, 2024 16:20

vstinner merged commit 67b18a1 into python:main Dec 6, 2024
41 checks passed

vstinner deleted the thread_set_name branch December 6, 2024 16:27

bedevere-app bot removed the awaiting merge label Dec 6, 2024

serhiy-storchaka approved these changes Dec 6, 2024

View reviewed changes

xavierog mentioned this pull request Mar 15, 2025

_thread.set_name(): doubt about _PYTHREAD_NAME_MAXLEN values for BSD operating systems #131268

Closed

jamadden added a commit to gevent/gevent that referenced this pull request Apr 14, 2025

3.14: expose gevent.thread.set_name.

0a31cfd

See python/cpython#127338

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-59705: Add _thread.set_name() function #127338

gh-59705: Add _thread.set_name() function #127338

vstinner commented Nov 27, 2024 •

edited by bedevere-app bot

Loading

vstinner commented Nov 27, 2024

vstinner commented Nov 27, 2024

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

encukou left a comment

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

serhiy-storchaka commented Nov 28, 2024

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

serhiy-storchaka commented Dec 5, 2024

vstinner commented Dec 5, 2024

serhiy-storchaka commented Dec 5, 2024

serhiy-storchaka left a comment

encukou Dec 6, 2024

serhiy-storchaka Dec 6, 2024

vstinner Dec 6, 2024

encukou left a comment

serhiy-storchaka Dec 6, 2024

kulikjak commented Apr 2, 2025

		size_t len = PyBytes_GET_SIZE(name_encoded);
		if (len > PYTHREAD_NAME_MAXLEN) {

gh-59705: Add _thread.set_name() function #127338

gh-59705: Add _thread.set_name() function #127338

Conversation

vstinner commented Nov 27, 2024 • edited by bedevere-app bot Loading

vstinner commented Nov 27, 2024

vstinner commented Nov 27, 2024

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

encukou left a comment

Choose a reason for hiding this comment

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

serhiy-storchaka commented Nov 28, 2024

vstinner commented Nov 28, 2024

vstinner commented Nov 28, 2024

serhiy-storchaka commented Dec 5, 2024

vstinner commented Dec 5, 2024

serhiy-storchaka commented Dec 5, 2024

serhiy-storchaka left a comment

Choose a reason for hiding this comment

encukou Dec 6, 2024

Choose a reason for hiding this comment

serhiy-storchaka Dec 6, 2024

Choose a reason for hiding this comment

vstinner Dec 6, 2024

Choose a reason for hiding this comment

encukou left a comment

Choose a reason for hiding this comment

serhiy-storchaka Dec 6, 2024

Choose a reason for hiding this comment

kulikjak commented Apr 2, 2025

vstinner commented Nov 27, 2024 •

edited by bedevere-app bot

Loading