posix: semaphore: implement named semaphore functions #67007

ycsin · 2023-12-26T10:19:14Z

Implemented:

sem_open()
sem_close()
sem_unlink()

lib/posix/semaphore.c

jukkar · 2023-12-27T12:16:17Z

lib/posix/semaphore.c

+	sys_snode_t snode;
+	sem_t sem;
+	atomic_t ref_count;
+	char *name;


We could avoid malloc if allocated like this

char name[CONFIG_SEM_NAMELEN_MAX];

I think we have this pattern in many places in the code space.

I'm gonna gather more comments for this one, doing so will allocate a fixed sized array regardless of the actual length of the name.

Alternatively we can allocate the buffer for the nsem_obj and the name array in one go and the total memory allocated will depend on the name length. Ideally we would be able to k_realloc the buffer once the sem is unlinked, however the k_realloc isn't available (#41151), so I opted for the current implementation as the name buffer can be freed separately upon sem_unlink.

I would say a good follow-up PR would be to add static allocation support.

I'm ok with dynamic going in first and static coming in later, but I think both use cases are valuable for Zephyr.

The existing pattern that doesn't use malloc should be fine for filling the static gap later. Thanks for your help ❤️

lib/posix/semaphore.c

lib/posix/Kconfig.semaphore

lib/posix/semaphore.c

doc/services/portability/posix/option_groups/index.rst

cfriedt · 2023-12-29T14:18:06Z

MAINTAINERS.yml

@@ -2728,6 +2728,8 @@ POSIX API layer:
  status: maintained
  maintainers:
    - cfriedt
+  collaborators:
+    - ycsin


lib/posix/semaphore.c

tests/posix/common/src/semaphore.c

npitre · 2024-01-06T00:36:49Z

This code is racy.

sem_t *sem_open(const char *name, int oflags, ...)
{
       /* Check if the named semaphore exists */
       k_mutex_lock(&nsem_mutex, K_FOREVER);
       nsem = find_nsem(name);
       k_mutex_unlock(&nsem_mutex);

       /* !!! preemption point !!! */
       [...]

       if (nsem == NULL) {
                [...]
       } else {
               atomic_inc(&nsem->ref_count);
       }

When find_nsem() returns a valid non-null pointer for nsem,
nothing prevents another thread calling sem_unlink() and sem_close()
at the indicated preemption point, turning nsem into a zombie pointer
and corrupting freed and potentially reused memory.

npitre · 2024-01-08T17:52:59Z

The code now looks correct, the race is gone.

Here's some suggestions that aren't critical but could optimize the code
further:

All ref_count manipulations, except for one case, happen while
nsem_mutex is taken. The exception is sem_close() which is unlikely
to be on any performance-critical path. By enlarging the nsem_mutex
coverage in sem_close() you could drop all atomic operations and convert
ref_count to a plain int which would reduce the binary footprint, and
possibly run faster on some architectures.
You could consider the presence of a name as ref_count worthy. This
means initializing ref_count to 2 instead of 1 in sem_open() and
decrementing ref_count in sem_unlink(). This would allow for removing
the nsem->name == NULL test in sem_close() relying on the count alone
With the above, the ref_count decrements could be factored into
remove_nsem_if_unused() (and the function properly renamed to something
like e.g. nsem_put_ref() or the like) which would reduce binary footprint
further. This would open the possibility for cleanly asserting that the
count does not go negative and there is no longer a name if the count is 0
in only one place.

npitre · 2024-01-08T22:24:00Z

And another comment. You have:

#define SEM_FAILED (-1)

Since this is used where a pointer is normally returned, it would be nicer
to add the cast to the definition directly. Also, it is typically defined
as 0 so the error case can be seen as a null pointer return. So I'd suggest:

#define SEM_FAILED ((sem_t *) 0)

ycsin · 2024-01-09T09:37:03Z

You could consider the presence of a name as ref_count worthy. This
means initializing ref_count to 2 instead of 1 in sem_open() and
decrementing ref_count in sem_unlink(). This would allow for removing
the nsem->name == NULL test in sem_close() relying on the count alone

With the above, the ref_count decrements could be factored into
remove_nsem_if_unused() (and the function properly renamed to something
like e.g. nsem_put_ref() or the like) which would reduce binary footprint
further. This would open the possibility for cleanly asserting that the
count does not go negative and there is no longer a name if the count is 0
in only one place.

Hey @npitre, thanks for the suggestions!

Regarding to point 2 and 3, I've added assert tests to make sure that the ref_count doesn't underflow or overflow, but I'm not sure how to properly 'encodes' the unlink function call into the ref_count while still be able to guarantee that a named semaphore is destroyed only if it is unlinked and closed, i.e. without the nsem->name == NULL test, wouldn't sem_close + sem_close equals to sem_unlink + sem_close?

ycsin · 2024-01-09T09:42:38Z

Updated to:

adopt suggestions from @npitre (refactor for the removal of atomic, under/overflow tests)
added a little stress test for the named semaphore (one thread sem_open a semaphore repeatedly and finally sem_unlink, the other thread doing sem_close)
added a bit more testpoints for using a named semaphore in the normal semaphore test.

Add myself as a collaborator in the POSIX subsystem. Signed-off-by: Yong Cong Sin <[email protected]>

npitre · 2024-01-09T18:24:19Z

i.e. without the nsem->name == NULL test, wouldn't sem_close +
sem_close equals to sem_unlink + sem_close?

Consider these events:

operation                refcount change          refcount state
----------------------------------------------------------------

sem_open()               = 2 (created)            2
sem_open()               + 1 (already exists)     3
sem_close()              - 1                      2
sem_close()              - 1                      1
sem-unlink()             - 1                      0 (destroyed)

sem_open()               = 2 (created)            2
sem_open()               + 1 (already exists)     3
sem_unlink()             - 1                      2
sem_close()              - 1                      1
sem_close()              - 1                      0 (destroyed)

I'll push this change with a few misc tidbits on top of your pull request.
Feel free to fold them in your original commits or have them merged as is.

ycsin · 2024-01-10T02:54:01Z

lib/posix/semaphore.c

 	if (nsem->ref_count == 0) {
+		__ASSERT(nsem->name == NULL, "ref_count is 0 but sem is not unlinked");


@npitre This means that the following behavior is guaranteed only when CONFIG_ASSERT is enabled, is that intended?

If the semaphore has not been removed with a successful call to sem_unlink(), then sem_close() has no effect on the state of the semaphore.

i.e.: a named semaphore can be closed without being unlinked:

operation refcount change refcount state ---------------------------------------------------------------- sem_open() = 2 (created) 2 sem_close() - 1 1 sem_close() - 1 0

npitre · 2024-01-10T03:25:24Z

if (nsem->ref_count == 0) {

__ASSERT(nsem->name == NULL, "ref_count is 0 but sem is not unlinked");

@npitre This means that the following behavior is guaranteed only when
CONFIG_ASSERT is enabled, is that intended?

Yes. This would happen only if the semaphore user's code is buggy.
This should never happen with well behaved code. Assertions can be configured
out to save on binary size and runtime overhead once the code is proven to
behave properly.

If the semaphore has not been removed with a successful call to
sem_unlink()](https://pubs.opengroup.org/onlinepubs/9699919799/functions/sem_unlink.html),
then em_close()](https://pubs.opengroup.org/onlinepubs/9699919799/functions/sem_close.html)
has no effect on the state of the semaphore.

i.e.: a named semaphore can be closed without being unlinked

Yes, that is what the code does. The test suite exercizes that condition too.

operation                refcount change          refcount state
----------------------------------------------------------------

sem_open()               = 2 (created)            2
sem_close()              - 1                      1
sem_close()              - 1                      0

That is a bad example. You normally must have balanced open() and
close(). Closing your own instance twice is buggy. Inagine what would
happen if you have an open() from two different threads and one thread
does a close() twice. The other thread is unlikely to be happy about that.

Multiple close() for a single open() is so unusual and unexpected
that the spec would mention it explicitly if that were allowed.

ycsin · 2024-01-10T03:40:08Z

lib/posix/semaphore.c

@@ -15,7 +15,7 @@
 struct nsem_obj {
 	sys_snode_t snode;
 	sem_t sem;
-	unsigned int ref_count;
+	int ref_count;


@npitre I've modified other part that uses ref_count to int as well, and changed the overflow test from __ASSERT_NO_MSG(nsem->ref_count != UINT_MAX) to __ASSERT_NO_MSG(nsem->ref_count != INT_MAX)

Implements `sem_open()`, `sem_unlink()` & `sem_close()` functions and added tests for them. Updated existing tests and POSIX docs. Signed-off-by: Yong Cong Sin <[email protected]>

The `sem_open()`, `sem_close()` & `sem_unlink()` functions are now implemented, so mark them as supported. Signed-off-by: Yong Cong Sin <[email protected]>

Localize a few public variables and refactor the test and functions so that they are reusable. Signed-off-by: Yong Cong Sin <[email protected]>

Run the normal semaphore test with a named semaphore. Signed-off-by: Yong Cong Sin <[email protected]>

- Regroup refcount decrement and semaphore destruction by making the linked state into a counted reference for it. This allows for simplifying the code and cleanly adding a few assertions in a common location. - Remove redundant initialization to NULL on memory about to be freed and local pointer in nsem_cleanup(). Signed-off-by: Nicolas Pitre <[email protected]>

- Adjust refcount checks with regards to previous commit. - Remove redundant zassert_not_null() on local pointers after a sem_close(). There is no implicit reference passing in C. Signed-off-by: Nicolas Pitre <[email protected]>

ycsin · 2024-01-10T03:49:02Z

Feel free to fold them in your original commits or have them merged as is.

@npitre I've took the liberty to fold some of the fixes from your commits into the original commits where they where initially introduced and reworded your commits, but left the ref_count changes as-is in your commits

ycsin requested a review from cfriedt December 26, 2023 10:19

ycsin force-pushed the pr/posix_nsem branch from f2a19b2 to f171cf1 Compare December 26, 2023 10:35

ycsin marked this pull request as ready for review December 26, 2023 11:59

zephyrbot added area: Process area: POSIX POSIX API Library labels Dec 26, 2023

zephyrbot requested review from MaureenHelm, nashif and stephanosio December 26, 2023 12:00

zephyrbot assigned cfriedt Dec 26, 2023

ycsin force-pushed the pr/posix_nsem branch from f171cf1 to f0c0df2 Compare December 27, 2023 03:29

jukkar reviewed Dec 27, 2023

View reviewed changes

ycsin force-pushed the pr/posix_nsem branch from f0c0df2 to c90dde4 Compare December 27, 2023 12:53

jukkar reviewed Dec 28, 2023

View reviewed changes

lib/posix/Kconfig.semaphore Show resolved Hide resolved

lib/posix/semaphore.c Outdated Show resolved Hide resolved

ycsin mentioned this pull request Dec 28, 2023

RFC: Provide k_realloc() #41151

Closed

cfriedt reviewed Dec 29, 2023

View reviewed changes

doc/services/portability/posix/option_groups/index.rst Show resolved Hide resolved

cfriedt reviewed Dec 29, 2023

View reviewed changes

MAINTAINERS.yml

@@ -2728,6 +2728,8 @@ POSIX API layer:

status: maintained

maintainers:

- cfriedt

collaborators:

- ycsin

Copy link

Member

cfriedt Dec 29, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🥳

cfriedt mentioned this pull request Dec 29, 2023

posix: mqueue: pop mode as int with va_arg() #67071

Merged

cfriedt reviewed Dec 29, 2023

View reviewed changes

lib/posix/semaphore.c Outdated Show resolved Hide resolved

cfriedt reviewed Dec 29, 2023

View reviewed changes

tests/posix/common/src/semaphore.c Outdated Show resolved Hide resolved

ycsin force-pushed the pr/posix_nsem branch 3 times, most recently from f590fa8 to 2f76542 Compare January 2, 2024 04:04

ycsin requested review from cfriedt, jukkar and keith-packard January 2, 2024 10:34

cfriedt previously approved these changes Jan 3, 2024

View reviewed changes

ycsin dismissed cfriedt’s stale review via 5bb9aa5 January 3, 2024 17:19

ycsin force-pushed the pr/posix_nsem branch from 2f76542 to 5bb9aa5 Compare January 3, 2024 17:19

cfriedt previously approved these changes Jan 3, 2024

View reviewed changes

ycsin dismissed cfriedt’s stale review via 53128f6 January 8, 2024 04:48

ycsin force-pushed the pr/posix_nsem branch 3 times, most recently from 90b0fee to bec4c00 Compare January 8, 2024 16:29

ycsin force-pushed the pr/posix_nsem branch from bec4c00 to e19e5a7 Compare January 9, 2024 09:17

MAINTAINERS: add myself as a POSIX collaborator

89b9c87

Add myself as a collaborator in the POSIX subsystem. Signed-off-by: Yong Cong Sin <[email protected]>

ycsin force-pushed the pr/posix_nsem branch 2 times, most recently from 4767810 to 2d9c415 Compare January 9, 2024 14:59

ycsin requested a review from cfriedt January 9, 2024 16:34

ycsin commented Jan 10, 2024

View reviewed changes

ycsin force-pushed the pr/posix_nsem branch from 2bc7a21 to 60089dd Compare January 10, 2024 03:29

ycsin commented Jan 10, 2024

View reviewed changes

ycsin and others added 6 commits January 10, 2024 11:42

posix: semaphore: implement sem_open(), sem_unlink() & sem_close()

f252dda

Implements `sem_open()`, `sem_unlink()` & `sem_close()` functions and added tests for them. Updated existing tests and POSIX docs. Signed-off-by: Yong Cong Sin <[email protected]>

doc: posix: mark sem_open, sem_close & sem_unlink as supported

23788b8

The `sem_open()`, `sem_close()` & `sem_unlink()` functions are now implemented, so mark them as supported. Signed-off-by: Yong Cong Sin <[email protected]>

tests: posix: semaphore: refactor test_semaphore

eb1f8c8

Localize a few public variables and refactor the test and functions so that they are reusable. Signed-off-by: Yong Cong Sin <[email protected]>

tests: posix: semaphore: run normal semaphore test with named semaphore

9396cd1

Run the normal semaphore test with a named semaphore. Signed-off-by: Yong Cong Sin <[email protected]>

tests: posix: semaphore: assorted adjustments

b316cca

- Adjust refcount checks with regards to previous commit. - Remove redundant zassert_not_null() on local pointers after a sem_close(). There is no implicit reference passing in C. Signed-off-by: Nicolas Pitre <[email protected]>

ycsin force-pushed the pr/posix_nsem branch from 60089dd to b316cca Compare January 10, 2024 03:43

npitre approved these changes Jan 10, 2024

View reviewed changes

cfriedt approved these changes Jan 10, 2024

View reviewed changes

cfriedt merged commit e5b3231 into zephyrproject-rtos:main Jan 10, 2024

ycsin deleted the pr/posix_nsem branch January 10, 2024 12:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

posix: semaphore: implement named semaphore functions #67007

posix: semaphore: implement named semaphore functions #67007

ycsin commented Dec 26, 2023 •

edited

Loading

jukkar Dec 27, 2023

ycsin Dec 27, 2023

cfriedt Dec 28, 2023 •

edited

Loading

cfriedt Dec 29, 2023

npitre commented Jan 6, 2024

npitre commented Jan 8, 2024

npitre commented Jan 8, 2024

ycsin commented Jan 9, 2024

ycsin commented Jan 9, 2024

npitre commented Jan 9, 2024

ycsin Jan 10, 2024

npitre commented Jan 10, 2024

ycsin Jan 10, 2024

ycsin commented Jan 10, 2024

		if (nsem->ref_count == 0) {
		__ASSERT(nsem->name == NULL, "ref_count is 0 but sem is not unlinked");

posix: semaphore: implement named semaphore functions #67007

posix: semaphore: implement named semaphore functions #67007

Conversation

ycsin commented Dec 26, 2023 • edited Loading

jukkar Dec 27, 2023

Choose a reason for hiding this comment

ycsin Dec 27, 2023

Choose a reason for hiding this comment

cfriedt Dec 28, 2023 • edited Loading

Choose a reason for hiding this comment

cfriedt Dec 29, 2023

Choose a reason for hiding this comment

npitre commented Jan 6, 2024

npitre commented Jan 8, 2024

npitre commented Jan 8, 2024

ycsin commented Jan 9, 2024

ycsin commented Jan 9, 2024

npitre commented Jan 9, 2024

ycsin Jan 10, 2024

Choose a reason for hiding this comment

npitre commented Jan 10, 2024

ycsin Jan 10, 2024

Choose a reason for hiding this comment

ycsin commented Jan 10, 2024

ycsin commented Dec 26, 2023 •

edited

Loading

cfriedt Dec 28, 2023 •

edited

Loading