Clang test Analysis/live-stmts.cpp randomly fails on MacOS #126804

dyung · 2025-02-11T21:53:33Z

I have 2 build bots setup to build/test LLVM running on MacOS and I have noticed that the test clang/test/Analysis/live-stmts.cpp randomly seems to fail on the bot. I have two identically configured workers running the job, and it has failed on both so it doesn't seem specific to one machine configuration.

Here is a sample of the failing test output:

******************** TEST 'Clang :: Analysis/live-stmts.cpp' FAILED ********************
Exit Code: 1
Command Output (stderr):
--
RUN: at line 1: /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/clang -cc1 -internal-isystem /Users/buildbot/buildbot-root/aarch64-darwin/build/lib/clang/21/include -nostdsysteminc -analyze -analyzer-constraints=range -setup-static-analyzer -w -analyzer-checker=debug.DumpLiveExprs /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/Analysis/live-stmts.cpp 2>&1   | /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/FileCheck /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/Analysis/live-stmts.cpp
+ /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/clang -cc1 -internal-isystem /Users/buildbot/buildbot-root/aarch64-darwin/build/lib/clang/21/include -nostdsysteminc -analyze -analyzer-constraints=range -setup-static-analyzer -w -analyzer-checker=debug.DumpLiveExprs /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/Analysis/live-stmts.cpp
+ /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/FileCheck /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/Analysis/live-stmts.cpp
/Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/Analysis/live-stmts.cpp:239:16: error: CHECK-EMPTY: is not on the line after the previous match
// CHECK-EMPTY:
               ^
<stdin>:180:1: note: 'next' match was here
^
<stdin>:177:1: note: previous match ended here
^
<stdin>:178:1: note: non-matching line after previous match is here
ImplicitCastExpr 0x151009d78 '_Bool' <LValueToRValue>
^

<many lines skipped>

172: [ B3 (live expressions at block exit) ] 
         173:  
         174: IntegerLiteral 0x15082f800 'int' 0 
check:235     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
         175:  
empty:236     ^
         176: IntegerLiteral 0x15082f820 'int' 1 
check:237     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
         177:  
empty:238     ^
         178: ImplicitCastExpr 0x151009d78 '_Bool' <LValueToRValue> 
         179: `-DeclRefExpr 0x151009d38 '_Bool' lvalue ParmVar 0x151009bb8 'b' '_Bool' 
         180:  
empty:239     ! error: match on wrong line

Here are some recent runs of the build bot where the test failure occurred:

The text was updated successfully, but these errors were encountered:

shafik · 2025-02-11T22:33:36Z

Maybe a duplicate of: #126619

CC @haoNoQ @necto

llvmbot · 2025-02-11T22:35:17Z

@llvm/issue-subscribers-clang-static-analyzer

Author: None (dyung)

I have 2 build bots setup to build/test LLVM running on MacOS and I have noticed that the test `clang/test/Analysis/live-stmts.cpp` randomly seems to fail on the bot. I have two identically configured workers running the job, and it has failed on both so it doesn't seem specific to one machine configuration.

Here is a sample of the failing test output:

******************** TEST 'Clang :: Analysis/live-stmts.cpp' FAILED ********************
Exit Code: 1
Command Output (stderr):
--
RUN: at line 1: /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/clang -cc1 -internal-isystem /Users/buildbot/buildbot-root/aarch64-darwin/build/lib/clang/21/include -nostdsysteminc -analyze -analyzer-constraints=range -setup-static-analyzer -w -analyzer-checker=debug.DumpLiveExprs /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/Analysis/live-stmts.cpp 2&gt;&amp;1   | /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/FileCheck /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/Analysis/live-stmts.cpp
+ /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/clang -cc1 -internal-isystem /Users/buildbot/buildbot-root/aarch64-darwin/build/lib/clang/21/include -nostdsysteminc -analyze -analyzer-constraints=range -setup-static-analyzer -w -analyzer-checker=debug.DumpLiveExprs /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/Analysis/live-stmts.cpp
+ /Users/buildbot/buildbot-root/aarch64-darwin/build/bin/FileCheck /Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/Analysis/live-stmts.cpp
/Users/buildbot/buildbot-root/aarch64-darwin/llvm-project/clang/test/Analysis/live-stmts.cpp:239:16: error: CHECK-EMPTY: is not on the line after the previous match
// CHECK-EMPTY:
               ^
&lt;stdin&gt;:180:1: note: 'next' match was here
^
&lt;stdin&gt;:177:1: note: previous match ended here
^
&lt;stdin&gt;:178:1: note: non-matching line after previous match is here
ImplicitCastExpr 0x151009d78 '_Bool' &lt;LValueToRValue&gt;
^

&lt;many lines skipped&gt;

172: [ B3 (live expressions at block exit) ] 
         173:  
         174: IntegerLiteral 0x15082f800 'int' 0 
check:235     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
         175:  
empty:236     ^
         176: IntegerLiteral 0x15082f820 'int' 1 
check:237     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
         177:  
empty:238     ^
         178: ImplicitCastExpr 0x151009d78 '_Bool' &lt;LValueToRValue&gt; 
         179: `-DeclRefExpr 0x151009d38 '_Bool' lvalue ParmVar 0x151009bb8 'b' '_Bool' 
         180:  
empty:239     ! error: match on wrong line

Here are some recent runs of the build bot where the test failure occurred:

steakhal · 2025-02-12T06:42:59Z

Duplicate of #126619

steakhal · 2025-02-12T06:44:42Z

Ill have a look at his

steakhal · 2025-02-12T13:14:58Z

I tried running this test about half a million times, and it always passed. I'm on linux x86_64.
The flaky failure may have something to do with ASLR and other platform-specific behavior.

After looking at the output you observed, my guess is that there is some nondeterminism somewhere, making the dumps unstable wrt. their ordering.

If you look at each live expressions at block exit section, the expectation specifies a fixed ordering, yet, in your observed failure they appear in a different permutation.

I think this shouldn't be too hard to fix while dumping these, but we should maybe go to the bottom of this and consider making the backing implementation deterministic instead/in addition to sorting in the dumps.

I don't believe it's the #100745 fault. I think it just exposed this behavior more often due to the additional tests.

Multiple people reported flaky bot failures tied to clang/test/Analysis/live-stmts.cpp I tried reproducing the flaky behavior on my Linux x86_64 system, but the tests appears to be stable in my context. Only by looking at the failures reported, I could formulate a potential diagnosis. The output always looked almost the same, except that the Exprs dumped per Basic block were shuffled compared to my expectation. This suggests to me some ordering issue. If you look at the backing storage of `blocksEndToLiveness[B].liveExprs`, it uses `llvm::ImmutableSet<const Expr *>`. That container likely uses the pointer values as keys, thus the runtime values of the addresses influence the iteration order. To fix this, before dumping, I sort the expressions by their "beginLocs". It should be efficient enough for a debug checker, where there is no performance constraint. This should hopefully fix the flaky behavior on systems where ASLR works differently than (my) Linux system. Hopefully fixes llvm#126619 Hopefully fixes llvm#126804

steakhal · 2025-02-12T13:38:25Z

Find the fix at #126913

…lvm#126913) Multiple people reported flaky bot failures tied to `clang/test/Analysis/live-stmts.cpp` I tried reproducing the flaky behavior on my Linux x86_64 system, but the tests appears to be stable in my context. Only by looking at the failures reported, I could formulate a potential diagnosis. The output always looked almost the same, except that the Exprs dumped per Basic block were shuffled compared to my expectation. This suggests to me some ordering issue. If you look at the backing storage of `blocksEndToLiveness[B].liveExprs`, it uses `llvm::ImmutableSet<const Expr *>`. That container likely uses the pointer values as keys, thus the runtime values of the addresses influence the iteration order. To fix this, before dumping, I sort the expressions by their "beginLocs". It should be efficient enough for a debug checker, where there is no performance constraint. This should hopefully fix the flaky behavior on systems where ASLR works differently than (my) Linux system. Hopefully fixes llvm#126619 Hopefully fixes llvm#126804

…2nd attempt) In my previous attempt (llvm#126913) of fixing the flaky case was on a good track when I used the begin locations as a stable ordering. However, I forgot to consider the case when the begin locations are the same among the Exprs. In an `EXPENSIVE_CHECKS` build, arrays are randomly shuffled prior to sorting them. This exposed the flaky behavior much more often basically breaking the "stability" of the vector - as it should. To fix this, I I use this time `Expr::getID` for a stable ID for an Expr. Hopefully fixes llvm#126619 Hopefully fixes llvm#126804

…2nd attempt) (#127406) In my previous attempt (#126913) of fixing the flaky case was on a good track when I used the begin locations as a stable ordering. However, I forgot to consider the case when the begin locations are the same among the Exprs. In an `EXPENSIVE_CHECKS` build, arrays are randomly shuffled prior to sorting them. This exposed the flaky behavior much more often basically breaking the "stability" of the vector - as it should. Because of this, I had to revert the previous fix attempt in #127034. To fix this, I use this time `Expr::getID` for a stable ID for an Expr. Hopefully fixes #126619 Hopefully fixes #126804

…lvm#126913) Multiple people reported flaky bot failures tied to `clang/test/Analysis/live-stmts.cpp` I tried reproducing the flaky behavior on my Linux x86_64 system, but the tests appears to be stable in my context. Only by looking at the failures reported, I could formulate a potential diagnosis. The output always looked almost the same, except that the Exprs dumped per Basic block were shuffled compared to my expectation. This suggests to me some ordering issue. If you look at the backing storage of `blocksEndToLiveness[B].liveExprs`, it uses `llvm::ImmutableSet<const Expr *>`. That container likely uses the pointer values as keys, thus the runtime values of the addresses influence the iteration order. To fix this, before dumping, I sort the expressions by their "beginLocs". It should be efficient enough for a debug checker, where there is no performance constraint. This should hopefully fix the flaky behavior on systems where ASLR works differently than (my) Linux system. Hopefully fixes llvm#126619 Hopefully fixes llvm#126804

…2nd attempt) (llvm#127406) In my previous attempt (llvm#126913) of fixing the flaky case was on a good track when I used the begin locations as a stable ordering. However, I forgot to consider the case when the begin locations are the same among the Exprs. In an `EXPENSIVE_CHECKS` build, arrays are randomly shuffled prior to sorting them. This exposed the flaky behavior much more often basically breaking the "stability" of the vector - as it should. Because of this, I had to revert the previous fix attempt in llvm#127034. To fix this, I use this time `Expr::getID` for a stable ID for an Expr. Hopefully fixes llvm#126619 Hopefully fixes llvm#126804

…2nd attempt) (llvm#127406) In my previous attempt (llvm#126913) of fixing the flaky case was on a good track when I used the begin locations as a stable ordering. However, I forgot to consider the case when the begin locations are the same among the Exprs. In an `EXPENSIVE_CHECKS` build, arrays are randomly shuffled prior to sorting them. This exposed the flaky behavior much more often basically breaking the "stability" of the vector - as it should. Because of this, I had to revert the previous fix attempt in llvm#127034. To fix this, I use this time `Expr::getID` for a stable ID for an Expr. Hopefully fixes llvm#126619 Hopefully fixes llvm#126804 (cherry picked from commit f378e52)

llvmbot added the clang Clang issues not falling into any other category label Feb 11, 2025

EugeneZelenko added test-suite clang:static analyzer and removed clang Clang issues not falling into any other category labels Feb 11, 2025

steakhal marked this as a duplicate of #126619 Feb 12, 2025

steakhal self-assigned this Feb 12, 2025

steakhal mentioned this issue Feb 12, 2025

[clang][analysis] Fix flaky clang/test/Analysis/live-stmts.cpp test #126913

Merged

EugeneZelenko added clang:analysis and removed test-suite clang:static analyzer labels Feb 12, 2025

steakhal closed this as completed in #126913 Feb 12, 2025

steakhal closed this as completed in be25d61 Feb 12, 2025

steakhal mentioned this issue Feb 16, 2025

[clang][analysis] Fix flaky clang/test/Analysis/live-stmts.cpp test (2nd attempt) #127406

Merged

steakhal mentioned this issue May 12, 2025

[clang][analysis] Fix flaky clang/test/Analysis/live-stmts.cpp test (2nd attempt) (#127406) #139591

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clang test Analysis/live-stmts.cpp randomly fails on MacOS #126804

Clang test Analysis/live-stmts.cpp randomly fails on MacOS #126804

dyung commented Feb 11, 2025

shafik commented Feb 11, 2025

Uh oh!

llvmbot commented Feb 11, 2025

Uh oh!

steakhal commented Feb 12, 2025

Uh oh!

steakhal commented Feb 12, 2025

Uh oh!

steakhal commented Feb 12, 2025

Uh oh!

steakhal commented Feb 12, 2025

Uh oh!

Clang test Analysis/live-stmts.cpp randomly fails on MacOS #126804

Clang test Analysis/live-stmts.cpp randomly fails on MacOS #126804

Comments

dyung commented Feb 11, 2025

shafik commented Feb 11, 2025

Uh oh!

llvmbot commented Feb 11, 2025

Uh oh!

steakhal commented Feb 12, 2025

Uh oh!

steakhal commented Feb 12, 2025

Uh oh!

steakhal commented Feb 12, 2025

Uh oh!

steakhal commented Feb 12, 2025

Uh oh!