chore(ci_visibility): move retry logic to pytest_runtest_protocol #13448

vitor-de-araujo · 2025-05-19T14:00:09Z

This is essentially #13376, recreated now that the fix in the unittest suite (#13445) has been merged, and with the previous pytest_runtest_protocol logic wrapped in a try/except to avoid breaking the pipeline in case of an internal error in CI Visibility.

Currently we let pytest's builtin pytest_runtest_protocol hook to run the test, and we check whether to retry at the makereport stage. This has a number of consequences:

We don't have access to the setup, call, teardown reports all at once; we see one at a time during makereport, and we have to patch them and stash information around to keep state across each time we pass through the reports for a given test.
pytest logs each report as it is created, so we have to patch them at the right time so they get printed correctly (to e.g. change the outcome from FAILED to ATR INITIAL ATTEMPT FAILED; this affects not only printing, but also the session exit status).
In particular, for EFD when the initial attempt passes, we run too late and the PASSED status was already logged by the time we patch it, so it shows PASSED instead of EFD INITIAL ATTEMPT PASSED. Not only that, but we have to handle this case specially when generating the terminal summary.

This PR moves the retry logic from the pytest_runtest_makereport hook to the pytest_runtest_protocol hook. This means we replace pytest's own pytest_runtest_protocol with our own. We invoke pytest's internal runtestprotocol function directly from our hook, so the behavior of our hook is similar to the pytest's own hook. The difference is that we call this function with log=False, so pytest doesn't log the setup, call, teardown reports as they are created. Instead, we collect all reports, patch them as needed, and then print them out. This means we can write the logic having full knowledge of the final status of a test run, instead of patching things as we see them during setup, call and teardown.

For retries, the responsibility for logging the statuses is moved to the retry handlers themselves, so they can delay printing to after the reports have been patched. In principle, we could even decide to not print the retry results individually and only print the final status (which would make for a cleaner output), but this can come in a future version.

For EFD, the special final outcomes (dd_efd_final_passed, etc.) are replaced with plain passed, failed, skipped states, which xdist can handle, and the final states are only used in efd_get_teststatus (called from the pytest_report_teststatus hook).

Future work:

Attempt to Fix has to be modified in similar ways to EFD, but it also has to handle quarantine, so it's a bit more involved.
xdist still prints EFD INITIAL ATTEMPT for all atempts (not just the first one).
The whole retry logic outside of pytest should be refactored (see chore(ci_visibility): refactor test retry logic #13224), but this PR is a first step to make the rest possible.

Checklist

PR author has checked that all the criteria below are met
The PR description includes an overview of the change
The PR description articulates the motivation for the change
The change includes tests OR the PR description describes a testing strategy
The PR description notes risks associated with the change, if any
Newly-added code is easy to change
The change follows the library release note guidelines
The change includes or references documentation updates if necessary
Backport labels are set (if applicable)

Reviewer Checklist

Reviewer has checked that all the criteria below are met
Title is accurate
All changes are related to the pull request's stated goal
Avoids breaking API changes
Testing strategy adequately addresses listed risks
Newly-added code is easy to change
Release note makes sense to a user of the library
If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment
Backport labels are set in a manner that is consistent with the release branch maintenance policy

…raujo/SDTEST-1850/refactor-retry-logic-2

…r science)

…raujo/SDTEST-1850/refactor-retry-logic-2

…utcomes

…raujo/SDTEST-1850/refactor-retry-logic-2

…tor-de-araujo/SDTEST-1850/refactor-retry-logic-2

…raujo/SDTEST-1850/refactor-retry-logic-2

github-actions · 2025-05-19T14:02:00Z

CODEOWNERS have been resolved as:

ddtrace/contrib/internal/pytest/_atr_utils.py                           @DataDog/ci-app-libraries
ddtrace/contrib/internal/pytest/_attempt_to_fix.py                      @DataDog/ci-app-libraries
ddtrace/contrib/internal/pytest/_efd_utils.py                           @DataDog/ci-app-libraries
ddtrace/contrib/internal/pytest/_plugin_v2.py                           @DataDog/ci-app-libraries
ddtrace/contrib/internal/pytest/_retry_utils.py                         @DataDog/ci-app-libraries
ddtrace/contrib/internal/pytest/_utils.py                               @DataDog/ci-app-libraries
ddtrace/contrib/internal/pytest/plugin.py                               @DataDog/ci-app-libraries

ddtrace/contrib/internal/pytest/_plugin_v2.py

github-actions · 2025-05-19T14:23:34Z

Bootstrap import analysis

Comparison of import times between this PR and base.

Summary

The average import time from this PR is: 248 ± 4 ms.

The average import time from base is: 248 ± 5 ms.

The import time difference between this PR and base is: -0.4 ± 0.2 ms.

The difference is not statistically significant (z = -1.99).

Import time breakdown

The following import paths have shrunk:

ddtrace.auto 1.869 ms (0.75%)

ddtrace.bootstrap.sitecustomize 1.191 ms (0.48%)

ddtrace.bootstrap.preload 1.191 ms (0.48%)

ddtrace.internal.remoteconfig.client 0.582 ms (0.23%)

ddtrace 0.678 ms (0.27%)

pr-commenter · 2025-05-19T15:01:15Z

Benchmarks

Benchmark execution time: 2025-05-19 15:47:00

Comparing candidate commit dcc238e in PR branch vitor-de-araujo/SDTEST-1850/refactor-retry-logic-2 with baseline commit 1219e33 in branch main.

Found 1 performance improvements and 0 performance regressions! Performance is the same for 495 metrics, 8 unstable metrics.

scenario:iastdjangostartup-iast

🟩 execution_time [-389.081ms; -173.970ms] or [-16.350%; -7.311%]

…ocol (#13448)" This reverts commit 422d025.

…#13507) Remove `RetryTestReport` for good, as part of the effort to support `pytest-xdist`. Attempt-to-Fix was the only retry mechanism that still used it; this PR brings it in line with the recent EFD and ATR refactors (#13448 and #13288). As a bonus, this fixes the miscounting of active Attempt-to-Fix tests in the JUnit XML output. ## Checklist - [x] PR author has checked that all the criteria below are met - The PR description includes an overview of the change - The PR description articulates the motivation for the change - The change includes tests OR the PR description describes a testing strategy - The PR description notes risks associated with the change, if any - Newly-added code is easy to change - The change follows the [library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) - The change includes or references documentation updates if necessary - Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) ## Reviewer Checklist - [x] Reviewer has checked that all the criteria below are met - Title is accurate - All changes are related to the pull request's stated goal - Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes - Testing strategy adequately addresses listed risks - Newly-added code is easy to change - Release note makes sense to a user of the library - If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment - Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)

vitor-de-araujo added 30 commits May 9, 2025 14:14

ꙮ

4affa2a

unused imports

aea4991

Merge branch 'main' of github.com:DataDog/dd-trace-py into vitor-de-a…

5a9ca9b

…raujo/SDTEST-1850/refactor-retry-logic-2

do not yield

18bb443

moving retry logic to pytest_runtest_protocol

329e197

_process_result doesn't need call anymore

6bc457c

more code surgery

840d309

lints

9f6be8c

call handle_retries on call and teardown to simulate old behavior (fo…

62df4cf

…r science)

skip atr on teardown error

38511dc

style

c079843

yield

605af58

fight curses with curses

0446131

final countdown

855f5db

junit and things

39437be

a

596de26

a

f3b7969

all but the efd junit stuff passes

ba75c69

call me not if i be not summoned

4cdd370

xfail

ead0e8e

Merge branch 'main' of github.com:DataDog/dd-trace-py into vitor-de-a…

59cbcc8

…raujo/SDTEST-1850/refactor-retry-logic-2

commit it before you lose it

964c373

retry function is now responsible for its own logs; EFD uses normal o…

3c3cbfb

…utcomes

the lints

2f5977b

no need to mark it skipped, pytest_runtest_protocol already did it

308cab3

junit xml works now

0efbb6c

Merge branch 'main' of github.com:DataDog/dd-trace-py into vitor-de-a…

6d8ac64

…raujo/SDTEST-1850/refactor-retry-logic-2

oops

cbeca62

note for future self

02d925c

some beautiful consta

a7ea029

vitor-de-araujo added 6 commits May 14, 2025 12:48

moar constants

0d98f57

chore(ci_visibility): fix CI Visibility errors in unittest test suite

0d44650

them lints

6676ab9

Merge branch 'vitor-de-araujo/SDTEST-1850/fix-unittest-suite' into vi…

37c486d

…tor-de-araujo/SDTEST-1850/refactor-retry-logic-2

become more robust against internal errors

edaa613

Merge branch 'main' of github.com:DataDog/dd-trace-py into vitor-de-a…

413dc5d

…raujo/SDTEST-1850/refactor-retry-logic-2

update log message

1fce6f0

vitor-de-araujo commented May 19, 2025

View reviewed changes

ddtrace/contrib/internal/pytest/_plugin_v2.py Show resolved Hide resolved

vitor-de-araujo added changelog/no-changelog A changelog entry is not required for this PR. CI App labels May 19, 2025

vitor-de-araujo marked this pull request as ready for review May 19, 2025 14:10

vitor-de-araujo requested a review from a team as a code owner May 19, 2025 14:10

vitor-de-araujo requested review from BSanchidrian and gnufede May 19, 2025 14:10

gnufede approved these changes May 19, 2025

View reviewed changes

a

dcc238e

vitor-de-araujo enabled auto-merge (squash) May 19, 2025 15:33

vitor-de-araujo merged commit 422d025 into main May 19, 2025
323 of 325 checks passed

vitor-de-araujo deleted the vitor-de-araujo/SDTEST-1850/refactor-retry-logic-2 branch May 19, 2025 17:22

emmettbutler added a commit that referenced this pull request May 20, 2025

Revert "chore(ci_visibility): move retry logic to pytest_runtest_prot…

feeca4d

…ocol (#13448)" This reverts commit 422d025.

vitor-de-araujo mentioned this pull request May 26, 2025

chore(ci_visibility): remove RetryTestReport, clean up Attempt-to-Fix #13507

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore(ci_visibility): move retry logic to pytest_runtest_protocol #13448

chore(ci_visibility): move retry logic to pytest_runtest_protocol #13448

Uh oh!

vitor-de-araujo commented May 19, 2025 •

edited

Loading

Uh oh!

github-actions bot commented May 19, 2025

Uh oh!

Uh oh!

github-actions bot commented May 19, 2025 •

edited

Loading

Uh oh!

pr-commenter bot commented May 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

chore(ci_visibility): move retry logic to pytest_runtest_protocol #13448

chore(ci_visibility): move retry logic to pytest_runtest_protocol #13448

Uh oh!

Conversation

vitor-de-araujo commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Reviewer Checklist

Uh oh!

github-actions bot commented May 19, 2025

Uh oh!

Uh oh!

github-actions bot commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bootstrap import analysis

Summary

Import time breakdown

Uh oh!

pr-commenter bot commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks

scenario:iastdjangostartup-iast

Uh oh!

Uh oh!

Uh oh!

vitor-de-araujo commented May 19, 2025 •

edited

Loading

github-actions bot commented May 19, 2025 •

edited

Loading

pr-commenter bot commented May 19, 2025 •

edited

Loading