Refactor BatchLogRecordProcessor and associated tests #4535

DylanRussell · 2025-04-09T20:06:31Z

Description

Refactor BatchLogRecordProcessor, keeping the existing behavior mostly the same. This PR cleans up the code, including the tests, and also adds some new tests.

One exception is forceFlush which now calls export synchronously from the main thread and waits for it to finish.

Previously forceFlush would wait timeout_millis for the worker thread to make and finish an export call, and if an export call was in progress it would wait for the subsequent export call to finish. It would return true if this export call completed in time and false otherwise. It didn't cancel the request after timeout, it just stopped waiting for it to finish.

I think ideally forceFlush.timeout_millis (and also shutdown.timeout_millis) should be used as the time after which the export call(s) gets cancelled. But for that to work we need to be able to pass a timeout to export like what was proposed in #4183. Until then I think we should ignore it and document that it doesn't work.

I'm not sure what forceFlush should return, currently I have it return nothing (same as javascript. It could always return True, to signify that export was called until the queue was empty. It could return True if all export calls succeeded, and False otherwise, and it could stop flushing after the first failed export, like how go lang does it.

I think my proposed behavior is more inline with the spec too.

Note that the default for forceFlush.timeout_millis came from the OTEL_BLRP_EXPORT_TIMEOUT environment variable which is supposed to configure "the maximum allowed time to export data from the BatchLogRecordProcessor". I propose we leave this env var unused for now, and document that it doesn't do anything. This flag seems redundant with the OTLP Exporter timeout env vars anyway. Maybe in other languages the BatchLogRecordProcessor isn't the default one used for auto instrumentation, so it makes more sense for it to be configurable ?

Type of change

Please delete options that are not relevant.Please delete options that are not relevant.

[ X] Bug fix (non-breaking change which fixes an issue)

How Has This Been Tested?

Added lots of unit tests.

Does This PR Require a Contrib Repo Change?

Yes. - Link to PR:
[ x] No.

Checklist:

[x ] Followed the style guidelines of this project
Changelogs have been updated
[x ] Unit tests have been added
Documentation has been updated

aabmass · 2025-04-16T18:20:07Z

This flag seems redundant with the OTLP Exporter timeout env vars anyway. Maybe in other languages the BatchLogRecordProcessor isn't the default one used for auto instrumentation, so it makes more sense for it to be configurable ?

The OTEL_BLRP_EXPORT_TIMEOUT should work with all exporters, not just OTLP. I think the intention of having a separate one for OTLP is to specifically target OTLP exporters in case there are multiple BLRP instances. It's definitely a bit clunky though.

aabmass

This looks like a huge improvement to the complexity of the threading code 😃

I'd like to get some more eyes on this since it concurrency bugs can be really subtle

opentelemetry-sdk/src/opentelemetry/sdk/_logs/_internal/export/__init__.py

opentelemetry-sdk/tests/logs/test_export.py

aabmass · 2025-04-16T19:52:54Z

I think the failing Windows run is pretty typical of what we see with sleep() in tests: https://github.com/open-telemetry/opentelemetry-python/actions/runs/14366019090/job/40279304137?pr=4535. It might pass on a future run, but please try to improve the flakiness if you can

pmcollins

LGTM, thanks for these important changes!

My only feedback here is maybe it would also be helpful to try to eventually factor some classes out. For example, self._queue and self._queue_lock are often used together and perhaps could be in their own class. Also, more generally, we're doing batching for spans and logs -- could we use one generic batcher that could handle both signals?

opentelemetry-sdk/src/opentelemetry/sdk/_logs/_internal/export/__init__.py

DylanRussell · 2025-04-18T18:05:29Z

Added a buffer to that test that flaked, thanks for point that out. Hopefully it passes this time

DylanRussell · 2025-04-18T18:21:50Z

My only feedback here is maybe it would also be helpful to try to eventually factor some classes out. For example, self._queue and self._queue_lock are often used together and perhaps could be in their own class. Also, more generally, we're doing batching for spans and logs -- could we use one generic batcher that could handle both signals?

Sounds good ! I will look into this. I was planning to fix the BatchSpanProcessor code which works the exact same way, so some generic batch class makes a lot of sense. I think I'll do that in a separate PR tho, this one already getting big

DylanRussell · 2025-04-18T18:22:18Z

Can someone add the Skip Changelog tag ? I don't think this needs a changelog, since it's basically just a refactor and not changing behavior

DylanRussell · 2025-04-22T19:18:31Z

Alright I think this is good to merge, just need the Skip Changelog tag and then for someone to push it

lzchen

Much cleaner than before thanks!

…#4535)

Refactor BatchLogRecordProcessor

217463e

DylanRussell requested a review from a team as a code owner April 9, 2025 20:06

aabmass approved these changes Apr 16, 2025

View reviewed changes

pmcollins approved these changes Apr 17, 2025

View reviewed changes

opentelemetry-sdk/src/opentelemetry/sdk/_logs/_internal/export/__init__.py Show resolved Hide resolved

DylanRussell mentioned this pull request Apr 18, 2025

OTEL_BLRP_EXPORT_TIMEOUT should do something #4555

Open

DylanRussell added 3 commits April 18, 2025 17:55

Respond to comments

41cd520

Fix lint

8394244

Add delay for windows test.

0813487

Fix fork test

a088789

lzchen approved these changes Apr 23, 2025

View reviewed changes

lzchen added the Skip Changelog PRs that do not require a CHANGELOG.md entry label Apr 23, 2025

lzchen added 2 commits April 23, 2025 07:36

Merge branch 'main' into refactor_blrp

38b5420

Merge branch 'main' into refactor_blrp

3569be5

lzchen merged commit 00329e0 into open-telemetry:main Apr 24, 2025
477 of 481 checks passed

DylanRussell added a commit to DylanRussell/opentelemetry-python that referenced this pull request Apr 29, 2025

Refactor BatchLogRecordProcessor and associated tests (open-telemetry…

9ee6872

…#4535)

DylanRussell added a commit to DylanRussell/opentelemetry-python that referenced this pull request Apr 30, 2025

Refactor BatchLogRecordProcessor and associated tests (open-telemetry…

ed344a9

…#4535)

DylanRussell mentioned this pull request May 15, 2025

REQUEST: New membership for DylanRussell open-telemetry/community#2748

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor BatchLogRecordProcessor and associated tests #4535

Refactor BatchLogRecordProcessor and associated tests #4535

Uh oh!

DylanRussell commented Apr 9, 2025 •

edited

Loading

Uh oh!

aabmass commented Apr 16, 2025

Uh oh!

aabmass left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aabmass commented Apr 16, 2025

Uh oh!

pmcollins left a comment

Uh oh!

Uh oh!

DylanRussell commented Apr 18, 2025

Uh oh!

DylanRussell commented Apr 18, 2025

Uh oh!

DylanRussell commented Apr 18, 2025

Uh oh!

DylanRussell commented Apr 22, 2025

Uh oh!

lzchen left a comment

Uh oh!

Uh oh!

Uh oh!

Refactor BatchLogRecordProcessor and associated tests #4535

Refactor BatchLogRecordProcessor and associated tests #4535

Uh oh!

Conversation

DylanRussell commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How Has This Been Tested?

Does This PR Require a Contrib Repo Change?

Checklist:

Uh oh!

aabmass commented Apr 16, 2025

Uh oh!

aabmass left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aabmass commented Apr 16, 2025

Uh oh!

pmcollins left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DylanRussell commented Apr 18, 2025

Uh oh!

DylanRussell commented Apr 18, 2025

Uh oh!

DylanRussell commented Apr 18, 2025

Uh oh!

DylanRussell commented Apr 22, 2025

Uh oh!

lzchen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

DylanRussell commented Apr 9, 2025 •

edited

Loading