Support multiprocessing metrics #179

Agalin · 2020-10-09T19:27:52Z

Prometheus client does not work well in multiprocessing environments (i.e. basically all WSGI servers including Gunicorn).
It's easy to configure it in multiprocess mode tho.

While easy to enable it's not so easy to test. I've tried to reuse existing metrics tests but due to metrics initialization at module import time I'd need to either:

Patch self._value in each global metric (even more than that for histogram).
Change those metrics to be instance-specific, initialized during Microservice instance creation.
Which one you'd like more? I'd definitely prefer second option.

Also existing tests are buggy - there are inter-test dependencies, order matters. It's probably due to those global counters - metrics tests check for output that is generated during execution of other tests, notably response 200 for uri / which is not configured in metrics - and uses completely different service name...

coveralls · 2020-10-09T19:30:16Z

Pull Request Test Coverage Report for Build 947

85 of 87 (97.7%) changed or added relevant lines in 2 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage decreased (-0.07%) to 99.082%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
tests/test_metrics.py	76	78	97.44%

Totals
Change from base Build 943:	-0.07%
Covered Lines:	1726
Relevant Lines:	1742

💛 - Coveralls

Agalin · 2020-10-10T19:56:28Z

Uff. Found a way to test it in a (mostly) clean way. It now fixes metrics leaking into other tests. Making metrics local wouldn't work due to name collision. Also Jaeger config is global anyway so it needs reset through protected members.

BTW would be nice to do something to avoid repeated Jaeger configuration in tests - it unnecessarily spams log output with warnings.

alexppg · 2020-10-10T21:39:38Z

Nice, thanks! It was difficult to make those tests, even that way, so thanks for fixing them.

My only question is, when using the custom registry do we loose the metrics included by default?:

# HELP python_gc_objects_collected_total Objects collected during gc
# TYPE python_gc_objects_collected_total counter
python_gc_objects_collected_total{generation="0"} 2553.0
python_gc_objects_collected_total{generation="1"} 356.0
python_gc_objects_collected_total{generation="2"} 0.0
# HELP python_gc_objects_uncollectable_total Uncollectable object found during GC
# TYPE python_gc_objects_uncollectable_total counter
python_gc_objects_uncollectable_total{generation="0"} 0.0
python_gc_objects_uncollectable_total{generation="1"} 0.0
python_gc_objects_uncollectable_total{generation="2"} 0.0
...

@Agalin

PD: Btw, I'm working on migrating to opentelemetry right now. When done, will supersede this implementation, but since I don't know how long will it take, it's still good.

Agalin · 2020-10-11T11:11:11Z

Those are not supported but not due to registry - instead it's an explicit limitation of the multiprocess mode.

    Registries can not be used as normal, all instantiated metrics are exported
    Custom collectors do not work (e.g. cpu and memory metrics)
    Info and Enum metrics do not work
    The pushgateway cannot be used
    Gauges cannot use the pid label

I'd assume that python gc metrics are included in memory metrics.

alexppg · 2020-10-11T23:20:18Z

I see. I've never used them, and it seems more important to have the metrics be multi-process, so LGTM.

Support multiprocessing metrics

0060705

avara1986 requested a review from alexppg October 9, 2020 19:31

Agalin added 2 commits October 10, 2020 17:22

Fix metrics tests dependency on other tests

ce11da6

Test metrics with enabled multiprocess

f95eb9b

Agalin changed the title ~~WIP: Support multiprocessing metrics~~ Support multiprocessing metrics Oct 10, 2020

Agalin mentioned this pull request Oct 11, 2020

WIP: refactor(metrics): migrate from prometheus client to opentelemetry #187

Draft

alexppg approved these changes Oct 11, 2020

View reviewed changes

avara1986 merged commit 03a033a into python-microservices:master Oct 11, 2020

avara1986 added hacktoberfest-accepted Improvement Not a bug but... could be better labels Oct 12, 2020

alexppg mentioned this pull request Feb 23, 2021

Metrics are not multiprocess #227

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multiprocessing metrics #179

Support multiprocessing metrics #179

Agalin commented Oct 9, 2020

coveralls commented Oct 9, 2020 •

edited

Loading

Agalin commented Oct 10, 2020

alexppg commented Oct 10, 2020 •

edited

Loading

Agalin commented Oct 11, 2020 •

edited

Loading

alexppg commented Oct 11, 2020

Support multiprocessing metrics #179

Support multiprocessing metrics #179

Conversation

Agalin commented Oct 9, 2020

coveralls commented Oct 9, 2020 • edited Loading

Pull Request Test Coverage Report for Build 947

💛 - Coveralls

Agalin commented Oct 10, 2020

alexppg commented Oct 10, 2020 • edited Loading

Agalin commented Oct 11, 2020 • edited Loading

alexppg commented Oct 11, 2020

coveralls commented Oct 9, 2020 •

edited

Loading

alexppg commented Oct 10, 2020 •

edited

Loading

Agalin commented Oct 11, 2020 •

edited

Loading