Avoid frequent subprocess starts when reporting total process memory #221

hairyhum · 2017-09-11T10:04:34Z

System RSS reporting functions can take some time to execute, especially when called concurrently. To optimise that on hot code paths we cache the last value for 500 ms.
It won't affect regular memory collection functions for memory alarms and management stats (which happens every 1 sec), only calls from file_handle_cache, which can be called concurrently.

Addresses rabbitmq/rabbitmq-server#1343

If the process takes too long to update in the interval time, the interval will fill it's message box. System RSS memory reporting can be overloaded by constant requests if the message box contains many 'update' messages.

Getting system RSS can take up to hunreds of milliseconds and be not parallelizable on some platforms.

Monitor updates it's state every second. Cache expiration period of 1 second will keep it always cached.

lukebakken · 2017-09-11T22:15:42Z

src/vm_memory_monitor.erl

@@ -225,7 +239,8 @@ start_link(MemFraction, AlarmSet, AlarmClear) ->
                          [MemFraction, {AlarmSet, AlarmClear}], []).

 init([MemFraction, AlarmFuns]) ->
-    TRef = start_timer(?DEFAULT_MEMORY_CHECK_INTERVAL),
+    ets:new(?MODULE, [named_table, public]),


Would it be preferable to use the process dictionary instead of a public ets table?

There can (and usually will) be multiple processes that call the function.

Since we're using 1 second cache expiration time it makes sense to keep the value in the process state. The process is updating its state every second anyway. The question is do we want it to be 1 second or we need a better resolution?

@michaelklishin that makes sense. I suppose we don't want to serialize operations via call within this gen_server. I brought it up because I can only find 2 other instances of public ets tables in the stable source code.

@hairyhum 1 second sounds perfectly fine. We can use process dictionary here and maybe that would avoid a contention point with a large number of active queues. @hairyhum @dumbbell @dcorbacho WDYT?

@michaelklishin process dictionary would be fine in this case, it's only a tiny amount of info.

@hairyhum any objections to refactoring this to use process dictionaries?

Why do we need process dictionary, but not the gen_server state? All usages of this function call the server to get memory limit (get_memory_limit() function) anyway.

Why do we need process dictionary, but not the gen_server state?

Using the state would work well too. I didn't think of that.

hairyhum · 2017-09-14T09:36:55Z

Moved process memory to gen_server state. Ready for another review.

lukebakken

👍

michaelklishin · 2017-09-19T03:44:31Z

So every PR update, even if I just edit the title, results in a CLA bot comment now 😃.

pivotal-issuemaster · 2017-09-19T13:48:59Z

@hairyhum Please sign the Contributor License Agreement!

Click here to manually synchronize the status of this Pull Request.

See the FAQ for frequently asked questions.

lukebakken · 2017-09-19T13:49:39Z

@michaelklishin I will ask about that.

pivotal-issuemaster · 2017-09-19T14:02:40Z

@hairyhum Thank you for signing the Contributor License Agreement!

Do not use interval timer to update the process state.

3f11e2d

If the process takes too long to update in the interval time, the interval will fill it's message box. System RSS memory reporting can be overloaded by constant requests if the message box contains many 'update' messages.

hairyhum changed the title ~~Optimise memory reporting to avoid too much system requests.~~ Optimise memory reporting for less system memory reports. Sep 11, 2017

hairyhum changed the title ~~Optimise memory reporting for less system memory reports.~~ Optimise memory reporting to make less system memory reports. Sep 11, 2017

hairyhum changed the title ~~Optimise memory reporting to make less system memory reports.~~ Optimise memory reporting to call less system memory reports. Sep 11, 2017

Cache process memory for 500 ms.

579ceb6

Getting system RSS can take up to hunreds of milliseconds and be not parallelizable on some platforms.

hairyhum force-pushed the rabbitmq-server-1343 branch from 9979254 to 579ceb6 Compare September 11, 2017 11:31

Increase memory cache period to 1 second

e52358e

Monitor updates it's state every second. Cache expiration period of 1 second will keep it always cached.

michaelklishin mentioned this pull request Sep 11, 2017

Windows: RabbitMQ spawns wmic periodically and wmiprvse leaks resources rabbitmq/rabbitmq-server#1343

Closed

lukebakken self-requested a review September 11, 2017 16:20

lukebakken reviewed Sep 11, 2017

View reviewed changes

Store cached process memory in the gen_server state

56ea769

lukebakken added 2 commits September 14, 2017 07:42

revert whitespace change

aafbe75

Remove unused define

2bb8dd7

lukebakken approved these changes Sep 14, 2017

View reviewed changes

Merge branch 'stable' into rabbitmq-server-1343

d7c4fcb

michaelklishin approved these changes Sep 18, 2017

View reviewed changes

michaelklishin merged commit 391858b into stable Sep 18, 2017

michaelklishin changed the title ~~Optimise memory reporting to call less system memory reports.~~ Avoid frequent subprocess starts when reporting total process memory Sep 19, 2017

rabbitmq deleted a comment from pivotal-issuemaster Sep 19, 2017

lukebakken deleted the rabbitmq-server-1343 branch September 19, 2017 13:49

michaelklishin mentioned this pull request Oct 6, 2017

Alternative memory used by Erlang VM calculation strategy rabbitmq/rabbitmq-server#1223

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid frequent subprocess starts when reporting total process memory #221

Avoid frequent subprocess starts when reporting total process memory #221

Uh oh!

hairyhum commented Sep 11, 2017 •

edited

Loading

Uh oh!

lukebakken Sep 11, 2017

Uh oh!

michaelklishin Sep 12, 2017

Uh oh!

hairyhum Sep 12, 2017

Uh oh!

lukebakken Sep 12, 2017

Uh oh!

michaelklishin Sep 12, 2017

Uh oh!

dcorbacho Sep 12, 2017

Uh oh!

michaelklishin Sep 12, 2017

Uh oh!

hairyhum Sep 12, 2017

Uh oh!

lukebakken Sep 13, 2017

Uh oh!

hairyhum commented Sep 14, 2017

Uh oh!

lukebakken left a comment

Uh oh!

michaelklishin commented Sep 19, 2017

Uh oh!

pivotal-issuemaster commented Sep 19, 2017

Uh oh!

lukebakken commented Sep 19, 2017

Uh oh!

pivotal-issuemaster commented Sep 19, 2017

Uh oh!

Uh oh!

Avoid frequent subprocess starts when reporting total process memory #221

Avoid frequent subprocess starts when reporting total process memory #221

Uh oh!

Conversation

hairyhum commented Sep 11, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hairyhum commented Sep 14, 2017

Uh oh!

lukebakken left a comment

Choose a reason for hiding this comment

Uh oh!

michaelklishin commented Sep 19, 2017

Uh oh!

pivotal-issuemaster commented Sep 19, 2017

Uh oh!

lukebakken commented Sep 19, 2017

Uh oh!

pivotal-issuemaster commented Sep 19, 2017

Uh oh!

Uh oh!

hairyhum commented Sep 11, 2017 •

edited

Loading