Don't lock up 256KiB buffers when adding small files #4508

Stebalien · 2017-12-20T00:35:11Z

This is part of #4505.

~~It's WIP because we still need to use some form of buffer pool. As is, allocating and throwing away 256KiB buffers is killing GC.~~

License: MIT Signed-off-by: Steven Allen <[email protected]>

Stebalien · 2017-12-20T03:19:15Z

So, this uses the buffer pool from go-msgio. Should we break that into a separate package? It's used all over the place.

I'd use a chunker-specific buffer pool but we allow different sized chunkers...

kevina

~~This might cause a slight slow down due to the extra copying, especially when the buffer is almost pull, but in practice I don't think this will be a problem.~~

Looks like the P.R. was updated when I wrote this. Need to have another look.

whyrusleeping · 2017-12-20T03:25:22Z

It would be great if we could figure out a way to put all these buffers back into the buffer pool in the common case

License: MIT Signed-off-by: Steven Allen <[email protected]>

kevina · 2017-12-20T03:32:05Z

( @Stebalien for future reference, I find it annoying when people do a rebase that makes a major change. (In this case using the mpool from go-msgio). At least I would appreciate it if you just add new commits and save rebasing for fixups. )

Stebalien · 2017-12-20T03:44:44Z

It would be great if we could figure out a way to put all these buffers back into the buffer pool in the common case.

That would be nice but would require some form of refcounting on blocks (at least given the current interfaces).

( @Stebalien for future reference, I find it annoying when people do a rebase that makes a major change. (In this case using the mpool from go-msgio). At least I would appreciate it if you just add new commits and save rebasing for fixups. )

The new commit completely changed my approach so I figured I'd just do it over. I could have opened a new PR but I figured nobody had had looked at this one yet (that's when I start trying to preserve history). Sorry about that.

kevina · 2017-12-20T03:45:30Z

@Stebalien How is using a memory pool helping here? In the original code before the rebase you reused the 256Kib buffer unless the buffer was full. You are doing the same basic thing here. I don't think it is worth the extra dependency unless we provide (and use) a method to return the returned byte to the pool.

kevina · 2017-12-20T03:52:26Z

More to the point, it looks like you are just using the mpool as a better allocator, is the default go allocator so bad that it helps to the a custom run?

Or is it the case that NewSizeSplitter is called for each new file, in which case I withdraw my objection.

No Longer Applies.

Stebalien · 2017-12-20T06:43:34Z

Or is it the case that NewSizeSplitter is called for each new file, in which case I withdraw my objection.

It is. I'm using it for the case where we add a bunch of small files. Without this, memory usage is still unstable because the GC lags a bit.

The alternative is to re-use the splitter somehow. I toyed with a reset function but that would be a more invasive change.

kevina · 2017-12-20T07:04:55Z

It is. I'm using it for the case where we add a bunch of small files. Without this, memory usage is still unstable because the GC lags a bit.

Ok.

I was going to say what's one more dependency but:

So, this uses the buffer pool from go-msgio. Should we break that into a separate package? It's used all over the place.

Yes i think we should. I am not particularly happy with bringing in a whole package just to use one of its utility sub-packages.

Other than that it LGTM.

ghost assigned Stebalien Dec 20, 2017

ghost added the status/in-progress In progress label Dec 20, 2017

Stebalien added 3 commits December 19, 2017 19:08

Don't waste 256KiB buffers on small chunks.

aafbe65

License: MIT Signed-off-by: Steven Allen <[email protected]>

take adder by pointer, not by value...

101e1c3

License: MIT Signed-off-by: Steven Allen <[email protected]>

use DefaultSplitter function where appropriate

414b0ff

License: MIT Signed-off-by: Steven Allen <[email protected]>

Stebalien force-pushed the fix/add-small-files branch from 215d08d to 414b0ff Compare December 20, 2017 03:13

Stebalien changed the title ~~[WIP] Don't lock up 256KiB buffers when adding small files~~ Don't lock up 256KiB buffers when adding small files Dec 20, 2017

kevina previously approved these changes Dec 20, 2017

View reviewed changes

kevina self-requested a review December 20, 2017 03:22

add test for overallocation in chunker

c29f628

License: MIT Signed-off-by: Steven Allen <[email protected]>

whyrusleeping approved these changes Dec 31, 2017

View reviewed changes

whyrusleeping merged commit 8f17968 into master Dec 31, 2017

whyrusleeping deleted the fix/add-small-files branch December 31, 2017 22:29

ghost removed the status/in-progress In progress label Dec 31, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Don't lock up 256KiB buffers when adding small files #4508

Don't lock up 256KiB buffers when adding small files #4508

Uh oh!

Stebalien commented Dec 20, 2017 •

edited

Loading

Uh oh!

Stebalien commented Dec 20, 2017

Uh oh!

kevina left a comment •

edited

Loading

Uh oh!

whyrusleeping commented Dec 20, 2017

Uh oh!

kevina commented Dec 20, 2017

Uh oh!

Stebalien commented Dec 20, 2017

Uh oh!

kevina commented Dec 20, 2017

Uh oh!

kevina commented Dec 20, 2017

Uh oh!

Stebalien commented Dec 20, 2017

Uh oh!

kevina commented Dec 20, 2017

Uh oh!

Uh oh!

Uh oh!

Don't lock up 256KiB buffers when adding small files #4508

Don't lock up 256KiB buffers when adding small files #4508

Uh oh!

Conversation

Stebalien commented Dec 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Stebalien commented Dec 20, 2017

Uh oh!

kevina left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

whyrusleeping commented Dec 20, 2017

Uh oh!

kevina commented Dec 20, 2017

Uh oh!

Stebalien commented Dec 20, 2017

Uh oh!

kevina commented Dec 20, 2017

Uh oh!

kevina commented Dec 20, 2017

Uh oh!

Stebalien commented Dec 20, 2017

Uh oh!

kevina commented Dec 20, 2017

Uh oh!

Uh oh!

Stebalien commented Dec 20, 2017 •

edited

Loading

kevina left a comment •

edited

Loading