-
Notifications
You must be signed in to change notification settings - Fork 25.2k
Can not Reindex from remote even with 1 batch size - Remote responded with a chunk that was too large. Use a smaller batch size. #73261
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I don't think we would be able to reproduce this with so little information |
Pinging @elastic/es-distributed (Team:Distributed) |
If the document is very very very large reindex won't be able to store it in memory. It looks like the response from the remote host was 250mb. You probably should exclude that document. We probably should change the error message. In the past we'd resisted making the buffer size configurable. I wonder if it'd make sense to do so now. Or make it a portion of the heap or something. That buffer is part of the apache async http client - I don't think we could easily move it off heap. |
@phinzin in your reproduction example (and title of this issue), you mention:
But the batch size is set to 5000:
did you try it out with |
@henningandersen yes i tried already, the only thing strange here is that even i changed size to 1, |
@phinzin is a document of this size normal for your deployment? Would you perhaps be willing to find that document and figure out how it got in to ES to begin with? For one, our http handler has a default limit of 100MB, so plain indexing such a document should fail unless that setting has been tweaked. It could surely be updated enough times to grow that big but I would be interested in the background here before we decide on whether and/or how we will want to address this. |
some index contains incorrect data format and causing the issue |
Elasticsearch version (
bin/elasticsearch --version
): old: 5.6.3 new: 6.8.9Plugins installed: None
JVM version (
java -version
): Open JDK 1.8OS version (
uname -a
if on a Unix-like system): window 10Description of the problem including expected versus actual behavior:
{
"error": {
"root_cause": [
{
"type": "illegal_argument_exception",
"reason": "Remote responded with a chunk that was too large. Use a smaller batch size."
}
],
"type": "illegal_argument_exception",
"reason": "Remote responded with a chunk that was too large. Use a smaller batch size.",
"caused_by": {
"type": "content_too_long_exception",
"reason": "entity content is too long [255064703] for the configured buffer limit [104857600]"
}
},
"status": 400
}
Steps to reproduce:
Please include a minimal but complete recreation of the problem,
including (e.g.) index creation, mappings, settings, query etc. The easier
you make for us to reproduce it, the more likely that somebody will take the
time to look at it.
POST http://dest_host:9200/_reindex
{
"source": {
"remote": {
"host": "source_host:9200"
},
"index": "index name",
"size": 5000
},
"dest": {
"index": "index name"
}
}
Provide logs (if relevant):
The text was updated successfully, but these errors were encountered: