-
Notifications
You must be signed in to change notification settings - Fork 812
Stuck Write Message when there is a issue "multiple Read calls return no data or error" #426
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I think it relate to use SSL and send a lot of messages at the same time. It works fine when I only push 5 messages with the method "w.WriteMessages" |
What version are you on? I ran into a recent problem where any error on writing caused the driver to retry all messages. However the driver also wrote all errors to a dedicated error channel. Since the channel was read by the main routine, the error was immediately returned, however the go routine retrying the write request on the next error was stuck waiting on the channel to be read (which would never happen, because the reading routine had returned). This caused the writing to hang forever. |
The same thing just happened to me, I believe. I had a network fault where the leader data for Kafka was stale. Then I saw this error, and my writer thread seemed to hang forever.
Presumably I've encountered the same error as you have. I haven't been able to dig down into it just yet, but it looks like the same problem. Edit: It was the same problem. Got fixed by updating. |
Hello, thank you all for reporting. Would you be able to mention which version of kafka-go you're experiencing the issue with? |
I also encountered this problem. The version of |
@pandaadb do you have more details to share about this issue? Is the channel you are referring to managed within kafka-go?
|
Hard to say if it is the same issue, but I get |
This is a recurring issue for me, though I am not able to reproduce it manually. It seems to always be the came cadence. Starts out with:
Followed by:
There seems to have been a network fault and one of the zookeeper nodes lost connectivity to the cluster, to reconnect seconds later. This triggered a re-election and for some reason the client seems to be unaware of it and tries to write to a follower. Upon failing, it just gives up. When I try to recreate this, the client always seems to just be aware of all of this and just recover. I can't figure out what's different. Am I missing some obvious thing where I should be performing some manual action? |
I see I wrote above (August 25th) that it got fixed by updating, but apparently I just wasn't able to recreate it. |
Network connectivity issues can manifest in complex ways, I think the work in #846 will help better surface the conditions that triggered those errors. |
i also got this same issue- github.com/segmentio/kafka-go v0.4.38 There is no load in the kafka consumer OR publisher as such. |
I have a strange issue
Reproduce Steps:
Here is my code:
Sometimes I see this error on Kafka:
Do you have any ideas?
The text was updated successfully, but these errors were encountered: