Description
As part of the Scheduler benchmarking work, we have seen a fair bit of time still spent in Tornado. While we can now use uvloop ( #4448 ) and have optimized TCP communication ( #4506 ), this still remains. In figuring out how we might fix this, have done some reading on asyncio's Transports and Protocols, which uvloop in turn uses to provide significant performance gains.
It seem doable (though may be a bit of work) to reimplement TCP communication using this strategy, which would also allow us to benefit from uvloop's performance improvements in this area. Hoping after we do this with TCP, we can apply something similar to UCX as well (though this may need a bit of generalization).
Am curious if this is something we would be open to. Any thoughts? 🙂