TiFlash panics with Too many open files
due to grpc connection socket leak in the cloud GCP env
#9663
Labels
affects-7.5
This bug affects the 7.5.x(LTS) versions.
affects-8.1
This bug affects the 8.1.x(LTS) versions.
component/mpp
impact/panic
may-affects-5.4
may-affects-6.1
may-affects-6.5
may-affects-7.1
may-affects-8.5
severity/major
type/bug
The issue is confirmed as a bug.
Bug Report
1. Minimal reproduce step (Required)
2. What did you expect to see? (Required)
3. What did you see instead (Required)
There are too many FD under the tiflash process. The increasing number of FD caused queries to fail and eventually tiflash crashed. Most of the FD were related to sockets and a large number of sockets were still open but could not be found in "/proc/net".
After disabling mpp, the number of sockets no longer continues to grow. There may be potential bugs in the implementation about mpp.
4. What is your TiFlash version? (Required)
v7.5.3, v8.1.1
Root Cause
[RST, ACK]
tcp packet), the TCP connection is disconnected, the execution thread is destroyed, but the related socket, eventpoll, and eventfd are not destroyed.tcpdump the exception packet transmitting of method
getServerInfoByGRPC
.tcpdump the normal packet transmitting of
getServerInfoByGRPC
.The correlation between the number of leaked sockets and exception TCP packets
Feasible solution
Other
The text was updated successfully, but these errors were encountered: