-
Notifications
You must be signed in to change notification settings - Fork 599
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
performance regression (for debug & single binary) of backtrace capturing after bumping toolchain on macOS #6131
Comments
Can you collect some perf result on that? Thanks! |
I also found this regression on debug mode and Sysbench QPS dropped to almost zero. However everything works well on release mode. |
|
Probably not, seems to be a problem with frontend node. |
In multi-binary:
In single-binary:
Probably a bug with the Rust compiler, but I would still recommend not returning so many errors. |
Related function:
|
The suspicious thing is the dwarf region in the binary. On 2022-07-29:
On 2022-10-16:
|
Shall we file a bug report to rust-lang? (Though I believe this is more of an LLVM bug / LLVM upgrade) |
Maybe related: rust-lang/rust#98051 |
I would suggest keep this issue open while changing frontend to return Option instead of error. |
I think most of the errors generated in the frontend are for responding to the user, which cannot be avoided. |
Is it caused by capturing the stacktrace? |
No. The e2e test cases that timed out should not have any user-facing error. All tables exist.
Yes. |
There seems to be some new features in Rust / LLVM that generates a special kind of debug info DWARF, which contains stack unwinding region information. In debug mode it's just slow to rewind. |
It's not a big issue but still affected the developer's experience 🥲 What is our solution?
|
I don't think that's something to be fixed. It looks more like a feature that provides better backtrace.
I think a fix is going on to change them to cheaper CatalogEror. |
cc. @BowenXiao1999 |
I've opened an issue for this -> #6157. |
Backtrace capturing is not only slower, but can also lead to a segfault: #6205 |
The performance problem is much better now. It took me 200s to run |
Closing this issue in favor of #4077 and #6205. The slowness is acknowledged by the doc of The segfault is worse and tracked by #6205. |
TImeout is a separate issue. reopen. |
Describe the bug
I found risingwave becomes significantly slow on my mac. e.g.,
e2e_test/batch/basic/array.slt.part
takes 30+ seconds. Aftergit bisect
, I found the regression happens after #6025.To Reproduce
No response
Expected behavior
No response
Additional context
@skyzh can't reproduce. CI also works fine. Need someone else to confirm it.Confirmed it happens on debug & single binary mode.The text was updated successfully, but these errors were encountered: