POC to show performance improvements of not copying token #1561

alamb · 2024-11-26T16:59:14Z

This PR was hoping to show that the approach described in this ticket would be effective

Improve performance by not copying Tokens as much #1558

While I had this all loaded into my head, I wanted to bash out a PR / POC, but ran out of time.

This PR shows how we could avoid copying tokens

Goals;

Sketch out peek_token_ref() and similar functions
Use those functions to rewrite the hot paths shown in the flamegraph (parse_prefix, etc. as shown below)
Demonstrate improved performance with the benchmark

It will take some restructring of parse_prefix to avoid the use of the copying APIs, but I absolutely think it is possible

davisp · 2024-12-10T23:41:51Z

main...davisp:datafusion-sqlparser-rs:pd/experiment/less-token-cloning

tl;dr - Roughly 28% speedup on the microbenchmarks compared to main. 18% of that by tweaking a few of the main token methods in the last commit.

I ended up poking at this today and have managed to get about a 28% improvement on the benchmarks over main. Roughly 10% of that is checking for hot paths in the flame graphs and slowly converting those functions over to avoid cloning tokens.

On a whim, I also poked at optimizing the Token::make_word method that does a binary search over all keywords. That appears to have saved about 400ms (of 1.2s total originally attributed to it). I skimmed the docs for phf but I'm not certain what this project's appetite is for new dependencies. It currently seems quite lean so I didn't pursue that just yet.

Lastly, the biggest single commit was the latest that updates a few of the token handling methods themselves to avoid clones. This was the biggest jump at 18% improvement for a total just around 28% (based on my very non-scientific measurements).

Also, one thing I did different than you @alamb, I noted CI flagged your EOF_TOKEN, so I just made mine a static since it doesn't have any initialization.

All in all, this approach certainly seems feasible as well as actually productive. It turns out the hardest part is accounting for error reporting with the expected("message", token) API. I ended up adding an expected_current which doesn't take a token parameter and uses the current index for reporting. That works for now, but going forward I think we're gonna want to figure out a much different error reporting API as its quite fragile (you have to pay attention to which token is being reported and whether or not the token index has been modified). There are also a number of places where the token being reported in errors is incorrect.

alamb · 2024-12-11T15:57:10Z

On a whim, I also poked at optimizing the Token::make_word method that does a binary search over all keywords. That appears to have saved about 400ms (of 1.2s total originally attributed to it). I skimmed the docs for phf but I'm not certain what this project's appetite is for new dependencies. It currently seems quite lean so I didn't pursue that just yet.

I agree we try to keep the dependency set down to a minimum. \

I wonder if we could automatically generate a jumptable with some trickery (maybe we could make a custom match block keyed on individual letters)L

Something like

match word[0] {
  'a' | 'A' => match word[1] {
    'b'| 'B' = > //match the third letter here

We could then make a script, etc to generate this jumptable from the list of keywords 🤔

alamb · 2024-12-11T15:58:01Z

This sounds really neat @davisp

Would you be willing to start making some smaller PRs with parts of your findings?

davisp · 2024-12-11T18:23:15Z

Would you be willing to start making some smaller PRs with parts of your findings?

I've opened #1587 for the token cloning work and #1588 for investigating ways to speed up Token::make_word. Looks like I broke an error message though so time to fix that.

alamb · 2024-12-12T15:28:59Z

Would you be willing to start making some smaller PRs with parts of your findings?

I've opened #1587 for the token cloning work and #1588 for investigating ways to speed up Token::make_word. Looks like I broke an error message though so time to fix that.

AMAZING!!! Thank you

alamb added 2 commits November 26, 2024 11:34

Sketch out no copy parse_token, peek

62001ba

Add more peek

5a7aca0

alamb mentioned this pull request Nov 26, 2024

Improve performance by not copying Tokens as much #1558

Open

davisp mentioned this pull request Dec 7, 2024

Reorganize the Parser module #1581

Draft

davisp mentioned this pull request Dec 11, 2024

Improve parsing performance by reducing token cloning #1587

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

POC to show performance improvements of not copying token #1561

POC to show performance improvements of not copying token #1561

alamb commented Nov 26, 2024

davisp commented Dec 10, 2024

alamb commented Dec 11, 2024

alamb commented Dec 11, 2024

davisp commented Dec 11, 2024

alamb commented Dec 12, 2024

POC to show performance improvements of not copying token #1561

Are you sure you want to change the base?

POC to show performance improvements of not copying token #1561

Conversation

alamb commented Nov 26, 2024

davisp commented Dec 10, 2024

alamb commented Dec 11, 2024

alamb commented Dec 11, 2024

davisp commented Dec 11, 2024

alamb commented Dec 12, 2024