Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Decode AVX512 UTF8 #34

Open
wants to merge 6 commits into
base: main
Choose a base branch
from
Open

Decode AVX512 UTF8 #34

wants to merge 6 commits into from

Conversation

Nick-Nuon
Copy link
Collaborator

@Nick-Nuon Nick-Nuon commented Sep 10, 2024

This PR is for the AVX512 decode for the UTF8 . The tests pass but unfortunately the perfomance is so-so:

Method FileName Mean Error StdDev Speed (GB/s)
AVX2DecodingRealDataUTF8 data/email/ 308.9 us 0.69 us 0.46 us 6.40
AVX512DecodingRealDataUTF8 data/email/ 3,408.5 us 10.40 us 6.19 us .58
SSEDecodingRealDataUTF8 data/email/ 409.3 us 0.24 us 0.13 us 4.83
DotnetRuntimeSIMDBase64RealDataUTF8 data/email/ 648.1 us 0.80 us 0.42 us 3.05

Of particular note , I couldn't find a reference to _mm512_maskz_compress_epi8 in the documentation so I assume its not exposed
: I used the same routine to compress as in SSE and AVX2 to replace it.

Still working on tuning it but I'll probably ping soon.

@lemire
Copy link
Member

lemire commented Sep 10, 2024

Ok. Good. :-)

@lemire
Copy link
Member

lemire commented Sep 10, 2024

I think I see why it is not faster. I should be able to fix that!!!

@lemire
Copy link
Member

lemire commented Sep 10, 2024

Ah. No. We can't do it yet because they have not exposed the necessary functionality yet:

dotnet/runtime#88946

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants