Optimize FPU save and restore #1235

wkozaczuk · 2023-06-02T03:57:51Z

Currently, we save and restore the FPU state in the following places:

linux.cc - syscall
core/elf.cc - elf_resolve_pltgot
arch/x64/exceptions.cc - interrupt and general protection
arch/x64/mmu.cc - page fault
arch/x64/signal.cc - when calling signal handler

Typically it is achieved by calling xsave and xrstor which are pretty expensive operations. Its total cost can be measured indirectly by commenting out FPU lock code in linux.cc syscall function and running misc-syscall-perf test. The difference between them is on average 70 ns.

It would be nice to figure out how we can take advantage of xsaveopt and other instructions to speed it up. For details please see this excellent article.

Unfortunately, it is not clear how exactly we would need to change FPU saving/restoring code to take advantage of these instructions. I kind of understand the FPU state needs to be saved in the same memory locations for xsaveopt to work correctly. But how exactly would this work across multiple threads?

Also, I am not sure how much performance gain would we see in practical terms. I did hack the FPU save/restore code in syscall function in linux.cc to use some global FPU state variable and I could see 70ns reduced to ~50ns.

Some other relevant notes:

https://groups.google.com/g/osv-dev/c/a_XxZbb7vng/m/Sh3G57N8BKsJ - handle AVX instructions
https://groups.google.com/g/osv-dev/c/w_fuxsYla-M/m/dx3b6il-ywkJ - better FPU save/restore (has some good explanation about the cost of it by Avi)
- the fpu state used to be part of the thread state
- possibly corresponding commit - 202b2cc (fpu: early save/restore in interrupt/exception context)

The text was updated successfully, but these errors were encountered:

wkozaczuk added the optimization label Jun 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize FPU save and restore #1235

Optimize FPU save and restore #1235

wkozaczuk commented Jun 2, 2023 •

edited

Loading

Optimize FPU save and restore #1235

Optimize FPU save and restore #1235

Comments

wkozaczuk commented Jun 2, 2023 • edited Loading

wkozaczuk commented Jun 2, 2023 •

edited

Loading