Uses_fp proves to be problematic in case the compiler decides to generate SSE code, it silently corrupts FP / SSE state. This only affects RTAI.
I propose to get rid of realtime kernel threads that declare not to use FP (so every thread should be uses_fp=1), saving/restoring FP context shouldn't create measureable increase in latencies or max frequencies achievable on contemporary CPUs.
https://linuxcnc.org/docs/html/man/man9/motion.9.html
https://linuxcnc.org/docs/html/man/man3/hal_create_thread.3hal.html