Avoid using the invoke to call worklet for ArrayHandleRecombineVec. This may cause cuda long compiling issue (cuda 12.x). More information can be found in this issue.