Reduce device algorithm template complexity

Reduces the size of output libraries and exectuables by making the symbol names shorter. In rough testing will reduce file size by about 5-6%

Merge request reports

Loading