Skip to content

Fixes an issue that causes AMD GPUs to fail when using numerous LUT variables.

This happens because the temporary LUT allocation is done inside the BB of the executed threads. Moving LUT allocation to the beginning of the inner loop's BB (instead of only in the BB of the threads that effectively run) addresses the issue.

Closes #163 (closed)

Merge request reports