Open
Description
With the current DPC++, in order to use local memory in a device function called inside a kernel, we have to pass it as argument to the device function. Could DPC++ support static local memory (e.g., something like __local__ a[256]
) anywhere in device code? Could it also support dynamic local memory (e.g., something like extern __local__ a[]
) with the amount of local memory specified at runtime during kernel launch?
Activity