Opened 7 years ago

Last modified 5 years ago

#972 new defect

explore -cl-uniform-work-group-size compiler option

Reported by: pkienzle Owned by:
Priority: minor Milestone: SasView WishList
Component: SasView Keywords:
Cc: Work Package: SasModels Infrastructure

Description

Docs says:

This requires that the global work-size be a multiple of the work-group size specified to clEnqueueNDRangeKernel. Allow optimizations that are made possible by this restriction.

I believe we are already doing this since some drivers were failing when the length of the q array was not a multiple of 32. Instead we check whether we are beyond the end of the array when we enter the kernel. If other drivers can benefit from this with a speed improvement or smaller code size it should be easy to implement.

Change History (1)

comment:1 Changed 5 years ago by butler

  • Work Package changed from SasModels Redesign to SasModels Infrastructure
Note: See TracTickets for help on using tickets.