What about benchmarks and performance updates/regression/etc.? I'm sure the CUDA to OCL switch was a preformance hit at least in some cases (and compared to the equivalent CUDA code). Has anybody worked on something comprehensive?
Would be good to convince he Phoronix guy to add it to his standard set of OpenCL benchmarks. He has the hardware and a good benchmarking/reporting tool, but somebody will need to pester him to do the benchmarks right.
Would be good to convince he Phoronix guy to add it to his standard set of OpenCL benchmarks. He has the hardware and a good benchmarking/reporting tool, but somebody will need to pester him to do the benchmarks right.