This project is read-only.
1
Vote

Support for cuStreamQuery() method

description

Hello,

I am trying to use streams to run a few kernels concurrently. The problem I see with the current version of CUDAfy.NET is that when the host code adds requests to GPU streams they are not submitted to GPU until the SynchronizeStream() method is called. I confirmed by running Nsight performance analysis that this results in inefficient use of GPU. I did some research online and learnt about the existence of the cuStreamQuery method, which supposedly flushes the queued requests. http://www.hybriddsp.com/portals/0/cudafy/html/f7f2297c-caec-c5e4-373c-ee0da1799b62.htm

This method is currently not exposed to the CudaGPU class. Is this something that could be added in the next version?

Thanks!

comments