2018-03-05 21:45:51 UTC
assume you create a separate context for each thread.
If I want to use CUDA 4.0+'s one-context-per-process model instead,
how would I do that in PyCUDA?
I think you'd call cudaSetDevice instead of cuCtxCreate? Does the
equivalent exist in PyCUDA? If it does not, can I add it?
I need to share memory between threads.