现在的位置: 首页 > 综合 > 正文

CUDA cuPrintf

2014年01月02日 ⁄ 综合 ⁄ 共 1382字 ⁄ 字号 评论关闭

CUDA cuPrintf

February 8th, 2010 by Jeremy

I finally got an Nvidia developer account a few days ago which gave me access to a very useful library to use with CUDA.

cuPrintf allows printf equivalent statements to be placed inside CUDA kernels without the need for -deviceemu.

The following example demonstrates a simple use for cuPrintf and displays the current thread ID. 

演示了打印当前线程的ID.

  1. #include <cuda.h>
  2. #include "cuPrintf.cu"
  3.  
  4. __global__ void cuPrintfExample()
  5. {
  6.  int tid;
  7.  tid = blockIdx.x* blockDim.x
    + threadIdx.x;
  8.  cuPrintf("%d\n", tid);
  9. }
  10.  
  11. int main()
  12. {
  13.  cudaPrintfInit();
  14.  cuPrintfExample <<<
    5
    , 2 >>>
    (
    );
  15.  cudaPrintfDisplay(stdout,true);
  16.  cudaPrintfEnd();
  17.  return 0;
  18. }

cudaPrintfInit and cudaPrintfEnd only need be called once throughout your entire project.

Output is not automatically displayed on the screen, but stored in a buffer which is cleared and displayed when cudaPrintfDisplay is called. The size of the buffer can be specified with the optional argument cudaPrintfInit(size_t bufferLen).

cudaPrintfEnd simply frees the memory allocated by cudaPrintfInit.

When cudaPrintfDisplay is called, output stored in the buffer is displayed to the console. The second argument in this call either displays the current thread (true) or doesn’t (false). The first arguemnt, specified by stdout in this example, simply defines
the descriptor where the cuPrintf log is sent.

On another note, I’ve found that using cuPrintf impacts on the performance of my kernels, presumably due to the data transfer performed every time cuPrintfDisplay() is called.

http://www.jeremykemp.co.uk/08/02/2010/cuda-cuprintf/

抱歉!评论已关闭.