QTrk
List of CUDA references
General introductions:
David A. Patterson and John L. Hennessy. Computer Organization and Design, chapter Appendix A: Graphics and Computing GPUs. Morgan Kaufmann, 5th edition, 2013.
[2]
https://devblogs.nvidia.com/parallelforall/
https://devblogs.nvidia.com/parallelforall/easy-introduction-cuda-c-and-c/
http://www.graphics.stanford.edu/~hanrahan/talks/why/walk001.html
Specific optimimizations:
http://on-demand.gputechconf.com/gtc/2014/presentations/S4158-cuda-streams-best-practices-common-pitfalls.pdf
http://cuda-programming.blogspot.nl/2013/02/texture-memory-in-cuda-what-is-texture.html
https://devblogs.nvidia.com/parallelforall/
https://devblogs.nvidia.com/parallelforall/how-access-global-memory-efficiently-cuda-c-kernels/
https://devblogs.nvidia.com/parallelforall/using-shared-memory-cuda-cc/
https://devblogs.nvidia.com/parallelforall/efficient-matrix-transpose-cuda-cc/
https://devblogs.nvidia.com/parallelforall/how-optimize-data-transfers-cuda-cc/
Reference:
http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html
http://docs.nvidia.com/cuda/cufft/
CUDA Version related:
https://devtalk.nvidia.com/default/topic/858507/cuda-setup-and-installation/fft-libraries-for-win32-cuda-7-0-missing-/
https://devblogs.nvidia.com/parallelforall/gpu-pro-tip-cuda-7-streams-simplify-concurrency/
http://docs.roguewave.com/totalview/8.14.1/html/index.html#page/User_Guides/totalviewug-about-cuda.31.4.html
Generated by
1.8.12