Jump to content

Nvidia CUDA Compiler

From Wikipedia, the free encyclopedia

This is the current revision of this page, as edited by Jerryobject (talk | contribs) at 06:06, 5 August 2024 (Template:Infobox software parameters: update-standardize-conform, underscores > spaces, fills. WP:EXTernal links: update, add. WP:LINKs: needless WP:PIPEs > WP:NOPIPEs, update-standardizes, adds. MOS:FIRSTABBReviations clarify, define before WP:ABBRs in parentheses. Small WP:COPYEDITs WP:EoS WP:TERSE. Nonacronym proper noun MOS:ALLCAPS > Title case.). The present address (URL) is a permanent link to this version.

(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)
Developer(s)Nvidia
Initial releaseJune 2007; 17 years ago (2007-06)
Stable release
12.6.0 / August 2024; 3 months ago (2024-08)
Available inEnglish
TypeCompiler
LicenseProprietary
Websitedocs.nvidia.com/cuda/cuda-compiler-driver-nvcc

Nvidia CUDA Compiler (NVCC) is a compiler by Nvidia intended for use with CUDA. It is proprietary software.

Compiler

[edit]

CUDA code runs on both the central processing unit (CPU) and graphics processing unit (GPU). NVCC separates these two parts and sends host code (the part of code which will be run on the CPU) to a C compiler like GNU Compiler Collection (GCC) or Intel C++ Compiler (ICC) or Microsoft Visual C++ Compiler, and sends the device code (the part which will run on the GPU) to the GPU. The device code is further compiled by NVCC. NVCC is based on LLVM.[1] According to Nvidia provided documentation, nvcc in version 7.0 supports many language constructs that are defined by the C++11 standard, and a few from C99. In version 9.0, several more constructs from the C++14 standard are added.[2]

Any source file containing CUDA language extensions (.cu) must be compiled with nvcc. NVCC is a compiler driver which works by invoking all the necessary tools and compilers like cudacc, g++, cl, etc. NVCC can output either C code (CPU Code) that must then be compiled with the rest of the application using another tool or Parallel Thread Execution (PTX) or object code directly. An executable with CUDA code requires: the CUDA core library (cuda) and the CUDA runtime library (cudart).

Other widely used libraries:

  • CUBLAS: BLAS implementation
  • CUFFT: FFT implementation
  • CUDPP (Data Parallel Primitives): Reduction, Scan, Sort.
  • Thrust: Reduction, Scan, Sort.

See also

[edit]

References

[edit]
  1. ^ "CUDA LLVM Compiler". Nvidia Developer. Retrieved Apr 6, 2016.
  2. ^ "CUDA C++ Programming Guide". NVIDIA Documentation Hub. Retrieved 2019-06-28.

General

[edit]
  1. David B. Kirk, and Wen-mei W. Hwu. Programming massively parallel processors: a hands-on approach. Morgan Kaufmann, 2010.
  2. "Nvidia CUDA Compiler Driver NVCC". Nvidia Documentation Hub. Archived from the original on Oct 13, 2023.
  3. "CUDPP". GPGPU. Archived from the original on Nov 17, 2018.
[edit]