cuda-0.11.0.1: FFI binding to the CUDA interface for programming NVIDIA GPUs
Copyright[2009..2023] Trevor L. McDonell
LicenseBSD
Safe HaskellSafe-Inferred
LanguageHaskell98

Foreign.CUDA.Runtime.Exec

Description

Kernel execution control for C-for-CUDA runtime interface

Synopsis

Kernel Execution

type Fun = FunPtr () Source #

A global device function.

Note that the use of a string naming a function was deprecated in CUDA 4.1 and removed in CUDA 5.0.

data FunAttributes Source #

Constructors

FunAttributes 

Fields

data FunParam where Source #

Kernel function parameters. Doubles will be converted to an internal float representation on devices that do not support doubles natively.

Constructors

IArg :: !Int -> FunParam 
FArg :: !Float -> FunParam 
DArg :: !Double -> FunParam 
VArg :: Storable a => !a -> FunParam 

attributes :: Fun -> IO FunAttributes Source #

Obtain the attributes of the named global device function. This itemises the requirements to successfully launch the given kernel.

setCacheConfig :: Fun -> CacheConfig -> IO () Source #

On devices where the L1 cache and shared memory use the same hardware resources, this sets the preferred cache configuration for the given device function. This is only a preference; the driver is free to choose a different configuration as required to execute the function.

Switching between configuration modes may insert a device-side synchronisation point for streamed kernel launches

launchKernel Source #

Arguments

:: Fun

Device function symbol

-> (Int, Int)

grid dimensions

-> (Int, Int, Int)

thread block shape

-> Int64

shared memory per block (bytes)

-> Maybe Stream

(optional) execution stream

-> [FunParam] 
-> IO () 

Invoke a kernel on a (gx * gy) grid of blocks, where each block contains (tx * ty * tz) threads and has access to a given number of bytes of shared memory. The launch may also be associated with a specific Stream.