cuew is wrapping most/all cuda and nvrtc api.
Woud it also make sens to wrap cudnn, at least its main api :
- cudnnCreate, cudnnDestroy
- cudnnSetTensorNdDescriptor, cudnnSetTensor4dDescriptor
- cudnnCreateTensorDescriptor, cudnnCreatePoolingDescriptor, cudnnCreateReduceTensorDescriptor
- cudnnSetReduceTensorDescriptor
- cudnnGetReductionWorkspaceSize
- cudnnSetPoolingNdDescriptor, cudnnSetPooling2dDescriptor
- cudnnPoolingForward, cudnnPoolingBackward
- cudnnReduceTensor
?
Kind
WT
cuew is wrapping most/all cuda and nvrtc api.
Woud it also make sens to wrap cudnn, at least its main api :
?
Kind
WT