-
- Downloads
(1) Modifying visc_gemm_ptx.ll to also compile the kernel.ll file to produce ptx
binary (2) Several hacks in visc-rt to estimate the work required to get MatrixMultiplication to work. 3 issues need to be resolved i) The metadata for kernel followed by number of kernels has to be one single metadata ii) Get Kernel name and work group sizes in NVPTX pass iii) Major: Cannot use device memory pointer as cl_mem. Fundamentally this is a design compatibility issue with VISC and OpenCL 1.1. OpenCL 2.0 might resolve it
Loading
Please register or sign in to comment