Skip to content
Snippets Groups Projects
Commit b770deb6 authored by Prakalp Srivastava's avatar Prakalp Srivastava
Browse files

(1) Modifying visc_gemm_ptx.ll to also compile the kernel.ll file to produce ptx

    binary
(2) Several hacks in visc-rt to estimate the work required to get
    MatrixMultiplication to work. 3 issues need to be resolved
    i)    The metadata for kernel followed by number of kernels has to be one
          single metadata
    ii)   Get Kernel name and work group sizes in NVPTX pass
    iii)  Major: Cannot use device memory pointer as cl_mem. Fundamentally
          this is a design compatibility issue with VISC and OpenCL 1.1.
          OpenCL 2.0 might resolve it
parent 4434c2e8
No related branches found
No related tags found
No related merge requests found
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment