Skip to content
Snippets Groups Projects
  1. Nov 14, 2014
  2. Nov 11, 2014
  3. Nov 09, 2014
  4. Nov 06, 2014
    • Prakalp Srivastava's avatar
      (1) Modifying visc_gemm_ptx.ll to also compile the kernel.ll file to produce ptx · b770deb6
      Prakalp Srivastava authored
          binary
      (2) Several hacks in visc-rt to estimate the work required to get
          MatrixMultiplication to work. 3 issues need to be resolved
          i)    The metadata for kernel followed by number of kernels has to be one
                single metadata
          ii)   Get Kernel name and work group sizes in NVPTX pass
          iii)  Major: Cannot use device memory pointer as cl_mem. Fundamentally
                this is a design compatibility issue with VISC and OpenCL 1.1.
                OpenCL 2.0 might resolve it
      b770deb6
  5. Nov 05, 2014
  6. Nov 04, 2014
  7. Nov 03, 2014
  8. Oct 31, 2014
  9. Oct 30, 2014
  10. Oct 19, 2014
  11. Oct 16, 2014
  12. Oct 12, 2014
  13. Oct 07, 2014
    • Prakalp Srivastava's avatar
      (1) Modified the launch and wait intrinsics to a different format. No need to pass · 667d659d
      Prakalp Srivastava authored
      the graphID as reference. It is returned by the launch functions. Makes it
      easier to write code
      (2) BuildDFG support for data transfer to PTX nodes
      (3) ClearDFG bugs fixed
      (4) visc-rt modified
      (5) Test case 3level.ll modified to match the new launch and wait intrinsic
      format
      
      M    llvm/test/VISC/unitTests/3level.ll
      M    llvm/include/llvm/IR/DFGraph.h
      M    llvm/include/llvm/IR/IntrinsicsVISC.td
      M    llvm/projects/visc-rt/visc-rt.cpp
      M    llvm/lib/Transforms/BuildDFG/BuildDFG.cpp
      M    llvm/lib/Transforms/DFG2LLVM_X86/DFG2LLVM_X86.cpp
      M    llvm/lib/Transforms/ClearDFG/ClearDFG.cpp
      667d659d
  14. Sep 11, 2014
  15. Sep 04, 2014
  16. Aug 13, 2014
  17. Aug 12, 2014
  18. Aug 11, 2014
  19. Aug 10, 2014
    • Prakalp Srivastava's avatar
      (1) Added ClearDFG pass that deletes all Internal nodes. Later modify it to · a2cd9b1a
      Prakalp Srivastava authored
      delete only nodes for which code has been generated.
      (2) Modified DFG2LLVM_X86 to not delete Launch Intrinsic (now that is part of
      clearDFG) and fixed freed the memory allocated by CodeGenTraversal
      M    llvm/lib/Transforms/DFG2LLVM_X86/DFG2LLVM_X86.cpp
      A    llvm/lib/Transforms/ClearDFG
      A    llvm/lib/Transforms/ClearDFG/ClearDFG.cpp
      A    llvm/lib/Transforms/ClearDFG/ClearDFG.exports
      A    llvm/lib/Transforms/ClearDFG/CMakeLists.txt
      A    llvm/lib/Transforms/ClearDFG/Makefile
      a2cd9b1a
    • Prakalp Srivastava's avatar
      (1) Code to generate pthread create, join, exit calls · 157d3bdb
      Prakalp Srivastava authored
      (2) Have to generate code for a new Launch function which can be called by
      pthread_create and which extracts the root inputs from a memory location i8*.
      (3) Formatted the different files to be more consisitent with LLVM coding
      principles
      M    llvm/include/llvm/BuildDFG/BuildDFG.h
      M    llvm/include/llvm/IR/DFGraph.h
      M    llvm/lib/Transforms/BuildDFG/BuildDFG.cpp
      M    llvm/lib/Transforms/DFG2LLVM_X86/DFG2LLVM_X86.cpp
      157d3bdb
  20. Aug 06, 2014
Loading