Skip to content
Snippets Groups Projects
Commit bc478ebe authored by Prakalp Srivastava's avatar Prakalp Srivastava
Browse files

(1) Modified X86 pass to now not do code gen if any visiting node has a genFunc

    set. This is important so as not to fail in case of 2-level PTX kernel,
    where the intermediate node does not have a gen function, but would pass
    because its parent has one.
(2) Modified PTX pass to
    i.  Correct mapping of arguments between intermediate and kernel launch node
    ii. Allow both constants and arguments as node limit values
    iii.Generate get_group_id and get_global_id correctly

M    Transforms/DFG2LLVM_X86/DFG2LLVM_X86.cpp
M    Transforms/DFG2LLVM_NVPTX/DFG2LLVM_NVPTX.cpp
parent 08b3da48
No related branches found
No related tags found
No related merge requests found
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment