Skip to content
Snippets Groups Projects
Commit 13e445d6 authored by Prakalp Srivastava's avatar Prakalp Srivastava
Browse files

sgemm shared memory execution time reduced by running unroll and scalarrepl pass

on the kernel. This promotes the private memory allocation in each thread to
registers.
parent ad899278
No related branches found
No related tags found
No related merge requests found
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment