Skip to content
Snippets Groups Projects

Repository graph

You can move around the graph by using the arrow keys.
Select Git revision
  • hpvm-release-epochs
  • hpvm-release-epochs0
  • main default
  • v2.0
  • v2.0rc
  • v1.0
  • v0.5
7 results
Created with Raphaël 2.2.010Apr98765432130Mar292827262423222119181716151312116543228Feb27262524232019181716151413121110742131Jan30292827252423222120191817161514131098743231DecAdd changes to profiler interfaces to avoid breaking coseadded test for dominance chain and extra check for loop guard branchAdding support for dumping CPU runtime configsmergignAdding baseline configuration to autotuning outputssplit unroll & jam to two passes so that we can run cse, simplifycfg, and loop simplify between themAdding CPU support in profiler and making adding devices more easyAdding CPU tensor operations and approximationsMerge branch 'approx_hpvm' of https://gitlab.engr.illinois.edu/llvm/hpvm into approx_hpvmAdding build support of OpenMP for CPU tensor runtimeUpdating Global Knobs file to include 33% samplingAdding Sampling test for 1*1 filterFix a bug in indexing for skip_offset == 4 for filter samplingModifying tensorConvSampSim to handle 33% samplingPushing latest unit test source - with tests for baseline FP32 FP16Merge branch 'approx_hpvm' of https://gitlab.engr.illinois.edu/llvm/hpvm into approx_hpvmFixed baseline convolution baseline and removed redundant tensorsAdding detailed unit test for 3*3 filter samplingmergingremoving stray printsAdd FP16 support for baseline convolutionAdd regular convolution and bug fixAdding unit tests for sampling and baseline for 1*1 filterAdd fixes to filter samplingUpdating sampling unit test that uncovers more bugsAdding failing sampling test on 1*1 filtersModifying buildRtConfig.py to use global_knobs.txtMergingAdding automatic extraction of devtime knobsFixes to row and column perforation routinesReading Samp Params from global_knobs.txtReading Perf Flag mappings from global_knobs.txtCreating Autotuner initializer for one-time initializationsBetter organization of SampParamSet and PerfParamSetremoving unnecessary printRemoving unnecessary printMoving print msg to DEBUG macroMoving to 2K images for ProfilingUpdating VGG16 tensorRT source to use 1K imgs for ProfilingUdating resnet to 1K images
Loading