Skip to content
Snippets Groups Projects
Commit a16411e7 authored by hsharif3's avatar hsharif3
Browse files

Update TensorRT README.md

parent 6bba7427
No related branches found
No related tags found
No related merge requests found
...@@ -4,29 +4,21 @@ ...@@ -4,29 +4,21 @@
### Dependencies ### Dependencies
- CUDA-9.1 or above - CUDA >= 9.1
- Your device must have a CUDA-enabled nVidia GPU
- CUBLAS-9.1 or above - included with CUDA by default
- cuDNN-7.0 or above - cuDNN >= 7
- `cmake >= 3.18` ### Building Tensor Runtime
- `make >= 4` Tensor Runtime and the DNN sources using the Tensor runtime are built with the unified HPVM build system. These
can also be separately built. HPVM Tensor Runtime can be built as:
- `gcc < 8` or `3.2 <= clang < 9` ```
- We have an upperbound for compiler version because CUDA doesn't support too recent compilers make -j ${NUM_THREADS} tensor_runtime
```
### Building the Tensor Runtime The tensor runtime is built as a static library under `build/lib/liibtensor_runtime.a`
The following commands will compile the tensor runtime library (`build/libtensor_runtime.a`)
as well as a number of exemplary benchmarks (DNN models):
```shell
mkdir build && cd build
cmake ../
make -j
```
### Tensor Runtime APIs ### Tensor Runtime APIs
...@@ -38,26 +30,10 @@ make -j ...@@ -38,26 +30,10 @@ make -j
- For examples of using `tensor_runtime` functions, see `dnn_sources/src/alexnet_cifar10.cc`. - For examples of using `tensor_runtime` functions, see `dnn_sources/src/alexnet_cifar10.cc`.
- Also, try running `build/alexnet_cifar10` which is compiled from that file and runnable out of the box. - Also, try running `build/alexnet_cifar10` which is compiled from that file and runnable out of the box.
## Developer Notes
### Directory Structure ### Directory Structure
- ./tensor_runtime: - `tensor_runtime`:
- ./tensor_runtime/include/: Include files for Tensor Runtime
- ./tensor_runtime/include/tensor_signatures.cc: Include file with Tensor RT signatures
- NOTE: UPDATE this with updated API
- ./tensor_runtime/src/: HPVM TensorRT sources
- ./dnn_sources: - `dnn_sources1`:
- ./dnn_sources/src/${BENCH}.cc: Per Bench FULL-precision source
- ./dnn_sources/src/half/${BENCH}.cc: Per Bench HALF-precision source
- ./dnn_sources/src/promise/${BENCH}.cc: Per Bench Layer-API source
- ./bin:
- ./bin/install_runtime.sh: Script for moving Tensor RT files to ./lib
- ./bin/run_autotuner.py: Python script for running Autotuner experiments
- ./bin/setup_tyler_paths.sh: Tyler-specific path setup for Tensor RT
- ./bin/setup_jetson.sh: Jetson board specific path setup for Tensor RT
- ./bin/setup_cuda_paths.sh: Place-holder script for setting CUDA paths
- ./bin/swing_selection.py: Script for hardware mapping
- NOTE: Includes the L2,L1 norm mapping to hardware knobs
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment