TensorRT custom plugin

Just add some new custom tensorRT plugin

New plugin

BatchedNMSLandmark_TRT, BatchedNMSLandmarkDynamic_TRT: Batched NMS with face landmark
BatchedNMSLandmarkConf_TRT, BatchedNMSLandmarkConfDynamic_TRT: Batched NMS with face lanmdark & confidence
EfficientNMSLandmark_TRT: Efficient NMS with face landmark
EfficientNMSCustom_TRT: Same Efficient NMS, but return boxes indices
RoIAlignDynamic: Same ONNX RoIAlign, copy from MMCVRoIAlign
RoIAlign2Dynamic: Same as pyramidROIAlignPlugin, but only one feature_map.

Prerequisites

Deepstream 6.0.1 or Deepstream 6.1

Install

Follow guide from

Please refer to the guide under github.com/NVIDIA-AI-IOT/deepstream_tao_apps

1. Installl Cmake (>= 3.13)

TensorRT OSS requires cmake >= v3.13, so install cmake 3.13 if your cmake version is lower than 3.13

wget https://github.com/Kitware/CMake/releases/download/v3.19.4/cmake-3.19.4.tar.gz
tar xvf cmake-3.19.4.tar.gz
cd cmake-3.19.4/
mkdir $HOME/install
./configure --prefix=$HOME/install
make -j$(nproc)
sudo make install

2. Build TensorRT OSS Plugin

DeepStream Release	TRT Version	TRT_OSS_CHECKOUT_TAG	Support
5.0	TRT 7.0.0	release/7.0	No
5.0.1	TRT 7.0.0	release/7.0	No
5.1	TRT 7.2.X	21.03	No
6.0 EA	TRT 7.2.2	21.03	No
6.0 GA	TRT 8.0.1	release/8.0	No
6.0.1	TRT 8.2.1	release/8.2	Yes
6.1	TRT 8.2.5.1	release/8.2	Yes

git clone -b release/8.2 https://github.com/hiennguyen9874/TensorRT
cd TensorRT/
git submodule update --init --recursive
export TRT_SOURCE=`pwd`
cd $TRT_SOURCE
mkdir -p build && cd build
## NOTE: as mentioned above, please make sure your GPU_ARCHS in TRT OSS CMakeLists.txt
## if GPU_ARCHS is not in TRT OSS CMakeLists.txt, add -DGPU_ARCHS=xy as below, for xy, refer to below "How to Get GPU_ARCHS" section
$HOME/install/bin/cmake .. -DGPU_ARCHS=xy  -DTRT_LIB_DIR=/usr/lib/x86_64-linux-gnu/ -DCMAKE_C_COMPILER=/usr/bin/gcc -DTRT_BIN_DIR=`pwd`/out
make nvinfer_plugin -j$(nproc)

After building ends successfully, libnvinfer_plugin.so* will be generated under pwd/out/ or ./build.

3. Replace "libnvinfer_plugin.so*"

// backup original libnvinfer_plugin.so.x.y, e.g. libnvinfer_plugin.so.8.0.0
sudo mv /usr/lib/x86_64-linux-gnu/libnvinfer_plugin.so.8.p.q ${HOME}/libnvinfer_plugin.so.8.p.q.bak
// only replace the real file, don't touch the link files, e.g. libnvinfer_plugin.so, libnvinfer_plugin.so.8
sudo cp $TRT_SOURCE/`pwd`/out/libnvinfer_plugin.so.8.m.n  /usr/lib/x86_64-linux-gnu/libnvinfer_plugin.so.8.p.q
sudo ldconfig

How to Get GPU_ARCHS

Can use either method to get GPU_ARCHs

GPU_ARCHS value can be got by "deviceQuery" CUDA sample

cd /usr/local/cuda/samples/1_Utilities/deviceQuery
sudo make
./deviceQuery

If there is not "/usr/local/cuda/samples" in your system, you could use the deviceQuery.cpp in this folder,

nvcc deviceQuery.cpp -o deviceQuery
./deviceQuery

There will be output like below, which indicates the "GPU_ARCHS" is 75.

./deviceQuery

Detected 2 CUDA Capable device(s)

Device 0: "Tesla T4"
  CUDA Driver Version / Runtime Version          10.2 / 10.2
  CUDA Capability Major/Minor version number:    7.5

Name		Name	Last commit message	Last commit date
Latest commit History 299 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
.vscode		.vscode
cmake		cmake
demo		demo
docker		docker
include		include
parsers		parsers
plugin		plugin
python		python
quickstart		quickstart
samples		samples
scripts		scripts
third_party		third_party
tools		tools
.clang-format		.clang-format
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt		CMakeLists.txt
CODING-GUIDELINES.md		CODING-GUIDELINES.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
README_origin.md		README_origin.md
VERSION		VERSION
deviceQuery.cpp		deviceQuery.cpp
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TensorRT custom plugin

New plugin

Prerequisites

Install

1. Installl Cmake (>= 3.13)

2. Build TensorRT OSS Plugin

3. Replace "libnvinfer_plugin.so*"

How to Get GPU_ARCHS

Acknowledgments

About

Releases 3

Packages

Languages

License

hiennguyen9874/TensorRT

Folders and files

Latest commit

History

Repository files navigation

TensorRT custom plugin

New plugin

Prerequisites

Install

1. Installl Cmake (>= 3.13)

2. Build TensorRT OSS Plugin

3. Replace "libnvinfer_plugin.so*"

How to Get GPU_ARCHS

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages