TensorRT Plugin #1

johnnynunez · 2024-05-22T10:06:52Z

I've see your NVIDIA/TensorRT#3859
Is it possible to have on trt10?
I'm working on jetson agx Orin and now is compatible with cuda 12.5, cudnn 9.1.1 and tensorrt 10.0.1.6.
Also, is it compatible with yolov8?

levipereira · 2024-05-22T16:01:32Z

Yes, it can be easily implemented on TRT 10 and for any version of YOLO since v4, because it's same implementation as End2End Efficient NMS but add a new layer det_indices.
I will try to find some free time and implement it on 8.5 and 10.0

levipereira · 2024-05-22T22:44:50Z

NVIDIA/TensorRT#3859 (comment)

levipereira · 2024-05-23T02:21:01Z

@johnnynunez Check this out.
https://github.com/levipereira/ultralytics -- Added Support for TRT Plugin YoloNMS on Yolov8 for Instance Segmentation and Object Detection

I have tested/validated on deepstream with yolov8n -- https://github.com/levipereira/deepstream-yolov9

from ultralytics import YOLO
# model = YOLO("yolov8n-seg.pt") 
model = YOLO("yolov8n.pt") 
model.export(format="onnx_trt")

johnnynunez · 2024-05-23T08:09:15Z

@levipereira awesome! but maybe still I have to do the predict compatible.
These guys did it: https://github.com/nkb-tech/ultralytics

@johnnynunez Check this out. https://github.com/levipereira/ultralytics -- Added Support for TRT Plugin YoloNMS on Yolov8 for Instance Segmentation and Object Detection

I have tested/validated on deepstream with yolov8n -- https://github.com/levipereira/deepstream-yolov9
from ultralytics import YOLO
# model = YOLO("yolov8n-seg.pt") 
model = YOLO("yolov8n.pt") 
model.export(format="onnx_trt")

johnnynunez · 2024-05-23T11:21:47Z

@levipereira also can you create a PR to ultralytics?

levipereira · 2024-05-23T22:40:17Z

@johnnynunez

With Triton Server and Triton Client, we can easily perform inference and evaluation on any YOLO Series model. Check out the evaluation results of YOLOv8 models using YOLO_NMS_TRT at the link below:

YOLOv8 Evaluation Results

Implementing inference using the TensorRT API and Custom Plugin within the Ultralytics project involves a significant amount of work. I may consider implementing it in the future.

Using Triton Server, we can build and test any model without additional effort.

For more information, visit:

levipereira · 2024-05-24T23:43:16Z

@levipereira also can you create a PR to ultralytics?

Will implement end2end with EfficientNMS or YOLO_NMS_TRT and open a PR.

johnnynunez · 2024-05-30T09:17:20Z

@levipereira do you have lower mAP with efficient_nms in COCO eval?

levipereira · 2024-06-12T19:46:54Z

@levipereira do you have lower mAP with efficient_nms in COCO eval?

No, I did not get a lower mAP. The results were consistent with the baseline evaluation.

levipereira · 2024-06-16T23:40:42Z

@johnnynunez https://github.com/levipereira/triton-server-yolo?tab=readme-ov-file#evaluation-test-on-tensorrt
I got the same result, even with FP16.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TensorRT Plugin #1

TensorRT Plugin #1

johnnynunez commented May 22, 2024

levipereira commented May 22, 2024

levipereira commented May 22, 2024

levipereira commented May 23, 2024

johnnynunez commented May 23, 2024 •

edited

Loading

johnnynunez commented May 23, 2024

levipereira commented May 23, 2024

levipereira commented May 24, 2024 •

edited

Loading

johnnynunez commented May 30, 2024

levipereira commented Jun 12, 2024

levipereira commented Jun 16, 2024 •

edited

Loading

TensorRT Plugin #1

TensorRT Plugin #1

Comments

johnnynunez commented May 22, 2024

levipereira commented May 22, 2024

levipereira commented May 22, 2024

levipereira commented May 23, 2024

johnnynunez commented May 23, 2024 • edited Loading

johnnynunez commented May 23, 2024

levipereira commented May 23, 2024

levipereira commented May 24, 2024 • edited Loading

johnnynunez commented May 30, 2024

levipereira commented Jun 12, 2024

levipereira commented Jun 16, 2024 • edited Loading

johnnynunez commented May 23, 2024 •

edited

Loading

levipereira commented May 24, 2024 •

edited

Loading

levipereira commented Jun 16, 2024 •

edited

Loading