speeding up yolov5 megadetector inference #105

rbavery · 2023-03-25T00:04:44Z

Inference for the fully reproduced megadetector v5a model is currently about 9 seconds per image. This PR speeds this up by:

compiling to ONNX, independent of image size changes
reducing image size while preserving performance as much as possible
we did not do any other optimizations (NeuralMagic, or direct custom ONNX sparsify)

See the README.md for instructions on getting started with downloading model weights, packaging the model, running the torchserve container, and sending image post requests. This adds two notebooks that can be used to

compare models on folders of images or
run single image inference and debug each step locally and compare with the torchserve container results.

rbavery · 2023-03-25T00:11:10Z

Inference is currently about 8 seconds per image. This branch is for investigating how to speed this up by:

compiling to torchscript, independent of image size changes

reducing image size while preserving performance as much as possible, potentially with multiple compiled torchscript models for different image sizes

other optimizations (NeuralMagic, ONNX, TensorRT)

See the README.md for instructions on getting started with downloading model weights, packaging the model, running the torchserve container, and sending image post requests.

Goal: Average inference time per image at 2-3 seconds. We were able to achieve this by resizing all images to 640x640 px and using a torchscript model compiled for this size, but this degraded performance.

…ge imgs

rbavery · 2023-03-31T23:59:32Z

After compiling to ONNX we get inference speeds of 1.7 seconds vs ~5 seconds for no compilation! This is on my local desktop. We'll test this on an endpoint early next week.

nathanielrindlaub

Awesome - all looks great!

add timings

e22277d

rbavery added 8 commits March 28, 2023 18:49

inference comparison notebook, yolov5 export edit, readme update

981eb07

deps for onnx

d90d6e3

comparisons

2d87589

debugging new coord transform single img inf notebook

5152017

working fixed size 960,1280, fix bbox coord rescale for small and lar…

f092045

…ge imgs

README update

23805f1

inf notes and account for local gpu

ef71357

compile script

ddd30ce

rbavery requested a review from nathanielrindlaub April 1, 2023 00:01

update deploy nb to reflect endpoint names and concurrency memory limits

06e2b50

nathanielrindlaub approved these changes Apr 7, 2023

View reviewed changes

nathanielrindlaub merged commit 4b5a244 into resize-fix-false-neg Apr 7, 2023

This was referenced Apr 7, 2023

Improve inference speed on sagemaker serverless while preserving accuracy #106

Closed

Run serverless endpoint batch test and record cost and time results #99

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speeding up yolov5 megadetector inference #105

speeding up yolov5 megadetector inference #105

rbavery commented Mar 25, 2023 •

edited

Loading

rbavery commented Mar 25, 2023

rbavery commented Mar 31, 2023 •

edited

Loading

nathanielrindlaub left a comment

speeding up yolov5 megadetector inference #105

speeding up yolov5 megadetector inference #105

Conversation

rbavery commented Mar 25, 2023 • edited Loading

rbavery commented Mar 25, 2023

rbavery commented Mar 31, 2023 • edited Loading

nathanielrindlaub left a comment

Choose a reason for hiding this comment

rbavery commented Mar 25, 2023 •

edited

Loading

rbavery commented Mar 31, 2023 •

edited

Loading