Agent for TensorRT Prediction

This is the TensorRT agent for MLModelScope, an open-source framework and hardware agnostic, extensible and customizable platform for evaluating and profiling ML models across datasets / frameworks / systems, and within AI application pipelines.

Currently it has caffes models from the caffe Model Zoo built in. More built-in models are comming. One can evaluate the models on any systems of insterest with either local TensorRT installation or TensorRT docker images.

Check out MLModelScope and welcome to contribute.

Installation

Install go if you have not done so. Please follow Go Installation.

Download and install the MLModelScope TensorRT Agent:

go get -v github.com/rai-project/tensorrt

The agent requires The TensorRT C library and other Go packages.

Go packages

You can install the dependency through go get.

cd $GOPATH/src/github.com/rai-project/tensorrt
go get -u -v ./...

Or use Dep.

dep ensure -v

This installs the dependency in vendor/.

Note: The CGO interface passes go pointers to the C API. This is an error by the CGO runtime. Disable the error by placing

export GODEBUG=cgocheck=0

in your ~/.bashrc or ~/.zshrc file and then run either source ~/.bashrc or source ~/.zshrc

TensorRT

TensorRT is required. If you use TensorRT Docker Images (e.g. NVIDIA GPU CLOUD (NGC)), skip this step. Refer to go-tensorrt for TensorRT installation.

External services

Refer to External services.

Use within TensorRT Docker Images

Refer to Use within TensorFlow Docker Images.

Continue if you have

installed all the dependencies
downloaded carml_config_example.yml to $HOME as .carml_config.yml
launched docker external services on the host machine of the docker container you are going to use

, otherwise read above

An example of using NGC TensorRT docker image:

nvidia-docker run --shm-size=1g --ulimit memlock=-1 --ulimit stack=67108864 -it --privileged=true --network host \
-v $GOPATH:/workspace/go1.12/global \
-v $GOROOT:/workspace/go1.12_root \
-v ~/.carml_config.yml:/root/.carml_config.yml \
nvcr.io/nvidia/tensorrt:19.06-py3

NOTE: The SHMEM allocation limit is set to the default of 64MB. This may be insufficient for TensorRT. NVIDIA recommends the use of the following flags: nvidia-docker run --shm-size=1g --ulimit memlock=-1 --ulimit stack=67108864 ...

Within the container, set up the environment so that the agent can find the TensorRT installation.

export GOPATH=/workspace/go1.12/global
export GOROOT=/workspace/go1.12_root
export PATH=$GOROOT/bin:$PATH

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64/
export CGO_LDFLAGS="${CGO_LDFLAGS} -L /usr/local/cuda/lib64 -L /usr/local/cuda/extras/CUPTI/lib64/"

export PATH=$PATH:$(go env GOPATH)/bin
export GODEBUG=cgocheck=0
cd $GOPATH/src/github.com/rai-project/tensorrt/tensorrt-agent

Usage

Refer to Usage

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
builtin_models		builtin_models
dockerfiles		dockerfiles
predictor		predictor
tensorrt-agent		tensorrt-agent
.agignore		.agignore
.editorconfig		.editorconfig
.gitignore		.gitignore
.travis.yml		.travis.yml
Gopkg.lock		Gopkg.lock
Gopkg.toml		Gopkg.toml
Makefile		Makefile
README.md		README.md
VERSION		VERSION
azure-pipelines.yml		azure-pipelines.yml
builtin_models_static.go		builtin_models_static.go
generate.go		generate.go
init.go		init.go
register.go		register.go
utils.go		utils.go

rai-project/tensorrt

Folders and files

Latest commit

History

Repository files navigation

Agent for TensorRT Prediction

Installation

Go packages

TensorRT

External services

Use within TensorRT Docker Images

Usage

About

Topics

Resources

Stars

Watchers

Forks

Languages