[Question] What is the difference between gptManagerBenchmark and gptSessionBenchmark in cpp benchmarks? #2092

hcnhcn012 · 2024-08-06T09:12:01Z

version of TensorRT-LLM: v0.11.0

What is the difference between gptManagerBenchmark and gptSessionBenchmark in cpp benchmarks, please give me some example scenarios of using each benchmark script, thans a lot :)

yuhengxnv · 2024-08-25T18:45:16Z

gptSessionBenchmark seems to be simpler to use, but it's recommended to use gptManagerBenchmark.

lfr-0531 · 2024-09-02T06:41:34Z

The gptSessionBenchmark is used to benchmark the running with GptSession runtime, which can only support the static batching and is deprecated now. The gptManagerBenchmark is used to benchmark the executor and gptManager runtime, and it can support both static batching and inflight batching. Since GptSession will be removed in a future release, we recommand using gptManagerBenchmark. For the benchmark script, plz refer to the cpp benchmark README: https://github.com/NVIDIA/TensorRT-LLM/tree/main/benchmarks/cpp#benchmark-c-runtime

lfr-0531 added the question Further information is requested label Sep 2, 2024

lfr-0531 self-assigned this Sep 4, 2024

lfr-0531 added the triaged Issue has been triaged by maintainers label Sep 4, 2024

lfr-0531 closed this as completed Sep 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] What is the difference between gptManagerBenchmark and gptSessionBenchmark in cpp benchmarks? #2092

[Question] What is the difference between gptManagerBenchmark and gptSessionBenchmark in cpp benchmarks? #2092

hcnhcn012 commented Aug 6, 2024

yuhengxnv commented Aug 25, 2024

lfr-0531 commented Sep 2, 2024

[Question] What is the difference between gptManagerBenchmark and gptSessionBenchmark in cpp benchmarks? #2092

[Question] What is the difference between gptManagerBenchmark and gptSessionBenchmark in cpp benchmarks? #2092

Comments

hcnhcn012 commented Aug 6, 2024

version of TensorRT-LLM: v0.11.0

yuhengxnv commented Aug 25, 2024

lfr-0531 commented Sep 2, 2024