[Error] Static dimension mismatch while setting input shape. while running llama 3 8B #2071

manickavela29 · 2024-07-31T14:56:21Z

System Info

CPU architecture : 86_64
CPU/Host memory size 187GB
GPU properties
GPU name : A10
GPU memory size : 24 GB
Clock frequencies used (if applicable)
Libraries
TensorRT-LLM tag : v0.10.0

Model : Lama 3 8B

Container used :

nvcr.io/nvidia/tritonserver:24.07-trtllm-python-py3

Engine is build with below cmd:


python ../quantization/quantize.py --model_dir model/ \
                                   --output_dir tllm_checkpoint_1gpu_awq \
                                   --dtype float16 \
                                   --qformat int4_awq \
                                   --awq_block_size 128 \
                                   --tp_size 1 \
                                   --pp_size 1

trtllm-build --checkpoint_dir tllm_checkpoint_1gpu_awq \
            --output_dir trt_engines/awq/1-gpu_1/  \
            --gemm_plugin float16

Who can help?

@byshiue @kaiyux

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Docker image :

ARG BASE_IMAGE=nvcr.io/nvidia/tritonserver
ARG BASE_TAG=24.07-trtllm-python-py3
FROM ${BASE_IMAGE}:${BASE_TAG}

llama 3 8b and quantization scripts from examples of tensorrt_llm

Expected behavior

Model inferencing running sucessfully

actual behavior

Failing with the error

[07/31/2024-13:01:36] [TRT] [E] 3: [executionContext.cpp::setInputShape::2278] Error Code 3: API Usage Error (Parameter check failed at: runtime/api/executionContext.cpp::setInputShape::2278, condition: engineDims.d[i] == dims.d[i] Static dimension mismatch while setting input shape.)
[07/31/2024-13:01:36] [TRT] [E] 3: [executionContext.cpp::setInputShape::2278] Error Code 3: API Usage Error (Parameter check failed at: runtime/api/executionContext.cpp::setInputShape::2278, condition: engineDims.d[i] == dims.d[i] Static dimension mismatch while setting input shape.)
[07/31/2024-13:01:36] [TRT] [E] 3: [executionContext.cpp::setInputShape::2278] Error Code 3: API Usage Error (Parameter check failed at: runtime/api/executionContext.cpp::setInputShape::2278, condition: engineDims.d[i] == dims.d[i] Static dimension mismatch while setting input shape.)
[07/31/2024-13:01:36] [TRT] [E] 3: [executionContext.cpp::setInputShape::2278] Error Code 3: API Usage Error (Parameter check failed at: runtime/api/executionContext.cpp::setInputShape::2278, condition: engineDims.d[i] == dims.d[i] Static dimension mismatch while setting input shape.)
[07/31/2024-13:01:36] [TRT] [E] 3: [executionContext.cpp::setInputShape::2278] Error Code 3: API Usage Error (Parameter check failed at: runtime/api/executionContext.cpp::setInputShape::2278, condition: engineDims.d[i] == dims.d[i] Static dimension mismatch while setting input shape.)
[07/31/2024-13:01:36] [TRT] [E] 3: [executionContext.cpp::setInputShape::2278] Error Code 3: API Usage Error (Parameter check failed at: runtime/api/executionContext.cpp::setInputShape::2278, condition: engineDims.d[i] == dims.d[i] Static dimension mismatch while setting input shape.)
[07/31/2024-13:01:36] [TRT] [E] 3: [executionContext.cpp::setInputShape::2278] Error Code 3: API Usage Error (Parameter check failed at: runtime/api/executionContext.cpp::setInputShape::2278, condition: engineDims.d[i] == dims.d[i] Static dimension mismatch while setting input shape.)
[07/31/2024-13:01:36] [TRT] [E] 3: [executionContext.cpp::setInputShape::2278] Error Code 3: API Usage Error (Parameter check failed at: runtime/api/executionContext.cpp::setInputShape::2278, condition: engineDims.d[i] == dims.d[i] Static dimension mismatch while setting input shape.)
[07/31/2024-13:01:36] [TRT] [E] 3: [executionContext.cpp::setInputShape::2278] Error Code 3: API Usage Error (Parameter check failed at: runtime/api/executionContext.cpp::setInputShape::2278, condition: engineDims.d[i] == dims.d[i] Static dimension mismatch while setting input shape.)
[07/31/2024-13:01:36] [TRT] [E] 3: [executionContext.cpp::resolveSlots::2991] Error Code 3: API Usage Error (Parameter check failed at: runtime/api/executionContext.cpp::resolveSlots::2991, condition: allInputDimensionsSpecified(routine) )
[07/31/2024-13:01:36] [TRT] [E] 3: [executionContext.cpp::resolveSlots::2991] Error Code 3: API Usage Error (Parameter check failed at: runtime/api/executionContext.cpp::resolveSlots::2991, condition: allInputDimensionsSpecified(routine) )
2024-07-31 13:01:36,784 ERROR: Error generating text

output_gen_ids = self.decoder.decode(input_ids, input_lengths, sampling_config = self.sampling_config)
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/runtime/generation.py", line 774, in wrapper
ret = func(self, *args, **kwargs)
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/runtime/generation.py", line 2944, in decode
return self.decode_regular(
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/runtime/generation.py", line 2569, in decode_regular
should_stop, next_step_tensors, tasks, context_lengths, host_context_lengths, attention_mask, logits, encoder_input_lengths = self.handle_per_step(
File "/usr/local/lib/python3.10/dist-packages/tensorrt_llm/runtime/generation.py", line 2244, in handle_per_step
raise RuntimeError(f"Executing TRT engine failed step={step}!")
RuntimeError: Executing TRT engine failed step=0!

additional notes

I experimented with these configs, but they didn't make any difference

--max_input_len=4096
--max_batch_size=1
--max_num_tokens=4096

The text was updated successfully, but these errors were encountered:

Kefeng-Duan · 2024-08-21T08:00:48Z

Hi, @manickavela29
Not sure what's your run script, could you help to provide it?

by the way, could you update your version to latest and try examples/run.py or examples/summarize.py at first?

manickavela29 · 2024-09-01T06:56:01Z

I just used the trtllm build engine cmd and tried running inference with samples and examples provided from trt-llm repo

I moved on to completely different tasks, not sure if I can pull this up again and verify so closing this.
Thank you.

manickavela29 added the bug Something isn't working label Jul 31, 2024

manickavela29 closed this as completed Sep 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Error] Static dimension mismatch while setting input shape. while running llama 3 8B #2071

[Error] Static dimension mismatch while setting input shape. while running llama 3 8B #2071

manickavela29 commented Jul 31, 2024

Kefeng-Duan commented Aug 21, 2024

manickavela29 commented Sep 1, 2024

[Error] Static dimension mismatch while setting input shape. while running llama 3 8B #2071

[Error] Static dimension mismatch while setting input shape. while running llama 3 8B #2071

Comments

manickavela29 commented Jul 31, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

Kefeng-Duan commented Aug 21, 2024

manickavela29 commented Sep 1, 2024