Which source file contains the code for loading the model? #2068

HongfengDu · 2024-07-31T11:38:43Z

I want to customize the model loading process and modify the logic for loading the model

nv-guomingz · 2024-07-31T12:24:41Z

https://github.com/NVIDIA/TensorRT-LLM/blob/main/examples/llama/convert_checkpoint.py#L414

HongfengDu · 2024-07-31T14:10:49Z

https://github.com/NVIDIA/TensorRT-LLM/blob/main/examples/llama/convert_checkpoint.py#L414
Thank you, I would like to modify the CPP file. Another issue， Executor(std::vector<uint8_t> const& engineBuffer, std::string const& jsonConfigStr, ModelType modelType, ExecutorConfig const& executorConfig); the interface how call

github-actions · 2024-08-31T01:57:55Z

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 15 days."

nv-guomingz added the question Further information is requested label Jul 31, 2024

github-actions bot added the stale label Aug 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which source file contains the code for loading the model? #2068

Which source file contains the code for loading the model? #2068

HongfengDu commented Jul 31, 2024

nv-guomingz commented Jul 31, 2024

HongfengDu commented Jul 31, 2024

github-actions bot commented Aug 31, 2024

Which source file contains the code for loading the model? #2068

Which source file contains the code for loading the model? #2068

Comments

HongfengDu commented Jul 31, 2024

nv-guomingz commented Jul 31, 2024

HongfengDu commented Jul 31, 2024

github-actions bot commented Aug 31, 2024