InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 281
Star 3.1k

Code
Issues 165
Pull requests 25
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: InternLM/lmdeploy

[Benchmark] benchmarks on different cuda architecture with mo...

#815 opened Dec 11, 2023 by lvhan028

Open 6

报名参加书生·浦语大模型实战营——两周带你玩转微调部署评测全链路

#890 opened Dec 26, 2023 by vansin

Open

Labels 32 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

165 Open 820 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Bug] ValueError: Tokenizer class Qwen2Tokenizer does not exist or is not currently imported.

#1903 opened Jul 3, 2024 by zhyncs

2 tasks done

请问什么时候会支持对CogVLM2的量化

#1902 opened Jul 3, 2024 by EasonGZY

多轮对话批处理耗时异常

#1901 opened Jul 3, 2024 by SunnyLee20230523

[Feature] support InternVL-2.0

#1900 opened Jul 2, 2024 by rTrQqgH74lc2PT5k

[Bug] Using the turbomind engine, prompting more than 10k tokens will result in garbage output.

#1896 opened Jul 2, 2024 by dafu-wu

2 tasks done

[Bug] CUDA runtime error: an illegal memory access was encountered when 8bit kv quant was enabled

#1895 opened Jul 1, 2024 by aabbccddwasd

2 tasks done

[Bug]

#1894 opened Jul 1, 2024 by CodexDive

2 tasks

GenerationConfig 类中的参数n没有发挥作用

#1893 opened Jul 1, 2024 by 1452083640

单条样本推理可以不使用stream_infer吗

#1891 opened Jul 1, 2024 by zhanghanweii

1 of 2 tasks

[Feature] blazing great work about KV Cache: Mooncake

#1884 opened Jun 28, 2024 by zhyncs

[Feature] long context inference optimization

#1879 opened Jun 27, 2024 by zhyncs

[Feature] support Gemma 2

#1878 opened Jun 27, 2024 by zhyncs

[Docs] TurboMind推理引擎与PyTorch推理引擎速度对比

#1872 opened Jun 27, 2024 by LRHstudy

[Bug] 不支持qwen0.5b的加速？以及qwen0.5b的awq量化？

#1870 opened Jun 27, 2024 by qism

2 tasks

[Bug] AttributeError: 'LlavaNextConfig' object has no attribute 'hidden_size'

#1868 opened Jun 27, 2024 by zhaozeno

1 of 2 tasks

[Bug] internvl 模型被推理后，针对图片内容回答的答案不正确

#1866 opened Jun 27, 2024 by seven1122

1 of 2 tasks

使用pipeline加载Qwen1.5-32B-Chat，tp=4，使用openai prompt格式提示其清洗中文但生成回复都是英文

#1864 opened Jun 26, 2024 by Yang-bug-star

使用OpenAI format的输入得到的response要如何提取出回复文本，返回的response好像是分段的

#1863 opened Jun 26, 2024 by Yang-bug-star

[Bug] 单轮的图文交错对话的实现原理

#1862 opened Jun 26, 2024 by stay-leave

2 tasks done

[Bug] Segmentation fault: address not mapped to object at address 0x2058

#1849 opened Jun 25, 2024 by austingg

2 tasks done

[Bug] InternLM2MLP.forward() missing 1 required positional argument: 'im_mask'

#1847 opened Jun 25, 2024 by jiangjingz

2 tasks done

如何指定模型的数据类型为f16

#1846 opened Jun 25, 2024 by Yang-bug-star

[Docs] 多模态模型的api_server应该如何多卡部署？

#1840 opened Jun 24, 2024 by red-fox-yj

[Feature] How to support bf16 when inferencing Internvl-chat

#1839 opened Jun 24, 2024 by Leo-yang-1020

[Bug] qwen2 awq量化微调后的模型报错

#1836 opened Jun 24, 2024 by qiuxuezhe123

2 tasks

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly