Plato2推理比参数量接近的DialoGPT慢10倍 #1743

nonstopfor · 2022-03-09T02:51:51Z

我使用https://github.com/PaddlePaddle/PaddleNLP/tree/f079f91c48f17a1b9382076e89ce7050de83ea1d/examples/dialogue/plato-2 这里的plato2例子，用的是24层的模型（参数量310M），在同样的gpu和同样的输入情况下，推理速度比medium 版本的dialogpt（参数量345M）慢10倍左右，想问下这是正常的现象吗，或者有什么办法对plato2的推理做加速吗？我尝试了设置with paddle.no_grad()和设置latent_type_size=1，但好像速度都没有明显加快。谢谢！

环境：
Linux python 3.8
paddlepaddle-gpu=2.2.2

smallv0221 · 2022-03-09T03:36:03Z

请问您的dialogpt模型是paddlenlp中的么。plato2在生成回复后会再过一个模型用于回复筛选，可能是会慢一些。由于plato2系统特殊性目前也没有将它接入paddlenlp.transformers中。暂时没有更多加速手段，十分抱歉

nonstopfor · 2022-03-09T03:54:26Z

dialogpt用的是transformers库里面的。了解了

guoshengCS · 2022-03-11T05:48:07Z

目前也没有将它接入paddlenlp.transformers中

@nonstopfor 你好，目前PLATO2由于需要生成候选和候选排序两个阶段，暂时未接入预训练模型库中，我们也在考虑将生成阶段模型接入其中，参见 #1735 。

对于对话预训练模型模型，paddlenlp.transformers中已经集成了PLATO-mini（100M，中文）和PLATO-xl（11B，英文，首个开源百亿参数对话模型），也可以按需要尝试。

对于已经接入paddlenlp.transformers的生成模型，PaddleNLP通过生成API集成了高性能加速能力， 345M 的参数量下（gpt-medium-en模型）预期能较 transformers 有8~9倍的性能提升，欢迎使用体验，dialogpt 这样更多的模型也欢迎参考paddlenlp.transformers中的GPT进行贡献~ 另外我们近期也在做对 transformers 的模型兼容，对于一些头部模型，预期可以直接加载 transformers的模型权重，敬请期待~

github-actions · 2023-01-11T01:18:24Z

This issue is stale because it has been open for 60 days with no activity. 当前issue 60天内无活动，被标记为stale。

github-actions · 2023-01-26T00:17:13Z

This issue was closed because it has been inactive for 14 days since being marked as stale. 当前issue 被标记为stale已有14天，即将关闭。

ZHUI assigned smallv0221 Mar 9, 2022

github-actions bot added the stale label Jan 11, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Jan 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plato2推理比参数量接近的DialoGPT慢10倍 #1743

Plato2推理比参数量接近的DialoGPT慢10倍 #1743

nonstopfor commented Mar 9, 2022 •

edited

Loading

smallv0221 commented Mar 9, 2022

nonstopfor commented Mar 9, 2022

guoshengCS commented Mar 11, 2022 •

edited

Loading

github-actions bot commented Jan 11, 2023

github-actions bot commented Jan 26, 2023

Plato2推理比参数量接近的DialoGPT慢10倍 #1743

Plato2推理比参数量接近的DialoGPT慢10倍 #1743

Comments

nonstopfor commented Mar 9, 2022 • edited Loading

smallv0221 commented Mar 9, 2022

nonstopfor commented Mar 9, 2022

guoshengCS commented Mar 11, 2022 • edited Loading

github-actions bot commented Jan 11, 2023

github-actions bot commented Jan 26, 2023

nonstopfor commented Mar 9, 2022 •

edited

Loading

guoshengCS commented Mar 11, 2022 •

edited

Loading