Add gcu llama readme #8950

EnflameGCU · 2024-08-16T07:43:21Z

PR types

New features

PR changes

Docs

Description

Add gcu llama2-13b readme

paddle-bot · 2024-08-16T07:43:26Z

Thanks for your contribution!

codecov · 2024-08-16T08:16:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 54.14%. Comparing base (3ef7c1b) to head (6116026).
Report is 225 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8950      +/-   ##
===========================================
+ Coverage    54.06%   54.14%   +0.08%     
===========================================
  Files          650      650              
  Lines       103883   103883              
===========================================
+ Hits         56164    56250      +86     
+ Misses       47719    47633      -86

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests

wawltor · 2024-08-21T12:48:00Z

llm/gcu/llama/README.md

@@ -0,0 +1,162 @@
+## 🚣‍♂️ 使用PaddleNLP在GCU下运行llama2-13b模型 🚣
+
+PaddleNLP在燧原GCU（[了解燧原](https://www.enflame-tech.com/)）上对llama2-13B模型进行了深度适配和优化，实现了燧原GCU推理入口和GPU的基本统一，仅需修改device即可完成推理任务的迁移。


可以加上一些GCU硬件的描述

已添加S60加速卡基本情况介绍，并附官方链接

wawltor · 2024-08-21T12:48:43Z

llm/gcu/llama/README.md

+### 1. 环境准备：(这将花费您10～20min时间)
+
+1. 初始化环境，安装驱动<br/>
+  **注：您可以联系燧原工程师以获取软件驱动包和其他帮助**


联系方式提供下？

已添加联系方式

wawltor · 2024-08-21T12:50:32Z

llm/gcu/llama/README.md

+cd PaddleNLP
+python -m pip install -r requirements.txt
+# 源码编译安装 paddlenlp
+python setup.py bdist_wheel && python -m pip uninstall paddlenlp -y && python -m pip install dist/paddlenlp*


推荐一个固定版本的PaddleNLP版本会不会更好，然后再提供 git clone & python setup.py install 的方式

推荐使用PaddleNLP release版本v3.0.0-beta0，已更新步骤描述

wawltor · 2024-08-21T12:51:46Z

llm/gcu/llama/README.md

+tar -zxf wikitext-103.tar.gz
+```
+### 3. 推理：(这将花费您15~30min时间)
+下载权重文件后需要更改推理配置文件，以获取更大的性能提升。将```/root/.paddlenlp/models/__internal_testing__/sci-benchmark-llama-13b-5k/config.json```更改为下面的内容：


默认是不会提供 model config.json的？这里提供下怎么下载 internal_testing/sci-benchmark-llama-13b-5k 的模型方式？

首次推理将自动下载权重和配置，已更新描述

DrownFish19 · 2024-08-26T09:05:34Z

llm/gcu/llama/README.md

+2. 拉取镜像
+```bash
+# 注意此镜像仅为paddle开发环境，镜像中不包含预编译的飞桨安装包、TopsPlatform安装包等
+docker pull registry.baidubce.com/paddlepaddle/paddle:latest-dev


这里建议限定可运行版本，latest-dev不是好的tag

DrownFish19 · 2024-08-26T09:06:03Z

llm/gcu/llama/README.md

+```
+3. 参考如下命令启动容器
+```bash
+docker run --name paddle-gcu-test -v /home:/home --network=host --ipc=host -it --privileged registry.baidubce.com/paddlepaddle/paddle:latest-dev /bin/bash


此处docker启动命令也需按照对应镜像名称修改。

wawltor

LGTM

wawltor reviewed Aug 21, 2024

View reviewed changes

EnflameGCU force-pushed the gcu_llama_readme branch 2 times, most recently from f3a7669 to 16f869c Compare August 26, 2024 03:08

[GCU] Add gcu llama2-13b readme

6116026

EnflameGCU force-pushed the gcu_llama_readme branch from 16f869c to 6116026 Compare August 26, 2024 08:13

DrownFish19 reviewed Aug 26, 2024

View reviewed changes

wawltor approved these changes Aug 26, 2024

View reviewed changes

wawltor merged commit 154928a into PaddlePaddle:develop Aug 26, 2024
10 of 12 checks passed

lixcli pushed a commit to lixcli/PaddleNLP that referenced this pull request Aug 28, 2024

[GCU] Add gcu llama2-13b readme (PaddlePaddle#8950)

eddc061

Mangodadada pushed a commit to Mangodadada/PaddleNLP that referenced this pull request Sep 10, 2024

[GCU] Add gcu llama2-13b readme (PaddlePaddle#8950)

b9ac92e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add gcu llama readme #8950

Add gcu llama readme #8950

EnflameGCU commented Aug 16, 2024

paddle-bot bot commented Aug 16, 2024

codecov bot commented Aug 16, 2024 •

edited

Loading

wawltor Aug 21, 2024

EnflameGCU Aug 22, 2024 •

edited

Loading

wawltor Aug 21, 2024

EnflameGCU Aug 22, 2024

wawltor Aug 21, 2024

EnflameGCU Aug 22, 2024

wawltor Aug 21, 2024

EnflameGCU Aug 22, 2024

DrownFish19 Aug 26, 2024 •

edited

Loading

DrownFish19 Aug 26, 2024

wawltor left a comment

		@@ -0,0 +1,162 @@
		## 🚣‍♂️ 使用PaddleNLP在GCU下运行llama2-13b模型 🚣

		PaddleNLP在燧原GCU（[了解燧原](https://www.enflame-tech.com/)）上对llama2-13B模型进行了深度适配和优化，实现了燧原GCU推理入口和GPU的基本统一，仅需修改device即可完成推理任务的迁移。

Add gcu llama readme #8950

Add gcu llama readme #8950

Conversation

EnflameGCU commented Aug 16, 2024

PR types

PR changes

Description

paddle-bot bot commented Aug 16, 2024

codecov bot commented Aug 16, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

EnflameGCU Aug 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DrownFish19 Aug 26, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 16, 2024 •

edited

Loading

EnflameGCU Aug 22, 2024 •

edited

Loading

DrownFish19 Aug 26, 2024 •

edited

Loading