Skip to content

Commit

Permalink
Fix typo
Browse files Browse the repository at this point in the history
  • Loading branch information
xusenlin committed Aug 11, 2023
1 parent a6a7b11 commit aafaaf9
Show file tree
Hide file tree
Showing 2 changed files with 1 addition and 19 deletions.
2 changes: 1 addition & 1 deletion api/patches.py
Original file line number Diff line number Diff line change
Expand Up @@ -165,4 +165,4 @@ def apply_rerope_patch(training_length: int = 4096, window_size: int = 512):
global TRAINING_LENGTH, WINDOW_SIZE
TRAINING_LENGTH, WINDOW_SIZE = training_length, window_size
LlamaAttention.forward = forward_with_rerope
logger.info(f"Apply NTK scaling with TRAINING_LENGTH={TRAINING_LENGTH}")
logger.info(f"Apply ReRoPE with TRAINING_LENGTH={TRAINING_LENGTH}")
18 changes: 0 additions & 18 deletions docs/SCRIPT.md
Original file line number Diff line number Diff line change
Expand Up @@ -325,24 +325,6 @@ docker run -it -d --gpus all --ipc=host --net=host -p 80:80 --name=aquila-chat-7
--embedding_name moka-ai/m3e-base
```

### NewHope

SLAM-group/NewHope:

```shell
docker run -it -d --gpus all --ipc=host --net=host -p 80:80 --name=newhope \
--ulimit memlock=-1 --ulimit stack=67108864 \
-v `pwd`:/workspace \
llm-api:pytorch \
python api/app.py \
--port 80 \
--allow-credentials \
--model_name newhope \
--model_path SLAM-group/NewHope \
--device cuda \
--embedding_name moka-ai/m3e-base
```

### Qwen-7b-chat

除已有的环境之外,推荐安装下面的依赖以提高运行效率和降低显存占用
Expand Down

0 comments on commit aafaaf9

Please sign in to comment.