Convert_checkpoint.py failed with LLAMA 3.1 8B instruct #2105

ShuaiShao93 · 2024-08-09T21:23:16Z

System Info

Debian 11

Who can help?

@byshiue @juney-nvidia

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

$ git clone https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct

$ python3 TensorRT-LLM/examples/llama/convert_checkpoint.py --model_dir ./Meta-Llama-3.1-8B-Instruct --output_dir ./tllm_checkpoint_1gpu_bf16 --dtype bfloat16

Expected behavior

convert_checkpoint.py should work with llama 3.1

actual behavior

[TensorRT-LLM] TensorRT-LLM version: 0.11.0
0.11.0
Traceback (most recent call last):
  File "/home/ss/TensorRT-LLM/examples/llama/convert_checkpoint.py", line 461, in <module>
    main()
  File "/home/ss/TensorRT-LLM/examples/llama/convert_checkpoint.py", line 453, in main
    convert_and_save_hf(args)
  File "/home/ss/TensorRT-LLM/examples/llama/convert_checkpoint.py", line 378, in convert_and_save_hf
    execute(args.workers, [convert_and_save_rank] * world_size, args)
  File "/home/ss/TensorRT-LLM/examples/llama/convert_checkpoint.py", line 402, in execute
    f(args, rank)
  File "/home/ss/TensorRT-LLM/examples/llama/convert_checkpoint.py", line 367, in convert_and_save_rank
    llama = LLaMAForCausalLM.from_hugging_face(
  File "/opt/conda/lib/python3.10/site-packages/tensorrt_llm/models/llama/model.py", line 328, in from_hugging_face
    model = LLaMAForCausalLM(config)
  File "/opt/conda/lib/python3.10/site-packages/tensorrt_llm/models/modeling_utils.py", line 361, in __call__
    obj = type.__call__(cls, *args, **kwargs)
  File "/opt/conda/lib/python3.10/site-packages/tensorrt_llm/models/llama/model.py", line 267, in __init__
    transformer = LLaMAModel(config)
  File "/opt/conda/lib/python3.10/site-packages/tensorrt_llm/models/llama/model.py", line 211, in __init__
    self.layers = DecoderLayerList(LLaMADecoderLayer, config)
  File "/opt/conda/lib/python3.10/site-packages/tensorrt_llm/models/modeling_utils.py", line 289, in __init__
    super().__init__([cls(config, idx) for idx in self.layer_list])
  File "/opt/conda/lib/python3.10/site-packages/tensorrt_llm/models/modeling_utils.py", line 289, in <listcomp>
    super().__init__([cls(config, idx) for idx in self.layer_list])
  File "/opt/conda/lib/python3.10/site-packages/tensorrt_llm/models/llama/model.py", line 51, in __init__
    self.attention = Attention(
  File "/opt/conda/lib/python3.10/site-packages/tensorrt_llm/layers/attention.py", line 347, in __init__
    assert rotary_embedding_scaling["type"] in ["linear", "dynamic"]
KeyError: 'type'

additional notes

N/A

The text was updated successfully, but these errors were encountered:

KuntaiDu · 2024-08-14T03:27:56Z

I am also experiencing this issue when running benchmarks.

daulet · 2024-08-14T16:47:54Z

sync up, it's fixed on main branch, post v0.11.0 release

byshiue · 2024-08-19T08:21:31Z

The llama 3.1 is not supported on TRT-LLM 0.11. It is only supported on main branch now.

The first commit we support llama 3.1 is bca9a33b022dc6a924bf7913137feed3d28b602d, which is released on 23 July 2024.

yuhengxnv · 2024-08-19T09:46:27Z

Discussed with @byshiue , Llama 3.1 models require transformer >= 4.43.0. Maybe a workaround is temporarily
pip install -U transformers
before running convert_checkpoint.py

ivanbaldo · 2024-09-09T23:37:00Z

Is this supported on a released version now?

ivanbaldo · 2024-09-09T23:39:18Z

Ouch sorry for the noise, I see that on https://github.com/NVIDIA/TensorRT-LLM/releases/tag/v0.12.0 it's supported.

ShuaiShao93 added the bug Something isn't working label Aug 9, 2024

byshiue closed this as completed Aug 19, 2024

byshiue added not a bug Some known limitation, but not a bug. and removed bug Something isn't working labels Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert_checkpoint.py failed with LLAMA 3.1 8B instruct #2105

Convert_checkpoint.py failed with LLAMA 3.1 8B instruct #2105

ShuaiShao93 commented Aug 9, 2024 •

edited

Loading

KuntaiDu commented Aug 14, 2024

daulet commented Aug 14, 2024

byshiue commented Aug 19, 2024

yuhengxnv commented Aug 19, 2024

ivanbaldo commented Sep 9, 2024

ivanbaldo commented Sep 9, 2024

Convert_checkpoint.py failed with LLAMA 3.1 8B instruct #2105

Convert_checkpoint.py failed with LLAMA 3.1 8B instruct #2105

Comments

ShuaiShao93 commented Aug 9, 2024 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

KuntaiDu commented Aug 14, 2024

daulet commented Aug 14, 2024

byshiue commented Aug 19, 2024

yuhengxnv commented Aug 19, 2024

ivanbaldo commented Sep 9, 2024

ivanbaldo commented Sep 9, 2024

ShuaiShao93 commented Aug 9, 2024 •

edited

Loading