Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mT5 directory structure #279

Open
hpk23 opened this issue Jan 4, 2024 · 1 comment
Open

mT5 directory structure #279

hpk23 opened this issue Jan 4, 2024 · 1 comment
Assignees
Labels
triaged Issue has been triaged by maintainers

Comments

@hpk23
Copy link

hpk23 commented Jan 4, 2024

I'm trying to use the mT5 model, for mT5, tensorrt-llm build creates an engine for encoder and decoder, how should I organize the directory structure in this case? (In all models, there seems to be only an example where there is only a decoder).

ensemble
 - 1
 - config.pbtxt

postprocessing
 - 1
  - model.py
 - config.pbtxt

preprocessing
 - 1
  - model.py
 - config.pbtxt

tensorrt_llm
 - 1
   - model.py
 - config.pbtxt
@byshiue byshiue added the triaged Issue has been triaged by maintainers label Jan 8, 2024
@symphonylyh
Copy link
Collaborator

Hi @hpk23 , we're working on a trition backend example for T5 structure. The progress is tracked under NVIDIA/TensorRT-LLM#800. We appreciate your patience for a few more weeks as we finalize the structure

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
triaged Issue has been triaged by maintainers
Projects
None yet
Development

No branches or pull requests

3 participants