Failure to compile a model on Inf1 with optimum-cli due to lack of arguments #471
Closed
2 of 4 tasks
Labels
bug
Something isn't working
System Info
Who can help?
@JingyaHuang
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction (minimal, reproducible, runnable)
When I attempted to compile
bert-base-uncased
model on an Inf1 instance following the official document, I encountered the following error occurred. I used the pre-built PyTorch environment for Inf1 provided by "Deep Learning AMI Neuron PyTorch 1.13 (Ubuntu 20.04) 20240102".Expected behavior
This error occurs because
neuron.py
does not use utilize arguments such as--library_name
,--subfolder
,--compiler_workdir
,--disable-weights-neff-inline
, and other arguments in thelevel_group
category, which are used inneuronx.py
. When I modifiedneuron.py
to use the same arguments asneuronx.py
, the model was successfully compiled. The output is as follows:The text was updated successfully, but these errors were encountered: