Is it possible to add support for Refact LLM(1.6B)? #3013

Jipok · 2023-09-04T18:58:43Z

Jipok
Sep 4, 2023

Reddit announce: https://www.reddit.com/r/LocalLLaMA/comments/169yonh/we_trained_a_new_16b_parameters_code_model_that/
Blog: https://refact.ai/blog/2023/introducing-refact-code-llm/
Code: https://github.com/smallcloudai/refact/
Model: https://huggingface.co/smallcloudai/Refact-1_6B-fim

Do I understand correctly that this model cannot yet be used in llama.cpp since there is no support for Multi Query Attention yet?
Is this the only blocker?

Jipok · 2023-09-04T19:00:03Z

Jipok
Sep 4, 2023
Author

The model is very desirable if you believe the benchmarks.

0 replies

Green-Sky · 2023-09-04T23:53:29Z

Green-Sky
Sep 4, 2023
Collaborator

Do I understand correctly that this model cannot yet be used in llama.cpp since there is no support for Multi Query Attention yet?
Is this the only blocker?

llama.cpp does support it. But the main code does not yet work with alibi.

0 replies

Jipok · 2023-09-05T07:25:34Z

Jipok
Sep 5, 2023
Author

There are bounty($2000) for CPU inference support for Refact LLM: smallcloudai/refact#77

0 replies

Green-Sky · 2023-09-07T17:29:04Z

Green-Sky
Sep 7, 2023
Collaborator

tracking issue: #3061

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to add support for Refact LLM(1.6B)? #3013

{{title}}

Replies: 4 comments

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Is it possible to add support for Refact LLM(1.6B)? #3013

Jipok Sep 4, 2023

Replies: 4 comments

Jipok Sep 4, 2023 Author

Green-Sky Sep 4, 2023 Collaborator

Jipok Sep 5, 2023 Author

Green-Sky Sep 7, 2023 Collaborator

Jipok
Sep 4, 2023

Jipok
Sep 4, 2023
Author

Green-Sky
Sep 4, 2023
Collaborator

Jipok
Sep 5, 2023
Author

Green-Sky
Sep 7, 2023
Collaborator