llama : add Refact support #3061

ggerganov · 2023-09-07T13:54:25Z

This is a new 1.6B code model: https://huggingface.co/smallcloudai/Refact-1_6B-fim

We should look into adding support into llama.cpp similar as to how we did for Falcon and Baichuan:

I haven't looked into the architecture yet, but I'm hoping it is similar to GPT, which we already know how to handle OK thanks to the Falcon experience. I see it also uses Alibi which we should have support for, but we haven't tested extensively, so there might be issues there.

The text was updated successfully, but these errors were encountered:

Jipok · 2023-09-09T13:31:39Z

Related: #3013

discobot · 2023-09-11T21:18:04Z

Let me attempt this :)

ggerganov added help wanted Extra attention is needed good first issue Good for newcomers model Model specific labels Sep 7, 2023

ggerganov added this to ggml : roadmap Sep 7, 2023

ggerganov moved this to Todo in ggml : roadmap Sep 7, 2023

olegklimov mentioned this issue Sep 21, 2023

[bounty] CPU inference support, Mac M1/M2 inference support smallcloudai/refact#77

Open

ds5t5 mentioned this issue Sep 25, 2023

add refact model #3329

Merged

ggerganov closed this as completed in #3329 Oct 4, 2023

ggerganov moved this from Todo to Done in ggml : roadmap Oct 4, 2023

coder543 mentioned this issue Jan 25, 2024

Add Refact model ollama/ollama#468

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : add Refact support #3061

llama : add Refact support #3061

ggerganov commented Sep 7, 2023

Jipok commented Sep 9, 2023

discobot commented Sep 11, 2023

llama : add Refact support #3061

llama : add Refact support #3061

Comments

ggerganov commented Sep 7, 2023

Jipok commented Sep 9, 2023

discobot commented Sep 11, 2023