Summarization Task not working as expected #377
Unanswered
jpborgesmoura
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Hi @jpborgesmoura!
Yes, that setting affects the spaCy pipeline per se, but doesn't change the prompt. It might be useful to automatically append this setting to the prompt, but there are also downsides to it. We'll consider it.
LLMs tend to pick up automatically on the language in the prompt, if they are trained on multilingual data. In case of Dolly v2 the model was trained on English data only (see here). I recommend you
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello everyone,
I'm just a newbie in the artificial intelligence area, doing some kind of tests and research with the spacy-llm tool.
I need to operate the task of Summarization Text and, for my experiments, I'm using the open source model Dolly, in its 3b version.
I'm having relative good understanding about its operation way, but getting into trouble when it comes to make the model summarize my text with the restrictions that I need to apply, which is: summarize texts written in brazilian portuguese keeping then in this language, using maximmum 30 words.
For this purpose, I'm setting in my
config.cfg
file, among other stuff, the attributes 'max_n_words' and 'template', where I put a slithly adjusted version of the default templatesummarization.v1.jinja
that explicitly requires that the final result comes in Brazilian Portuguese (only setting the language as 'pt' in the config file doesn't seem to be enough).Here's how my
config.cfg
file looks like:In my template, I updated the prompt as it goes: "You are an expert summarization system. Your task is to accept Text as input and summarize the Text in a concise way. The summary must always be written in Brazilian Portuguese."
However, when the model runs for my text, the results not always comes in Portuguese (sometimes it's in English), and the quantity of words exceed the maxximum that I had set. For the 'max_n_words' attribute I've read in the API docs that 'this should not expected to work exactly', but, in my case, it seems that is being totally ignored at all.
That's the code of my experiment:
I don't know if there's something that I'm missing and I ask for some kind of help of yours, if it's possible. In advance, I want to thank you for your attention and apologize if it's a foolish question or if I made some mistakes with my English.
Best regards,
João Paulo
Beta Was this translation helpful? Give feedback.
All reactions