Generator streamlining, docs #682

leondz · 2024-05-14T09:58:26Z

Added generator docs

Made streamlining changes to generators.base.Generator logic

Resolves #662

…nerate() return would/would not be filtered for None that was predicated on whether a model supported multiple outputs

jmartin-tech

👍

My question may already be answered somewhere, it does not change that current expectations are as documented.

jmartin-tech · 2024-05-14T17:12:51Z

docs/source/garak.generators.base.rst

+  #. Otherwise, we need to assemble the outputs over multiple calls. There are two options here.
+    #. Is garak running with ``parallel_attempts > 1`` configured? In that case, start a multiprocessing pool with as many workers as the value of ``parallel_attempts``, and have each one of these work on building the required number of generations, in any order.
+    #. Otherwise, call ``_call_model()`` repeatedly to collect the requested number of generations.
+  #. Strip ``None`` responses from the outputs, and return the resulting list of prompt responses.


This is the current practice, however I am interested in the reasoning for this. Is there value in knowing that call returned None, in theory we can determine this happened from inspection of configuration, however I wonder if a None response might be valuable in some probes patterns. Consider if a model is setup to mitigate certain responses by simply closing the connection. In practice this is not a good pattern as a 200 response with a rejection is the common response, yet I can akin it to the idea that if you call someone to ask a question they might simply hang up the phone.

This was a great time to ask this question. None is intended to indicate that generation failed, so would definitely be good for the situation you mention. Perhaps then generate() should not filter out Nones at all. But also - perhaps when a model fails when given a blank prompt, it should return None, instead of exception handling and a string. I think that we're in a better place if any str-typed output from a generator is an indication that the model successfully output a string, and that this output is the string component of the output. Wdyt?

cf.: #689 (comment)

leondz added 3 commits May 14, 2024 11:55

streamline logic at top of generate(): remove an instability where ge…

55f518c

…nerate() return would/would not be filtered for None that was predicated on whether a model supported multiple outputs

start doc intro softer

1ba91df

describe generate() flow and generator structure

ca2ef0b

leondz added documentation Improvements or additions to documentation generators Interfaces with LLMs labels May 14, 2024

leondz requested a review from jmartin-tech May 14, 2024 09:58

jmartin-tech approved these changes May 17, 2024

View reviewed changes

leondz mentioned this pull request May 21, 2024

stablize openai parallel #689

Merged

jmartin-tech mentioned this pull request May 23, 2024

Set generator _call_model() and _generate() type hints; amend functions #694

Merged

rm bit about None-stripping

406a3e2

leondz merged commit b4c8c81 into main May 27, 2024
3 checks passed

github-actions bot locked and limited conversation to collaborators May 27, 2024

leondz deleted the docs/generator branch May 29, 2024 07:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generator streamlining, docs #682

Generator streamlining, docs #682

leondz commented May 14, 2024

jmartin-tech left a comment

jmartin-tech May 14, 2024

leondz May 21, 2024

Generator streamlining, docs #682

Generator streamlining, docs #682

Conversation

leondz commented May 14, 2024

jmartin-tech left a comment

Choose a reason for hiding this comment

jmartin-tech May 14, 2024

Choose a reason for hiding this comment

leondz May 21, 2024

Choose a reason for hiding this comment