[doc] custom output formatter schema (#2018)

deepjavalibrary · Jun 4, 2024 · b8c2fec · b8c2fec
1 parent 93ed30d
commit b8c2fec
Showing 1 changed file with 97 additions and 0 deletions.
diff --git a/serving/docs/lmi/user_guides/output_formatter_schema.md b/serving/docs/lmi/user_guides/output_formatter_schema.md
@@ -0,0 +1,97 @@
+# Custom output formatter schema
+
+This document provides the schema of the output formatter, with which you can write your own custom output formatter. 
+
+
+## Signature of your own output_formatter
+
+To write your own custom output formatter, follow the signature below:
+```
+from djl_python.output_formatter import RequestOutput
+
+def custom_output_formatter(request_output: RequestOutput) -> str:
+    #your implementation here
+```
+
+## RequestOutput schema
+The RequestOutput class is designed to encapsulate the output of a request in a structured format. Here is an in-depth look at its structure and the related classes:
+```mermaid
+classDiagram
+    RequestOutput <|-- TextGenerationOutput
+    RequestOutput *-- RequestInput
+    TextGenerationOutput "1" --> "1..*" Sequence
+    Sequence "1" --> "1..*" Token
+    RequestInput <|-- TextInput
+    class RequestOutput{
+        +int request_id
+        +bool finished
+        +RequestInput input
+    }
+    class TextGenerationOutput{
+        +map sequences
+        +int best_sequence_index
+        +list[Token] prompt_tokens_details
+    }
+    class Sequence{
+        +list[Token] tokens
+        +float cumulative_log_prob
+        +string finish_reason
+        +has_next_token()
+        +get_next_token()
+    }
+    
+    class RequestInput{
+        +int request_id
+        +dict parameters
+        +Union[str, Callable] output_formatter
+    }
+    class TextInput{
+        +str input_text
+        +list[int] input_ids
+        +any adapters
+        +any tokenizer
+    }
+    class Token{
+        +int id
+        +string text
+        +float log_prob
+        +bool special_token
+        +as_dict()
+    }
+    
+```
+
+### Detailed Description
+
+- **RequestOutput**: This is the main class that encapsulates the output of a request.
+- **TextGenerationOutput**: This subclass of RequestOutput is specific to text generation tasks. Right now this is the only task supported for custom output formatter. Each text generation task can generate multiple sequences. 
+  - best_sequence_index: index of the best sequence with the highest log probabilities. Please use this, when you are trying to look up the output sequence. 
+  - Note that, right now, only one sequence will be generated. In the future release, multiple sequences generation will be supported. 
+- **Sequence** : Represents a sequence of generated tokens and it's details 
+  - has_next_token() and get_next_token() methods function like an iterator. In iterative generation, each step produces a single token.
+  - get_next_token() advances the iterator to the next token and returns a Token instance along with details indicating whether it is the first token (first_token) and whether it is the last token (last_token).
+
+## Example
+Here is an example of a custom output formatter:
+```python
+from djl_python.output_formatter import TextGenerationOutput
+import json
+
+def custom_output_formatter(request_output: TextGenerationOutput):
+    """
+    Replace this function with your custom output formatter.
+
+    Args:
+        request_output (TextGenerationOutput): The request output
+
+    Returns:
+        (str): Response string
+
+    """
+    best_sequence = request_output.sequences[request_output.best_sequence_index]
+    next_token, first_token, last_token = best_sequence.get_next_token()
+    result = {"token_id": next_token.id, "token_text": next_token.text, "token_log_prob": next_token.log_prob}
+    if last_token:
+        result["finish_reason"] = best_sequence.finish_reason
+    return json.dumps(result) + "\n"
+```