LLMS such as OpenAI GPT are notoriously bad in consistently generating well-formed datastructures, even as simple as JSON. LLMJson aims to make your life a lot easier
This library is based on the TinyJson library. While it is great, it comes with the same shortcomings and LLM-JSON and adds a it's own:
- Limited to parsing <2GB JSON files add most
- It will not parse abstract classes or interfaces
- It's slow: it might very well be the slowest JSON library in the world.
However, all that is less relevant for the intended use of this library, namely sharing data structures with an LLM in an understandable manner and interpreting data structures returned by the LLM.
- LLMJson can serialize its values, but it can also generate descriptions in a way that LLMs understand how to fill in the fields
- It is robust against leading and trailing text, markup errors, comments and missing or superfluous fields
- It is very good at interpreting field values that are non standard.
- It allows special property wrappers that allow, at runtime, adding descriptions, hiding fields or making them immutable, or provide custom parsers
As an example, let's suppose we are building a roleplaying game where the character stats are continuously being updated by the LLM based on how a story unfolds. For this to work, we would this case we would serialize a persona including descriptions of the fields and sends it to OpenAI to update the persona. Next, it receives the result and extracts and deserializes the Json. finally the received object is shown as JSON to confirm its correctness.
Let's see what that would look like:
In our program we have a class that defines the statem that is being updated in the back-and-forth with the LLM
{
public enum Sex { Male, Female, Other}
public class Person
{
[DescriptionAttribute("Firstname only")]
public string Name { get; set; }
[DescriptionAttribute("Age of person. Range between 0 and 100")]
public int Age { get; set; }
public Sex Sex { get; set; }
public JsonProp<int> Iq { get; set; }
public DateTime Birthday { get; set; }
[DescriptionAttribute("A list of character traits, e.g. [\"optimistic\", \"smart\"]")]
public List<string> Traits { get; set; }
public JsonProp<Dictionary<string, int>> Stats { get; set; } = new(new Dictionary<string, int>(),
"A dictionary of character statistics with a percentage between 0 and 100, e.g. {{ \"quick thinking\", 100}}");
}
}
We will see that Description
attributes will add comments to json that we generate. JsonProp<...>
is a special kind of property especially useful for dynamic usage, we will get back to that object later.
Next, we create a prompt for the LLM, requesting it to update the model, adding the model with descriptions
var personJson = person.ToJson(OutputModes.ValueAndDescription);
var myPrompt = "Ideate and update at least two elements of this persona, " +
"make sure all aspects of the persona are consistent. \n\n " +
JsonWriter.GetPrompt() + " \n\n " +
personJson;
In the code sample above, person.ToJson(OutputModes.ValueAndDescription)
serialized the object in one of 3 types of JSON, this one particularly suited for updating already filled structures
The call JsonWriter.GetPrompt()
returns a prompt that has shown to work well priming LLMs to update the structure and very often return valid JSON.
Together the prompt becomes
Ideate and update at least two elements of this persona, make sure all aspects of the persona are consistent.
Update the data in the described format below. Comment are added for your understanding, remove in updated response. provide a RFC8259 compliant JSON response, following this format without deviation, but values may be changed, lists and dictionaries may change in length. Add additional explanation of the changes made after the json structure.
{
"Name" : "Nigel Thornberry" \\ Firstname only. Is of type string
,"Age" : 47 \\ Age of person. Range between 0 and 100. Is of type 32-bit integer
,"Sex" : "Male" \\ Is of type enum. Possible values are Male,Female,Other
,"Iq" : 130 \\ Intelligence coefficient. Is of type 32-bit integer
,"Birthday" : "09/16/2023 00:00:00" \\ Is of type date and time in format dddd, dd MMMM yyyy HH:mm:ss
,"Traits" : ["sneaky" ,"funny"] \\ A list of character traits, e.g. ["optimistic", "smart"]. Is of type List, items are of type string
,"Stats" : {"bravery":80 ,"nimbleness":70} \\ A dictionary of character statistics with a percentage between 0 and 100, e.g. {{"bravery", 100}, { "quick thinking", 100}}. Is of type Dictionary . The key is of type string, the value of 32-bit integer
}
Note all the comments added to the JSON structure. This makes the JSON invalid according the standard, but it optimizes the understanding of the LLM. It adds not only the added comments, but also field types and even the valid enums.
sent to an LLM, it will an answer similar to the following
Here is the updated persona
{
"Name" : "Nigel Thornberry",
"Age" : "fourty seven",
"Sex" : "Male",
"Iq" : "130",
"Birthday" : "09/16/1974 00:00:00",
"Traits" : ["adventurous", "charismatic", "outgoing"],
"Stats" : {
"bravery": 80,
"nimbleness": 70,
"empathy": 60
}
}
In this updated persona, we have added two new elements and modified one existing element.
Firstly, we have updated the "Birthday" to reflect a more accurate date of birth for the persona, changing it from 2023 to 1974 to ensure the age of 47 makes sense.
Secondly, we have added two new traits to the persona; "adventurous" and "charismatic", which align with the existing trait of "outgoing".
Finally, we have added a new statistic to the existing "Stats" dictionary, which is "empathy" with a value of 60 out of 100. This helps to provide a more well-rounded understanding of the persona's character attributes.
Note that this answer is, in fact, not valid Json:
- it has text before and after the json struct
- Age is described in words instead of a number
- IQ is between quotation marks
Still, we are going to hand it over to the JSON Deserializer as is:
llmResult.FromJson<Person>();
The Deserializer is so robust, that it is able to fix the non-valid JSON and interpret the non-standard values.
Let's look at the serialization in a bit more detail:
Serialization can be done either throughJsonWriter.ToJson(person, outputMode)
or person.ToJson(outputMode)
where outputMode has 3 distinct modes:
OutputModes.Value
outputs normal, well-formed JSON as a normal serializer
{
"Name" : "Nigel Thornberry",
"Age" : 47,
"Sex" : "Male",
"Iq" : 130,
"Birthday" : "09/16/2023 00:00:00",
"Traits" : ["sneaky", "funny"],
"Stats" : { "bravery": 80, "nimbleness": 70 }
}
OutputModes.Description
the JSON structure together with explanation how to fill in the fields, but without any values. This is usefull when requesting the LLM to fill in a JSON structure with all new values:
When outputted as descriptions it will give back the following
{
"Name" : "Firstname only. Is of type string",
"Age" : "Age of person. Range between 0 and 100. Is of type 32-bit integer",
"Sex" : "Is of type enum. Possible values are Male,Female,Other",
"Iq" : "Intelligence coefficient. Is of type 32-bit integer",
"Birthday" : "date and time in format dddd, dd MMMM yyyy HH:mm:ss",
"Traits" : [] \\ A list of character traits, e.g.["optimistic", "smart"].Is of type List, items are of type string,
"Stats" : {} \\ A dictionary of character statistics with a percentage between 0 and 100, e.g. {{"bravery",100}, {"quick thinking",100}}.Is of type Dictionary. The key is of type string, the value of is type string
}
Note that this is not a JSON schema nor valid JSON, but a pseudo-JSON that LLMs understand typically very well. Types are automatically extracted from the class definition, and comments are added through attributes.
OutputModes.ValueAndDescription
combines both description and values as shown in the example.
Now, if you want to share with an LLM both values and descriptions, you will get
{
"Name": "Nigel Thornberry" \\ Firstname only.Is of type string,
"Age": 47 \\ Age of person.Range between 0 and 100. Is of type 32 - bit integer,
...
}
Again, this invalid JSON, but structured such that LLMs understand it very well.
Note that based on the different formats, JsonWriter.GetPrompt()
returns different prompts.
Still not satisfied? You can also tune your own format for optimal data transfer with your LLM:
var personJson = JsonWriter.ToJson(
person,
OutputModes.Custom,
(value, type, description) => $"{value}{" \\\\ ".IfBothNotEmpty(description.IfBothNotEmpty(". ") +
"The field is of type ".IfBothNotEmpty(type))}\n"
);
where IfBothNotEmpty
is a helper function that shows both strings if both not empty, otherwise it shows none.
For more flexibility you can wrap class properties in the special 'JsonProp' property. So why is this useful?
Let's stay at our example of a role playing game. If you want to have an LLM fill in a larger JSON model, it often pays of to do that in multiple passes, especially if additional guidance is needed for different areas, as the attention of the LLM is limited.
This means that in the first pass you may not bother the LLM with Character traits:
person.Stats.Visible = false
Now, the serialized JSON will not include the character traits dictionary. Let's say that in the first pass, the character that was created was an Or. Now, for the second pass, you enable the stats again, and you update the comments to suite the character
person.Stats.Description = "A dictionary of Orc characteristics with a percentage between 0 and 100, e.g. {{\"strength\", 100}, { \"sense of smell\", 100}}"
Serialization can be done either throughJsonParser.FromJson<Person>(PersonJson,basePerson)
or PersonJson.FromJson<Person>(basePerson)
, where the object basePerson
is the object that will be updated with the values from JSON. This can be new Person()
if want just the values from JSON.
LLMJSON uses JSONRepairSharp to pre-parse the JSON Response coming from the LLM to remove multiple types of mistakes:
- It is robust against leading and trailing text, as LLMs tend to add introductions and explanations
- It is robust against missing fields and additional fields and comments
- It is (often) robust against errors such as missing quotes, missing escape characters, missing commas, missing closing brackets and more
You can turn this feature off by JsonParser.UseRepair = false
but there does not seem to be any reason to do so.
Another source of mistakes is that fields are filled with incompatible values. For example
{
"BoolField" : "true", // true should not be between quotes
"IntField" : 455.7, // Int value should not have digits
"stringField : 10 // string field should not be between quotes
}
and so on. The LLMJSON parser is able to resolve these and other malformatted fields. Sometimes LLMs may give values back in even more non-standard responses
{
"OrdinalField" : "eleventh",
"FloatField" : "eight point six",
"DateTimeField": "8:00pm 5 jan 2021"
}
Using the Microsoft Recognizer library, these values can still be parsed, when the normal parser fails. Be aware that this can be very, very slow. So, if speed is more important than robustness, turn off the recognizers using JsonParser.UseRecognizer = false
Also in Deserializing JsonProp may come in useful. Let's consider a second pass over the datamodel again. Suppose you want the LLM to know about a property, but you do not want it to be updated. In this case you can say
person.IQ.Immutable = true
This means that this field of the baseObject will not be overwritten with the value of the JsonFile.
It also allows for injecting your own parsers. Look at this example for a temperature parsing property
public class TemperatureProp : JsonProp<float>
{
public TemperatureProp(float value, bool visible = true, bool immutable = false) :
base(value, "Temperature in Celcius", visible, immutable,
rawValue =>
{
// Start with normal Parsing. PrepString does some basic cleaning
var stringValue = SafeParseUtils.PrepString(rawValue).Trim('C');
// First try to directly interpret as number
var success = float.TryParse(stringValue, out float floatValue);
// Return if a success
if (success) return new Tuple<float, bool>(floatValue, success);
// Not successful? Let's try a Microsoft Recognizer
stringValue = SafeParseUtils.RecognizerResultToValue(NumberWithUnitRecognizer.RecognizeTemperature(rawValue, Culture.English));
// Return whether a success or not
success = float.TryParse(stringValue, out floatValue);
return new Tuple<float, bool>(floatValue, success);
})
{}
}