Fun: What to do about duplicated fields? #11

robojumper · 2019-05-30T16:33:16Z

DarkestDungeonSaveEditor/src/main/java/de/robojumper/ddsavereader/file/DsonFile.java

Lines 359 to 390 in 50c28e9

    
           // Whether this File has duplicate fields that will get lost when converting to 
        
           // string 
        
           // This doesn't seem to be causing any issues, but is important for test 
        
           // coverage because 
        
           // files with duplicate fields will re-encode to a different size. 
        
           public boolean hasDuplicateFields() { 
        
               Set<String> fields = new HashSet<>(); 
        
               for (int i = 0; i < rootFields.size(); i++) { 
        
                   if (!fields.add(rootFields.get(i).name)) { 
        
                       return true; 
        
                   } 
        
                   if (hasDuplicateFields(rootFields.get(i))) { 
        
                       return true; 
        
                   } 
        
               } 
        
               return false; 
        
           } 
        
           private boolean hasDuplicateFields(DsonField field) { 
        
               Set<String> fields = new HashSet<>(); 
        
               if (field.type == FieldType.TYPE_OBJECT) { 
        
                   for (int i = 0; i < field.children.length; i++) { 
        
                       if (!fields.add(field.children[i].name)) { 
        
                           return true; 
        
                       } 
        
                       if (hasDuplicateFields(field.children[i])) { 
        
                           return true; 
        
                       } 
        
                   } 
        
               } 
        
               return false; 
        
           }

DarkestDungeonSaveEditor/src/test/java/de/robojumper/ddsavereader/file/ConverterTests.java

Lines 82 to 87 in 50c28e9

    
           // Files with duplicate fields will not have the same size anyway. 
        
           // Filter them out here 
        
           if (!dupeFieldFiles.contains(i)) { 
        
               assertEquals(reEncodedFiles.get(i).length, files.get(i).length, 
        
                       fileList.get(i) + " encodes to different number of bytes"); 
        
           }

The decoder eats duplicated fields.

This was an issue before; I simply excluded the failing file name from the byte size equality tests. In #9, a new duplicated field wound up; excluding that entire file name would regress test coverage. Thus, files now have a method that can be used to check whether discrepancies between original file size and re-encoded file size are expected. This is still not ideal.

Here's where the "Fun:" part comes in: One could find a reasonable upper bound for allowed differences, as in "we know field X is duplicated with name of length Y and data size of Z -- the file is allowed to have as many as 12+16+Y+Z+3 more bytes" (12+16 for header, 3 for alignment).

One could drive the point even further and derive a number of allowed different bytes for files without duplicated fields: Assuming that one or two bytes per Meta2 block have garbage bits and that the header has a bunch of garbage, we could have a more fine-grained test.

The text was updated successfully, but these errors were encountered:

thanhnguyen2187 · 2022-09-12T16:53:33Z

Hi there. Any update on this? I am also trying to write my own version of Darkest Dungeon Save Editor, and encountered this issue on persist.progression.json. It was something like this, where slay_a_squiffy_with_jester gets mentioned twice as a field. Is it safe if I ignore the duplicated data?

{
  "__revision_dont_touch": 1683488768,
  "base_root": {
    "version": 2,
    "dungeon": {
      ...
    },
    "completed_plot_quests_data": {
      ...
    },
    ...
    "achievements": {
    ...
    },
    "real_achievements": {
      ...
      "slay_a_squiffy_with_jester": {
        "rtti": 1935132924,
        "id": "slay_a_squiffy_with_jester",
        "completed": false,
        "awarded": false,
        "conditions": {
          "0": {
            "enemies_killed": 0
          },
          "1": {
            "enemies_killed": 0
          }
        }
      },
      ...
    },
    "infestation": {
      ...
    },
    "flashback_completion_counts": {
      ...
    }
  }
}

By the way, thanks for the awesome tool and documentation!

robojumper · 2022-09-12T18:58:29Z

I can't give you an authoritative answer on how to deal with these. I'm not aware of any issues caused by dropping these duplicates, but the scope of this project was the binary encoding of the save data, not the actual semantics of the data, so might be lots of issues I don't know about.

I'm glad you're finding this project useful though!

robojumper added the impl-java Pertaining to the Java implementation label Mar 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fun: What to do about duplicated fields? #11

Fun: What to do about duplicated fields? #11

robojumper commented May 30, 2019 •

edited

Loading

thanhnguyen2187 commented Sep 12, 2022

robojumper commented Sep 12, 2022

Fun: What to do about duplicated fields? #11

Fun: What to do about duplicated fields? #11

Comments

robojumper commented May 30, 2019 • edited Loading

thanhnguyen2187 commented Sep 12, 2022

robojumper commented Sep 12, 2022

robojumper commented May 30, 2019 •

edited

Loading