Add parsable grid data output #52

spencerharmon · 2020-02-27T02:14:47Z

The purpose of this patch is to add an option to output the grid state in JSON format each epoch along with the .mfs save files. This output includes a string with data member names and values.
I've built a bit of tooling around this patch and I'm convinced it's a workable solution for easily analyzing logical (non/extra-spatial?) relationships between atoms and changes to systems of atoms within the MFM over time.
There are some improvements which could be made to this feature.
First, it doesn't output base layer state, which would be nice to have.
Second, data member names and values are output as a non-JSON string which requires further parsing. I've made a python implementation of a parser to convert this into standard JSON: mfm-griddata-parser
Third, very long strings representing data member names and values are truncated and some data is lost.

… into element_meta

…rmation we seek

src/sim/include/AbstractDriver.h

DaveAckley · 2020-02-27T13:48:12Z

src/sim/include/AbstractDriver.h

-	    //for some reason, grid.GetAtomInSite doesn't work. Maybe because it's constant? Optimizing happening?
+	    //for some reason, grid.GetAtomInSite doesn't do what we need. Maybe because it's constant? Optimizing happening?
+	    //todo: figure out why that doesn't work. GetAtomInSite has an option to get the base layer, which would
+	    //be nice to have. 
            T* atom = grid.GetWritableAtom(siteInGrid);


Yeah I'd think 'const T* atom = grid.getAtomInSite(getFromBase, siteInGrid);' What goes wrong?

I'll check this out tonight. I made that change initially back in November, so it escapes me what happens in that case. I'd love to have another option to export the base layer or append it in a separate list in the same document.

I didn't get a chance to check on this tonight after all, but I'll check it out tomorrow.

Finally got to check this out. You're right: GetAtomInSite works as expected. Latest commit adds base layer to JSON output!

DaveAckley · 2020-02-27T14:00:17Z

src/sim/include/AbstractDriver.h

 	    const Element<EC> * e = grid.LookupElement(t);
            const UlamElement<EC> * uelt = e->AsUlamElement();

+	    //todo: are there any alternatives to the string un this buffer that contain the names of the data members and their value at current epoch?


I'd consider adding like
PRINT_FORMAT_JSON = 0x00001000, //< Format data members in JSON
at UlamClass.h:71 or so (hmm and at DebugUtils.ulam:20 or so in ULAM),
then checking for PRINT_FORMAT_JSON like around UlamClass.tcc:103 and 110.
Not super easy but simplifies and flattens the JSON

If it drops the need for anyone else to have to use the Python deserializer I wrote or otherwise implement their own, I think it's worth it. I'll see if I can pick up on what you're suggesting when I get to this tonight.
I assume the idea would be to add an additional method in order to not mess up the display of data member values in the mfms gui; is that the right way?

I think the new commits are in the spirit of this. I added this flag and added a condition for this in UlamClass.tcc. It could be prettier since I repeated most of what's already there. I can clean this up, but the resulting output looks closer to what I was hoping for in the first place.

…ect from previous 'data_string' to 'data_members'

…bject is created for atoms with no data members. before, the quotes served this purpose for string type.

DaveAckley · 2020-03-01T12:47:42Z

src/sim/include/AbstractDriver.h

@@ -1244,7 +1243,7 @@ namespace MFM
 		      "\"symbol\":\"%s\", "
 		      "\"name\":\"%s\", "
 		      "\"argb\":%d, "
-		      "\"data_string\":\"%s\"}"
+		      "\"data_members\":%s}"


If this Printf was broken into two parts -- the part up through "data_string":, and the part after the %s there -- then the OString buff could be avoided entirely. (Admittedly it was always gross.) fs.Printf(FRONT_PART); then uelt->Print(ucr,fs,*atom,ETC); then fs.Printf(BACK_PART). Passing fs to uelt->Print instead of buff.

If we're pretty confident OString2048 will do the job, given how much expansion we might get printing one atom in JSON, maybe not such an issue.

I could begin to speculate the max character length of the data members object, but chances are I'd be wrong.
Funny enough, I left a todo in the latest commit about converting the event_layer_atoms and base_layer_atoms lists into char buffers (up to the number of sites in the grid times the max length of an atom long!). I didn't do it because I'd want to use one of these OverflowableCharBufferByteSync templates with a buffsize that scales with the grid size (though I imagine this might kill compile times so it may be not worth it for that reason alone).
The problems were I was hoping to solve with this approach were:

to avoid excessive fopen/fclose operations by caching the whole JSON document in memory before streaming the whole thing to disk in one go. I mean, I'm running a rather large tmpfs for my /tmp, but supposing someone specifies their output to be written to, e.g. a 5500 rpm HDD, I'm wondering if that would unnecessarily increase iowait on the system running the simulation (and the frustration of the programmer using the simulator). And,

so that I can grab non-empty atoms from the event layer and base layer in one pass.

I tried digging into the FileByteSync to see if this problem is already solved, but I didn't get to the bottom of it.
I think maybe I'm trying to optimize too much for an unlikely scenario.

tldr; if you think I should drop the buffers altogether, I can pass the FileByteSync object instead.

…uary, it seems. It looks like I was in the process of removing this atombuff buffer. This other data_members thing in the atom printf statement rings a bell, too. It fixed some corner case, I think. Anyway, I'm trusting myself here because I need to go troubleshoot a Makefile

MFM catchup w Dave

spencerharmon added 16 commits October 17, 2019 23:36

add an argument, save a file, and get the names of the elements.

dcc6bbf

get datamember names and default values

6847e71

save state every epoch (finally..)

0a82dd6

remove some cruft

b8ebcee

add argb color data and make it more JSON-y

ceb3f03

ensure there isn't a trailing comma

d678f82

add tile and grid configuration data to output; remove empty sites

9efdca3

add an argument, save a file, and get the names of the elements.

e8ac544

get datamember names and default values

453ad81

save state every epoch (finally..)

e4520a1

remove some cruft

3deeeab

add argb color data and make it more JSON-y

b9b8994

ensure there isn't a trailing comma

1f2adbf

add tile and grid configuration data to output; remove empty sites

fb0216b

Merge branch 'element_meta' of https://www.github.com/spencerharmon/mfm…

d88c20c

… into element_meta

cleanup elementmeta references since the -gd option contains the info…

4229fb0

…rmation we seek

DaveAckley reviewed Feb 27, 2020

View reviewed changes

src/sim/include/AbstractDriver.h Outdated Show resolved Hide resolved

DaveAckley reviewed Feb 27, 2020

View reviewed changes

spencerharmon added 11 commits February 28, 2020 00:07

Add json print format to PrintClassMembers

dd13881

increase size of data member buffer

b0c5c01

remove quotes around data member JSON and change the key for this obj…

96686a7

…ect from previous 'data_string' to 'data_members'

move list end bracket to inside proper loop

1c71a97

fix: missing semicolon

1578281

don't print size zero data members

771ba40

fix: off-by-one error causes malformed list

0a442d3

Actually, there was one fewer off-by-one error than I thought.

9dd6902

change the filename and option names to include the word json

2d75674

move the curly braces up one level; this ensures that an empty json o…

453f807

…bject is created for atoms with no data members. before, the quotes served this purpose for string type.

Added base_layer_atoms list to JSON output.

e959503

DaveAckley reviewed Mar 1, 2020

View reviewed changes

spencerharmon added 3 commits September 11, 2020 00:40

sed hack to fix inc

d536ea9

Merge branch 'develop' into element_meta

78550c0

DaveAckley pushed a commit that referenced this pull request Oct 10, 2022

Merge pull request #52 from DaveAckley/develop

98d3bfc

MFM catchup w Dave

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add parsable grid data output #52

Add parsable grid data output #52

spencerharmon commented Feb 27, 2020

DaveAckley Feb 27, 2020

spencerharmon Feb 27, 2020

spencerharmon Feb 28, 2020

spencerharmon Mar 1, 2020

DaveAckley Feb 27, 2020

spencerharmon Feb 27, 2020

spencerharmon Feb 28, 2020

DaveAckley Mar 1, 2020

spencerharmon Mar 3, 2020

Add parsable grid data output #52

Are you sure you want to change the base?

Add parsable grid data output #52

Conversation

spencerharmon commented Feb 27, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment