Explore using JSON for LB stats files for a more consistent, readable format #1469

lifflander · 2021-06-09T17:43:42Z

What Needs to be Done?

Choose a new format for the LB stats file.

lifflander · 2021-06-09T18:47:46Z

Proposal for format:

{
    "phases": [
        {
            "phase": [
                {
                    "id": 1,
                    "tasks": [
                        {
                            "resource": "cpu",
                            "node": 10,
                            "object": 1993438,
                            "time": 5.34
                        },
                        {
                            "resource": "cpu",
                            "node": 10,
                            "object": 283883,
                            "time": 3.4,
                            "suphases": [
                                {
                                    "id": 1,
                                    "time": 1.4
                                },
                                {
                                    "id": 2,
                                    "time": 2.0
                                }
                            ]
                        }
                    ]
                }
            ]
        }
    ]
}

@nlslatt @PhilMiller @JacobDomagala @jstrzebonski

jstrzebonski · 2021-06-09T18:57:26Z

If there won't be anything except phases, then it could be a simple array, which still is valid JSON.

[
        {
            "phase": [
                {
                    "id": 1,
                    "tasks": [
                        {
                            "resource": "cpu",
                            "node": 10,
                            "object": 1993438,
                            "time": 5.34
                        },
                        {
                            "resource": "cpu",
                            "node": 10,
                            "object": 283883,
                            "time": 3.4,
                            "suphases": [
                                {
                                    "id": 1,
                                    "time": 1.4
                                },
                                {
                                    "id": 2,
                                    "time": 2.0
                                }
                            ]
                        }
                    ]
                }
            ]
        }
    ]

jstrzebonski · 2021-06-09T18:57:39Z

One question, why is phase an array?

lifflander · 2021-06-09T19:01:04Z

One question, why is phase an array?

Because there will be multiple phases. I just have phase 1 in this example, but there will be many in real data.

lifflander · 2021-06-09T19:01:36Z

Oh, I see what you mean. I misunderstood. I think you are right on this.

jstrzebonski · 2021-06-09T19:01:56Z

OK, so than that would be sufficient I guess:

[
        {
            "phase": {
                    "id": 1,
                    "tasks": [
                        {
                            "resource": "cpu",
                            "node": 10,
                            "object": 1993438,
                            "time": 5.34
                        },
                        {
                            "resource": "cpu",
                            "node": 10,
                            "object": 283883,
                            "time": 3.4,
                            "suphases": [
                                {
                                    "id": 1,
                                    "time": 1.4
                                },
                                {
                                    "id": 2,
                                    "time": 2.0
                                }
                            ]
                        }
                    ]
                }
        },
...
    ]

jstrzebonski · 2021-06-09T19:19:22Z

On second thought, maybe like this:

{
  "phases": [
    {
      "id": 1,
      "tasks": [
        {
          "resource": "cpu",
          "node": 10,
          "object": 1993438,
          "time": 5.34
        },
        {
          "resource": "cpu",
          "node": 10,
          "object": 283883,
          "time": 3.4,
          "subphases": [
            {
              "id": 1,
              "time": 1.4
            },
            {
              "id": 2,
              "time": 2.0
            }
          ]
        }
      ]
    },
    {
      "id": 2,
      "tasks": [
        {
          "resource": "cpu",
          "node": 10,
          "object": 1993438,
          "time": 5.34
        },
        {
          "resource": "cpu",
          "node": 10,
          "object": 283883,
          "time": 3.4,
          "subphases": [
            {
              "id": 1,
              "time": 1.4
            },
            {
              "id": 2,
              "time": 2.0
            }
          ]
        }
      ]
    }
  ]
}

that way phases, tasks and subphases are all simply arrays of objects, which, I think, we expect.

nlslatt · 2021-06-10T16:19:20Z

Are we including node just for completeness? Or are we no longer going to have one file per rank?

PhilMiller · 2021-06-10T21:54:09Z

If we do include node, then depending on the reset of the structure, the files could in theory be simply concatenated or the parsed results merged (i.e. set/map union)

lifflander · 2021-06-10T22:12:35Z

If we do include node, then depending on the reset of the structure, the files could in theory be simply concatenated or the parsed results merged (i.e. set/map union)

That's exactly why I included it. So we could combine the files and it would be correct. I'm still intending to have one file per rank because that will be most efficient to output I think. But this would allow us to easily combine them and know which rank they came from without relying on the filename to know.

lifflander · 2021-06-10T22:15:17Z

So the optional meta-data file would look like this:

{
  "subphases": [
    {
      "id": 1,
      "name": "mySubphaseName1"
    },
    {
      "id": 2,
      "name": "mySubphaseName2"
    },
  ]
}

lifflander · 2021-06-10T23:31:10Z

So the issue with the specification is that if we incrementally output, we can't create an array for phases unless we read it in and then write it out again (AFAIK). Basically, the code I wrote ends up with something like this for an incremental builder:

{
    "phases": [
        null,
        null,
        null,
        null,
        null,
        null,
        null,
        {
            "tasks": [
                {
                    "time": 1.6927719116210938e-05,
                    "subphases": [
                        {
                            "time": 1.6927719116210938e-05,
                            "id": 0
                        }
                    ],
                    "resource": "cpu",
                    "object": 107374182400,
                    "node": 0
                },
                {
                    "time": 0.012003183364868164,
                    "subphases": [
                        {
                            "time": 0.012003183364868164,
                            "id": 0
                        }
                    ],
                    "resource": "cpu",
                    "object": 25769803776,
                    "node": 0
                },
                {
                    "time": 0.011924028396606445,
                    "subphases": [
                        {
                            "time": 0.011924028396606445,
                            "id": 0
                        }
                    ],
                    "resource": "cpu",
                    "object": 0,
                    "node": 0
                }
            ],
            "id": 7
        }
    ]
}

But the code to output this is easy to write:

  using json = nlohmann::json;

  json j;
  j["phases"] = {};
  j["phases"][phase]["id"] = phase;

  std::size_t i = 0;
  for (auto&& elm : node_data_.at(phase)) {
    ElementIDStruct id = elm.first;
    TimeType time = elm.second;
    j["phases"][phase]["tasks"][i]["resource"] = "cpu";
    j["phases"][phase]["tasks"][i]["node"] = theContext()->getNode();
    j["phases"][phase]["tasks"][i]["object"] = id.id;
    j["phases"][phase]["tasks"][i]["time"] = time;

    auto const& subphase_times = node_subphase_data_.at(phase)[id];
    std::size_t const subphases = subphase_times.size();
    if (subphases != 0) {
      for (std::size_t s = 0; s < subphases; s++) {
        j["phases"][phase]["tasks"][i]["subphases"][s]["id"] = s;
        j["phases"][phase]["tasks"][i]["subphases"][s]["time"] = subphase_times[s];
      }
    }
    i++;
  }

  fmt::print("j={}\n", to_string(j));

lifflander · 2021-06-10T23:32:15Z

So I think we need to not output an array for phases, instead just a grouping on the name.

lifflander · 2021-06-10T23:36:32Z

Actually, now I read more, I think to do this "correctly" we have to read all the json in, add what we want, and then write it out again. So we will need some work-around for the incremental output.

nlslatt · 2021-06-11T17:14:11Z

Given that limitation, is JSON really the right format to use?

…detected

… examples

lifflander added the type: task label Jun 9, 2021

lifflander assigned PhilMiller and lifflander Jun 9, 2021

lifflander added a commit that referenced this issue Jun 13, 2021

#1469: lib: add nlohmann/json library (v3.9.1)

8d1530c

lifflander added a commit that referenced this issue Jun 13, 2021

#1469: lib: add brotli library (v1.0.9)

6b2416a

lifflander added a commit that referenced this issue Jun 13, 2021

#1469: cmake: add brotli and json library to bundled build

416a20a

lifflander added a commit that referenced this issue Jun 13, 2021

#1469: utils: implement streaming compressor using brotli interface

908c9b1

lifflander added a commit that referenced this issue Jun 13, 2021

#1469: utils: implement output adaptor for compression json

a5efe88

lifflander added a commit that referenced this issue Jun 13, 2021

#1469: utils: implement incremental json appender with compression

e12890c

lifflander added a commit that referenced this issue Jun 13, 2021

#1469: cmake: add new directories to build

13a5490

lifflander added a commit that referenced this issue Jun 13, 2021

#1469: utils: implement base appender to reduce header deps

b190f49

lifflander added a commit that referenced this issue Jun 13, 2021

#1469: lb: implement JSON writer using streaming append

a745178

lifflander mentioned this issue Jun 13, 2021

1469 Output LB statistics as JSON #1475

Merged

7 tasks

lifflander added a commit that referenced this issue Jun 13, 2021

#1469: lb: fix a small bug

d2091f4

lifflander added a commit that referenced this issue Jun 13, 2021

#1469: lb: remove old code, use proper name for file

fef5787

lifflander added a commit that referenced this issue Jun 17, 2021

#1469: lib: remove pkg_config causing failure on CI

96c75a7

lifflander added a commit that referenced this issue Jun 17, 2021

#1469: lib: brotli cmake fixes for Intel and cmake

b6591c8

lifflander added a commit that referenced this issue Jun 17, 2021

#1469: lib: brotli add version to project command

5534db6

lifflander added a commit that referenced this issue Jun 17, 2021

#1469: lib: brotli explicitly set policy as NEW

58be599

lifflander added a commit that referenced this issue Jun 17, 2021

#1469: lib: json library fix whitespace causing CI error

b684943

lifflander added a commit that referenced this issue Jun 17, 2021

#1469: lib: json library work around Intel warning

a26bb6a

lifflander added a commit that referenced this issue Jun 17, 2021

#1469: util: fix warning (-1) for std::size_t

73f8fb1

lifflander added a commit that referenced this issue Jun 17, 2021

#1469: lib: json fix Hedley TPL to force attributes off not properly …

b613e13

…detected

lifflander added a commit that referenced this issue Jun 17, 2021

#1469: lib: json fix warning in nvcc 11

a5387b4

lifflander added a commit that referenced this issue Jun 22, 2021

#1469: docker: add support for nvidia nvcc 10.2 (useful in the future)

4134874

lifflander added a commit that referenced this issue Jun 22, 2021

#1469: lib: json work around nvcc 10.1 bug after identifying it

3c5b95d

lifflander added a commit that referenced this issue Jun 22, 2021

#1469: license: fix headers with new template generated

7a2f1da

lifflander added a commit that referenced this issue Jun 22, 2021

#1469: args: fix duplicated code

dc479f4

lifflander added a commit that referenced this issue Jun 22, 2021

#1469: utils: fix accidentally added whitespace

6b740c5

lifflander added a commit that referenced this issue Jun 22, 2021

#1469: lib: build brotli in portable mode to avoid undefined behavior

2c19c2e

lifflander added a commit that referenced this issue Jun 22, 2021

#1469: lib: remove new option from brotli to avoid policy problems

8f7d74f

lifflander added a commit that referenced this issue Jun 23, 2021

#1469: docs: update documentation on stat file output along with some…

4eebc6f

… examples

lifflander added a commit that referenced this issue Jun 24, 2021

#1469: lb: read node from file instead of using this_node

92127ac

lifflander added a commit that referenced this issue Jun 24, 2021

#1469: docs: fix typo about communication

be62a55

lifflander added a commit that referenced this issue Jun 24, 2021

#1469: util: improve error messages from Brotli

c7d5099

lifflander added a commit that referenced this issue Jun 24, 2021

#1469: lb: optimize restart reader with emplace

b771338

lifflander added a commit that referenced this issue Jun 24, 2021

#1469: tests: simplify expression for equality

77ab6ec

lifflander added a commit that referenced this issue Jun 24, 2021

#1469: util: use class variable for consistency

6d0ea70

lifflander added a commit that referenced this issue Jun 24, 2021

#1469: util: abstract into variable for clarity

96110b3

lifflander added a commit that referenced this issue Jun 24, 2021

#1469: util: change visibility to private

c16659b

lifflander added a commit that referenced this issue Jun 25, 2021

#1469: util: abstract isCompressed into a function in JSON reader

bf2fa1e

lifflander added a commit that referenced this issue Jun 25, 2021

#1469: util: change type of buffer to uint8_t to reduce casting

121711c

lifflander added a commit that referenced this issue Jun 25, 2021

#1469: lb: use automatic conversion for vector

89926e8

lifflander pushed a commit that referenced this issue Jun 28, 2021

#1469: util: use const ref instead of std::unique_ptr when possible

1852358

lifflander closed this as completed in #1475 Jun 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explore using JSON for LB stats files for a more consistent, readable format #1469

Explore using JSON for LB stats files for a more consistent, readable format #1469

lifflander commented Jun 9, 2021

lifflander commented Jun 9, 2021

jstrzebonski commented Jun 9, 2021

jstrzebonski commented Jun 9, 2021

lifflander commented Jun 9, 2021

lifflander commented Jun 9, 2021

jstrzebonski commented Jun 9, 2021

jstrzebonski commented Jun 9, 2021 •

edited

Loading

nlslatt commented Jun 10, 2021

PhilMiller commented Jun 10, 2021

lifflander commented Jun 10, 2021 •

edited

Loading

lifflander commented Jun 10, 2021

lifflander commented Jun 10, 2021

lifflander commented Jun 10, 2021

lifflander commented Jun 10, 2021

nlslatt commented Jun 11, 2021

Explore using JSON for LB stats files for a more consistent, readable format #1469

Explore using JSON for LB stats files for a more consistent, readable format #1469

Comments

lifflander commented Jun 9, 2021

lifflander commented Jun 9, 2021

jstrzebonski commented Jun 9, 2021

jstrzebonski commented Jun 9, 2021

lifflander commented Jun 9, 2021

lifflander commented Jun 9, 2021

jstrzebonski commented Jun 9, 2021

jstrzebonski commented Jun 9, 2021 • edited Loading

nlslatt commented Jun 10, 2021

PhilMiller commented Jun 10, 2021

lifflander commented Jun 10, 2021 • edited Loading

lifflander commented Jun 10, 2021

lifflander commented Jun 10, 2021

lifflander commented Jun 10, 2021

lifflander commented Jun 10, 2021

nlslatt commented Jun 11, 2021

jstrzebonski commented Jun 9, 2021 •

edited

Loading

lifflander commented Jun 10, 2021 •

edited

Loading