doc: create ADR 0004 for Skate MQTT Hierarchy #2686

firestack · 2024-07-04T17:45:20Z

This proposes the initial schema and topic hierarchy.

Rendered: https://github.com/mbta/skate/blob/kf/doc/mqtt-hierarchy-adr/documentation/adr/0004-trip-modifications-mqtt-topic-structure.md
Asana Ticket: https://app.asana.com/0/1205385723132845/1207716595296863/f

github-actions · 2024-07-04T17:50:31Z

Coverage of commit `3f21acc`

Summary coverage rate:
  lines......: 93.7% (3276 of 3498 lines)
  functions..: 73.5% (1347 of 1832 functions)
  branches...: no data found

Files changed coverage rate: n/a

Download coverage report

documentation/adr/0004-trip-modifications-mqtt-topic-structure.md

joshlarson

Similar to #2685 (review), I think it'd be worth discussing some alternatives, and why we're choosing this approach.

Alternatives that we should at least mention:

A single topic with no retained messages where we put explicit "diff" messages (Detour A exists now, Detour B now applies to trip xyz, Detour C is deactivated, etc)
Topics split out by detour ID with retained messages, but instead of deactivating with a 0-byte message, deactivating with an explicit "deactivated" message
No deactivating at all - we just stop adding trip ID's to a particular detour (or remove them if they've already been added) once a detour doesn't apply to future or current trips anymore.

firestack · 2024-07-09T20:42:02Z

Similar to #2685 (review), I think it'd be worth discussing some alternatives, and why we're choosing this approach.

Alternatives that we should at least mention:

* A single topic with no retained messages where we put explicit "diff" messages (Detour A exists now, Detour B now applies to trip xyz, Detour C is deactivated, etc)

* Topics split out by detour ID with retained messages, but instead of deactivating with a 0-byte message, deactivating with an explicit "deactivated" message

* No deactivating at all - we just stop adding trip ID's to a particular detour (or remove them if they've already been added) once a detour doesn't apply to future or current trips anymore.

@joshlarson I copied your thoughts and added some responses, let me know what you think?
https://github.com/mbta/skate/compare/124dfdc2beb3cc2b22d6279b2fdefc5676cd89d1..00ca2f322b14d8813ad2450c3f7215cdc48ba1ef

joshlarson · 2024-07-11T21:40:21Z

So the one thing I'm still uncertain about is what happens if we publish a zero-byte deletion while Transit Data is down.

My fear is something like the following sequence of events:

Dispatcher activates a detour
Skate sends GTFS-TM message
Transit Data activates that detour downstream using their Transit Data ways
Transit Data goes down for some reason
Dispatcher deactivates that detour
Skate deletes the GTFS-TM message off that queue with a zero-byte message
Transit Data comes back online
No new message (since what we did was a deletion, not a "delete message")
Everyone downstream of Transit Data thinks the detour is still active

I see a few ways around this, some of which depend on precisely what MQTT does with zero-byte messages.

MQTT could delete the topic. If that happens, I'm not sure what we do, TBH.

MQTT could keep the topic around with no retained message (I think this is the most likely outcome), and when new clients come online, it treats the topic as empty. Transit Data could iterate through all of the topics, and treat any that have no retained message as "deactivated" and do its Transit Data thing to deactivate them downstream. This seems fine, but does run the risk of Transit Data having to iterate over an increasingly large collection of topics on startup.

MQTT could deliver the zero-byte message on startup, which means this isn't even a concern (I'd be surprised if that's how this works, but I like surprises 🙂)

firestack · 2024-07-11T21:54:11Z

Transit Data comes back online

No new message (since what we did was a deletion, not a "delete message")

Everyone downstream of Transit Data thinks the detour is still active

As I understand it, Concentrate is ephemeral and does not have a DB, it'll re-read open topics when it comes online, and publish a new GTFS-RT file without the removed trip modification in it.

firestack · 2024-07-11T22:01:36Z

MQTT could delete the topic. If that happens, I'm not sure what we do, TBH.

Topics don't really "exist", you only know about them when a message is sent on it.

MQTT could deliver the zero-byte message on startup, which means this isn't even a concern (I'd be surprised if that's how this works, but I like surprises 🙂)

A zero-byte message "deletes the topic", by deleting the last retained message, and therefore new subscribers after the deletion are not even aware the message or topic ever existed.

MQTT could keep the topic around with no retained message (I think this is the most likely outcome)

I don't understand? But see above, I think this is an incorrect assumption?

[..] when new clients come online, it treats the topic as empty. Transit Data could iterate through all of the topics, and treat any that have no retained message as "deactivated"

This is what's expected, except it's that when it first connects it'll receive all retained messages for topics it subscribes too. (which should include the wildcard to subscribe to all ids)

and do its Transit Data thing to deactivate them downstream.

There's not much to do aside from publish a new GTFS-RT file with the current data, because it's stateless, there's nothing to "remove", only to "omit" (and omission is a side-effect of the data disappearing from the feed via the zero-byte message)

This seems fine, but does run the risk of Transit Data having to iterate over an increasingly large collection of topics on startup.

We should be deleting any topics that aren't live anymore, if the collection is growing, either all those detours should be live, or there's a bug IMO.

joshlarson · 2024-07-12T16:39:59Z

As I understand it, Concentrate is ephemeral and does not have a DB, it'll re-read open topics when it comes online, and publish a new GTFS-RT file without the removed trip modification in it.

This is helpful context, thank you!

I guess I wasn't (maybe am still not?) totally clear on who TD's downstream consumers are, and what data they need - for instance, I thought that TD was going to publish something directly to Swiftly, which might mean that they need to publish a deactivation to Swiftly as well?

Anyway, assuming that Walton/TransitData approves, then I think this looks great!

firestack · 2024-07-12T17:12:06Z

for instance, I thought that TD was going to publish something directly to Swiftly, which might mean that they need to publish a deactivation to Swiftly as well?

My understanding is that Swiftly consumes our GTFS-RT feed, so they'd just implicitly get these once we started publishing them to the feed, and they'd continue to make predictions for us based on this info. I'd defer to @Whoops to make sure, but I'm fairly certain we won't be tying into Swiftly via any specific API's at this point.

Whoops · 2024-07-12T17:20:21Z

for instance, I thought that TD was going to publish something directly to Swiftly, which might mean that they need to publish a deactivation to Swiftly as well?

My understanding is that Swiftly consumes our GTFS-RT feed, so they'd just implicitly get these once we started publishing them to the feed, and they'd continue to make predictions for us based on this info. I'd defer to @Whoops to make sure, but I'm fairly certain we won't be tying into Swiftly via any specific API's at this point.

@fsaid90 This is the question I mentioned in our meeting. Can you put what you said in writing? :-)

fsaid90 · 2024-07-12T18:45:44Z

for instance, I thought that TD was going to publish something directly to Swiftly, which might mean that they need to publish a deactivation to Swiftly as well?

My understanding is that Swiftly consumes our GTFS-RT feed, so they'd just implicitly get these once we started publishing them to the feed, and they'd continue to make predictions for us based on this info. I'd defer to @Whoops to make sure, but I'm fairly certain we won't be tying into Swiftly via any specific API's at this point.

@fsaid90 This is the question I mentioned in our meeting. Can you put what you said in writing? :-)

Yep!

Right now the flow is:

Busloc -> Swiftly -> Consumers

Busloc assembles GTFS-RT VehiclePositions + TripUpdates sans predictions and delivers them to Swiftly. Swiftly send adjusted feeds with predictions. Concentrate consumes + applies block waivers to cancel trips then publishes to consumers.

We'll have a new flow as well:

Skate -> Somewhere? S3? Concentrate? -> Swiftly -> Concentrate -> consumers

Skate would send TripModifications to us, we'd have to forward them on to Swiftly (to adjust predictions), then we'd receive updated GTFS-RT from Swiftly that factors in the TripModifications, and we'd publish that GTFS-RT (alongside our TripModifications). I don't expect that Swiftly will adjust TripModifications, they just need to know about them to update predictions accordingly (i.e. we can publish the TripModifications as is from Skate).

@Whoops you mentioned this might be a circular dependency, but I'm not sure that it is! Let me know what you think.

firestack · 2024-07-12T18:49:02Z

Skate -> Somewhere? S3? Concentrate? -> Swiftly -> Concentrate -> consumers

@Whoops you mentioned this might be a circular dependency, but I'm not sure that it is! Let me know what you think.

Seems a little circular to me 😅

graph TD
	Skate
	-- MQTT --> Concentrate
	-- without predictions --> Swiftly
	-- with predictions --> Concentrate
	--> Consumers

But also, Skate could become a consumer at some point because we'd potentially want the Alert information that Alerts would add eventually.

graph TD
	Skate
	-- MQTT --> Concentrate
	-- without predictions --> Swiftly
	-- with predictions --> Concentrate
	--> Consumers -.->  Skate

fsaid90 · 2024-07-12T19:15:15Z

Skate -> Somewhere? S3? Concentrate? -> Swiftly -> Concentrate -> consumers

@Whoops you mentioned this might be a circular dependency, but I'm not sure that it is! Let me know what you think.

Seems a little circular to me 😅
graph TD
	Skate
	-- MQTT --> Concentrate
	-- without predictions --> Swiftly
	-- with predictions --> Concentrate
	--> Consumers
Loading
But also, Skate could become a consumer at some point because we'd potentially want the Alert information that Alerts would add eventually.
graph TD
	Skate
	-- MQTT --> Concentrate
	-- without predictions --> Swiftly
	-- with predictions --> Concentrate
	--> Consumers -.->  Skate
Loading

Haha definitely looks circular 😅

I'm really thinking about it as two separate flows:

GTFS-RT TripUpdates + VehiclePositions flow continues to be the same: Busloc -> S3 -> Swiftly -> Concentrate -> public

TripModifications: Skate -> Concentrate -> Public AND Skate -> Concentrate -> S3 -> Swiftly <- it ends there since we're not waiting on the resource to come back to us?

But obviously I'm not a developer so I'll defer to y'all 😃

Whoops · 2024-07-12T20:04:15Z

I think the big question is what exactly is Switly expecting to consume? Right now Concentrate takes in a bunch of RT (including Swiftly) feeds, and soon these Skate messages and outputs them into an RT feed. If Swiftly is going to then consume that feed then we have a situation where Swiftly is both consuming that RT feed to generate it's output, and Switfly's output goes into that RT feed.

I'm curious what you mean by "without predictions" in the diagram above. The only Concentrate output that doesn't have predictions in my mind is VehiclePositions.

firestack · 2024-07-12T20:10:38Z

I'm curious what you mean by "without predictions" in the diagram above. The only Concentrate output that doesn't have predictions in my mind is VehiclePositions.

Oh that was probably unneeded and just adds confusion, but I'm thinking that Swiftly will need the VehiclePositions without predictions AND the Trip Modifications, so that it can make predictions on the detoured trips.

Where/How does Swiftly get our VehiclePosistions and TripUpdates without predictions currently?

fsaid90 · 2024-07-12T20:50:54Z

@Whoops

I think the big question is what exactly is Switly expecting to consume?

VehiclePositions from Busloc (via S3)
TripUpdates from Busloc (via S3)
TripModifications from somewhere

If Swiftly is going to then consume that feed then we have a situation where Swiftly is both consuming that RT feed to generate it's output, and Swiftly's output goes into that RT feed.

Here's where I might be missing something - in my thinking:

Swiftly is taking in Busloc VehiclePositions + TripUpdates output as per the above
Swiftly now also takes in TripModifications (to make proper modifications for adjusted trips)
Swiftly sends us back only TripUpdates as they currently do
Concentrate publishes:
- VehiclePositions from Busloc
- TripModifications from Skate (Swiftly won't be modifying TripModifications - I don't see a reason for them to do so unless you do?)
- TripUpdates from Swiftly (factoring in TripModifications if they exist)

@firestack

Where/How does Swiftly get our VehiclePosistions and TripUpdates without predictions currently?

Swiftly retrieves VehiclePositions and TripUpdates that Busloc produces and dumps into S3 - basically polling

joshlarson · 2024-07-13T00:21:17Z

I don't expect that Swiftly will adjust TripModifications, they just need to know about them to update predictions accordingly (i.e. we can publish the TripModifications as is from Skate).

I'm wondering about the propagated_modification_delay field in the Modification message.

Our current code doesn't do anything to compute that field, and I'd been thinking about that field as effectively a prediction, which puts it into Swiftly's wheelhouse. So I had envisioned a TripModification going through a pipeline like

Skate generates TripModification message without propagated_modification_delay and sends it to TransitData
TransitData sends that message to Swiftly
Swiftly updates predictions for the selected trips and lets TransitData know in some way (as well as updating predictions that Skate receives)
TransitData publishes TripModification messages with propagated_modification_delay populated to the actual GTFS-RT feed

How do you all envision propagated_modification_delay being set? Do we even need that field (I think yes, or else rider-facing info will have inaccurate predictions)? Should Skate figure it out and attach it to the message, rather than relying on Swiftly?

(I think this field is what Kayla was referring to by "with/without predictions".)

firestack · 2024-07-13T00:24:48Z

How do you all envision propagated_modification_delay being set? Do we even need that field (I think yes, or else rider-facing info will have inaccurate predictions)? Should Skate figure it out and attach it to the message, rather than relying on Swiftly?

I think that because Swiftly is responsible for providing the TripUpdates data, that their predictions will just include the info computed from the Trip Modification data we give them. We should assert that this assumption is correct, but this seems most correct to me. So I don't think we need to provide the propagated_modification_delay, because IMO that's only needed if we didn't have live updating TripUpdates but instead the client needs to propagate this delay through a static schedule.

Edit!!:
Upon thinking about it some more, it's a great point, because maybe we do need Swiftly to populate that field eventually, so that future trips that aren't yet having predictions made yet also get schedule updates!

documentation/adr/0004-trip-modifications-mqtt-topic-structure.md

paulswartz · 2024-07-15T20:17:58Z

documentation/adr/0004-trip-modifications-mqtt-topic-structure.md

+On `Shape` Messages, the
+[`Shape.shape_id`](https://gtfs.org/realtime/reference/#message-shape)
+field will also have a unique ID generated by Skate, which will be
+referenced by the corresponding Trip Modification's


question: if the shape also has an ID, why not:

<prefix>/trip_modifications/#{modification id}

<prefix>/shapes/#{shape id}

Given this feedback, this does give me pause!

cc: @mbta/skate-developers

I'm a little stuck here.

One issue with separating them out, is that you have to predict what id's you want to subscribe to if you want to subscribe to only a single modification and all of it's dependencies.

I know that this is attempting to predict the future, and we don't have a use case for subscribing to individual topics, but following the "Best Practices" from HiveMQ I wonder if having a shared ID for identifying all related resources to a trip modification is more extensible?

OTOH, IMO it's possible (but unlikely, given ordering requirements if for example it applied to an inbound and outbound trip) that we'd have a single shape referenced by multiple trip modifications, in which the proposed structure falls apart.

I cannot find the relationship between shapes and trip modifications to decide whether the risk you note in the last paragraph ("... in which the proposed structure falls apart") is likely to be a problem. TripModifications contains a list of SelectedTrips which have Shapes of the existing route/trip, but the Modifications field doesn't have any nested references to Shapes, it seems. If that's the case, then the logic in paragraph 2 especially makes sense to me

I think that if two trips had different lists of missed stops but the same detour shape, then we'd need to publish two trip modifications. I don't think we're at the point where we'd do that sort of optimization(share a single shape between "disjointed" trips/trips on different routes), but the current structure would not cope with that as well as the format @paulswartz proposed.
It's still doable and not that much of an issue if the subscriber subscribes to the wildcards.

<prefix>/trip_modifications/+/trip_modification <prefix>/trip_modifications/+/shape

Oh oh ohhhhh I was somehow missing that selected_trips contain a reference to the new detour shape, not the existing trip shape. Ok, so I can totally see a situation where different trip mods can reference the same shape. And I can see where trips on different routes would get sent down the same detour path, right? But I agree that the reuse of shapes for different trip_mods would be infrequent, and so nesting shapes under trip_mods seems logical (for your subscription reasoning)

documentation/adr/0004-trip-modifications-mqtt-topic-structure.md

github-actions · 2024-07-16T18:06:26Z

Coverage of commit `3483134`

Summary coverage rate:
  lines......: 93.6% (3309 of 3535 lines)
  functions..: 73.7% (1365 of 1853 functions)
  branches...: no data found

Files changed coverage rate: n/a

Download coverage report

github-actions · 2024-07-25T17:46:15Z

Coverage of commit `49c24b6`

Summary coverage rate:
  lines......: 93.6% (3314 of 3541 lines)
  functions..: 73.6% (1366 of 1855 functions)
  branches...: no data found

Files changed coverage rate: n/a

Download coverage report

github-actions · 2024-07-25T17:50:42Z

Coverage of commit `49c24b6`

Summary coverage rate:
  lines......: 93.6% (3315 of 3541 lines)
  functions..: 73.7% (1367 of 1855 functions)
  branches...: no data found

Files changed coverage rate: n/a

Download coverage report

Co-authored-by: Hannah Purcell <69368883+hannahpurcell@users.noreply.github.com> Co-authored-by: Josh Larson <jlarson@mbta.com>

github-actions · 2024-08-01T15:49:04Z

Coverage of commit `61465b8`

Summary coverage rate:
  lines......: 93.1% (3319 of 3566 lines)
  functions..: 72.6% (1357 of 1870 functions)
  branches...: no data found

Files changed coverage rate: n/a

Download coverage report

firestack requested review from Whoops and joshlarson July 4, 2024 17:45

firestack marked this pull request as ready for review July 4, 2024 18:43

firestack requested a review from a team as a code owner July 4, 2024 18:43

firestack mentioned this pull request Jul 4, 2024

feat: create module to publish trip modifications to MQTT #2679

Merged

paulswartz reviewed Jul 5, 2024

View reviewed changes

documentation/adr/0004-trip-modifications-mqtt-topic-structure.md Outdated Show resolved Hide resolved

joshlarson reviewed Jul 5, 2024

View reviewed changes

documentation/adr/0004-trip-modifications-mqtt-topic-structure.md Outdated Show resolved Hide resolved

joshlarson reviewed Jul 5, 2024

View reviewed changes

documentation/adr/0004-trip-modifications-mqtt-topic-structure.md Outdated Show resolved Hide resolved

joshlarson requested changes Jul 5, 2024

View reviewed changes

firestack force-pushed the kf/doc/mqtt-hierarchy-adr branch from 3f21acc to 5a94454 Compare July 8, 2024 13:51