-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Handle ref duplicates #152
Comments
IMHO, using
The 3rd option has my preference, and the 4th can be a fall back if necessary. |
I think we do good to separate the conversation into:
As the standard behaviour I like what you point our @nlehuby and group on the fly if they seem to be the same. This would be a much better behaviour. But still there are a lot of "unless if". And how we address them? a. We append automatically the field that caused the difference (the agency, the type of transport, etc.) - this might dangerous and we don't have always a suitable value In any case if there is any duplicate we probably want to keep the warning in place, so that people are informed to use one (of the) fallback solutions. I'm still not so happy about neither a. nor b. Do you have any other better idea? For fallbacks, which need manual interaction either with code or the configuration file, we have the two options (3 and 4 from @prhod) and we could implement both:
Personally, I thing we are ok, with giving this flexibility only to the creators (fallback option 4). This is because I dislike a bit to encourage anybody using the OSM |
It is not the option that seems to win the vote but I prefer my last suggestion to use the
So if I had to choose between @prhod options, I obviously choose the 3rd one. For the standard behaviour, I think we need to append the osm line anyhow, because grouping is uncertain and discarding is too violent. |
So, for the standard behaviour we can choose between the uncertain, the violent or the ugly? 😄 i follow you, @nlehuby to give preference to the ugly. Just wondering whether we want it to be a bit nicer:
For the fallback, the handling in creators (option 4) is there already by nature. And the question is whether we want to add a configuration option to exchange the |
To sum up, we have an agreement on the following:
|
Follow-up conversation from this PR.
For now, if we have multiple OSM route_masters with the same
ref
tag, we only keep the first one and discard the others.But multiple lines with the same line number can be a thing. See examples in the PR in France and Ivory Coast.
OSM
ref
tag is used to set these values in routes.txt files:source: https://github.com/grote/osm2gtfs/wiki/Source-for-GTFS-values
GTFS spec does not forbid to have duplicates on
route_short_name
, but GTFS Best Practices discourages this practice.But GTFS spec does forbid duplicates on
route_id
.How should we handle that ?
For now, we discard the duplicates and only keep the first one. The result in case of
ref
duplicates is an incomplete GTFS.We could also group all OSM lines with
ref
duplicates under the same GTFS route.It can be a strong option, unless if
route_long_name
and actually represent a different public_transport lineWe could keep them distinct and create a GTFS route for each one of them, but we then need to change the
route_id
construction process in the default creator.relation/{osm_id}
orosm_id
is a good option already used in some connectors.What do you think ?
The text was updated successfully, but these errors were encountered: