Skip to content

Future import pipeline planning #1752

@jbruechert

Description

@jbruechert

I think it might be time to replace gtfsclean for a number of reasons:

  • MOTIS is gaining NeTEx support. We could not do our usual checks and filtering from gtfsclean on netex
  • No support for gtfs-flex. We currently skip gtfsclean for these feeds, but that again skips filtering.
  • MOTIS itself is more forgiving in parsing than gtfsclean.
  • our import pipeline is currently weirdly split between gtfsclean features and lua scripts.

IMO the only thing that makes sense is doing the filtering in MOTIS itself, to avoid implementing support for data formats multiple time. This could be done by adding the relevant features to MOTIS, or by automatically adding a lua script that does this everywhere. The lua scripting does not currently expose the needed features though.

Required features:

  • null-coordinate filtering (null-coordinates still there, but don't lead to problems)
  • speed-based filtering (miiiight not be needed if problems only stem from (0,0)-coordinates, not sure)
  • agency-based filtering (doable in lua)
  • finding gtfs in a subdirectory of the zip. Needs checking if this still occurs. The one feed I remember is meanwhile replaced.

It would still be nice to be able to re-publish our feeds as we currently do. I'd propose to:

  • Try adding GTFS writing functionality to MOTIS. This would also give us a generic NeTEx to GTFS converter for free which as fas as I can tell the ecosystem is currently lacking. Particularly in the beginning, this feature might not be able to write all GTFS features.
  • re-publish the unprocessed feeds as well. This is probably useful regardless of the rest. Sometimes people might need the files without filtering.

Steps

  • Make sure required MOTIS features are ready
  • Add option to opt out of gtfsclean per-feed
  • Flip default to make gtfsclean opt-in

Metadata

Metadata

Assignees

No one assigned

    Labels

    importregarding the download & import of data sourcesmotischanges to the MOTIS routing engine are necessarytooling

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions