Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GTFS Updates: Smart Updates & Storage #60

Open
nateschickler0 opened this issue Sep 19, 2018 · 3 comments
Open

GTFS Updates: Smart Updates & Storage #60

nateschickler0 opened this issue Sep 19, 2018 · 3 comments

Comments

@nateschickler0
Copy link
Contributor

GTFS data is not updated automatically and will cause routing to fail if updated too early or not at all. We want to

  1. automatically download and verify the correct GTFS data on setup/install/run
  2. store non-current (old and future) GTFS data both locally and on S3
  3. detect and switch out-of-date GTFS data files with new GTFS data automatically to ensure 0 downtime
  4. ensure the server/graph-hopper replaces and rebuild all caches and graphs when new GTFS data is provided (bonus: without needing to restart the server or redeploy)
@nateschickler0
Copy link
Contributor Author

#52 The install/deploy script includes functionality to download and unzip GTFS data, so these issues are somewhat related.

@nateschickler0 nateschickler0 mentioned this issue Oct 2, 2018
@nateschickler0
Copy link
Contributor Author

Data scripts removed and functionality placed into TCATUtils.js, also added date validation and data expiration and invalid notifications #89

@mattbarker016
Copy link
Contributor

mattbarker016 commented Oct 28, 2018

Summary

GTFS data is not updated automatically. This will cause all routing to fail if updated too early or not at all.

Implementation / Scope:

  • Updating graphhopper to latest release
  • GTFS diff-ing, versioning, tagging, and downloading
  • Creating a database
  • Pulling and pushing GTFS data to/from our own database
  • DB Management
    • Redundant backups
    • Delete old and unusable data
    • Maintain data for current period
  • Updating the GTFS data for the Graphhopper routing services
    • Rebuilding the graph cache
    • Restarting the service automatically

Key Issues:

  • Graphhopper services run on a separate servers and Docker containers but need to update the GTFS data in sync right as the old version becomes invalidated
  • Future-proofing for possible changes in how TCAT formats or serves the GTFS data

@mattbarker016 mattbarker016 changed the title Add smart GTFS updates and storage GTFS Updates: Smart Updates & Storage Oct 28, 2018
@young-k young-k assigned young-k and unassigned JunyoungLim Jan 31, 2019
@alanna-zhou alanna-zhou self-assigned this Sep 1, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants