Collect stops systematically from RIS::Stations #3

traines-source · 2024-12-17T00:22:49Z

As always, this turned out to be a bit less straightforward than intended...

Idea: Take stop information from RIS::Stations, because this API allows to download all stops with only a few hundred queries and contains basically the same information as HAFAS.

To build, you need to set env vars DB_API_KEY and DB_CLIENT_ID, which can be obtained via the above link for free (10k requests per month). Pre-built data files for convenience are here:

This results in an up-to-date dataset with 290708 stops and in theory no missing stops in Germany – conditions apply:

some stops outside of Germany, particularly bus stops, are not contained in RIS::Stations – i.e. they are missing, e.g. also Malaga from the issue above (in total, 46510 stops are missing compared to my last HAFAS collection)
disused stops are not contained anymore
META stations are not contained
Groß Gerau station (8000136) is completely missing from RIS::Stations for some reason – will report that to DB (and surely also some bus stations)

In general, RIS::Stations also has other quality issues (like misassigned station groups, see e.g. Elisenstraße, München). Difficult to say whether they are worse than in HAFAS.

Missing fields:

lines

Additional fields, mainly from Stada which is additionally integrated for train stations:

stadaId, ifoptId, ris100Ids, facilities, reisezentrumOpeningHours, priceCategory

The weight is now calculated based on the products, the number of children (for the main station of a group) and the priceCategory for German train stations – I think the price category is a very good indicator for the importance of a station, so much so that I included it as an exponential factor – assuming that the importance is approx. inversely proportional to the number of stations of the same importance. I tested the weighting a bit with db-hafas-stations-autocomplete and was satisfied, but feel free to play around :)

Of course, instead of merging this into this repo, we could also create a separate db-ris-stations, but I think this would be more confusing than helping. The only real drawback are the missing international stations which are also incomplete in the existing collection. In both cases we could set up a GitHub workflow cronjob to automatically update and push the dataset/package e.g. once a month.

traines-source force-pushed the ris branch from 5246e72 to b6a2edd Compare December 17, 2024 00:33

collect stops systematically from RIS::Stations

6fae5bf

traines-source force-pushed the ris branch from b6a2edd to 6fae5bf Compare December 17, 2024 01:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collect stops systematically from RIS::Stations #3

Collect stops systematically from RIS::Stations #3

traines-source commented Dec 17, 2024 •

edited

Loading

Collect stops systematically from RIS::Stations #3

Are you sure you want to change the base?

Collect stops systematically from RIS::Stations #3

Conversation

traines-source commented Dec 17, 2024 • edited Loading

traines-source commented Dec 17, 2024 •

edited

Loading