-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Acceptance tests keeps running for hours #1587
Comments
I tested with this dataset: https://storage.googleapis.com/storage/v1/b/mdb-latest/o/de-unknown-ulmer-eisenbahnfreunde-gtfs-1081.zip?alt=media (It's 900 MB with 40 000 000 shape rows) It seems the problem started with this PR #1553 I locally removed the code from this PR and it ran fast. Possible solutions:
Also it would be useful to add more logging. I don't think there is any DEBUG logging in the validator. |
Looking at #1553, there is definitely so inefficient code in there. For example:
This is iterating over every line in shapes.txt, when the set of unique shape ids is already available via Then, the worst offender:
For each shape ids, we are again looping over every entry in shapes.txt to find matching shape points. This is where the NxM blow-up and slow-down is likely coming from. Again I'd also point out that the shape points and stop points below should both probably be filtered by |
Description
Acceptance tests kept running without stopping and had to be canceled.
Examples of long-running tests:
The text was updated successfully, but these errors were encountered: