feat: add memory metrics to acceptance tests #1874

davidgamez · 2024-10-07T20:45:47Z

Description

Memory consumption depends on the JVM state when the memory is collected; there is a considerable difference between reference(master) and latest(PR) executions.

From our AI friend ;-) :

This pull request includes significant changes to the output-comparator module, focusing on refactoring the memory usage comparison logic and improving the performance metrics collection. The most important changes include the removal of the BoundedPriorityQueue class, the introduction of new comparators, and the refactoring of the ValidationPerformanceCollector class to streamline memory usage reporting.

Memory Usage Comparison Refactor:

Removed the BoundedPriorityQueue class, which was previously used for maintaining bounded priority queues of dataset memory usage. (BoundedPriorityQueue.java)
Introduced the UsedMemoryDecreasedComparator class to compare DatasetMemoryUsage objects based on the minimum difference in used memory. (UsedMemoryDecreasedComparator.java)
Updated the UsedMemoryIncreasedComparator class to compare DatasetMemoryUsage objects based on the maximum difference in used memory, reversing the comparison for sorting purposes. (UsedMemoryIncreasedComparator.java)

Validation Performance Collection Refactor:

Refactored the ValidationPerformanceCollector class to remove the use of BoundedPriorityQueue and instead use lists for memory usage tracking. (ValidationPerformanceCollector.java)
Consolidated the performance metrics calculation into a new PerformanceMetrics class and added methods to compute average, median, standard deviation, min, and max values. (ValidationPerformanceCollector.java) [1] [2]
Simplified the generation of performance metrics logs by introducing a helper method generatePerformanceMetricsLog to format and append metrics to the log string. (ValidationPerformanceCollector.java) [1] [2]

These changes aim to improve the maintainability and readability of the memory usage comparison logic and enhance the performance metrics reporting in the output-comparator module.

github-actions · 2024-10-09T17:03:58Z

This contribution does not follow the conventions set by the Google Java style guide. Please run the following command line at the root of the project to fix formatting errors: ./gradlew goJF.

github-actions · 2024-10-11T16:28:51Z

📝 Acceptance Test Report

📋 Summary

✅ The rule acceptance has passed for commit 265d9a2
Download the full acceptance test report here (report will disappear after 90 days).

📊 Notices Comparison

New Errors (0 out of 1588 datasets, ~0%) ✅

No changes were detected due to the code change.

Dropped Errors (0 out of 1588 datasets, ~0%) ✅

No changes were detected due to the code change.

New Warnings (0 out of 1588 datasets, ~0%) ✅

No changes were detected due to the code change.

Dropped Warnings (0 out of 1588 datasets, ~0%) ✅

No changes were detected due to the code change.

🛡️ Corruption Check

0 out of 1588 sources (~0 %) are corrupted.

⏱️ Performance Assessment

📈 Validation Time

Assess the performance in terms of seconds taken for the validation process.

Time Metric	Dataset ID	Reference (s)	Latest (s)	Difference (s)
Average	--	3.98	4.03	⬆️+0.05
Median	--	1.39	1.41	⬆️+0.03
Standard Deviation	--	11.34	11.50	⬆️+0.16
Minimum in References Reports	us-oregon-high-desert-point-gtfs-636	0.50	0.56	⬆️+0.06
Maximum in Reference Reports	gb-unknown-uk-aggregate-feed-gtfs-2014	290.38	295.16	⬆️+4.78
Minimum in Latest Reports	us-california-catalina-express-gtfs-299	0.52	0.50	⬇️-0.02
Maximum in Latest Reports	gb-unknown-uk-aggregate-feed-gtfs-2014	290.38	295.16	⬆️+4.78

📜 Memory Consumption

Metric	Dataset ID	Reference (s)	Latest (s)	Difference (s)
Average	--	480.91 MiB	489.40 MiB	⬆️+8.49 MiB
Median	--	245.34 MiB	248.00 MiB	⬆️+2.67 MiB
Standard Deviation	--	854.66 MiB	880.42 MiB	⬆️+25.76 MiB
Minimum in References Reports	tr-kocaeli-metro-izmir-gtfs-1824	34.05 MiB	34.10 MiB	⬆️+48.00 KiB
Maximum in Reference Reports	gb-unknown-uk-aggregate-feed-gtfs-2014	10.00 GiB	9.82 GiB	⬇️-185.38 MiB
Minimum in Latest Reports	us-california-flex-v2-developer-test-feed-1-gtfs-1817	34.05 MiB	34.05 MiB	⬇️0 bytes
Maximum in Latest Reports	gb-unknown-uk-aggregate-feed-gtfs-2014	10.00 GiB	9.82 GiB	⬇️-185.38 MiB

github-actions · 2024-10-15T18:06:50Z

📝 Acceptance Test Report

📋 Summary

✅ The rule acceptance has passed for commit 940ff27
Download the full acceptance test report here (report will disappear after 90 days).

📊 Notices Comparison

New Errors (0 out of 1601 datasets, ~0%) ✅

No changes were detected due to the code change.

Dropped Errors (0 out of 1601 datasets, ~0%) ✅

No changes were detected due to the code change.

New Warnings (0 out of 1601 datasets, ~0%) ✅

No changes were detected due to the code change.

Dropped Warnings (0 out of 1601 datasets, ~0%) ✅

No changes were detected due to the code change.

🛡️ Corruption Check

0 out of 1601 sources (~0 %) are corrupted.

⏱️ Performance Assessment

📈 Validation Time

Assess the performance in terms of seconds taken for the validation process.

Time Metric	Dataset ID	Reference (s)	Latest (s)	Difference (s)
Average	--	3.99	4.04	⬆️+0.05
Median	--	1.39	1.41	⬆️+0.03
Standard Deviation	--	11.44	11.44	⬆️+0.00
Minimum in References Reports	us-oregon-hut-airport-shuttle-gtfs-635	0.52	0.59	⬆️+0.07
Maximum in Reference Reports	gb-unknown-uk-aggregate-feed-gtfs-2014	295.53	295.97	⬆️+0.44
Minimum in Latest Reports	us-massachusetts-massachusetts-area-express-max-gtfs-431	0.60	0.52	⬇️-0.07
Maximum in Latest Reports	gb-unknown-uk-aggregate-feed-gtfs-2014	295.53	295.97	⬆️+0.44

📜 Memory Consumption

Metric	Dataset ID	Reference (s)	Latest (s)	Difference (s)
Average	--	485.51 MiB	483.42 MiB	⬇️-2.10 MiB
Median	--	246.06 MiB	248.08 MiB	⬆️+2.02 MiB
Standard Deviation	--	879.81 MiB	874.43 MiB	⬇️-5.38 MiB
Minimum in References Reports	tr-kocaeli-metro-izmir-gtfs-1824	34.05 MiB	34.05 MiB	⬆️+8.00 KiB
Maximum in Reference Reports	gb-unknown-uk-aggregate-feed-gtfs-2014	10.00 GiB	10.21 GiB	⬆️+210.13 MiB
Minimum in Latest Reports	us-oregon-hut-airport-shuttle-gtfs-635	34.06 MiB	34.05 MiB	⬇️-16.00 KiB
Maximum in Latest Reports	gb-unknown-uk-aggregate-feed-gtfs-2014	10.00 GiB	10.21 GiB	⬆️+210.13 MiB

cka-y

LGTM -- it made the memory consumption much easier to read

davidgamez added 22 commits September 30, 2024 16:33

add memory usage records to the JSON report

f16f52e

downgrade aspectj dependecies to be compatible with jdk 11

3efa591

add memory usage to validator comparator

5f6c71b

run acceptance tests with sample data

848d798

fix memory usage serialization

e89816c

fix performance collector

b66828f

fix npe

0416d1c

support negative memory usage for logging

0cc1833

simplifly memory usage report

f6789c8

Merge branch 'master' into chore/memory-monitor

e59614c

fix compilation issue

3efd975

add feeds with no reference

39182dd

add no references to the report

06fe749

fix failing tests

a70accf

fix memory table formatting

b10320d

sort feeds on the no reference list and limit them to 25 maximum items

97678ee

add documentation and sort memory usage for feed with no reference

94b9077

orting from the highest to the lowest memory usage

98c0275

improve acceptance tests documentation

a16fdee

Merge branch 'master' into chore/memory-monitor

f95843d

revert acceptance tests sample running

db36328

remove large feeds from exclude list

2a7a1f6

davidgamez marked this pull request as ready for review October 7, 2024 20:45

davidgamez added the do not merge This PR needs more work/discussion or is not meant to be merged label Oct 7, 2024

davidgamez changed the title ~~Test/large feeds~~ test: large feeds with ci pipeline Oct 7, 2024

davidgamez added 2 commits October 9, 2024 12:59

Merge branch 'master' into test/large-feeds

8fa5651

Merge branch 'master' into test/large-feeds

f9ba38e

davidgamez added 2 commits October 9, 2024 13:05

fix formatting

5f644fc

fix ordering

c7ef809

davidgamez added 5 commits October 9, 2024 14:52

fix unit test

b50f9bf

add decreased memory comparator

4bd6064

add memory metrics

053cdbe

fix comment formatting

2513c23

fix invalid references

8a304bb

MobilityData deleted a comment from github-actions bot Oct 10, 2024

davidgamez changed the title ~~test: large feeds with ci pipeline~~ feat: add memory metrics to acceptance tests Oct 11, 2024

remove memory full list

ce54a4d

MobilityData deleted a comment from github-actions bot Oct 11, 2024

davidgamez added 2 commits October 11, 2024 11:41

delete unused comparators

b372c2c

delete unused code

f0bf1eb

davidgamez removed the do not merge This PR needs more work/discussion or is not meant to be merged label Oct 11, 2024

MobilityData deleted a comment from github-actions bot Oct 15, 2024

davidgamez requested a review from cka-y October 15, 2024 17:29

Merge branch 'master' into test/large-feeds

8bc962f

davidgamez requested a review from jcpitre October 15, 2024 18:51

cka-y approved these changes Oct 16, 2024

View reviewed changes

davidgamez merged commit 7426549 into master Oct 16, 2024
335 checks passed

davidgamez deleted the test/large-feeds branch October 16, 2024 14:50

davidgamez mentioned this pull request Oct 28, 2024

GitHub workflow action failures for large feeds #1304

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add memory metrics to acceptance tests #1874

feat: add memory metrics to acceptance tests #1874

davidgamez commented Oct 7, 2024 •

edited

Loading

github-actions bot commented Oct 9, 2024

github-actions bot commented Oct 11, 2024

github-actions bot commented Oct 15, 2024

cka-y left a comment

feat: add memory metrics to acceptance tests #1874

feat: add memory metrics to acceptance tests #1874

Conversation

davidgamez commented Oct 7, 2024 • edited Loading

Description

Memory Usage Comparison Refactor:

Validation Performance Collection Refactor:

github-actions bot commented Oct 9, 2024

github-actions bot commented Oct 11, 2024

📝 Acceptance Test Report

📋 Summary

📊 Notices Comparison

🛡️ Corruption Check

⏱️ Performance Assessment

github-actions bot commented Oct 15, 2024

📝 Acceptance Test Report

📋 Summary

📊 Notices Comparison

🛡️ Corruption Check

⏱️ Performance Assessment

cka-y left a comment

Choose a reason for hiding this comment

davidgamez commented Oct 7, 2024 •

edited

Loading