Test history refactoring and improvements #625

mdealer · 2024-07-09T13:33:57Z

This is an update of #446

Links to HPI iterations if you want to try it:

https://github.com/mdealer/junit-plugin/releases/

TL;DR:
Major performance and usability improvements to the test history file storage, history charts and test failure age computation.

Improve large JUnit test history performance and usability. RAM and CPU resources are better utilized to parse and cache the test results for faster retrieval.

JUnit SQL Storage plugin does not solve the problem fully as there are too many requests generated by the existing implementation.

To fully benefit from this, assign more CPU cores, e.g. 8+, and increase the Java heap size, e.g. -Xmx8g.

Changes:

Test result XML Files (junitResult.xml) are now parsed using an XML reader instead of XStream.
Test result history is loaded in parallel.
Test result history window (start and end build number) is now dynamic, allowing better navigation to past results via the webpage.
Test result history can now handle a much larger test and build counts in the thousands.
Test history graphs have been reimplemented to avoid build trend plugin and using ECharts directly for greater flexbility.
Test result age now scans back at most 25 builds without test results instead of resetting age at the first failed build
Test trend graph is shown more often on build page (some issues still remain in Jenkins core)
Some limited support for Dark Reader otherwise the graphs may turn unreadable
Added new parameter to junit pipeline step: keepTestNames for cases when your results already contain a namespace

Binary is now bigger due to a 3rd party math library for processing historical results.

The more CPU cores, the faster the result XML file parsing, the quicker the initial and subsequent history page load. The more heap size, the more test results will remain cached (using SoftReferences), the less CPU and Disk IO usage when browsing the results back and forth.

New parameter in 'junit' pipeline step: keepTestNames

Allow disabling the test name mangling when uploading results from multiple stages or parallel branches in a single build via 'keepTestNames' parameter. There was a change in the plugin a few years ago that basically broke the result URLs (e.g. if we are linking to the results from some external system), this is a way to undo this change.

def c = {
    node(null) {
        writeFile file: 'result.xml', text: '<testsuite><testcase time="2.6875" classname="MyTest" name="Run 0" assertions="3 total, 0 failed, 3 succeeded"/></testsuite>'
        junit keepTestNames: true, testResults: 'result.xml'
    }
}

parallel 'p1': c, 'p2': c

Related tickets that are potentially fixed

There various already reported issues that I also encountered on the way. This list must be double checked, but based on the code I looked at, there is a very high chance that most of the 'closed' tickets below are wrongly closed and this PR should improve the situation for them.

Status

TODOs:

Testing done

Live 'tested' for over 1 year, also being resynced multiple times inbetween, so we are talking multiple Jenkins versions and thousands of builds with thousands of test results each.

The UI changes are functional, but I am not that familiar with Jenkins to make it fully clean. I think that should be done as part of another PR.

The existing tests should cover nearly all of the changes.

There is no 'before' screenshots because before it didn't work, the history page loaded forever.

See it in action

If you want to test it with large number of tests, then create a Pipeline Job, call it 'junit_test' and add this Groovy script into it:

def c = { target ->
    node(null) {
        //if (params.NUM?.toInteger() % 35 < Math.random() * 35) {
            def sb = new StringBuilder()
            sb.append('<testsuite>')
            def bn = (env.BUILD_NUMBER.toInteger() % 3) + 1
            def sets = []
            for (int i = 0; i < 20; ++i) {
                if (Math.random() > 0.2) {
                    sets.add(target + '.S' + i)
                }
            }
            
            def sr = Math.random()
            sets.each { set ->
                def failOnce = false
                int i = 0;
                for (; i < 200 + (env.BUILD_NUMBER.toInteger() * 0.01 * (Math.random() / 100 + 0.9)); ++i) {
                    def duration = env.BUILD_NUMBER.toInteger() / 10000 + Math.random() * 50 + Math.random() * Math.random() * 50 - 10000 * Math.random()
                    if (i % bn == 0 && sr < 0.3 && (sr < 0.1 ? !failOnce : true)) {
                        failOnce = true
                        sb.append("<testcase time=\"${duration}\" classname=\"${set}.SomeLongLongLongLongLongTestName_${i}\" name=\"Run 0\" assertions=\"3 total, 0 failed, 3 succeeded\">")
                        sb.append("<failure message=\"failure message ${i}\" type=\"Failure\"/>")
                        sb.append("</testcase>")
                    } else {
                        sb.append("<testcase time=\"${duration}\" classname=\"${set}.SomeLongLongLongLongLongTestName_${i}\" name=\"Run 0\" assertions=\"3 total, 0 failed, 3 succeeded\"/>")
                    }
                }
                int start = i
                for (; i < start + 40 + (env.BUILD_NUMBER.toInteger() * 0.01); ++i) {
                    sb.append("<testcase time=\"0\" classname=\"${set}.SomeLongLongLongLongLongTestName_${i}\" name=\"Run 0\" assertions=\"3 total, 0 failed, 3 succeeded\">")
                    sb.append("<skipped type=\"Skipped\" message=\"This test was disabled.\"/>")
                    sb.append("</testcase>")
                }
            }
            sb.append('</testsuite>')
            writeFile file: 'result.xml', text: sb.toString()
            junit keepTestNames: true, testResults: 'result.xml'
            archiveArtifacts 'result.xml'
        //}
    }
}
def p1c = { c('p1') }
def p2c = { c('p2') }
def m = [ 'p1': p1c, 'p2': p2c ]
parallel m

Then run the job for... thousands of times or as much as you like. Running takes some time, but navigating the results should be still fast. The NUM parameter is only required to have some builds without test results inbetween, you can use it if you want.

To speed things up, set up a second trigger job that executes this:

for (int i = 0; i < 500; ++i) {
    build quietPeriod: 0, wait: false, propagate: false, job: 'junit_test' //, parameters: [string(name: 'NUM', value: i.toString())]
}

More logging. Parallel history stream.

… tests. Add timeout and build count limit.

Reduce trend chart loading times. Display same count of builds in trend chart as in table. Workaround for jQuery issue.

Support decimal duration. Increase builds in view to 100.

Round maximum up if over 0.5.

Reuse the retrieved history for chart generator to avoid multiple requests.

Fix click on chart.

…it goofy). Improve chart appearance. Show both charts.

Add history size links. Improve history page load time.

Add Total line. More distinct line colors. Improve overall appearance.

Increase age computation window to 25 to skip over builds without results.

Fix test compilation error. Update POM.

Fix history chart layout with only few results.

…-plugin into test-history-refactor-1265

src/main/java/hudson/tasks/junit/ClassResult.java

See jenkinsci/junit-plugin#625

…er/junit-plugin into test-history-refactor-1265

mdealer · 2024-07-23T12:51:02Z

I discovered that properties are not loaded properly, I am fixing it (this was added by another PR in the last year or so and I forgot to implement the parsing).

I will also see if I can implement a test for writing+parsing.

Implement related test.

timja · 2024-07-24T12:39:22Z

Any final concerns or shall we ship?

timja · 2024-07-24T14:22:55Z

Going to ship now to benefit from some of @mdealer's availability if required.

timja · 2024-07-24T15:02:42Z

Thanks @mdealer!

Released in https://github.com/jenkinsci/junit-plugin/releases/tag/1279.v72cf99b_25c43

jenkinsci/junit-plugin#625 includes history improvemnts and performance improvements. Especially helpful for large JUnit results.

basil · 2024-08-07T00:33:50Z

pom.xml

+        <dependency>
+            <groupId>ca.umontreal.iro.simul</groupId>
+            <artifactId>ssj</artifactId>
+            <version>3.3.1</version>


This was outdated at the time of integration. I think I have managed to upgrade it in #628.

basil · 2024-08-07T00:34:55Z

pom.xml

+        <dependency>
+            <groupId>com.pivovarit</groupId>
+            <artifactId>parallel-collectors</artifactId>
+            <version>2.6.1</version>


This is up-to-date, but Renovate is trying to upgrade it to version 3.x in e.g. #629, which won't work because 3.x requires Java 21 or newer. Can we update the Renovate configuration to exclude 3.x until we are ready to require Java 21 or newer? That way, we will continue to get updates to version 2.x.

basil · 2024-09-06T15:16:52Z

Breaks org.jenkinsci.plugins.parallel_test_executor.ParallelTestExecutorTest.unloadableTestResult with

java.lang.AssertionError: 

Expected: a string containing "Build #2 has no loadable test results (supposed count 1), skipping"
     but: was "Started
[Pipeline] Start of Pipeline
[Pipeline] splitTests
Using build #2 as reference
1 test classes (0ms) divided into 1 sets. Min=0ms, Average=0ms, Max=0ms, stddev=0ms
[Pipeline] End of Pipeline
Finished: SUCCESS
"
        at org.hamcrest.MatcherAssert.assertThat(MatcherAssert.java:20)
        at org.hamcrest.MatcherAssert.assertThat(MatcherAssert.java:6)
        at org.jvnet.hudson.test.JenkinsRule.assertLogContains(JenkinsRule.java:1570)
        at org.jenkinsci.plugins.parallel_test_executor.ParallelTestExecutorTest.unloadableTestResult(ParallelTestExecutorTest.java:124)
        at java.base/java.lang.reflect.Method.invoke(Method.java:569)
        at org.jvnet.hudson.test.JenkinsRule$1.evaluate(JenkinsRule.java:655)
        at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
        at java.base/java.lang.Thread.run(Thread.java:840)

jglick · 2024-09-06T15:49:12Z

src/main/java/hudson/tasks/junit/TestResultAction.java

+        return r;
+    }
+
+    static ConcurrentHashMap<String, SoftReference<TestResult>> resultCache = new ConcurrentHashMap<>();


This is incorrect FTR: Jenkins plugins should never keep state in static fields. Rather use instance fields of some @Extension. Noticed because of a test failure in jenkinsci/parallel-test-executor-plugin#296 where we try to simulate a Jenkins restart without using RealJenkinsRule and thus a new JVM.

I am not that familiar with Jenkins extension mechanism, I just assumed that it's OK because

it is in the gray, between the state of the plugin and that of the file system

plugins cannot be unloaded

Maybe there is a good example on how to do it?

In practice this mainly causes trouble for functional tests. If there is not already an @Extension which would be a natural home for the state, you can simply add one

@Extension public static final class TheCache { static TheCache get() { return ExtensionList.lookupSingleton(TheCache.class); } private final Cache<Whatever, Else> data = Caffeine.newCache().orSomething(); // etc. }

Edgars Batna added 30 commits March 3, 2023 14:09

Allow disabling test name mangling (keepTestNames).

3dfeb90

Fix tests.

589bcc1

Improve comment.

5deff15

Assume history always available.

84367f8

More logging. Parallel history stream.

Paralellize test history handling to unbreak it with large amounts of…

b11b825

… tests. Add timeout and build count limit.

Implement explicit XML parsing to avoid slow reflection access.

c29f0cc

Make test history dynamic and increase table size.

379ef56

Fix trend charts not shown after carousel slide.

8ba78e7

Fix carousel not redrawing.

0bba408

Reduce trend chart loading times. Display same count of builds in trend chart as in table. Workaround for jQuery issue.

Show same range as table in test result trend charts.

165207c

Show test status in duration chart.

3cb7ab3

Support decimal duration. Increase builds in view to 100.

Remove the other test history chart.

e2d7b54

Round maximum up if over 0.5.

Support 'count' parameter in history URLs.

6a7f600

Reuse the retrieved history for chart generator to avoid multiple requests.

Improve test history appearance.

7f2a9e7

Improve test history appearance.

673397a

Fix click on chart.

Remove carousel as charts are no longer async (and because it was a b…

60d9ef8

…it goofy). Improve chart appearance. Show both charts.

Make it compatible to XUnit plugin.

fae3c33

Improve test history appearance.

d195628

Add history size links. Improve history page load time.

Put both charts into one.

0142b86

Support Dark Reader to some extent.

d68d1c9

Add Total line. More distinct line colors. Improve overall appearance.

Improve test history appearance.

8a682dc

Improve chart appearance.

21e7a9c

Fix history URL.

80f2727

Implement cache for TestResult.

03f950b

Increase age computation window to 25 to skip over builds without results.

Less sorting on test result freeze.

955443b

Improve test history chart appearance.

65db008

Fix test compilation error. Update POM.

Update POM.

ad7ec09

Fix JUnit test result writes not updated in cache.

81205f6

Fix history chart layout with only few results.

Update POM.

72589da

Fix build.

81c21f5

Merge branch 'test-history-refactor-1265' of github.com:mdealer/junit…

4ca39a7

…-plugin into test-history-refactor-1265

timja mentioned this pull request Jul 23, 2024

Test junit refactor jenkinsci/bom#3394

Closed

6 tasks

timja reviewed Jul 23, 2024

View reviewed changes

src/main/java/hudson/tasks/junit/ClassResult.java Show resolved Hide resolved

mdealer pushed a commit to mdealer/junit-attachments-plugin that referenced this pull request Jul 23, 2024

Prepare for API change in JUnit.

0018bf0

See jenkinsci/junit-plugin#625

mdealer pushed a commit to mdealer/junit-attachments-plugin that referenced this pull request Jul 23, 2024

Prepare for API change in JUnit.

12db0eb

See jenkinsci/junit-plugin#625

Edgars Batna added 2 commits July 23, 2024 14:05

Fix properties not loaded.

6cb42d1

Merge branch 'test-history-refactor-1265' of https://github.com/mdeal…

d958cd6

…er/junit-plugin into test-history-refactor-1265

mdealer mentioned this pull request Jul 23, 2024

Prepare for API change in JUnit. jenkinsci/junit-attachments-plugin#127

Merged

6 tasks

Implement XML parsing for more fields (including properties).

31b862c

Implement related test.

timja mentioned this pull request Jul 23, 2024

Bump JUnit plugin version jenkinsci/junit-attachments-plugin#128

Closed

6 tasks

jglick mentioned this pull request Jul 23, 2024

Avoid calling ClassResult.getChildren from tests jenkinsci/junit-attachments-plugin#129

Merged

timja merged commit 72cf99b into jenkinsci:master Jul 24, 2024
16 checks passed

MarkEWaite added a commit to MarkEWaite/docker-lfs that referenced this pull request Jul 24, 2024

Use junit plugin 1279.v72cf99b_25c43

2bee40f

jenkinsci/junit-plugin#625 includes history improvemnts and performance improvements. Especially helpful for large JUnit results.

MarkEWaite mentioned this pull request Jul 25, 2024

Install most recent JUnit plugin on ci.jenkins.io and friends jenkins-infra/helpdesk#4197

Closed

timja mentioned this pull request Jul 29, 2024

Restore compatiblity with junit-sql-storage #630

Merged

6 tasks

basil reviewed Aug 7, 2024

View reviewed changes

timja mentioned this pull request Aug 14, 2024

Skip 3.x of parallel-collectors in renovate #635

Merged

This was referenced Sep 6, 2024

Bump BOM to fix JUnit issues jenkinsci/parallel-test-executor-plugin#296

Closed

Recent refactoring broke Parallel Test Executor #644

Closed

jglick reviewed Sep 6, 2024

View reviewed changes

jglick added a commit to jglick/parallel-test-executor-plugin that referenced this pull request Sep 6, 2024

Working around jenkinsci/junit-plugin#625 (comment)

ab594bc

dwnusbaum mentioned this pull request Sep 10, 2024

Avoid String concatenation in compareTo #645

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test history refactoring and improvements #625

Test history refactoring and improvements #625

mdealer commented Jul 9, 2024 •

edited by timja

Loading

mdealer commented Jul 23, 2024

timja commented Jul 24, 2024

timja commented Jul 24, 2024

timja commented Jul 24, 2024

basil Aug 7, 2024

basil Aug 7, 2024

basil commented Sep 6, 2024

jglick Sep 6, 2024

mdealer Sep 9, 2024 •

edited

Loading

jglick Sep 9, 2024

Test history refactoring and improvements #625

Test history refactoring and improvements #625

Conversation

mdealer commented Jul 9, 2024 • edited by timja Loading

Changes:

New parameter in 'junit' pipeline step: keepTestNames

Related tickets that are potentially fixed

Status

Testing done

See it in action

mdealer commented Jul 23, 2024

timja commented Jul 24, 2024

timja commented Jul 24, 2024

timja commented Jul 24, 2024

basil Aug 7, 2024

Choose a reason for hiding this comment

basil Aug 7, 2024

Choose a reason for hiding this comment

basil commented Sep 6, 2024

jglick Sep 6, 2024

Choose a reason for hiding this comment

mdealer Sep 9, 2024 • edited Loading

Choose a reason for hiding this comment

jglick Sep 9, 2024

Choose a reason for hiding this comment

mdealer commented Jul 9, 2024 •

edited by timja

Loading

mdealer Sep 9, 2024 •

edited

Loading