Do we know why random tests are failing? #388

neukym · 2024-10-23T15:39:32Z

Do we know why random tests are failing during CI? They fail, and then pass on the second attempt - it's a bit annoying for something that should be reproducible.

caiw · 2024-10-25T08:11:44Z

It's bizarre, isn't it? Here is one example failure:

=========================== short test summary info ============================
[117](https://github.com/kymata-atlas/kymata-core/actions/runs/11416705740/job/31768215218#step:7:118)
FAILED tests/test_expression.py::test_hes_rename_functions_just_one - assert <kymata.entit...x7f36ca0be350> == <kymata.entit...x7f36ca193010>
[118]...
  
[119]...
  Full diff:
[120]...
  - <kymata.entities.expression.HexelExpressionSet object at 0x7f36ca193010>
[121]...
  ?                                                                  ^^ ^^
[122]...
  + <kymata.entities.expression.HexelExpressionSet object at 0x7f36ca0be350>
[123]...
  ?                                                                  ^^^ ^
[124]...
=========== 1 failed, 146 passed, 3 skipped, 290 warnings in 11.28s ============
[125]...
Error: Process completed with exit code 1.

And the intermittent failures I've inspected are always similar, like a failure in object identity.

However looking at that test test_hes_rename_functions_just_one():

def test_hes_rename_functions_just_one():
    data_left = [np.random.randn(5, 10) for _ in range(2)]
    data_right = [np.random.randn(5, 10) for _ in range(2)]

    es = HexelExpressionSet(
        functions=["first", "second"],
        hexels_lh=range(5),
        hexels_rh=range(5),
        latencies=range(10),
        data_lh=data_left,
        data_rh=data_right,
    )
    target_es = HexelExpressionSet(
        functions=["first_renamed", "second"],
        hexels_lh=range(5),
        hexels_rh=range(5),
        latencies=range(10),
        data_lh=data_left,
        data_rh=data_right,
    )
    assert es != target_es
    es.rename(functions={"first": "first_renamed"})
    assert es == target_es

That should not be failing intermittently.

Here's the only thing I can imagine:

Tests for ExpressionSet equality are done by comparing data blocks (and other things). This will involve float many float comparisons, and the data blocks in this test are randomly generated. Perhaps some random floats fail at equality comparison because some spooky nondeterministic behaviour in the Gitlab CI runner. (I've never made it fail by running locally.)

If this is the case, then using non-random data would at least fix this issue, even if it didn't fully explain it.

caiw · 2024-10-25T08:13:33Z

Or it could be nondeterministic behaviour in the sparse package, I suppose, which converts the numpy.arrays to sparse.COOs in HexelExpressionSet.__init__... I really don't know how we could track that down though unless we can find a specific example of test data which causes the test to fail.

caiw · 2024-10-25T08:31:55Z

Here is another example of a randomly failing test involving float comparisons but not involving sparse...

caiw · 2024-10-25T09:38:10Z

One thing which might work would be to force (e.g.) numpy.float32s in all tests, in case it's some kind of heterogeneous architecture issue with the Github CI runners.

neukym added the 📋 tests label Oct 23, 2024

caiw mentioned this issue Oct 25, 2024

Option to show/hide labels on IPPM plots #389

Merged

caiw self-assigned this Oct 25, 2024

neukym added the 🪲 bug Something isn't working label Oct 31, 2024

caiw linked a pull request Nov 1, 2024 that will close this issue

Low-hanging dtype specification #392

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do we know why random tests are failing? #388

Do we know why random tests are failing? #388

neukym commented Oct 23, 2024

caiw commented Oct 25, 2024

caiw commented Oct 25, 2024

caiw commented Oct 25, 2024

caiw commented Oct 25, 2024

Do we know why random tests are failing? #388

Do we know why random tests are failing? #388

Comments

neukym commented Oct 23, 2024

caiw commented Oct 25, 2024

caiw commented Oct 25, 2024

caiw commented Oct 25, 2024

caiw commented Oct 25, 2024