Splits invalid when collection order not deterministic #25

mbkroese · 2021-06-17T20:53:14Z

The big assumption underlying the two splitting algorithms is that the order of collected items is constant.
However, I've come across a case where this assumption was violated.
In my case I had a test parametrised with pytest.mark.parametrize, but the items to parametrize with would sometimes change order.

Take this example:

import pytest

@pytest.mark.parametrize('name', set(['henk', 'ingrid']))
def test_hello(name):
    pass

If you run this often enough you'll see that the order changes:

[2021-06-17 22:47:10] test_temp.py::test_hello[henk] PASSED                                                                                                                [ 50%]
[2021-06-17 22:47:10] test_temp.py::test_hello[ingrid] PASSED                                                                                                              [100%]

and

[2021-06-17 22:47:10] test_temp.py::test_hello[ingrid] PASSED                                                                                                              [ 50%]
[2021-06-17 22:47:10] test_temp.py::test_hello[henk] PASSED                                                                                                                [100%]

I'm not sure how to address this, but I think there are a few options:

not splitting over different values of parametrize for the same test. In other words, make sure that a single group will run all tests for test_hello.
try to create some deterministic order out of test cases by sorting. I'm not sure this will work in all cases tho (for example it might not work for objects)
do splitting on one machine, save the splits and just call pytest with those pre-calculated groups (so not really using this plugin as a plugin :p)

The text was updated successfully, but these errors were encountered:

jerry-git · 2021-06-20T16:49:05Z

I assume it's not deterministic here because of set, or can you repro it also with list or tuple?

mbkroese · 2021-06-20T17:12:24Z

No, this problem occurs when either the data structure has non-deterministic order or the code generating the parametrised test cases is for some reason not deterministic.

jerry-git · 2021-06-21T08:26:39Z

I think we could go with 1. aka make sure the tests inside same parametrize are run in the same group. However, the downside is ofc that if one parametrised test is very time consuming vs the rest of the suite, the splits would not be great.

OTOH, maybe it's better to make sure that we don't accidentally skip tests (or run some test in multiple groups) 🤔

With these thoughts, I'd go with 1. 🙂

mbkroese · 2021-06-22T19:53:02Z

downside is ofc that if one parametrised test is very time consuming vs the rest of the suite, the splits would not be great.

Yes, and I wonder if we should perhaps be safe by default (i.e. option 1) and allow users to do the unsafe thing (existing behaviour). If we make clear what the tradeoffs are, the user can then decide for him/herself. In other words add a parameter that --split-level=func by default but can be set to --split-level=parametrized or --split-level=file?

dpanici · 2024-08-08T23:57:06Z

Was this implemented yet? If not I think it would be a great feature to have, as I have some tests I want ran in the same group under the same parametrize decorator

mbkroese mentioned this issue Jun 24, 2021

Improve least_duration algorithm by sorting durations #28

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Splits invalid when collection order not deterministic #25

Splits invalid when collection order not deterministic #25

mbkroese commented Jun 17, 2021 •

edited

Loading

jerry-git commented Jun 20, 2021

mbkroese commented Jun 20, 2021

jerry-git commented Jun 21, 2021

mbkroese commented Jun 22, 2021

dpanici commented Aug 8, 2024

Splits invalid when collection order not deterministic #25

Splits invalid when collection order not deterministic #25

Comments

mbkroese commented Jun 17, 2021 • edited Loading

jerry-git commented Jun 20, 2021

mbkroese commented Jun 20, 2021

jerry-git commented Jun 21, 2021

mbkroese commented Jun 22, 2021

dpanici commented Aug 8, 2024

mbkroese commented Jun 17, 2021 •

edited

Loading