Add splitting algorithm #16

mbkroese · 2021-06-11T07:04:40Z

This makes two updates:

adds an argument to use a different splitting algorithm
we print the estimated duration of each group.
adds algorithm used to split tests

I'll share some anecdotal information about how the new algorithm performs vs the old.

mbkroese · 2021-06-11T07:35:05Z

For my testing suite:

1790 tests, 3 splits

new:
947.01s, 948.57s, 948.51s

old:
961s, 955s, 914s

mbkroese · 2021-06-11T07:35:52Z

So the new grouping is more balanced, although the difference is not massive.

mbkroese · 2021-06-11T07:36:33Z

@jerry-git @sondrelg please review

mbkroese · 2021-06-11T07:52:27Z

Links back to this discussion: #14 (comment)

sondrelg

This makes a lot of sense to me @mbkroese 👍 Never seen heapq before - looks pretty neat 🙂

tests/test_plugin.py

jerry-git · 2021-06-14T10:10:33Z

README.md

+* Absolute Order: whether each group contains all tests between first and last element in the same order as the original list of tests
+* Relative Order: whether each test in each group has the same relative order to its neighbours in the group as in the original list of tests
+
+The `duration_based_chunks` algorithm aims to find optimal boundaries for the list of tests and every test group contains all tests between the start and end bounary.


I think it could be valuable to mention also in the usage section that one can specify the splitting algorithm via command line arg and also mention what is the default behaviour

Added a small note about it.

jerry-git · 2021-06-14T10:11:30Z

src/pytest_split/algorithms.py

+    from _pytest import nodes
+
+
+ALGORITHMS = ["duration_based_chunks", "least_duration"]


I think enum could make more sense here

Changed to enum. Please let me know if I made the change you intended.

src/pytest_split/algorithms.py

jerry-git · 2021-06-14T10:29:26Z

tests/test_algorithms.py

+
+class TestAlgorithms:
+    @pytest.mark.parametrize("algo_name", algorithms.ALGORITHMS)
+    def test__split_test(self, algo_name):


Let's keep consistency, other test modules seem to use single _ after test 🙂

It is consistent in the sense that I use the format: test_{func}_{does_something}_{when}.
And in this case the func is called _split_test.

Please let me know if you still prefer the change.

tests/test_plugin.py

mbkroese · 2021-06-14T17:37:23Z

@jerry-git I don't understand why the CI fails. It seems like the module is not actually installed?

jerry-git · 2021-06-16T07:34:32Z

Hmm, it's interesting that it works for pull_request_target but not for pull_request 🤔 @sondrelg Any knowledge on this?

sondrelg · 2021-06-16T07:40:37Z

I had the same problem in the Poetry setup PR but I can't remember how I fixed it. I wonder if you should just remove the pull_request_target trigger in the test workflow - as long as it passes in this repo, it must work right? 😛

jerry-git · 2021-06-16T08:21:41Z

Yeah I was considering removing pull_request_target but in this case it's the pull_request which is failing, not pull_request_target

sondrelg · 2021-06-16T08:47:18Z

Which ever it is, the tests running in this repo seem to pass and the tests running in the fork fail, right

I can't think of a reason why this would happen, but there might be a subtle difference between poetry install --no-interaction and pip install -e . that's relevant 🤔

jerry-git · 2021-06-16T12:36:16Z

@mbkroese please rebase from master, I think I got it fixed in #21

The new splitting algorithm goes over the items, determines a best estimate for the duration and then adds it to the smallest group. The are edge-cases that this algorithm can't handle properly. For example, when the last test item has a large test duration, the current solution won't create the optimal solution. This can be improved by sorting the test durations first, and starting with assignment of tests that have the largest test duration.

The old algorithm was restored, and the new one was added as well. The user can now choose which algorithm is most suitable for their use case.

mbkroese · 2021-06-16T18:31:47Z

thanks @jerry-git , the tests succeed now.

sondrelg reviewed Jun 11, 2021

View reviewed changes

jerry-git self-requested a review June 11, 2021 10:39

jerry-git reviewed Jun 11, 2021

View reviewed changes

tests/test_plugin.py Outdated Show resolved Hide resolved

mbkroese changed the title ~~Mkroese/update splitting algorithm~~ Add splitting algorithm Jun 12, 2021

jerry-git reviewed Jun 14, 2021

View reviewed changes

src/pytest_split/algorithms.py Outdated Show resolved Hide resolved

jerry-git reviewed Jun 14, 2021

View reviewed changes

src/pytest_split/algorithms.py Outdated Show resolved Hide resolved

jerry-git reviewed Jun 14, 2021

View reviewed changes

src/pytest_split/algorithms.py Outdated Show resolved Hide resolved

jerry-git reviewed Jun 14, 2021

View reviewed changes

tests/test_plugin.py Outdated Show resolved Hide resolved

mbk added 9 commits June 16, 2021 20:30

Removed unused messages attribute

9afe747

More precise typing information

d4eaf55

Printing estimated duration

5d1ffd8

Removed duplicate test

aa09d63

Added user option to specify splitting algorithm

456811a

The old algorithm was restored, and the new one was added as well. The user can now choose which algorithm is most suitable for their use case.

Add printing name of splitting algorithm

3a5f273

Use enum to list available algorithms

e829189

Fix linting violations

1c7a76b

Added a section about the newly added CLI option in the README

fd563d8

jerry-git merged commit 627c6ee into jerry-git:master Jun 17, 2021

jerry-git mentioned this pull request Dec 24, 2021

[Actions] Auto-Update cookiecutter template #42

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add splitting algorithm #16

Add splitting algorithm #16

mbkroese commented Jun 11, 2021 •

edited

Loading

mbkroese commented Jun 11, 2021

mbkroese commented Jun 11, 2021

mbkroese commented Jun 11, 2021

mbkroese commented Jun 11, 2021

sondrelg left a comment

jerry-git Jun 14, 2021

mbkroese Jun 14, 2021

jerry-git Jun 14, 2021

mbkroese Jun 14, 2021

jerry-git Jun 14, 2021

mbkroese Jun 14, 2021

mbkroese commented Jun 14, 2021

jerry-git commented Jun 16, 2021

sondrelg commented Jun 16, 2021

jerry-git commented Jun 16, 2021

sondrelg commented Jun 16, 2021

jerry-git commented Jun 16, 2021

mbkroese commented Jun 16, 2021

		from _pytest import nodes


		ALGORITHMS = ["duration_based_chunks", "least_duration"]

Add splitting algorithm #16

Add splitting algorithm #16

Conversation

mbkroese commented Jun 11, 2021 • edited Loading

mbkroese commented Jun 11, 2021

mbkroese commented Jun 11, 2021

mbkroese commented Jun 11, 2021

mbkroese commented Jun 11, 2021

sondrelg left a comment

Choose a reason for hiding this comment

jerry-git Jun 14, 2021

Choose a reason for hiding this comment

mbkroese Jun 14, 2021

Choose a reason for hiding this comment

jerry-git Jun 14, 2021

Choose a reason for hiding this comment

mbkroese Jun 14, 2021

Choose a reason for hiding this comment

jerry-git Jun 14, 2021

Choose a reason for hiding this comment

mbkroese Jun 14, 2021

Choose a reason for hiding this comment

mbkroese commented Jun 14, 2021

jerry-git commented Jun 16, 2021

sondrelg commented Jun 16, 2021

jerry-git commented Jun 16, 2021

sondrelg commented Jun 16, 2021

jerry-git commented Jun 16, 2021

mbkroese commented Jun 16, 2021

mbkroese commented Jun 11, 2021 •

edited

Loading