I am trying to run my codebase's tests (Same project as #28),
which include a lot of things like
(rule
(alias runtest)
(action
(diff pretty.expected pretty.output)))
Where the test produces some pretty.output, and pretty.expected is checked in to the repo
(so-called "gold" tests)
The difficulty is one run of these tests produces all the .output files in the _build folder. This means
that if the first mutation causes a test failure, all the later ones will as well, because they're just comparing
to the same test output.
This is definitely related to #16 #18, but it seems different enough I wanted to open my own issue for it.
I've got my currently-not-working branch here: https://github.com/stan-dev/stanc3/compare/master...WardBrian:stanc3:mutaml?expand=1
The way this currently proceeds if you run make mutation-test is that the first few runs timeout, while they're creating output files, and then the rest all fail because those output files exist and will not compare equal to the .expect files
I am trying to run my codebase's tests (Same project as #28),
which include a lot of things like
Where the test produces some pretty.output, and pretty.expected is checked in to the repo
(so-called "gold" tests)
The difficulty is one run of these tests produces all the
.outputfiles in the _build folder. This meansthat if the first mutation causes a test failure, all the later ones will as well, because they're just comparing
to the same test output.
This is definitely related to #16 #18, but it seems different enough I wanted to open my own issue for it.
I've got my currently-not-working branch here: https://github.com/stan-dev/stanc3/compare/master...WardBrian:stanc3:mutaml?expand=1
The way this currently proceeds if you run
make mutation-testis that the first few runs timeout, while they're creating output files, and then the rest all fail because those output files exist and will not compare equal to the.expectfiles