Difficulty running any test built on dune's `diff` action

I am trying to run my codebase's tests (Same project as #28),
which include a lot of things like

```
(rule
 (alias runtest)
 (action
  (diff pretty.expected pretty.output)))
```

Where the test produces some pretty.output, and pretty.expected is checked in to the repo 
(so-called "gold" tests)

The difficulty is one run of these tests produces all the `.output` files in the _build folder. This means
that if the first mutation causes a test failure, all the later ones will as well, because they're just comparing
to the same test output.

This is definitely related to #16 #18, but it seems different enough I wanted to open my own issue for it.

I've got my currently-not-working branch here: https://github.com/stan-dev/stanc3/compare/master...WardBrian:stanc3:mutaml?expand=1

The way this currently proceeds if you run `make mutation-test` is that the first few runs timeout, while they're creating output files, and then the rest all fail because those output files exist and will not compare equal to the `.expect` files

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difficulty running any test built on dune's `diff` action #31

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Difficulty running any test built on dune's diff action #31

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

Difficulty running any test built on dune's `diff` action #31