Use a different set of commands to detect fourslash test updates #1923

jakebailey · 2025-10-22T16:58:56Z

This is the weird set of commands I have been using locally to keep things up to date while working around the problem of "failing tests" leaving baselines behind. It seems like updatefailing should do something like this, but I think it might be too annoying.

(Sending first without the baseline update to show that it works.)

Copilot

Pull Request Overview

This PR updates the CI workflow to use a more comprehensive process for detecting and updating fourslash test changes. The new approach addresses an issue where failing tests leave behind baselines, which the existing updatefailing command doesn't properly handle.

Replaces single updatefailing command with a multi-step process
Clears failing test list, regenerates fourslash tests, and iteratively updates baselines
Ensures all test state is properly synchronized by removing and regenerating fourslash baselines

.github/workflows/ci.yml

Copilot · 2025-10-22T16:59:33Z

.github/workflows/ci.yml

+        run: |
+          echo "" > ./internal/fourslash/_scripts/failingTests.txt
+          npm run convertfourslash
+          npx hereby test
+          npx hereby baseline-accept
+          npm run updatefailing
+          npx hereby baseline-accept
+          rm -rf testdata/baselines/reference/fourslash
+          npx hereby test


This complex multi-step process lacks inline comments explaining why each step is necessary. Add comments documenting the purpose of each phase (e.g., why tests are run multiple times, why fourslash baselines are removed and regenerated) to help maintainers understand this workaround.

Suggested change

run: |

echo "" > ./internal/fourslash/_scripts/failingTests.txt

npm run convertfourslash

npx hereby test

npx hereby baseline-accept

npm run updatefailing

npx hereby baseline-accept

rm -rf testdata/baselines/reference/fourslash

npx hereby test

run: |

# Clear the list of failing fourslash tests to start fresh.

echo "" > ./internal/fourslash/_scripts/failingTests.txt

# Convert fourslash test sources to the latest format.

npm run convertfourslash

# Run all tests to generate new baselines.

npx hereby test

# Accept all generated baselines to update reference outputs.

npx hereby baseline-accept

# Update the list of failing tests after baseline update.

npm run updatefailing

# Accept any new baselines generated by updatefailing.

npx hereby baseline-accept

# Remove existing fourslash baselines to force full regeneration.

rm -rf testdata/baselines/reference/fourslash

# Run tests again to regenerate all fourslash baselines from scratch.

npx hereby test

# Accept the freshly regenerated baselines.

Use a different set of commands to detect fourslash test updates

60ae2d2

jakebailey requested review from Copilot and gabritto October 22, 2025 16:58

Copilot AI reviewed Oct 22, 2025

View reviewed changes

or true

3774646

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Use a different set of commands to detect fourslash test updates #1923

Use a different set of commands to detect fourslash test updates #1923

Uh oh!

jakebailey commented Oct 22, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Use a different set of commands to detect fourslash test updates #1923

Are you sure you want to change the base?

Use a different set of commands to detect fourslash test updates #1923

Uh oh!

Conversation

jakebailey commented Oct 22, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Copilot AI Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants