Allow running the acceptance tests in pulumi/examples as part of the test suite #823

thomas11 · 2024-02-14T20:16:51Z

This change adds a new optional flag testPulumiExamples that can be set in .ci-mgmt.yaml. When set, the acceptance tests in pulumi/examples will be run as part of the test suite.

The goal is to increase test coverage in provider PRs or releases. pulumi/examples has a rich set of realistic programs that can be used for this via ProgramTest.

Providers don't necessarily need to gate PRs on these tests, they can be informational only.

This requires that the provider's test suite in examples/ is enabled for this. See pulumi/pulumi-azure#1717 for an example. It's not enough to just run the test suite in pulumi/examples because we want to use or inject the locally built provider and SDKs of the current PR.

I chose the approach using the test job's matrix because the new pulumi/examples tests should run concurrently to the regular ones, but to run them in a separate job would require copying the lengthy setup steps of the test job, or extracting the setup into a composite action which I thought could be avoided.

iwahbe

I'm not particularly good at GH Actions, so it would be really helpful to see a demo run of this change.

You can use the Makefile copy this change over by modifying the ci-mgmt target. Just change @master to the SHA of the commit you want to test.

It looks like we want to take this change on PRs (run-acceptance-tests) but not master, release or cron. Is that the plan?

provider-ci/internal/pkg/templates/bridged-provider/.github/workflows/run-acceptance-tests.yml

thomas11 · 2024-02-15T09:52:32Z

I'm not particularly good at GH Actions, so it would be really helpful to see a demo run of this change.

The p-azure PR I linked, pulumi/pulumi-azure#1717, is the demo! Specifically, you can see the new test job [test (nodejs, pulumiExamples)](https://github.com/pulumi/pulumi-azure/actions/runs/7913493113/job/21601801140?pr=1717) running the p/examples tests for the azure-classic examples.

It also demonstrates part of the value - some examples are failing because the examples are outdated. It could also go the other way round: a provider change breaks a previously working example.

It looks like we want to take this change on PRs (run-acceptance-tests) but not master, release or cron. Is that the plan?

I hope we can run these tests for each PR, but we don't need to decide just yet. Running them on releases could also be valuable. Either way, we won't make their success required just yet, as the p-azure example shows.

iwahbe · 2024-02-21T08:33:44Z

I'm not particularly good at GH Actions, so it would be really helpful to see a demo run of this change.

The p-azure PR I linked, pulumi/pulumi-azure#1717, is the demo! Specifically, you can see the new test job [test (nodejs, pulumiExamples)](https://github.com/pulumi/pulumi-azure/actions/runs/7913493113/job/21601801140?pr=1717) running the p/examples tests for the azure-classic examples.

Great. That looks good.

It also demonstrates part of the value - some examples are failing because the examples are outdated. It could also go the other way round: a provider change breaks a previously working example.

It looks like we want to take this change on PRs (run-acceptance-tests) but not master, release or cron. Is that the plan?

I hope we can run these tests for each PR, but we don't need to decide just yet. Running them on releases could also be valuable. Either way, we won't make their success required just yet, as the p-azure example shows.

I'm trying to get at the purpose of the new tests here. If we run them only in PRs (run-acceptance-tests), then they will be ignored. Engineers have no way to verify if they broke the tests or if tests were already broken on master.

I generally believe that 90% of the use of tests here is gating, and if tests are allowed to be broken then they loose that ability. I'd rather that they run on PRs and master, and that they do block PRs. If the tests are sufficiently flakey that it is unsafe to block on, I'm skeptical they provide enough signal to make them worth running.

t0yv0 · 2024-02-21T15:46:15Z

I'm not sure I agree, this sounds to me like an argument that says "we can't make the gating workflow ideal so we'll continue willfully shipping changes that break examples." Having examples evergreen (or almost) does not sound like something we should give up on already? If examples are evergreen, then a PR failing this check is an indication the PR has a problem, which is what we want.

t0yv0 · 2024-02-21T15:47:49Z

If we have to enforce.. I would argue it'd be much more customer-centric to open P1s for when examples are broken than to open P1s for when CI workflows are broken as is our current practice.

t0yv0 · 2024-02-21T15:51:48Z

Actually to take the step back, I'd like to ask if we can link Thomas' design doc and epic/unit of work here? I was under the impression that running these tests was what was agreed on in the multiple-round reviewed design doc, am I perhaps misremembering or there's some new information to reconsider?

thomas11 · 2024-02-22T15:48:36Z

Great points, @iwahbe and @t0yv0. The design doc is Test Coverage and Maintenance of Examples but it's Pulumi-internal.

We had indeed agreed to run these tests in order to increase test coverage, but there are still variables as to when and how we run them.

Ideally, as Ian said, we'd like to have an additional gate, at PR or release time, that prevents us from shipping regressions. Before adding the pulumi/examples tests in this way, however, we need to consider that this repo belongs to another team and gets new commits from all over Pulumi. If one of these is faulty - although theoretically that should be prevented by pulumi/example's own test suites - the provider gating its PRs on these examples is blocked from merging.

Some ways around this:

Fail the PR/release, and when investigation shows that p/examples is at fault, manually override the branch protection. However, not long ago we actually removed the manual overrides for increased safety.
Don't run the tests against the HEAD of pulumi/examples but a fixed version. That solves this problem but requires regular updates of the dependency or we'll test against old examples soon.
Don't require the pulumi/examples tests to pass, just have developers look at them before merging. As Ian pointed out, they might ignore the tests. It also doesn't compose with auto-merge.
- This option could be improved by adding a Slack message or other ping when the p/examples tests fail

t0yv0 · 2024-02-22T20:18:57Z

Don't require the pulumi/examples tests to pass, just have developers look at them before merging

I thought this was the agreement and I still support this option, that's the only viable option IMO.

iwahbe · 2024-02-23T15:46:53Z

@thomas11 Let's get the tests in as advisory. We can always increase the impact of the tests once they are in place, and I don't want this work to bitrot while we discuss.

…ests

thomas11 · 2024-02-26T14:24:21Z

@thomas11 Let's get the tests in as advisory. We can always increase the impact of the tests once they are in place, and I don't want this work to bitrot while we discuss.

Sounds good, thanks! I just rebased it removed the last TODO. I need you or @t0yv0 to stamp it, though.

iwahbe

LGTM

thomas11 requested review from guineveresaenger, iwahbe and a team February 14, 2024 20:16

thomas11 force-pushed the tkappler/test-pulumi-examples branch from 2c5be3a to c854704 Compare February 14, 2024 20:27

iwahbe reviewed Feb 14, 2024

View reviewed changes

provider-ci/internal/pkg/templates/bridged-provider/.github/workflows/run-acceptance-tests.yml Outdated Show resolved Hide resolved

mjeffryes assigned thomas11 Feb 16, 2024

thomas11 added 3 commits February 26, 2024 15:15

Allow running the acceptance tests in pulumi/examples as acceptance t…

735978e

…ests

Switch new flag to more legible testTarget={local,pulumiExamples}

c57131d

Switch to p/examples default branch now that #1583 is merged

86cf165

thomas11 force-pushed the tkappler/test-pulumi-examples branch from 9ad78fd to 86cf165 Compare February 26, 2024 14:20

iwahbe self-requested a review February 26, 2024 14:40

iwahbe approved these changes Feb 26, 2024

View reviewed changes

thomas11 merged commit 2665e75 into master Feb 26, 2024
3 checks passed

thomas11 deleted the tkappler/test-pulumi-examples branch February 26, 2024 15:28

danielrbradley mentioned this pull request Jul 12, 2024

Extract shared test workflows and action #1037

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow running the acceptance tests in pulumi/examples as part of the test suite #823

Allow running the acceptance tests in pulumi/examples as part of the test suite #823

thomas11 commented Feb 14, 2024 •

edited

Loading

iwahbe left a comment

thomas11 commented Feb 15, 2024

iwahbe commented Feb 21, 2024

t0yv0 commented Feb 21, 2024

t0yv0 commented Feb 21, 2024

t0yv0 commented Feb 21, 2024

thomas11 commented Feb 22, 2024 •

edited

Loading

t0yv0 commented Feb 22, 2024

iwahbe commented Feb 23, 2024

thomas11 commented Feb 26, 2024

iwahbe left a comment

Allow running the acceptance tests in pulumi/examples as part of the test suite #823

Allow running the acceptance tests in pulumi/examples as part of the test suite #823

Conversation

thomas11 commented Feb 14, 2024 • edited Loading

iwahbe left a comment

Choose a reason for hiding this comment

thomas11 commented Feb 15, 2024

iwahbe commented Feb 21, 2024

t0yv0 commented Feb 21, 2024

t0yv0 commented Feb 21, 2024

t0yv0 commented Feb 21, 2024

thomas11 commented Feb 22, 2024 • edited Loading

t0yv0 commented Feb 22, 2024

iwahbe commented Feb 23, 2024

thomas11 commented Feb 26, 2024

iwahbe left a comment

Choose a reason for hiding this comment

thomas11 commented Feb 14, 2024 •

edited

Loading

thomas11 commented Feb 22, 2024 •

edited

Loading