Add mutation testing as part of ci pipeline. by bernata · Pull Request #39 · codeheadsystems/pk-auth

bernata · 2026-06-07T19:02:20Z

Or run manually: npx stryker run
Sample report:

An example of a specific mutant of product code that survived all tests of ceremonies [red is original code, green is what got replaced; i.e. mutant, but all the tests passed]

Or run manually: npx stryker run

wolpert · 2026-06-08T14:14:16Z

I like the idea, just wondering how to operationalize it. If it runs silently and I have to manually look for the report, it'll eventually be ignored.

Thoughts?

@bernata

StrykerJS mutation testing — added by Andrew Bernat (@bernata) in PR codeheadsystems#39 (codeheadsystems#39) — reported a 65.1% mutation score for the browser SDK and pinpointed real test gaps that plain line coverage hid: code that ran but had no assertion pinning its behaviour, so a mutated version still passed. This adds the missing assertions. Why this matters: these guards, header rules, and error-mapping branches are the SDK's public contract with the server. A test that exercises them without asserting on them gives false confidence — exactly what mutation testing flags. Credit to Andrew for wiring up the tooling that surfaced these. Gaps closed (each new case kills a previously-surviving mutant): - results.ts: had ZERO coverage (0% score). New results.test.ts pins both branches of isCeremonySuccess / isCeremonyFailure and the success narrowing. - http.ts: assert the `accept` header is always sent; content-type/body are omitted on body-less requests; JSON `null`/primitive bodies leave PkAuthHttpError.data undefined (the && vs || object check); and an absent getToken yields the friendly validation error, not a raw TypeError. - refresh.ts: assert the negative branch of both type guards, the default /auth/refresh path, and that a non-JSON 401 body maps to "unknown" instead of throwing (the e.data?.detail optional-chaining branch). - admin.ts: cover the two previously-untested methods (completeEmailVerification, startPhoneVerification) and add a matrix that asserts every admin call carries the Bearer token, so the `authenticated` flag can no longer be mutated to false undetected. Note: a handful of remaining base64url survivors are equivalent mutants (out-of-bounds typed-array writes are ignored, atob is lenient, the padding regex anchor is moot for btoa output) and are intentionally left as-is. All 67 browser-SDK tests pass; tsc --noEmit clean. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

@bernata

The updated StrykerJS report (PR #39, @bernata) lifted the SDK from 65.1% to 79.3% after the first hardening pass, leaving ceremonies.ts at 43.2% as the dominant gap: 36 of its mutants were no-coverage. The existing suite drove only the register() happy path, so authenticate(), the credential create/get helpers, the cancellation branches, and the navigator.credentials guard were never executed. These flows ARE unit-testable — CeremonyOptions.credentials injects a fake CredentialsContainer — so this drives them end to end: - authenticate(): start -> get -> finish, token returned, POST verbs, and the start-body username ?? null default; - conditional mediation set only when conditional=true (and absent otherwise); - register() start/finish body contract: displayName ?? username, label ?? null, challenge null, and both steps POSTed (kills the surviving "POST"/default and logical-operator mutants); - create/get cancellation: a null credential rejects with the cancelled error; - the navigator.credentials guard throws its clear message (jsdom has no navigator.credentials), killing the guard's conditional/string mutants. Also adds a base64url DataView/subarray case that kills the toUint8Array ArrayBufferView dispatch mutant. The few remaining base64url survivors are equivalent mutants (lenient atob, ignored out-of-bounds writes, the padding regex anchor) and are left as-is, as are the navigator typeof-flips that only differ in a non-browser environment. All 77 browser-SDK tests pass; tsc --noEmit clean. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

bernata · 2026-06-08T15:52:09Z

I like the idea, just wondering how to operationalize it. If it runs silently and I have to manually look for the report, it'll eventually be ignored.

Thoughts?

Yes. This is a problem [i.e. making it operationally useful]. Here is what I'd like to do:

Merge this PR
Have you sign up for a free for open source stryker account
You would get a stryker token and set it as a repository secret called STRYKER_DASHBOARD_TOKEN
I would modify the github action workflow to use this token which should then make the report available through the stryker dashboard; and i think we can create a link to get to the dashboard [i think] -- i haven't actually used this one before.

I think the same problem exists for code coverage [i.e. jacoco report is some artifact uploaded on the workflow run]. I'd like to do the same thing with SonarQube free for open source to create saved coverage reports tracked over time and a dash of sast.

After that, I think i can write some test code.

Let me know what you think of this plan and we can sort out next steps.

Andrew Bernat and others added 2 commits June 7, 2026 12:01

Add mutation testing as part of ci pipeline.

527276d

Or run manually: npx stryker run

Merge branch 'main' into mutation-testing

c396d66

wolpert mentioned this pull request Jun 8, 2026

test(browser): close mutation-testing coverage gaps in passkeys SDK #40

Merged

Merge branch 'main' into mutation-testing

911479d

Merge branch 'main' into mutation-testing

7426bff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add mutation testing as part of ci pipeline.#39

Add mutation testing as part of ci pipeline.#39
bernata wants to merge 4 commits into
codeheadsystems:mainfrom
bernata:mutation-testing

bernata commented Jun 7, 2026 •

edited

Loading

Uh oh!

wolpert commented Jun 8, 2026

Uh oh!

bernata commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bernata commented Jun 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wolpert commented Jun 8, 2026

Uh oh!

bernata commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bernata commented Jun 7, 2026 •

edited

Loading