H4 fresh-agent workflow — 2026-06-10
Status: H4 bounded workflow passed · Audience: maintainers and agents
Claim
A fresh agent completed one bounded mcp_flutter workflow using declared repository surfaces before raw exploration: prove the first Skill Steward dogfood scenario status. This is an H4 proof for the bounded contract-status workflow, not a claim that all product or runtime workflows are H4.
Task
Prove the first Skill Steward dogfood scenario status for mcp_flutter using declared repository surfaces before raw exploration.
Repo state
git status
# On branch codex/steward-adoption-h2
# nothing to commit, working tree clean
git status --short --branch
# ## codex/steward-adoption-h2
Declared surfaces used first
AGENTS.mdGovernance & Skill Steward section.makefile/make check-contractsnative gate.- At proof time,
tool/steward/run.shwas the temporary Skill Steward bridge named inAGENTS.md; current reusable guidance uses the releasedstewardexecutable.
Transcript
bash tool/steward/run.sh doctor --json
# Passed after selecting a compatible Steward command surface.
make check-contracts
# Passed.
# Existing warnings only: skill metadata warnings for missing license and references/sources.md.
bash tool/steward/run.sh doctor --json
# Passed: config valid, no diagnostics, one quick-eligible action.
bash tool/steward/run.sh actions list --json
# Passed: listed fmt.check.tool-prefix.
bash tool/steward/run.sh action inspect fmt.check.tool-prefix --json
# Passed: bounded_local, auto, no writes, no git, no network, no secrets, no destructive effects.
bash tool/steward/run.sh probe --json --profile quick
# Passed: executed fmt.check.tool-prefix, exit_code 0.
bash tool/steward/run.sh benchmark --scenario mcp_flutter.web-dogfood-warm --strict --json
# Passed: result=pass, proof.status=ready, durability.status=ready.
Outcome
- Fresh-agent workflow: passed.
- Fallback needed at proof time: yes. The global
stewardbinary was present but stale for the declared contract workflow, so the agent used the then-documented temporary wrapper and local Skill Steward override. - Files edited by fresh agent: none.
- Outputs persisted by fresh agent: none.
- Review artifact added later:
docs/evidence/generated/mcp_flutter.web-dogfood-warm.strict.redacted.json. - Current adoption note: the temporary wrapper was not promoted. New adopters should install and run
stewarddirectly.
Remaining gaps and non-claims
- This proves one bounded contract-status workflow, not the WebMCP runtime dogfood path.
- The scenario
source.commitnames the pre-proof subject commit. The benchmark treats a local HEAD mismatch as a warning rather than remote equivalence proof. - Existing skill metadata/source warnings remain in
make check-contracts; they do not fail the native gate. - The original raw sibling-checkout command and temporary wrapper are local provenance only and are no longer reusable adopter guidance.
- H5 remains unclaimed because no repeated evidence has promoted a reusable diagnostic, action, eval, or harness capability with owner, risk, redaction, and validation.
