Add E2E test pipeline for azure.ai.agents extension (Tier 0/1) by v1212 · Pull Request #8607 · Azure/azure-dev

v1212 · 2026-06-11T08:49:01Z

Status: DRAFT - pending secrets setup and --allow-tool syntax confirmation

Summary

Adds a Copilot CLI-driven E2E test pipeline for the azure.ai.agents extension. Uses the official copilot -p programmatic mode (per GitHub docs).

Architecture (identical to local testing)

Copilot CLI (npm install -g @github/copilot)
  | MCP protocol (stdio, config: ~/.copilot/mcp-config.json)
cli-interactive-tester (MCP server)
  | tmux sessions
azd ai agent CLI (under test)

Workflow

Checkout (ref: trangevi/test-scenarios)
Setup: Node.js, Go, Python 3.12
Install Copilot CLI (npm install -g @github/copilot)
Build azd + install extension
Install cli-interactive-tester (git clone + pip install -e)
Azure Login + gh auth (Tier 1/2 only)
Create profile.local.yaml from secrets
Configure MCP (write ~/.copilot/mcp-config.json)
Run: copilot -p "" --allow-tool='cli-interactive-tester(*)' --no-ask-user
Upload results as artifacts
Always-run teardown for Tier 2

Trigger

workflow_dispatch only (tier selector: 0 / 0+1 / 0+1+2)
No PR trigger until pipeline is validated

Items to confirm

--allow-tool syntax for MCP-registered tools (is it cli-interactive-tester(*)?)
COPILOT_PAT - who creates the Fine-grained PAT with "Copilot Requests" permission?
cli-interactive-tester repo - public or private? If private, need authenticated clone
prompt-ci-run.md - needs to be created in scenarios directory (CI-adapted prompt)

Secrets needed

COPILOT_PAT (Fine-grained PAT with "Copilot Requests" permission)
AZURE_CLIENT_ID / AZURE_TENANT_ID / AZURE_SUBSCRIPTION_ID
FOUNDRY_PROJECT_ENDPOINT
GH_TOKEN (for gh auth + manifest-url scenario)

Validation

All 44 scenarios validated PASS in manual Copilot-driven local run.

Copilot

Pull request overview

Adds a new GitHub Actions workflow to run end-to-end (E2E) scenario-based checks for the azure.ai.agents azd extension on PRs and via manual dispatch, intended to cover Tier 0 (offline) and Tier 1 (Azure-authenticated, no provision) scenarios, with a commented Tier 2 placeholder.

Changes:

Introduces .github/workflows/e2e-ext-azure-ai-agents.yml with Tier 0 and Tier 1 jobs that build azd, install the extension, and execute scenario YAMLs.
Adds workflow_dispatch input scaffolding for a future Tier 2 live-Azure run (currently commented out).

Copilot CLI-driven pipeline using cli-interactive-tester MCP tool. Same architecture as local testing — Copilot reads scenario goals and drives terminal via MCP protocol. Implementation: - Copilot CLI installed via npm install -g @github/copilot - Auth via COPILOT_GITHUB_TOKEN (Fine-grained PAT, Copilot Requests perm) - MCP config in ~/.copilot/mcp-config.json (auto-loaded by Copilot) - Execution: copilot -p prompt --allow-tool=... --no-ask-user - workflow_dispatch with tier selector (0 / 0+1 / 0+1+2) - ubuntu-22.04 runner - Checkout: trangevi/test-scenarios (until PR Azure#8524 merges) - Tier 2 has always-run teardown for Azure resource cleanup - Results uploaded as artifacts TODO: - Confirm --allow-tool syntax for MCP-registered tools - Configure COPILOT_PAT secret (Fine-grained PAT) - Confirm cli-interactive-tester repo visibility - Create prompt-ci-run.md in scenarios directory Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Add setup-node + npm install -g @github/copilot - Use COPILOT_GITHUB_TOKEN env var with COPILOT_PAT secret - MCP config at ~/.copilot/mcp-config.json (auto-loaded) - Use copilot -p with --allow-tool and --no-ask-user - Add conditional on Azure Login (skip for Tier 0) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Use printf instead of heredoc to avoid leading whitespace in profile.local.yaml and mcp-config.json - Fix concurrency group to prevent parallel runs (was using run_id) - Add existence check for prompt-ci-run.md before Copilot invocation Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Create CI-adapted prompt file for Copilot CLI execution - Pass TIER input as env var so Copilot knows which phases to run - Prompt includes profile loading, scenario ordering, and output format Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

cwd may not be supported by Copilot CLI's MCP config schema. Use bash wrapper to cd before launching the MCP server. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

glharper · 2026-06-11T15:05:26Z

Review findings

This is a draft targeting wujia/e2e-pipeline-staging, and several items are already flagged by the author (--allow-tool syntax, COPILOT_PAT, repo visibility, prompt-ci-run.md). The notes below are beyond those.

🔴 Concrete bug — `azure/login` will fail without `id-token: write`

The Azure Login step uses OIDC federated credentials (client-id/tenant-id/subscription-id, no client-secret). OIDC login requires permissions: id-token: write, but the workflow only declares contents: read. As written, azure/login@v2 cannot fetch the OIDC token and Tier 1/2 runs will fail at the login step. Add id-token: write.

🟠 Convention — authenticated workloads should prefer ADO pipelines

Per cli/azd/AGENTS.md: "Prefer Azure DevOps pipelines for jobs that need secrets or Azure credentials — the team uses internal ADO pipelines for authenticated workloads in this public repo." No other workflow in .github/workflows/ currently uses azure/login — this PR would be the first. The Tier 1/2 path (Azure creds + COPILOT_PAT + GH_TOKEN + FOUNDRY_PROJECT_ENDPOINT) is exactly the kind of authenticated workload the team routes through ADO. Worth confirming with maintainers before this leaves staging. (Tier 0 is offline/no-auth and is a good fit for GitHub Actions.)

🟠 Tier 2 teardown can silently leak Azure resources

cd ~/working/azd-agents-shared/*/ 2>/dev/null && azd down --force --purge || true

If the glob matches multiple dirs, cd gets multiple args and fails.
If cd fails (path missing, e.g. setup aborted early), && short-circuits and || true keeps the job green — so no teardown runs and resources keep costing money with no signal. Given the explicit "~$2-5" cost note, consider failing loudly (or emitting a ::warning::) when the expected directory isn't found, and quoting/guarding the glob.

🟡 Minor / hardening

Secret interpolation in run: scripts. "Create test profile" inlines ${{ secrets.AZURE_SUBSCRIPTION_ID }} / FOUNDRY_PROJECT_ENDPOINT directly into the shell. Prefer passing them via env: (as the Copilot step correctly does for the token) to avoid shell-injection / log-exposure edge cases.
actions/setup-node@v4 has no node-version — pin one for reproducibility.
Hardcoded ref: trangevi/test-scenarios (already TODO'd) — must not reach main; it ties the workflow to an unmerged branch.
PATH across steps — re-exporting cli/azd in each step works, but echo "$DIR" >> $GITHUB_PATH once (per AGENTS.md guidance) is cleaner.
Trailing blank lines at EOF of the workflow file.
COPILOT_GITHUB_TOKEN env name — verify the Copilot CLI actually reads this (on the author's confirm list).

✅ Good

Least-privilege permissions block present (just needs id-token).
workflow_dispatch-only with tier gating and a Tier 2 cost-confirmation input — appropriately cautious.
concurrency with cancel-in-progress: false avoids clobbering in-flight Azure resource runs.
Tier-gated steps (if: contains(inputs.tier, ...)) are consistent and correct.
prompt-ci-run.md is clear; if: always() artifact upload + teardown are sensible.

Bottom line: Right architecture and appropriately gated, but not mergeable beyond staging yet: fix the id-token: write permission (hard blocker for Tier 1/2), resolve the ADO-vs-Actions convention question for the authenticated tiers, and harden the teardown. The Tier-0-only path is close to ready.

Required for azure/login OIDC federated credentials (Tier 1/2). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Remove id-token:write (not needed for Tier 0) - Comment out Tier 1/2 steps with TODO explaining ADO vs GHA decision - Fix Azure#3: teardown glob guard with warning (commented out) - Fix Azure#4: secrets via env: not inline shell (commented out) - Fix Azure#5: pin node-version: 20 - Fix Azure#7: use GITHUB_PATH instead of per-step export Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

All tiers (0/1/2) are now active — if team decides GHA is not appropriate for authenticated workloads, PR can be abandoned. Fixes from Glen's review: - id-token: write restored (required for azure/login OIDC) - Teardown: glob guard with ::warning:: instead of silent || true - Secrets: passed via env: instead of inline shell interpolation - setup-node: pin node-version 20 - PATH: use GITHUB_PATH once instead of per-step export - Create test profile: add if: condition (skip for Tier 0) - Trailing whitespace cleaned Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

v1212 · 2026-06-12T03:59:06Z

Thanks @glharper, great review! Addressed all items in the latest commit (d08b5c9). Here's the breakdown:

🔴 id-token: write — Fixed. Added to permissions block.

🟠 ADO convention — Totally valid point. I'm aware of the convention in AGENTS.md. This PR's goal is to validate whether this pattern (Copilot driving cli-interactive-tester to exercise azd interactively) works end-to-end in GitHub CI. If the team decides authenticated tiers should move to ADO, I'm fully supportive of that. Happy to discuss in our next sync.

🟠 Teardown glob — Fixed. Now uses explicit guard with ::warning:: annotation instead of silent || true.

🟡 Secret interpolation — Fixed. Secrets now passed via env: block, referenced as ${VAR} in shell.

🟡 Node version — Pinned to node-version: '20'.

🟡 PATH — Switched to >> $GITHUB_PATH in the build step, removed per-step exports.

🟡 Hardcoded ref — Keeping TODO as-is, will update once #8524 merges.

Re: overall direction — This is still a draft targeting a staging branch for all tests with the MCP tool and copilot driven framework. If later it is determined to move tier 1/2 tests to ADO, I would propose to abandon this PR but implement same pipeline in ADO for tier 0/1/2, leaving only tier 0 in GitHub action seems not helpful enough.

- Fail fast if Tier 2 selected without confirm_tier2_cost=true - Clarify that Azure/GitHub auth is only active for Tier 1/2 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

v1212 · 2026-06-12T08:44:03Z

https://github.com/v1212/azure-dev/actions/runs/27403167828 validated the pipeline in fork repo, it proved the testing framework working with copilot, mcp tool, and test cases. @glharper

microsoft-github-policy-service · 2026-06-19T12:37:08Z

Hi @@v1212. Thank you for your interest in helping to improve the Azure Developer CLI experience and for your contribution. We've noticed that there hasn't been recent engagement on this pull request. If this is still an active work stream, please let us know by pushing some changes or leaving a comment. Otherwise, we'll close this out in 7 days.

v1212 requested review from RickWinter, danieljurek, tg-msft and vhvb1989 as code owners June 11, 2026 08:49

Copilot AI review requested due to automatic review settings June 11, 2026 08:49

Copilot started reviewing on behalf of v1212 June 11, 2026 08:49 View session

microsoft-github-policy-service Bot assigned v1212 Jun 11, 2026

v1212 marked this pull request as draft June 11, 2026 08:51

v1212 changed the base branch from main to wujia/e2e-pipeline-staging June 11, 2026 08:52

Copilot AI reviewed Jun 11, 2026

View reviewed changes

v1212 force-pushed the wujia/ext-agents-e2e-pipeline branch 7 times, most recently from 68aab86 to 03fe499 Compare June 11, 2026 09:48

v1212 force-pushed the wujia/ext-agents-e2e-pipeline branch from 03fe499 to 5d996f5 Compare June 11, 2026 10:28

Jian Wu and others added 4 commits June 11, 2026 18:34

fix: remove cwd from MCP config, use bash -c cd instead

cd00189

cwd may not be supported by Copilot CLI's MCP config schema. Use bash wrapper to cd before launching the MCP server. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

github-actions Bot added the ext-agents azure.ai.agents extension label Jun 11, 2026

Jian Wu and others added 3 commits June 12, 2026 11:01

fix: add id-token:write permission, clean trailing lines

15ec239

Required for azure/login OIDC federated credentials (Tier 1/2). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

fix: add Tier 2 cost confirmation guard, fix prompt env description

be27a05

- Fail fast if Tier 2 selected without confirm_tier2_cost=true - Clarify that Azure/GitHub auth is only active for Tier 1/2 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

microsoft-github-policy-service Bot added the no-recent-activity identity issues with no activity label Jun 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add E2E test pipeline for azure.ai.agents extension (Tier 0/1)#8607

Add E2E test pipeline for azure.ai.agents extension (Tier 0/1)#8607
v1212 wants to merge 9 commits into
Azure:wujia/e2e-pipeline-stagingfrom
v1212:wujia/ext-agents-e2e-pipeline

v1212 commented Jun 11, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glharper commented Jun 11, 2026

Uh oh!

v1212 commented Jun 12, 2026

Uh oh!

v1212 commented Jun 12, 2026 •

edited

Loading

Uh oh!

microsoft-github-policy-service Bot commented Jun 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

v1212 commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Architecture (identical to local testing)

Workflow

Trigger

Items to confirm

Secrets needed

Validation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glharper commented Jun 11, 2026

Review findings

🔴 Concrete bug — azure/login will fail without id-token: write

🟠 Convention — authenticated workloads should prefer ADO pipelines

🟠 Tier 2 teardown can silently leak Azure resources

🟡 Minor / hardening

✅ Good

Uh oh!

v1212 commented Jun 12, 2026

Uh oh!

v1212 commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

microsoft-github-policy-service Bot commented Jun 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

v1212 commented Jun 11, 2026 •

edited

Loading

🔴 Concrete bug — `azure/login` will fail without `id-token: write`

v1212 commented Jun 12, 2026 •

edited

Loading