Code Review Skill

Standalone skill that evaluates PR code changes for correctness, safety, performance, and consistency with .NET MAUI conventions. Can be invoked directly by users or by other agents/skills.

Trigger phrases: "review code for PR #XXXXX", "code review PR #XXXXX", "review this PR's code", "analyze code changes in PR", "check PR code quality"

Do NOT use for: "what does PR #XXXXX do?", "summarize PR", "describe the changes", or any informational query — just answer those directly without invoking this skill.

How this differs from other skills:

pr-review — End-to-end PR workflow (4 phases: pre-flight, gate, try-fix, report). Use when you want the full pipeline including test verification and fix attempts.

pr-finalize — Verifies PR title/description match implementation + light code review. Use before merging.

code-review (this skill) — Deep code-only review with MAUI domain rules. Use when you want a thorough code analysis without running tests or modifying the PR.

Core Principles

Independence-first — Form your assessment from the code BEFORE reading the PR description. This prevents anchoring on the author's framing
Full-context — Read entire source files, not just diffs. Check callers, consumers, and git history
Empirical grounding — Reference specific code, line numbers, and call sites. No vague concerns
Severity calibration — Distinguish errors from warnings from suggestions. Not everything is critical
Devil's advocate — Challenge your own conclusions before finalizing

Inputs

Input	Required	Description
pr_number	Yes	GitHub PR number to review

Outputs

Field	Description
`verdict`	`LGTM`, `NEEDS_CHANGES`, or `NEEDS_DISCUSSION`
`confidence`	`high`, `medium`, or `low`
`findings`	Categorized findings with severity levels

Review Workflow

Step 1: Gather Code Context (No PR Narrative)

Do NOT read the PR description or issue yet.

Get the diff:
```
gh pr diff <PR_NUMBER>
```
Read full source files for every changed file (not just diff hunks):
```
gh pr diff <PR_NUMBER> --name-only
# Then read each file in full
```
Check callers and consumers of changed methods/properties:
- Use LSP findReferences and incomingCalls for modified symbols
- Understand how the changed code is used

Review git history of changed files:

git log --oneline -10 -- <changed-file>

Step 2: Delegate to Expert Reviewer

Delegate to the maui-expert-reviewer agent (.github/agents/maui-expert-reviewer.md) which runs per-dimension sub-agent evaluation. The agent's sole output is inline-findings.json — file:line comments in GitHub Review API format.

After the agent finishes:

If COMMENTS_VIA_FILE=true (CI): Done. The pipeline calls post-inline-review.ps1 to post findings using GH_COMMENT_TOKEN.

If COMMENTS_VIA_FILE is unset (local): Post inline findings directly:

COMMIT_SHA=$(gh pr view $PR_NUMBER --json headRefOid --jq .headRefOid)
gh api repos/dotnet/maui/pulls/$PR_NUMBER/reviews \
  --method POST \
  --input <(jq -n \
    --arg sha "$COMMIT_SHA" \
    --arg body "Expert review — see inline comments." \
    --argjson comments "$(cat CustomAgentLogsTmp/PRState/$PR_NUMBER/PRAgent/inline-findings.json)" \
    '{commit_id: $sha, body: $body, event: "COMMENT", comments: [$comments[] | {path, line, body, side: "RIGHT"}]}')

Step 3: Form Independent Assessment

Based ONLY on the code (no PR description), answer:

What does this change do? Describe the behavioral change in your own words
Why might it be needed? Infer motivation from the code
Is the approach sound? Would a simpler alternative work?
What problems do you see? Run through the agent's dimension CHECKs for matched dimensions

Step 4: Read PR Narrative and Reconcile

Now read the PR description, linked issue, and comments. Treat these as claims to verify, not facts.

Where your assessment disagrees with the author's claims, investigate further
If the PR claims a bug fix, verify the root cause analysis matches the code
Check existing review comments to avoid duplicating feedback

Step 5: Check CI Status

gh pr checks <PR_NUMBER>

Review CI results. Never post ✅ LGTM if any required CI check is failing. If CI is failing:

If caused by the PR's code changes, flag as ❌ error
If a known infrastructure issue or pre-existing flake, note it but still use ⚠️
If the PR description acknowledges the failure, note it in the summary

Step 6: Devil's Advocate and Verdict

Before finalizing your verdict:

Challenge your findings — For each issue you flagged, ask: "Am I sure, or am I guessing?"
Challenge your approval — If you're leaning LGTM, ask: "What could go wrong that I'm not seeing?"
Check platform blind spots — If the change touches platforms you can't fully reason about, say so explicitly

Then deliver your verdict:

LGTM — Code is correct, safe, and consistent with MAUI patterns. Ready for human approval.
NEEDS_CHANGES — Concrete issues found that should be addressed before merge.
NEEDS_DISCUSSION — Complex tradeoffs or architectural questions that need human judgment.

Review Output Format

Constraints (from Android team's approach):

Only comment on added/modified lines — don't flag pre-existing code
One issue per comment. If the same issue appears many times, flag once with a note listing all affected files
Don't pile on. 3 important comments > 15 nitpicks
Don't flag what CI catches. Skip compiler errors, formatting the linter will catch, etc.
Avoid false positives. Verify the concern actually applies given full context. If unsure, phrase as a question.

## Code Review — PR #XXXXX

### Independent Assessment
**What this changes:** [Your understanding from code alone]
**Inferred motivation:** [Why this change seems needed]

### Reconciliation with PR Narrative
**Author claims:** [Summary of PR description]
**Agreement/disagreement:** [Where your assessment matches or differs]

### Findings

#### ❌ Error — [Brief description]
[Explanation with specific file:line references]

#### ⚠️ Warning — [Brief description]
[Explanation with specific file:line references]

#### 💡 Suggestion — [Brief description]
[Explanation]

### Devil's Advocate
[Challenges to your own conclusions]

### Verdict: LGTM / NEEDS_CHANGES / NEEDS_DISCUSSION
**Confidence:** high / medium / low
**Summary:** [2-3 sentences explaining the verdict]

Verdict Consistency Rules

The verdict must match your most severe finding. If you have any ❌ Error findings, the verdict must be NEEDS_CHANGES. If only ⚠️ Warnings, use judgment but explain.
Devil's advocate before finalizing. Re-read all findings. For each warning, ask: "Would I be comfortable if this merged as-is?"
Never approve what you can't verify. If the fix touches platform code you can't fully reason about, say so explicitly and use NEEDS_DISCUSSION.
LGTM means no ❌ Errors. You can LGTM with 💡 Suggestions. You can LGTM with ⚠️ Warnings only if you've explained why they're acceptable.
🚨 NEVER use --approve or --request-changes on GitHub. Only post comments. Approval is a human decision.
Write findings to disk, do not post directly. The agent does not have the GitHub comment token. Write findings to CustomAgentLogsTmp/PRState/{PR}/PRAgent/ — the CI pipeline or posting scripts handle GitHub interaction.

Posting the Review

The agent writes findings to disk. Posting is done separately by Review-PR.ps1:

Inline review comments (preferred — findings at exact file:line):

# Preview first:
pwsh .github/scripts/post-inline-review.ps1 -PRNumber <PR_NUMBER> -DryRun

# Post when ready:
pwsh .github/scripts/post-inline-review.ps1 -PRNumber <PR_NUMBER>

Wall-of-text summary (phase content assembled into a single PR comment):

# Called by Review-PR.ps1 automatically:
pwsh .github/scripts/post-ai-summary-comment.ps1

In CI (eng/pipelines/ci-copilot.yml), Review-PR.ps1 calls both post-inline-review.ps1 (for inline findings) and post-ai-summary-comment.ps1 (for the wall-of-text from {phase}/content.md files), using GH_COMMENT_TOKEN.

Completion Criteria

Full source files read (not just diffs)
Independent assessment formed before reading PR narrative
MAUI-specific checklist walked through for each applicable section
Findings categorized by severity (❌ / ⚠️ / 💡)
Devil's advocate check performed
Verdict is consistent with findings
Output follows the format above