Overview

Checklist

Examples

CI Integration

Last verified: 2026-04-17 · next review in 118 days

CI is where agents scale from "neat assistant" to "automated teammate". This page covers the concrete patterns: non-interactive invocation, PR review bots, and the guardrails that keep the bills manageable.

Non-interactive agent calls

Most agents have a -p / --prompt / --non-interactive flag that runs a single prompt and exits with the result on stdout:

# Claude Code
claude -p "Summarize changes in src/ since last week" --output-format json

# Codex CLI
codex "add a return type to getUserById in src/db/users.ts" --non-interactive

# Aider
aider --message "Fix the type error in src/pipeline.ts and commit" --yes

These compose with shell pipelines — feed the output to jq, grep for a pass/fail string, fail the job on error.

Pattern 1: PR review bot

A GitHub Actions workflow triggered on pull_request that runs an agent to review the diff and comment.

# .github/workflows/ai-review.yml
name: AI Review
on:
  pull_request:
    branches: [main]

permissions:
  contents: read
  pull-requests: write

jobs:
  review:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
        with:
          fetch-depth: 0

      - name: Run review
        env:
          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
        run: |
          DIFF=$(git diff origin/main...HEAD)
          claude -p "Review this diff for bugs, security issues, missing tests. Be specific and concise: $DIFF" \
            --allowed-tools "" \
            --output-format text > review.md

      - name: Post comment
        uses: marocchino/sticky-pull-request-comment@v2
        with:
          path: review.md

Variants:

Run the review agent with --allowed-tools "" to keep it read-only
Target specific scopes: "security review", "test coverage", "architecture review"
Use Continue for a more structured source-controlled-check workflow

Pattern 2: Scheduled maintenance agent

Nightly jobs that keep the codebase tidy:

# .github/workflows/nightly-tidy.yml
name: Nightly tidy
on:
  schedule: [{ cron: '0 3 * * *' }]
  workflow_dispatch:

jobs:
  tidy:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - name: Dependency patch bumps
        env:
          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
        run: |
          claude -p "Check package.json for patch-version bumps safe to apply. Update pnpm-lock.yaml. If tests pass, commit with message 'chore: patch bumps'. If not, abort." \
            --permission-mode auto \
            --allowed-tools "Read,Edit,Bash(pnpm *)"
      - name: Push and open PR
        run: gh pr create --title "Nightly patch bumps" --body "Automated"

Use this pattern for: patch bumps, typo fixes, lint cleanup, doc freshness checks.

Pattern 3: Agentic Universe pipeline

The canonical multi-stage pipeline in CI — what this very project runs:

Cron (6 AM UTC)
   ↓
Ingest (RSS / HN / GitHub / Reddit / ArXiv)
   ↓
Filter (keywords → embeddings → LLM)
   ↓
Categorize + Summarize (Anthropic Batch API)
   ↓
Deliver (Telegram / Slack / RSS / Email)
   ↓
ISR revalidate (Vercel webhook)

Runs on GitHub Actions (not Vercel — pipelines > 60s exceed Vercel function limits). See Orchestration Patterns and architecture notes.

Budget guards

CI can accidentally burn money. Two safety nets:

Per-run spend cap

# Track tokens as you go; abort if over budget
MAX_COST_USD=2.00

claude -p "..." --output-format json > response.json
COST=$(jq '.total_cost_usd' response.json)

python -c "import sys; sys.exit(1 if float('$COST') > $MAX_COST_USD else 0)"

Provider-side budget alerts

Anthropic Console, OpenAI Dashboard, and most providers let you set a monthly spend cap that throttles or denies requests past the limit. Set these before going to production — they're the backstop.

Monthly budget GitHub Action

# Runs daily, alerts if MTD spend is tracking too high
- name: Budget check
  run: node scripts/check-llm-budget.mjs --mtd-limit 50

Secrets handling

Never pass API keys as command-line args — they end up in process lists and shell history
Use env: in workflow yaml — GitHub masks these automatically in logs
Rotate on leak — if a key ever hits a log, revoke in the provider console first, then rotate

Gotchas

Agents can hang. Always wrap with a timeout: timeout 10m claude -p "...".
--yes / --bypass modes in CI. Necessary (no interactive approval available) but risky — always pair with tight --allowed-tools or a sandbox runner.
Concurrent provider limits. A morning cron firing off 20 parallel jobs can hit TPM caps. Stagger with jobs.<name>.strategy.max-parallel.
Cost per PR can surprise. A review bot on a 1000-line PR with a heavy model can cost $1+. Budget accordingly.

On this page

CI Integration

Non-interactive agent calls

Pattern 1: PR review bot

Pattern 2: Scheduled maintenance agent

Pattern 3: Agentic Universe pipeline

Budget guards

Per-run spend cap

Provider-side budget alerts

Monthly budget GitHub Action

Secrets handling

Gotchas

Further reading

On this page

On this page

CI Integration

Non-interactive agent calls

Pattern 1: PR review bot

Pattern 2: Scheduled maintenance agent

Pattern 3: Agentic Universe pipeline

Budget guards

Per-run spend cap

Provider-side budget alerts

Monthly budget GitHub Action

Secrets handling

Gotchas

Further reading

On this page