For the complete documentation index, see llms.txt.

Documentation

Repository docs

This route renders the repository README and markdown under docs/ .

Source: docs/DEBUGGING_DEPLOYMENTS.md

Rendered document

docs/DEBUGGING_DEPLOYMENTS.md

Parsed server-side (markdown to HTML in the app). Same bytes you get from the checkout.

Debugging Deployments

Use this runbook when a deploy fails or a live app looks unhealthy.

Quick triage (5 minutes)

1. Ask the Deployment Agent

Open Agent in the header:

  • "Why did my last deployment fail?"
  • "Is {repo} {service} healthy right now?"

The agent lists deployments, checks history, or loads runtime health (read-only, up to 2 tool calls).

2. Check deployment status

In the deploy workspace Overview:

StatusMeaning
deployingPipeline still running — watch live logs
runningLast deploy succeeded; check runtime health if URL fails
failedLast deploy did not complete — open History
degraded / unreachableRuntime health probe failing

See Deployment Status Reference.

3. Open Deployment History

Find the latest failed entry:

  1. Note failed step (Build, Verify, Deploy, etc.)
  2. Expand step logs — find the first or error line
  3. Note failure code if shown (for example CODEBUILD_DOCKER_IMAGE_BUILD_FAILED)

4. Escalate by failure type

Failed atGuide
Build / PublishBuild Failures
VerifyHealth Checks, Startup and Runtime Failures
Deploy / RolloutStartup and Runtime Failures
URL loads but wrong behaviorRuntime logs, env vars — Environment Variables
URL does not resolveDomain and TLS Issues

5. Analyze failure (optional)

On a specific history entry, run Analyze failure for an LLM summary using full run logs.

6. Fix and redeploy

  • Config issue → update env vars / branch / subdomain → redeploy
  • Build plan issue → Improve scan → review blueprint → redeploy
  • App code issue → fix repo → push → redeploy

Severe production outage

If users are impacted and you need service back before root-cause analysis:

  1. Rollback to the last successful history entry (manual, by commit)
  2. Confirm URL healthy via Overview or Deployment Agent
  3. Debug the failed commit separately

See Deployment History and Rollback.

Automatic rollback failure codes exist in classification, but manual rollback by commit is the supported recovery path today.

Collect evidence before asking for help

  1. Exact error text from step logs
  2. Repo name, service name, branch, commit SHA
  3. Failure code from history (if any)
  4. Whether scan build_status was passed
  5. Recent config changes (env vars, subdomain)

Related