Skip to main content

Operations runbook

This page does not replace your MongoDB Atlas or hosting provider console. It is a checklist for what to verify before and after launch.

Backups (MongoDB Atlas)​

  1. Confirm Cloud Backup (or equivalent) is enabled for the production cluster.
  2. Confirm retention meets your recovery policy (often 7–30 days minimum for production).
  3. Perform a restore drill to a scratch database or namespace at least once per quarter. Document who ran it and the outcome.

Uptime monitoring​

Configure external checks (e.g. Better Stack, UptimeRobot, Pingdom) for at least:

URLPurpose
https://api.attunelogic.com/healthAPI liveness
https://app.attunelogic.comWeb app
https://attunelogic.comMarketing site
https://docs.attunelogic.comDocumentation

Route alerts to a shared on-call channel (Slack or PagerDuty). For a small team, a dedicated operations phone is acceptable if someone acknowledges pages.

Sentry​

  • Configure alert rules for new issues in production and error-rate spikes.
  • Keep DSN per environment; do not reuse production DSN in staging if you need clean signal.

Email deliverability​

See Email deliverability for SPF, DKIM, and DMARC.