Core Concepts
Schedules
A schedule defines who is on-call and when:- Rotation: Team members take turns being on-call
- Shifts: Time periods (e.g., weekly, daily, custom)
- Overrides: Temporary changes for vacations or swaps
Escalation Policies
When alerts aren’t acknowledged, escalate to the next person:Creating an On-Call Schedule
- Dashboard
- API
- Go to Dashboard → On-Call → Create Schedule
- Name your schedule (e.g., “API Team On-Call”)
- Add team members
- Choose rotation type:
- Weekly: Each person on-call for a week
- Daily: Rotate daily
- Custom: Define specific shifts
- Set handoff time (e.g., 9:00 AM Monday)
- Save schedule
Schedule Types
Weekly Rotation
Most common for small teams:Follow-the-Sun
For global teams, hand off as business hours end:Layered Schedules
Combine schedules for 24/7 coverage:Escalation Policies
Define what happens when alerts go unacknowledged:- Notifies whoever is on-call from the API team schedule
- Waits 5 minutes for acknowledgment
- If not acknowledged, notifies the engineering manager
- Waits 10 more minutes
- If still not acknowledged, triggers PagerDuty
- Repeats the entire escalation 2 more times
Escalation steps support multiple notify methods:
web_push, voice_call, email, and sms. Add voice_call to automatically phone responders when alerts go unacknowledged. See Voice Alerts for setup and billing details.Schedule Overrides
When someone needs to swap shifts or take time off: Create overrides in the Dashboard or via API:- Dashboard
- API
- Go to Dashboard > On-Call > Schedules
- Click on a schedule and select Add Override
- Choose the user, time range, and reason
Who’s On-Call Right Now?
- Dashboard
- CLI
- API
View current on-call in Dashboard → On-Call → Now
Notification Preferences
Team members can set their notification preferences:- Push: Browser push notifications (all plans)
- Voice Call: AI-powered phone calls via Voice Alerts (Team+)
- SMS: Text messages for all alerts
- Email: Fallback for non-urgent
Best Practices
Keep rotations fair
Keep rotations fair
Distribute on-call burden evenly. Consider time zones and personal situations.
Set realistic timeouts
Set realistic timeouts
Give people time to respond—5 minutes for critical, 15+ for major. Too short causes stress, too long delays response.
Document runbooks
Document runbooks
Ensure on-call engineers know how to respond to common alerts.
Review escalations regularly
Review escalations regularly
Analyze escalation patterns monthly. If alerts frequently escalate, investigate why.
Respect off-hours
Respect off-hours
Minimize non-critical alerts during nights and weekends.