MonitorMojo Blog
Uptime Monitoring Checklist: What to Check, When to Check It, and What to Do
Uptime monitoring is most useful when it is systematic. A check that happens inconsistently tells you whether a site was available at a few random moments. A check that follows a defined process tells you about the site's reliable availability patterns, catches the failure modes that hit specific pages rather than the whole site, and creates a record of due diligence that supports client care plan delivery. This checklist covers the core elements of a practical uptime monitoring workflow.
What to include in an uptime check
A basic uptime check asks whether a URL returns a successful response. But a useful uptime check covers more: the specific URL being checked (not just the homepage), the HTTP status code returned, any redirect chain behavior, and the response time alongside the availability status.
For most business websites, the pages that matter most are not always the homepage. A booking form that fails, a checkout page that times out, or a login portal that returns an error can each represent complete unavailability for the visitor trying to complete that action — even if the homepage loads perfectly.
For agencies, this means uptime checks should cover the pages that matter most to the client's business: the contact or booking page, the checkout flow, the client portal, and any campaign landing pages that are actively driving traffic.
- Homepage URL included in every check
- Booking, checkout, or contact page included for sites where these are business-critical
- Any client portal or login page checked separately
- Active campaign landing pages included during campaign periods
- Subdomain availability checked separately if critical business functions run on subdomains
What status codes mean
Understanding what status codes mean is necessary for interpreting uptime check results. A 200 OK means the server responded successfully. A 301 or 302 means a redirect is in place. A 404 means the page was not found — a common result when a URL changes. A 500 means the server encountered an error processing the request.
Redirect responses (301, 302) are not failures — they indicate the server is working and directing traffic to the correct URL. But an unexpected redirect chain — a URL that redirects three times before reaching the final destination — may indicate a configuration issue worth reviewing.
Error responses (4xx, 5xx) indicate problems that warrant investigation. A 404 on a URL that was previously returning 200 means the page content has been removed or the URL structure has changed. A 500 means a server-side error that requires investigation of server logs or application configuration.
- 200 OK: site is responding normally
- 301/302 Redirect: responding, but redirecting to another URL — verify the final destination
- 403 Forbidden: server responding but blocking access — check authentication or IP restriction configuration
- 404 Not Found: page does not exist at this URL — verify the URL is correct
- 500 Server Error: server-side problem requiring investigation
- No response / timeout: connectivity or DNS issue requiring urgent investigation
Response time benchmarks
Uptime checks should include response time alongside availability status. A site that is technically 'up' but responding in five seconds is functionally unavailable for most mobile visitors. Response time gives you the quality dimension of availability, not just the binary up-or-down.
General response time benchmarks: under 200ms is fast, 200ms to 500ms is good, 500ms to 1 second is acceptable but worth monitoring, over 1 second warrants investigation, over 2 seconds is affecting visitor experience and likely conversions.
Track response time over time rather than just at a single point. A site that was running at 300ms last month and is now running at 1.2 seconds has a problem worth investigating — even if the current response time is still technically within acceptable limits.
- Is current response time within expected range for this site?
- Has response time changed significantly from the previous check?
- For ecommerce: is checkout response time under 1 second?
- For high-traffic pages: is response time acceptable under typical load?
- For mobile audiences: is response time fast enough for slower mobile connections?
Responding when a site is down
When an uptime check reveals that a site is unavailable, the response process matters as much as the detection. A structured response process reduces the time from detection to resolution and creates documentation of what happened.
First, verify the finding: run the check again from the same tool and also try loading the URL in an incognito browser window on a different network. One failed check may be a transient network issue. Two failed checks from different sources indicate a real problem.
Then identify the likely cause: check the hosting provider status page, verify DNS is resolving correctly, check for any recent site changes that might have caused the issue, and check whether the SSL certificate is expired or the domain has lapsed. The health check result will indicate which signal failed and give you a starting point for the investigation.
- Verify the outage with a second check from a different source
- Check the hosting provider's status page for any active incidents
- Verify DNS is resolving to the correct IP address
- Check for any recent deployments, updates, or configuration changes
- Check whether SSL certificate has expired
- Check whether domain registration is current
- Contact hosting provider support if the cause is not immediately apparent
- Notify the client if the outage is significant and resolution will take time
- Document the timeline: when the issue was detected, what was investigated, and when it was resolved
Frequently Asked Questions
How frequently should I run uptime checks?
For agencies running manual health check workflows, monthly checks on a consistent schedule are appropriate for most care plan clients, with additional checks after deployments or changes. For critical business websites where every hour of downtime has significant impact, more frequent checks give earlier detection.
Should I monitor every page on a website?
No. Focus on the pages that matter most: the homepage, business-critical conversion pages (checkout, booking, contact), and any page that is actively driving traffic from ads or campaigns. Checking every page on a large site is unnecessary — the signals from key pages are sufficient for most monitoring needs.
What should I do when a check shows a 301 redirect?
A 301 redirect means the server is responding and directing traffic to another URL. Verify that the final destination URL is correct and loading successfully. A redirect itself is not a problem, but a redirect to the wrong URL, a redirect loop, or a chain of more than two redirects may indicate a configuration issue.
How does MonitorMojo help with uptime monitoring?
MonitorMojo runs a combined website health check that covers reachability, HTTP status, SSL status, response time, security headers, and domain risk in one workflow. For agencies and small teams, this gives a more complete uptime picture than a simple ping-based availability check.
What is the difference between a website being down and a website being slow?
A website that is 'down' is not responding to requests at all, or is returning an error code like 500. A website that is 'slow' is responding but taking too long to do so — which can be just as damaging to visitor experience and conversions. Both are worth monitoring; response time captures the slow case that basic availability checks miss.