Every signal your server sends.
Nothing missed.
InfraCaptain monitors 47 different server signals every minute — metrics, security, stack health, and performance — and puts Captain AI on top of all of it so you get answers, not just data.
Server signals checked
every 60 seconds
Monitoring categories
in one dashboard
CPU overhead from
the monitoring agent
From install command
to live server data
Real-time performance,
historical trends
CPU, RAM, disk, and network — sampled every 60 seconds and stored with history so you can see patterns, not just snapshots.
CPU, RAM & Disk — live and historical
See current utilization at a glance. Trend charts build over time so you can tell whether a spike is a pattern or a one-off. p95 calculations highlight peak load.
- CPU usage with per-core breakdown and load average
- RAM used, free, and cached — plus swap usage
- Disk usage per mount point with growth rate prediction
- Inbound and outbound network traffic with anomaly detection
- Historical charts with 30-day (Basic) or 90-day (Pro) retention
- Spike annotations — AI identifies what caused each peak
Every process. Every job.
Every single run.
Track running processes and scheduled cron jobs with full execution history. Know when jobs fail silently — before they cause real damage.
Silent failures don't stay silent
InfraCaptain records every cron job execution — when it ran, how long it took, whether it succeeded or failed, and identifies patterns of failure.
- Full cron job execution history logged with exit code and duration
- Consecutive miss detection — alert after configurable threshold
- Runaway process detection flags processes running beyond normal duration
- Memory leak detection via per-process trend analysis over time
- 17 disabled cron jobs detected and flagged automatically
- Process persistence scoring — track which processes restart most frequently
Last success Apr 9 02:03 · Cron is running · Likely permission or path error in script
| Process | User | CPU% | MEM% | Status |
|---|---|---|---|---|
| apache2 | www-data | 73.6% | 7.1% | High |
| job_processor.py | www-data | 71.0% | 0.9% | Runaway |
| mongod | mongodb | 1% | 5.8% | Normal |
| node | deploy | 0.4% | 12.1% | Leak |
Your entire stack.
One screen.
Domains, SSL certificates, databases, web servers, third-party services — InfraCaptain maps your entire software stack and monitors each component's health continuously.
Web server, domains & database health
See every domain on your server, its web server configuration, SSL certificate status, and the databases powering it — all in one stack view.
- Virtual host detection — Apache, Nginx, and Caddy supported
- SSL certificate validity, expiry date, and mismatch detection
- Database status — MySQL, MariaDB, MongoDB, PostgreSQL, Redis
- Third-party service monitoring — email, cache, SMTP, message queues
- PageSpeed scores (mobile + desktop) for all detected domains
- Slow query detection with query count and uptime per database
Security monitoring that
never, ever sleeps.
Continuous security posture monitoring — not on-demand scans. Brute force detection, hidden processes, file integrity, and open port changes — all in real time.
Continuous, not on-demand
InfraCaptain monitors your server's security posture every minute. Hidden processes, brute force attacks, changed files, and SSL issues are flagged immediately.
- SSH brute force detection flagged after attempt threshold
- Hidden process detection — processes starting with dot flagged
- File integrity monitoring with SHA-256 hashing for system files
- Open port monitoring — any new port opening triggers alerts
- UFW firewall rule change detection included out of the box
- AI-generated security score (0–100) with prioritized fix list
- Attack timeline — 7-day view of all detected threat events
- Process security scan flags suspicious executables and locations
The AI that already knows
your server — before you ask.
Other AI tools make you explain the problem. Captain AI already has your server context — process history, resource trends, cron records, security events, log patterns — all structured and masked. Ask once. Get a specific, actionable answer in seconds.

The spike started at 16:31 — 4 minutes after job_processor.py triggered. That process is at 71% CPU and hasn't exited — it's been running for 23 minutes. Normal runtime is under 90 seconds.
This looks like an infinite loop triggered by a malformed entry in your jobs queue. Run this to confirm and stop it safely:

backup.sh missed Apr 10, 11, and 12. Last success was Apr 9 at 02:03 (ran for 4m 12s). On Apr 10 it started but exited in 3 seconds — too fast to complete.
Most likely cause: the backup destination directory was deleted or permissions changed. Check if /var/backups still exists and is writable by the cron user.
One question. One answer. Done.
Captain AI has full context on your server — not because you pasted it, but because InfraCaptain has been collecting it for days. Every conversation starts with your exact server state already loaded.
- Ask "why is my CPU high?" and get a specific process, timestamp, and fix
- Ask "why did my cron fail?" and get the exact failure pattern and likely cause
- Ask "will my disk fill up?" and get a days-until-full prediction with root cause
- Ask "is there a memory leak?" and get a trend analysis linked to your last deploy
- All sensitive values — IPs, emails, user IDs — masked before AI processing
- Runs on your existing server context — no extra data collection needed
25 minutes of back-and-forth pasting logs. Still no root cause. You're the context layer.
Root cause in one message. Specific fix included. Context was already there.
Alerts that tell you what to do,
not just what happened.
Every InfraCaptain alert includes the event, the context, and a suggested fix — so you're never woken up at 3am staring at a raw metric.
Instant alerts
Get notified the moment something crosses a threshold — CPU spike, memory leak, cron miss, SSL mismatch, brute force attempt.
- Email, webhook, or in-app
- Configurable thresholds
- Info, Warning, Critical
Context-rich alerts
Every alert includes AI-generated context — what triggered it, what it means for your server, and a specific suggested action.
- Root cause suggestion
- "How to Fix" guides
- Expert help on demand
Alert patterns
All alerts logged with resolution status. See recurring patterns across your servers and identify systemic issues before they escalate.
- Full alert timeline
- Resolution notes
- Pattern detection
Stop paying for resources
you don't use.
InfraCaptain tracks long-term CPU and RAM utilization trends. When a server consistently runs under capacity, we tell you how much you could save by downsizing.
Know exactly which servers to resize
Using p95 CPU and RAM calculations over 30 days, we identify servers that are consistently over-provisioned and give you a monthly savings estimate.
- 30-day p95 CPU and RAM analysis per server
- Flags servers with CPU p95 <40% and RAM p95 <50%
- Monthly savings estimate shown per server
- Alert triggered after 14 consecutive days of under-utilization
- Monthly digest showing combined savings across all servers
- Downsize safely — see peak usage to ensure headroom
💡 Combined monthly saving if you act on all 4 servers:
~$50/month ($600/year)
Reports your clients
actually read.
Automated monthly reports, white-label branding for agencies, and shareable server status links.
Monthly summaries
Auto-generated summaries covering uptime, performance trends, and savings opportunities — sent directly to your inbox.
- Per-server & multi-server
- Savings included
- PDF export available
White-label reports
Replace InfraCaptain branding with your own logo and colors. Appear as if they come from your agency — not a third-party tool.
- Custom logo & colors
- Sent from your domain
- Per-client grouping
Shareable status
Generate a read-only share link for any server. Clients or teammates can see health status and metrics without needing an account.
- Read-only access
- Revoke links anytime
- Password protection
All features, one platform.
Everything above is available from $9/month per server. No separate tools. No per-feature billing.
| Feature | Free | Basic | Pro |
|---|---|---|---|
| CPU, RAM, disk, network metrics | |||
| Live performance metrics | |||
| System info & uptime tracking | |||
| Basic alerts | |||
| Captain AI — last 24h only | |||
| Smart alerts with context & fix suggestions | – | ||
| Cron job execution history | – | ||
| SSL & certificate monitoring | – | ||
| Security monitoring AI-scored | – | ||
| File integrity monitoring | – | ||
| Software & stack monitoring | – | ||
| PageSpeed insights (mobile & desktop) | – | ||
| Captain AI — 30 days history | – | ||
| 30-day data retention | – | ||
| Report sharing & exports | – | ||
| Predictive system health monitoring | – | – | |
| Real-time performance tracking | – | – | |
| Full traffic intelligence & bot detection | – | – | |
| Advanced vulnerability monitoring | – | – | |
| AI automation & suggested actions | – | – | |
| Captain AI — 90 days history | – | – | |
| 90-day data retention | – | – | |
| Automated reports & scheduling | – | – | |
| White-label reporting | – | – | |
| API access & integrations | – | – |