How Fedora Copr is monitored¶
Nagios (with hypervisors’ checks) and Nagios external are the main monitoring services for Fedora Copr. That’s by fact that the Fedora Infrastructure team uses Nagios, and we tightly cooperate with that team (they help us with the ping&power support).
But we also — as a secondary solution — use the Prometheus monitoring (Graphana UI, visible internally within Red Hat).
We do the normal availability checks, storage space consumption checks, etc. but
we also have the functionality “black-box” copr-ping test that
periodically submits a simple build testing the whole Copr stack. This test is
triggered from the copr-backend
server.
On weekly basis, we analyze storage consumption to gather usage statistics.
On top of that all, we also use UptimeRobot to geographically check that our
CDN (implemented with AWS CloudFront) for copr-backend
works everywhere.
The following image illustrates the monitoring work-flow: