You are here: Team Handbook Infrastructure Monitoring Performance On this page Other Related Pages Monitoring
We do monitoring with Prometheus, leveraging available exporters like the node or the postgresql exporters, and we build whatever else is necessary within production engineering itself. The production team maintains the infrastructure for 2 monitoring interfaces, but almost anyone can add new monitoring. The two interfaces are:
No authentication is required Automatically syncs from the private monitoring infrastructure on every chef client execution. Don't change dashboards here, they will be overwritten. Refer to this interface by default; only use the private one for those cases where the public dashboard is not available. Highly Available setup Alerting feeds from this setup Toggle "public" to have the dashboard appear on the Public monitoring infrastructure. Default to making public, unless you can specify a good reason not to. Private GitLab account is required to access Separated from the public for security and availability reasons, they should have exactly the same graphs after we deprecate InfluxDB Useful System Health Dashboards Blackbox Monitoring GitLab Web Status: front end perspective of GitLab. Useful to understand how GitLab.com looks from the user perspective. Use this graph to quickly troubleshoot what part of GitLab is slow. GitLab Git Status: front end perspective of GitLab ssh access. Public Whitebox Monitoring Fleet overview: useful to see the fleet status from the inside of GitLab.com. Use this graph to quickly see if the workers or the database are under heavy load, and to check load balancer bandwidth. Postgres Stats: useful to understand how is the database behaving in depth. Use this graph to review if we have spikes of exclusive locks, active or idle in transaction processes Postgres Queries use this dashboard to understand if we have blocked or slow queries, dead tuples, etc. Storage Stats use this dashboard to understand storage use and performance. Private Whitebox Monitor Host Stats: useful to dive deep into a specific host to understand what is going on with it. Select a host from the dropdown on the top. Business Stats: shows many pushes, new repos and CI builds. Daily overview: shows endpoints with amount of calls and performance metrics. Useful to understand what is slow generally. GitLab Profiler
GitLab profiler data is a dashboard with links to request profiles and SQL queries run when loading pages on GitLab.com.
To add a page to this dashboard, create a merge request to the