Gitlab hero border pattern left svg Gitlab hero border pattern right svg

Reliability Engineering

On this page

Workflow How may we be of service? GitLab.com Status STATUS
Issue Trackers Infrastructure: Milestones, OnCall Production: Incidents, Changes, Deltas Delivery
Slack Channels #sre-lounge, #database #alerts, #production #g_delivery
Operations Runbooks (please contribute!) On-call: Handover Document, Reports  

Mission

Reliability Engineering teams are the gatekeepers and primary caretakers of the operational environment hosting all of GitLab's user-facing services (most notably GitLab.com), focusing on their availability, performance and scalability through reliability considerations.

Vision

Reliability Engineering team are composed of DBREs and SREs. As the role titles indicate, they have different areas of specialty but focus on the reliability of the environment as the unifying goal.

Reliability Engineering teams own the following operational processes:

The teams' overarching goal with respect to these processes is to outdate them through automation.

Key Metrics

Key metrics related to this group include:

Team

Each member of the Site Reliability Team is part of this vision:

Team Members

The following people are members of the Reliability Engineering Teams:

Person Role
Person Role
Person Role