Reliability Engineering

On this page

Workflow How may we be of service? GitLab.com Status STATUS
Issue Trackers Infrastructure: Milestones, OnCall Production: Incidents, Changes, Deltas Delivery
Slack Channels #sre-lounge, #database #alerts, #production #g_delivery
Operations Runbooks (please contribute!) On-call: Handover Document, Reports  

Mission

Reliability Engineering teams are the gatekeepers and primary caretakers of the operational environment hosting all of GitLab's user-facing services (most notably GitLab.com), focusing on their availability, performance and scalability through reliability considerations.

Vision

Reliability Engineering team are composed of DBREs and SREs. As the role titles indicate, they have different areas of specialty but focus on the reliability of the environment as the unifying goal.

Reliability Engineering teams own the following operational processes:

The teams' overarching goal with respect to these processes is to outdate them through automation.

Key Metrics

Key metrics related to this group include:

Team

Each member of the Site Reliability Team is part of this vision:

Team Members

The following people are members of the Reliability Engineering Teams:

Person Role
Anthony Sandoval Engineering Manager, Reliability Engineering (AS)
Andreas Brandl Senior Database Reliability Engineer, Enablement
Henri Philipps Senior Site Reliability Engineer, Verify & Release
Craig Barrett Senior Site Reliability Engineer, Configure & Plan
Person Role
Dave Smith Engineering Manager, Reliability Engineering (DS)
John Northrup Site Reliability Engineer, Verify & Release
Alejandro Rodríguez Site Reliability Engineer, Geo
Devin Sylva Senior Site Reliability Engineer, Manage
John T Skarbek Senior Site Reliability Engineer, Secure
Cameron S McFarland Senior Site Reliability Engineer
Casey Allen Shobe Senior Database Reliability Engineer, Ops
Person Role
Jose Cores Finotto Engineering Manager, Reliability Engineering (JF)
Ahmad Sherif Site Reliability Engineer, Monitor
Amarbayar Amarsanaa Senior Site Reliability Engineer, Distribution & Package
Yun Guo Senior Database Reliability Engineer, Dev
Hendrik Meyer Site Reliability Engineer, Create