Data Team Programs

Data Programs.

Introduction

Welcome to the Data Programs page. Here you’ll find information about the various Data Programs around GitLab and those the Data Team supports, ranging from onboarding to day-to-day operations.

Show-n-Tell and Demos

Data & Analytics Demos are a great way for everyone involved in the Data Program to share progress, innovation, collaborate, and just have fun. Data & Analytics Demos are held every Thursday and recordings are posted to the GitLab Unfiltered Data Team playlist.

Data Science AMAs

The Data Science Team regularly holds AMAs to help spread awareness of Data Science and initiatives. Check out the AMA with GitLab Data Scientists Agenda to learn more.

Data Onboarding

If you are onboarding to GitLab and will be working in the Data Program as an Engineer, Analyst, or Developer, follow these steps:

  1. Open a new issue in GitLab Data Analytics with the Data Onboarding template.
  2. Give the issue a descriptive name: Your Name - Data Onboarding
  3. Assign the issue to your Manager to add/remove relevant content

Data Proof of Value Guide

The Data Team performs Proof of Value Evaluations (PoVs) for all new technologies we are considering adding to the Data Platform or the broader Technology stack. This Guide is intended to help you perform a PoV efficiently and with great results.

Phase 1: Calculate Value and Define Requirements

  1. Establish the Value the technology can provide GitLab. Value can be measured in a variety of ways, ranging from efficiency to increased Sales to reduced compute.
  2. Create a Requirements document to define the business and technical requirements the technology must meet to be successful. Indicate whether each requirement is Must Have or Nice to Have. Here is a template we have used for Data Visualization PoV and another we have used for Product Analytics PoV.

Phase 2: Scoping & Policy Awareness

  1. Review the Procurement New Software Guide to ensure you understand the latest procurement process to follow.
  2. Execute an NDA with each Vendor included in the Evaluation.
  3. Obtain preliminary pricing to help validate established budget. If no existing budget is established, work with the Department lead to determine if the project is feasible. Let’s not waste time or energy for projects we can’t fund.

Phase 3: Evaluation Design

Evaluation Design is the most complex part of the PoV.

  1. Decide how to test the technology versus defined requirements. Often, successfully testing data technologies requires simulating production workloads and constructing a ‘Production Level SAFE Workload’ is a key challenge in a Data PoV Project.

Phase 4: Procurement

  1. Using the Requirements design as a guide, collaborate with the Vendor to create a Statement of Work (SoW).
    • The Statement of Work should include Success Criteria, Expectations, and a Project Timeline
    • We do not pay for PoVs and all should be $0 Cost
  2. Along with the SoW, ask the vendor to send you their Master Services Agreement (MSA).
  3. Because request with an amount of $0 is not supported in Coupa, you need to submit the SoW and MSA to Procurement via GitLab in the Procurement project.

Phase 5: Assessment

  1. Create a shared Slack Channel to coordinate the PoV with the Vendor.
  2. Reach out to the vendor for references to schedule a reference Calculate. In a reference call you can:
    • ask about the experience with the technology.
    • ask about their lessons learned.
    • ask how we can setup for success.

Phase 6: Wrap-Up

To Be Defined

Program Name Purpose
Data Catalog Catalog of dashboards, data sets, and analytics projects
Data for Product Managers Information to help Product Managers
Data for Product Analysis Information to help Product Analysts
Analytics Instrumentation Group Information covering the Analytics Instrumentation team
Data for Marketing Analysts Information to help Marketing Analysts
Data for Sales Analysts Information to help Sales Analysts
Data Triage Daily process to ensure the data platform remains available for analytics.

Data For Product Managers
This page is intended to help Product Managers at GitLab understand what data is available to them and how they can use it to understand how their product is used. This page primarily covers two topics: how to consume data, and what data is available. How to Consume Data at GitLab The user-facing end of GitLab’s data stack is comprised of our BI Tool, Tableau which is connected to our Snowflake data warehouse.