Gitlab hero border pattern left svg Gitlab hero border pattern right svg

Data Engineer (Product/Engineering)

Department: Business Operations

Apply now
Remote

This role will require an analytical and business-oriented mindset with the ability to implement rigorous database solutions and best practices in order to produce and influence the adoption of strong quality data insights to drive business decisions in all areas of GitLab. Data Engineers are essentially software engineers who have a particular focus on data movement and orchestration. 

Responsibilities

  • Collaborate with other functions across the company by building reports and dashboards with useful analyses and strong data insights
  • Explain trends across data sources, potential opportunities for growth or improvement, and data caveats for descriptive, diagnostic, predictive (including forecasting), and prescriptive data analysis
  • Deep understanding of how data is created and transformed through GitLab products and services provided by third-parties to help drive product designs or service usage or note impacts to data reporting capabilities
  • Understand and document the full lifecycle of data and our common data framework so that all data can be integrated, modeled for easy analysis, and analyzed for data insights
  • Document every action in either issue/MR templates, the handbook, or READMEs so your learnings turn into repeatable actions and then into automation following the GitLab tradition of handbook first!
  • Expand our database with clean data (ready for analysis) by implementing data quality tests while continuously reviewing, optimizing, and refactoring existing data models
  • Craft code that meets our internal standards for style, maintainability, and best practices for a high-scale database environment. Maintain and advocate for these standards through code review
  • Provide data modeling expertise to all GitLab teams through code reviews, pairing, and training to help deliver optimal, DRY, and scalable database designs and queries in Snowflake and in Periscope
  • Approve data model changes as a Data Team Reviewer and code owner for specific database and data model schemas
  • Own the end-to-end process of on-call data triaging from reading Airflow logs, to diagnosing the data issue, and to verifying and implementing a solution with an automated alerting system (ChatOps, etc) as well as providing data support for all GitLab members
  • Contribute to and implement data warehouse and data modeling best practices, keeping reliability, performance, scalability, security, automation, and version control in mind
  • Follow and improve our processes and workflows for maintaining high quality data and reporting while implementing the DataOps philosophy in everything you do

  • This position reports to the Manager, Data

Requirements

  • 2+ years experience in an analytics role
  • Experience building reports and dashboards in a data visualization tool
  • Passionate about data, analytics and automation. Experience cleaning and modeling large quantities of raw, disorganized data (we use dbt)
  • Experience with a variety of data sources. Our data includes Salesforce, Zuora, Zendesk, Marketo, NetSuite, Snowplow and many others (see the data team page)
  • Demonstrate capacity to clearly and concisely communicate complex business logic, technical requirements, and design recommendations through iterative solutions
  • Deep understanding of SQL in analytical data warehouses (we use Snowflake SQL) and in business intelligence tools (we use Periscope)
  • Hands on experience working with SQL, Python, API calls, and JSON, to generate business insights and drive better organizational decision making
  • Familiarity with Git and the command line
  • Deep understanding of relational and non-relational databases, SQL and query optimization techniques, and demonstrated ability to both diagnose and prevent performance problems
  • Effective communication and collaboration skills, including clear status updates
  • Positive and solution-oriented mindset
  • Comfort working in a highly agile, intensely iterative environment
  • Self-motivated and self-managing, with strong organizational skills
  • Ability to thrive in a fully remote organization
  • Share and work in accordance with our values
  • Successful completion of a background check

Product

  • Support the Product function by spearheading tracking and reporting initiatives
  • Focus on product usage metrics across SaaS and self-managed products
  • Build cross-functional analyses to drive strategic decision-making
  • Priorities will be set by a Director of Product but will collaborate with and report into the Data Team

 

Compensation

To view the full job description and its compensation calculator, view our handbook. The compensation calculator can be found towards the bottom of the page.

 

Additional details about our process can be found on our hiring page.

Remote-CLE