This group will be focused on enabling data teams to build, test, and deploy their machine learning models. This will be net new functionality within GitLab and will bridge the gap between DataOps teams and ML/AI within production. In addition, MLOps will provide tuning, testing, and deployment of machine learning models, including version control and partial rollout and rollback.
Algorithm selection based on data shape and analytics doing parameter tuning, feature selection, and data selection. This includes git-focused functionality for hosting machine learning models. Examples include JupyterHub and Anaconda as well as DVC, Dolt, and Delta lake.
Provide data teams with the tools to take raw large data sets, clean / morph / wrangle the data, and import the sanitized data into their models to prepare for deployment. Examples of this include Trifacta, TensorFlow Serving, UbiOps, and Sagemaker.
Verify everything works as expected and is ready for deployment. Examples of this include PyTest, PyTorch, Keras, and Scikit-learn.
Enable data teams to deploy their data models including partial rollout, partial rollback, and versioning of training data. Examples of this include Kubeflow and CML.
Our MLOps focus could also start with a focus on enabling partners to properly integrate into GitLab. Potential partners include Domino Data Lab, Determined AI, and Maiot.
We’re currently hiring for this role and looking for someone that understands the market, the opportunities, and the complexities to help design and develop our entry into MLOps. You’ll need experience in bringing products to markets, experience with Machine Learning tools, and experience developing and operating large-scale services. You should know the major competitors and partners in this space, and be able to architect capabilities around hosting machine learning models, training models, test and certification, and finally enable data teams to deploy with partial rollout and rollback and versioning. Our tech stack is Ruby, Go, and Vue.js, and you’ll need to work across backend, frontend database, and infrastructure to bring this opportunity to market.
Experiment A machine learning experiment is a set of variables that are varied across multiple runs in order to observe, and learn from, the effects of changes.
MLOps is the process of taking an experimental Machine Learning model into a production web system
ModelOps is a superset of ML/AI functionality including MLOps and DataOps. More information here