Intermediate Site Reliability Engineer, Gitaly:Cluster
Remote
GitLab is an open core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating the rate of human progress. This mission is integral to our culture, influencing how we hire, build products, and lead our industry. We make this possible at GitLab by running our operations on our product and staying aligned with our values. Learn more about Life at GitLab.
An overview of this role
The GitLab DevSecOps platform empowers 100,000+ organizations to deliver software faster and more efficiently. We are one of the worldâs largest all-remote companies with 2,000+ team members and values that foster a culture where people embrace the belief that everyone can contribute. Learn more about Life at GitLab.
SREs with Gitaly work alongside Backend Engineers with a focus primarily on improving the availability and the reliability of the Gitaly fleet on GitLab.com. While the backend engineers approach their responsibilities from a software developer point of view, the SREs approach the same problems from the operational perspective and collaborate closely on finding an optimal solution, in addition to ensuring that new Gitaly features can run at scale and deployed to production safely.
Gitaly is the Git data storage tier of GitLab, providing a reliable, secure and fast distributed Git data store over gRPC. For more information about Gitaly, see the teamâs Direction page.Â
Gitalyâs high-availability storage requires developers who understand distributed storage systems, their management, observability and availability. Cluster team contributes features, fixes bugs and improves performance of this software stack.
Currently, we're building a new distributed cluster solution and improvements to our Disaster Recovery readiness.
What youâll do Â
- Work with peer SREs to maintain Gitalyâs environments within GitLabâs SaaS offerings, including cost and performance optimization, capacity planning, migrations and debugging production issues.
- Participate in architectural discussions and decisions surrounding Gitaly, within the greater GitLab ecosystem.
- Design RPC interfaces for the Gitaly service.
- Scope, estimate and describe tasks to reach the teamâs goals.
- Develop production automation and tooling for Gitaly, for use both in SaaS and self-managed installations.
- Help ensure that Gitaly development tooling, releases and other processes serve the team and the productâs goals. âŚ
This job isn't fresh anymore!
Search Fresh JobsJob Profile
RestrictionsLocation-based eligibility requirements
Benefits/PerksCollaborative culture Equity Fully remote Fully remote work Incentive pay Opportunity for professional growth Remote-first company
Tasks- Debug production issues
- Hiring
- Peer Review
Acceptance Testing AI Ai-powered devsecops Automation Capacity planning Cloud Technologies Coding Debugging DevSecOps Disaster Recovery Equity Git GitLab Go GRPC High Availability Infrastructure Infrastructure as Code Kubernetes Monitoring Observability Open Source Peer review Performance Optimization Production environments Recruiting Ruby SaaS Sales Security Site Reliability Engineering Systems Talent Acquisition Terraform Tooling Writing
Experience3 years