FreshRemote.Work

Staff SRE - Cloud Data Store

United States - Remote Opportunity

Who We Are   Temporal is an open source programming model that can simplify code, make applications more reliable, and help developers focus on the important things like delivering features faster. Our amazing user community provides us with real-time feedback and contributions to improve the developer experience everywhere.   Backed by top VC firms, we have built a team of professionals from various successful start-ups and well-known technology companies. Together, we collaborate cross-functionally, engage with and support our users to improve the developer experience, deliver world-class open-source software, and enhance our commercial SaaS platform.   Temporal is growing and working towards building the best community. We are looking for those who share our values, challenge 'standard' thinking, and want to influence our future. If you have a passion for helping developer communities, open-source software, and want to be a part of our amazing team, we'd love to hear from you!

Summary

We have an opening for a Staff SRE (Service Reliability Engineer) in our Cloud Data Store team which is innovating cloud storage and rethinking the persistence layer of Temporal in a customized way. We’re solving hard distributed systems problems related to databases and the way storage works for the cloud. The charter of CDS team is to increase the reliability, and scalability and reduce COGS of running Cloud Temporal. See what the CDS team has been working on recently.   What You'll Do
  • Be the first Service Reliability Engineer in the CDS team.
  • Design, develop, and implement systems to enhance the operational efficiency and effectiveness of the Service and subsequently the CDS team.
  • Work within a highly collaborative team – and across team boundaries – to ensure exceptional service reliability during a period of hyper-growth and expansion.
  • Implement operational best practices, such as alerting and runbooks, to efficiently manage and maintain a high-scale distributed database system.
  • Drive the team towards achieving a high degree of automation.
  What You'll Bring
  • Experience contributing to complex cross-team engineering efforts focused on cloud, compute, networking and storage infrastructure
  • At least 10 years coding experience (Go, Java, or other applicable language) and experience writing concurrent code.
  • Deep experience in at least one or more cloud infrastructure environments (AWS, GCP, or Azure) and familiarity with adjacencies.
  • 6+ years of industry experience designing, building, and operating large, highly concurrent, reliable, and scalable distributed systems.
  • Excellent collaboration and communication skills - a strong sense of ownership and integrity demonstrated through clear …
This job isn't fresh anymore!
Search Fresh Jobs