Staff SRE - Cloud Data Store
United States - Remote Opportunity
Hey, this job isn't fresh anymore! 👉 Find fresh remote jobs here
Who We Are Temporal is an open source programming model that can simplify code, make applications more reliable, and help developers focus on the important things like delivering features faster. Our amazing user community provides us with real-time feedback and contributions to improve the developer experience everywhere. Backed by top VC firms, we have built a team of professionals from various successful start-ups and well-known technology companies. Together, we collaborate cross-functionally, engage with and support our users to improve the developer experience, deliver world-class open-source software, and enhance our commercial SaaS platform. Temporal is growing and working towards building the best community. We are looking for those who share our values, challenge 'standard' thinking, and want to influence our future. If you have a passion for helping developer communities, open-source software, and want to be a part of our amazing team, we'd love to hear from you!
Summary
We have an opening for a Staff SRE (Service Reliability Engineer) in our Cloud Data Store team which is innovating cloud storage and rethinking the persistence layer of Temporal in a customized way. We’re solving hard distributed systems problems related to databases and the way storage works for the cloud. The charter of CDS team is to increase the reliability, and scalability and reduce COGS of running Cloud Temporal. See what the CDS team has been working on recently. What You'll Do- Be the first Service Reliability Engineer in the CDS team.
- Design, develop, and implement systems to enhance the operational efficiency and effectiveness of the Service and subsequently the CDS team.
- Work within a highly collaborative team – and across team boundaries – to ensure exceptional service reliability during a period of hyper-growth and expansion.
- Implement operational best practices, such as alerting and runbooks, to efficiently manage and maintain a high-scale distributed database system.
- Drive the team towards achieving a high degree of automation.
- Experience contributing to complex cross-team engineering efforts focused on cloud, compute, networking and storage infrastructure
- At least 10 years coding experience (Go, Java, or other applicable language) and experience writing concurrent code.
- Deep experience in at least one or more cloud infrastructure environments (AWS, GCP, or Azure) and familiarity with adjacencies.
- 6+ years of industry experience designing, building, and operating large, highly concurrent, reliable, and scalable distributed systems.
- Excellent collaboration and communication skills - a strong sense of ownership and integrity demonstrated through clear …
This job isn't fresh anymore!
Search Fresh JobsJob Profile
Regions
Countries
Remote Opportunity
Benefits/Perks12 Holidays + 2 Floating Holidays Additional perks for Learning & Development Equity plan Internet Stipend Unlimited PTO WFH Meals
Tasks- Drive automation
AWS Azure Cloud Cloud Infrastructure Concurrent Code Distributed Systems GCP Go Java Networking
Experience10 years
EducationB.S. in Computer Science MS in Computer Science
TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9