FreshRemote.Work

Site Reliability Engineer, Cloud

Remote - United States

At Yugabyte, we are on a mission to become the default transactional database for the cloud. We are well underway on this journey with YugabyteDB, the open source, high-performance, distributed SQL database that runs on any cloud and enables developers to get instantly productive using well-known APIs. We are looking for talented and driven people to join us on our ambitious mission and help us build a lasting and impactful company.

We announced a $188M Series C round at a $1.3B valuation in October 2021, however we very much believe we are still in the early stages of our company’s journey. The transactional database market is estimated to grow from $40B in 2021 to $64B by 2025. Given our database is cloud-native by design, has on-demand horizontal scalability, and allows for geographical distribution of data using built-in replication, we are extremely well-positioned to address the market need for geo-distributed, high-scale, high-performance workloads.

The Role

As a Site Reliability Engineer focused on database availability and reliability you will be using your skills to operate and automate the life cycle of the YugabyteDB DBaaS.  You will design and build processes that will spin up systems and the infrastructure that manages the databases using secure, reliable, scalable and highly observable methodologies.  You will be using, operating, and configuring Kubernetes environments (GKE, EKS, AKS), Java frameworks, Shell scripts, Python scripts, Terraform templates and many other cloud technologies.  You will participate in the on-call rotation for 12 hours a day over 7 days, every 4-5 weeks and manage incidents on the DBaaS infrastructure coordinating support for our customers.  You will learn how to diagnose problems with our database and infrastructure technology and help deliver reliable service to our customers. 

We are looking for strong engineers who exemplify collaboration, teamwork, empathy and like to lead by example. We enjoy working with people who are driven and thrive in a fast-paced startup environment, and who have a strong desire to build an internet-scale, extensible control plane with strong emphasis on simplicity and user experience.  

Responsibilities

  • Design, develop, test, debug, troubleshoot, and maintain components of the DBaaS cloud infrastructure
  • Manage operational priorities of the DBaaS infrastructure
  • Establish process for handling and leading response to incidents on databases or infrastructure
  • Automate and manage regular maintenance operations such as upgrades etc.
  • Design and build DBaaS processes for encryption, security key/password management, storage management, etc. 
  • Utilize SRE golden signals to analyze …
This job isn't fresh anymore!
Search Fresh Jobs