FreshRemote.Work

Senior Site Reliability Engineer - APAC

APAC Remote

Customer.io is looking for a collaborative Site Reliability Engineer (SRE) who loves solving interesting puzzles and is excited to help us build out a scalable, reliable platform that our customers love. The successful candidate should also be able work independently when needed and be able to lead other SREs.

Our small, remote-first team is fun-loving, passionate and highly effective. We offer a lot of opportunity to grow and refine your skills while having a significant impact. The other members of our site reliability team have backgrounds in a wide variety of industries, from Telecom to Gaming, with 10 - 20 years of experience in DevOps. We believe that embracing our unique perspectives and expertise is key to our success as a collaborative team.

We are seeking product-minded, empowered individuals who work collaboratively with their peers on interesting problems, and get those solutions into the hands of customers quickly. We value diversity, attracting the best people in the world to serve as colleagues. Our flexibility and freedom to work from anywhere in the world enables you to craft a work environment in which you can do your best work.

As a Site Reliability Engineer you will:

  • Design, build, and maintain core infrastructure pieces that allow Customer.io scaling to support real-time processing and delivery of billions of messages
  • Plan the growth of Customer.io’s infrastructure
  • Automate the deployment process to make it as boring as possible
  • Be on our on-call rotation to respond to Customer.io availability incidents and provide support for technical support engineers with customer incidents
  • Ensure we have adequate observability of our infrastructure and applications.
  • Debug production issues across services and levels of the stack.
  • Take an active role in a friendly and supportive team that encourages you and the entire company to grow as individuals, professionals, and teams
  • Learn, practice, and share with your coworkers through code review, pair programming, team collaboration, and training to help improve our collective knowledge and best practices together

What we're looking for:

  • Preferably 7+ years of experience as a site reliability engineer and/or software engineer
  • Experience in managing and working with RDB systems (MySQL)
  • A solid understanding of problems of scalability and experience deploying and managing distributed applications on cloud infrastructure
  • Proven experience building cloud infrastructure via code using Terraform and automating operational toil
This job isn't fresh anymore!
Search Fresh Jobs