FreshRemote.Work

Senior Site Reliability Engineer

Remote (US & Canada)

Company Overview

Totango + Catalyst have joined forces to build a leading customer growth platform that helps businesses protect and grow their revenue. Built by an experienced team of industry leaders, our software integrates with all the tools CS teams already use to provide one centralized view of customer data. Our modern and intuitive dashboards help CS leaders develop impactful workflows and take the right actions to understand health, prevent churn, increase adoption, and drive expansion.

Position Overview

As a Senior Site Reliability Engineer at Totango + Catalyst, you will help shape our infrastructure and build the foundation our team relies on for the rapid delivery of our product. We’ll depend on you to instill best practices for building scalable distributed systems, emphasizing development experience, observability and fault tolerance. Our current stack consists of technologies such as Ruby on Rails, RDS, Elasticsearch, Java, and Kubernetes, and we are moving towards microservices and serverless. If you thrive in a growth-stage startup environment and are looking for more ownership and the ability to have a significant impact, we would love to meet you.

This role is opened to candidates working remotely anywhere in Canada and the U.S.

What You’ll Do

  • Manage our AWS infrastructure, with an emphasis on configuration as code.
  • Keep our site and our services up and running, or get it back up and running quickly when a failure occurs
  • Improve monitoring and work with developers to improve performance and reliability
  • Participate in technical design reviews and architecture planning
  • Debugging complex problems across an entire stack and creating solid solutions
  • Collaborate with product managers and developers to evolve our delivery pipeline
  • Working closely with internal partners and teams to ensure that we ship software that meets security, SLA, performance, and budget requirements
  • Help build our on-call policies and runbooks
  • Take ownership of projects and demonstrate a high level of accountability
  • Manage our data infrastructure and pipeline
  • Focus on quality, cost-effective scalability, and distributed system reliability and establish automated mechanisms

Who You Are:

  • You are passionate about learning. Obstacles and challenges don’t deter you, you find these as opportunities to learn and grow.
  • You have a positive demeanor and a go-getter attitude! 
  • You are a strong team player. You collaborate well with others, and want to work together …
This job isn't fresh anymore!
Search Fresh Jobs