Sr. Site Reliability Engineer
We are seeking a Senior Site Reliability Engineers in North America to join our team! As we continue to scale both our product and grow our team, we’re looking for experienced Site Reliability Engineers to help drive automation, performance and reliability in our cloud- based infrastructure. As part of this team, you’ll be responsible for the core elements of our AWS infrastructure and orchestration.
Using site reliability principles and a robust approach to observability, you will not only fix problems but solve the issues that contributed to them when things go wrong. You'll improve application reliability by using a software engineering approach to operations. You'll develop internal tools and systems for all engineering teams to leverage. You'll get to impact every engineering team in the organization and use a broad set of technologies. Maintaining excellent relationships and communicating effectively with teams will be key to success.
Building new features and services is a big part of this role. We are continually developing and implementing new ways to support our teams, understanding our customers needs, and becoming experts in site reliability.
When bad things happen, you'll have the support of your team to solve contributing causes, to learn from failures, and to build a robust and resilient system for our customers. We look for the solution that automates the problem away, not the one that requires manual effort.
If you’re interested in making a big impact and taking our infrastructure to the next level at a fast-growing and profitable startup, then read on.
We know applying for and taking on a new job at any company requires a leap of faith. We want you to feel comfortable and excited to apply at Zapier. To help share a bit more about life at Zapier, here are a few resources in addition to the job description that can give you an inside look at what life is like at Zapier. Hopefully, you'll take the leap of faith and apply.
- Our Commitment to Applicants
- Culture and Values at Zapier
- Zapier Guide to Remote Work
- Zapier Code of Conduct
- Diversity and Inclusivity at Zapier
Zapier is proud to be an equal opportunity workplace dedicated to pursuing and hiring a diverse workforce.
You’re an experienced technologist. You have at least 7 years of experience in the world of systems administration, systems engineering, or software development with at least 3 years of experience in Site Reliability Engineering, DevOps, or related field.
You know the cloud. You’ve designed and maintained highly available, cloud based infrastructures in AWS or another cloud offering. You understand how to leverage infrastructure as code tools, and have experience implementing best practices for reliability and observability. We use tools like Terraform, Kubernetes, Redis, Gitlab, and Datadog among others.
You can code. You have experience with a language like Python or Go to create automated tools. You believe in hands-off deployments and infrastructure as code. Well-honed experience with the fundamentals of software development goes a long way here.
You can solve complex systems challenges. You take ownership of complex challenges, understand how to improve performance, and help uncover opportunities for improvement. You’ve worked on problems where “just throw more hardware at it” isn’t enough for the system to scale.
You’re a great communicator. Not only do you know how to share your knowledge with the team and document things well so they can be consumed asynchronous (we do this a lot as a remote company), but you know how to communicate effectively with software and support teams.
You value our values. At Zapier, our values are at the heart of how we work together and how we think about our customers. In our remote setting, they help develop trust and ensure we work and collaborate together to democratize automation. You see how these values can empower meaningful work, you thrive in a collaborative setting, you are eager to continue growing and you’re excited to be part of the team.
Things We've Done Recently
- Develop new methods for retaining task history
- Migrating applications and services from EC2 to Kubernetes
- Write custom Kubernetes controllers to improve resilience
- Create deployment pipelines in ArgoCD
- Develop autoscaling strategies to handle bursts in workloads
- Implementing OPA to enforce policies across our Kubernetes Clusters
- Deploying ProxySQL for pooling connections against MySQL databases
How To Apply
We have a non-standard application process. To jump-start the process we ask a few questions we normally would ask at the start of an interview. This helps speed up the process and lets us get to know you a bit better right out of the gate.
After you apply, you are going to hear back from us, even if we don't seem like a good fit. In fact, throughout the process, we strive to make sure you never go more than seven days without hearing from us.
For the past nine years, Zapier has been helping people across the world automate the boring and tedious parts of their job. We do that by helping everyone connect the web applications they already use and love.
We believe that there are jobs a computer is best at doing and that there are jobs a human is best at doing. We want to empower businesses to create processes and systems that let computers do what they are best at doing and let humans do what they are best at doing.
We believe that with the right tools, you can have a big impact with less hassle.
We believe in small teams. Small teams are fast and nimble. Small teams mean less bureaucracy and less management and more getting things done.
We believe in a safe, welcoming, and inclusive environment. All teammates at Zapier agree to a code of conduct.
The Whole Package
Location: Remote in North America
Our team of 400+ is distributed (remote) because it lets us work with the best people. We have team members that live in the United Kingdom, Thailand, India, Nigeria, Taiwan, Guatemala, New Zealand, Australia, and more! While we can hire many roles in locations anywhere in the world, some require specific timezone fits.
- Competitive salary (we don't use remote as an excuse to pay less)
- Great healthcare + dental + vision coverage*
- Retirement plan with 4% company match*
- Profit sharing
- 2 annual company retreats to awesome places
- 14 weeks paid leave for new parents of biological or adopted children
- Pick your own equipment. We'll set you up with whatever Apple laptop + monitor combo you want plus any software you need.
- Unlimited vacation policy. Plus we require you to take at least 2 weeks off each year. We see most employees take 4-5 weeks off per year. This isn't a vague policy where unlimited vacation means no vacation.
- Work with awesome companies around the world. We partner with great software companies all over the world and you'll constantly get to interact with people from these great companies
*While we take care of our international folks as best we can, currently, healthcare and retirement plans are only available to UK, Canada, and US-based employees.
Optional: Share anonymously some demographic information about yourself to help us better track trends related to the backgrounds of candidates interested in working at Zapier in order for us to build a team that represents the users at Zapier and the broader world population.
Zapier is an equal opportunity employer. We're excited to work with talented and empathetic people no matter their race, color, gender, sexual orientation, religion, national origin, physical or mental disability, or age. Our code of conduct provides a beacon for the kind of company we strive to be, and we celebrate our differences because those differences are what allow us to make a product that serves a global user base.