Senior Site Reliability Engineer (2nd Shift)
Remote US
An inclusive work environment is an empowering one. At Cutover, we lead with empathy and enable others to succeed through curiosity, kindness, and self-expression.
Location: Remote, United States
2nd Shift: 2:00pm -11:00pm PST (10:00 PM - 7:00 AM UTC)
Choose your preferred work setup – whether it's fully remote or a hybrid model, with the flexibility to work from home or the office.Cutover’s Collaborative Automation SaaS platform enables enterprises to simplify complexity, streamline work, and increase visibility. Cutover’s automated runbooks connect teams, technology, and systems, increasing efficiency and reducing risk in IT disaster and cyber recovery, cloud migration, release management, and technology implementation. Cutover is trusted by world-leading institutions, including the three largest US banks and three of the world’s five largest investment banks.
What tech stack do we use here at Cutover?
As a Senior Reliability Engineer at Cutover, you'll be an integral part of our team, ensuring the robustness and performance of our Cutover Enterprise platform. The platform is crafted, featuring a ReactJS frontend with a Ruby on Rails API, and all hosted on the reliable infrastructure of Amazon Web Services (AWS).
Your role will involve close collaboration with our support and engineering teams. Together, we actively engage in maintaining and optimizing the platform's reliability, utilizing cutting-edge tools and occasionally incorporating the flexibility of Ruby scripting.
If you're passionate about ensuring the dependability and efficiency of complex systems and thrive in an environment where technologies like React, Ruby, AWS, Kubernetes, Terraform, Git, and Ansible are at the forefront, we invite you to join our team. Together, let's elevate the reliability of our Cutover Enterprise platform to new heights.
As a founding member of our reliability team you will use your experience to turbo charge the reliability of our product supporting our clients 24/7
As a Senior Reliability Engineer, here's what you'll be up to:
- Monitoring and Alerting: Help implement and maintain robust monitoring solutions using Datadog to proactively identify and address potential issues before they impact users.
Incident Response: Collaborate with cross-functional teams to enact incident response plans, participate in incident resolution, and conduct post-incident reviews to prevent future occurrences.
Automation: Develop and implement automation scripts and internal tools to streamline operational tasks and enhance system reliability.
Performance Analysis: Conduct performance analysis and identify areas for improvement in the React and Ruby components of our applications.
Documentation: …
This job isn't fresh anymore!
Search Fresh JobsJob Profile
20 days of PTO Collaborative environment Employee Referral Scheme Empowering culture Inclusive work environment Private health insurance Share Options
Tasks- Automation
- Documentation
- Incident response
- Performance analysis
Ansible Automation AWS Collaboration Datadog Documentation Git Incident Response Kubernetes Performance analysis React ReactJS Ruby Ruby on Rails System reliability Terraform
TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9