FreshRemote.Work

Senior Site Reliability Engineer

Remote

About the Team:

Netlify’s SRE team is on a mission to scale Netlify’s infrastructure to support our next million users. We focus on ensuring application resiliency and delivering a robust compute and network platform at scale. As a Site Reliability Engineer within the Infrastructure SRE team, you’ll play a key role in designing, developing, and delivering solutions that enhance the scalability, availability, and efficiency of our platform. Whether you're an experienced systems engineer or a software developer eager to dive into systems, we’re excited to hear from you!

We are a remote-first, globally distributed group, emphasizing asynchronous communication to prioritize execution over meetings. Documentation is at the core of how we operate, and we are deeply committed to our values of transparency, empowerment, and collective ownership. We foster a culture of curiosity, collaboration, and support, where there are no dumb questions. Everyone on our team knows their contributions matter, feels empowered in their role, and can quickly see the impact of their efforts. At Netlify, we believe diversity is key to innovation, and as a remote-first company, we value the unique perspectives that come from our team’s varied backgrounds. Whether you’re launching a new career or growing an established one, Netlify is a place where you can do impactful work while maintaining a healthy work-life balance

What You’ll Do: 

  • Manage full infrastructure lifecycle from design to decommission, ensuring systems are reliable and efficient.
  • Participate in an on-call rotation for the compute platform and related systems.
  • Automate routine tasks and develop tools to improve system efficiency and reduce the human intervention time on any tasks.
  • Conduct system performance tuning and troubleshooting, as well as capacity planning, to ensure system reliability and efficiency.
  • Participate in the creation and testing of disaster recovery plans.
  • Monitor and maintain observability systems to ensure issues are identified and resolved proactively.
  • Educate team members on security best practices and emerging threats.

What You'll Bring: 

  • Several years of experience in SRE, DevOps, or related roles.
  • Proven experience working in hyperscale cloud environments.
  • Demonstrated ability to lead infrastructure projects.
  • Strong understanding of network protocols and configurations.
  • Experience with automation tools (e.g., Ansible, Terraform) and scripting languages (e.g., Python, Bash, Golang).
  • Experience automating component deployment across multiple environments using tools like Jenkins, CircleCI, or GitHub Actions.
  • Proficient observability and log analysis techniques to detect and resolve system issues.
  • Effective communication skills for both technical and non-technical stakeholders.
  • Familiarity with compliance requirements and frameworks: PCI, ISO 2701, HIPAA, SOC

Applying

Not sure you meet 100% of our qualifications? Please apply anyway!

When applying please include: 

A resume or short listing of your job history & skills (link to a LinkedIn profile would be fine). We appreciate a cover letter explaining why you would enjoy working in this role at Netlify to get to know you a bit better, though this is not required and will not impact your application. Our mission is to “build a better web” and that cannot be done without a diversity of skill sets, backgrounds and thoughts. 

Of everything we've ever built at Netlify, we are most proud of our team. Netlify is an Equal Opportunity Employer. We are devoted to building a team of people with diverse backgrounds and lifestyles. Driving equality empowers our team, enables us to innovate, and helps us maintain a more inclusive environment. We don’t discriminate against employees or applicants based on gender identity or expression, sexual orientation, religion, age, race, military/veteran status, citizenship, pregnancy status, or any other differences. If we can do anything to provide a better interview, i.e. accommodate a disability, then please let us know by emailing accommodations@netlify.com

About Netlify

At Netlify, we’re on a mission to build a better web by making it easier than ever to build, deploy, and scale web applications. By unifying an entire ecosystem of web development tools, content sources, services, and APIs into one simplified workflow, Netlify empowers top brands to ship campaigns faster, reduce risk, and boost productivity and revenue. At the forefront of the composable web movement, with over 4 million web developers and businesses using the platform, with Netlify, you can connect everything and build anything. 

We are a Series D company that has raised over $200M from investors such as Andreessen Horowitz, Kleiner Perkins, EQT, Bessemer, BOND, and Menlo Ventures. As a fully distributed company, we aim to create a company culture where the best idea can come from anywhere and strive to be thoughtful, compassionate, and collaborative in our work. If this sounds like something you’d like to be part of, we’re excited to connect with you!

At Netlify, we are committed to a compensation philosophy that prioritizes fairness and equity, positions our employee compensation competitively in the market, recognizes and rewards performance, and takes a comprehensive approach to our rewards package. We anchor our compensation philosophy on a market-based approach, therefore salary ranges may differ depending on the labor cost in a particular location. The salary provided is in addition to robust benefits and participation in Netlify’s equity plan. Our base compensation for this role is targeted at $136,000 - $184,000  for most US-based locations. Candidates outside the US or in premium markets should consult with their Talent Acquisition partner regarding location-based ranges, as they may be higher or lower than the average US range listed. The starting pay will be determined based on multiple factors, including expertise and skills, market demands, experience, internal equity, and applicable geographic location. These compensation packages and ranges are subject to change and may be modified in the future.

Apply

Job Profile

Restrictions

Remote

Benefits/Perks

Diversity and Inclusion Equal opportunity employer Remote-first company Remote-first culture Work-life balance

Tasks
  • Automate routine tasks
  • Conduct performance tuning
  • Create disaster recovery plans
  • Educate on security best practices
  • Manage infrastructure lifecycle
  • Monitor observability systems
  • Participate in on-call rotation
Skills

Ansible Bash Capacity planning CircleCI Cloud environments Collaboration Communication DevOps Disaster Recovery GitHub GitHub Actions Golang Jenkins Log Analysis Network protocols Observability Python Security Best Practices Site Reliability Engineering Support Terraform Web development

Experience

5 years

Certifications

HIPAA ISO 2701 PCI SOC