FreshRemote.Work

Site Reliability Engineer III

United States | United States

We’re Ookla, the company behind Speedtest, Downdetector, Ekahau and an ever-expanding suite of enterprise connectivity solutions. Our team is a group of people brought together through passion and inspired by possibility. We are looking for team members who enjoy solving complex problems, are motivated to challenge themselves, and are delighted with turning clever ideas into unique products. Millions of consumer-initiated tests are performed through our products and billions of data points are collected globally every day. With all this measurement comes countless ways to dissect and analyze our data, and plenty of exciting opportunities to optimize our broader strategy. When you work for us, you are using Ookla data and insights to further our mission to improve connectivity for all. We are looking for a highly capable engineer to join our Platform and Site Reliability engineering team. You will be responsible for building, maintaining and operating the infrastructure platform on which all Ookla services are built. In this role, you will build, maintain, and support a massive-scale dynamic infrastructure that is relied on by hundreds of millions of users around the world. You will obsess over systems performance, scalability, reliability, observability, and security. Most importantly, you will help deliver critical application functionality and help make the internet experience better for our users to help us achieve our goal of better connectivity for all. We are committed to providing you a flexible work environment where individuality, fun, and talent are all valued equally. If you consider yourself innovative, adept at collaboration, and you care deeply about the work you do, we want to talk! Expectations for Success
  • Maintaining a distributed, global ecosystem of thousands of cloud instances, containerized workflows, serverless applications, Linux servers, and associated infrastructure supporting billions of requests daily.
  • Maintaining transactional database infrastructure using MySQL, PostgreSQL, and managed services such as RDS/Aurora.
  • Supporting the use of NoSQL data storage engines such as DynamoDb and MongoDB.
  • Building and supporting data stream processing with Kinesis or Kafka.
  • Supporting data engineering and big data toolchains such as Spark.
  • Supporting production systems in a 24x7x365 environment, including on-call responsibilities.
  • Providing architectural and operational support to software engineers in a wide variety of focus areas.
  • Support software and data engineering teams and guiding operational best practices.
  • Implementation and oversight of security programs including vulnerability remediation, patch management, IDS/IPS, penetration testing, and interfacing with our corporate InfoSec team.
  • Supporting the development to production code deploy …
This job isn't fresh anymore!
Search Fresh Jobs