FreshRemote.Work

Data Reliability Engineer

Hybrid / Bungie-Approved Remote Locations

Data Reliability Engineering at Bungie is a core team of the Central Tech area that keeps our games and tooling running at scale. Our team owns the overall scalability, observability and resilience of the databases, data processing platforms and in-memory key-value stores used throughout the Bungie ecosystem. We partner with our engineering teams and business units on projects, services, designs, and processes We are the stewards of architecture and provide tools and services to enable engineering teams to meet their design requirements.

RESPONSIBILITIES

  • Design, deploy, and maintain highly available and scalable data infrastructure components including Kafka, RabbitMQ, Redis, Elasticsearch, and Graphite
  • Perform capacity planning and scalability assessments for data platforms
  • Troubleshoot and resolve issues related to data processing pipelines, message queuing, and performance including participation in on-call rotation
  • Ensure data security, integrity, and compliance with industry best practices and regulatory requirements
  • Document system configurations, procedures, and operational knowledge
  • Advise service owners on industry and company standards and best practices
  • Maintain reliability and performance levels for core data platform infrastructure
  • Data observability strategy and implementation
  • Data ownership strategy and documentation

REQUIRED SKILLS

  • Strong understanding of Linux operating systems and their administration
  • Effective communication skills and ability to collaborate effectively in a team environment
  • Experience with infrastructure automation and configuration management (e.g., Ansible, Terraform…)
  • Excellent troubleshooting skills and the ability to analyze and resolve complex infrastructure resource and application deployment issues
  • Experience working in a distributed production environment
  • Deep understanding of cluster management areas, such as scaling, consistency tuning, replication, and multi-datacenter configuration
  • Familiarity with time-series monitoring systems & tools (e.g., Datadog, Prometheus, Grafana and ELK)
  • Experience designing and implementing logging and metric pipelines
Listed below is this role's base pay range. In other locations that Bungie has approved for remote work, this pay range may be adjusted up to +/- 10% and some roles may be classified as non-exempt (hourly). 

This range is determined by an array of factors, including training, transferable skills, work experience, business needs, and market demands. Additionally, it is subject to change and may be modified in the future. 

In the journey to make incredible worlds, Bungie employees don't simply do meaningful work - they receive meaningful support. Competitive salaries and discretionary bonus opportunities are just the start. We offer comprehensive healthcare coverage, generous 401(k) matching, flexible time off policies, paid parental leave, social clubs, and so much more. Check out our benefits page: Benefits - Bungie Careers to see all the perks that come with joining Bungie.Bellevue Pay Range$135,000—$163,000 USD 

So you don't meet every requirement listed in this job description? Do you think your experience level is not where it needs to be? Studies show that members of certain underrepresented groups are less likely to apply to jobs unless they believe they meet every single qualification. Bungie is focused on building a diverse, inclusive, and equitable workplace. If you're interested in this position but your past experience doesn't line up perfectly with every qualification in the job description, we want you to apply anyway. The right role for you may be right around the corner!

Most Bungie full-time employees will adopt a digital first approach allowing remote work in Bungie approved locations (outside of positions identified as 100% onsite in Bellevue/Seattle, or individuals preferring a hybrid/flex environment). Prospective full-time employees who are located outside of a Bungie approved state or province will need to establish residency in an approved state or province within 45 days of their start date. Contractors will follow a digital first approach adhering to the location guidelines agreed upon by our third-party employer/vendor and Bungie. Bungie’s remote policy is subject to change at the company’s discretion.

Apply

Job Profile

Restrictions

Bungie-approved remote locations Hybrid

Benefits/Perks

401(k) matching Competitive salaries Comprehensive healthcare Comprehensive healthcare coverage Discretionary bonuses Discretionary bonus opportunities Flexible time off Flexible time off policies Healthcare coverage Hybrid work Paid parental leave Social clubs

Tasks
  • Advise service owners
  • Design and maintain data infrastructure
  • Document system configurations
  • Ensure data security and compliance
  • Maintain data platform reliability
  • Perform capacity planning
  • Troubleshoot data processing issues
Skills

Ansible Cluster Management Communication Configuration Management Datadog Data reliability engineering Documentation ElasticSearch ELK Grafana Graphite Infrastructure Automation Kafka Linux Observability Prometheus RabbitMQ Redis Resilience Scalability Terraform Time-series monitoring Troubleshooting