Staff Engineer - Data Reliability Engineering (REMOTE)
MD Chevy Chase (Office) - JPS
Staff Engineer - Data Reliability Engineering
Position Summary
We are seeking a highly experienced Staff Engineer for Data Reliability Engineering to join our dynamic team. In this role, you will lead the design and implementation of large-scale, modern data platforms with a strong focus on reliability, scalability, and performance. As a key technical leader, you will leverage Open-Source technologies and work with major cloud platforms or large-scale private data centers to ensure our data infrastructure meets the highest standards for availability, security, and efficiency.
Position Description
You will collaborate closely with data engineers, software developers, SREs, and stakeholders across the organization to build systems that provide robust data processing pipelines, monitoring solutions, and automated incident responses, ensuring continuous data integrity and reliability.
Position Responsibilities:
Lead the design and implementation of large-scale, fault-tolerant, and highly available data platforms.
Architect and develop end-to-end data pipelines that ensure the reliability, scalability, and performance of data processing systems.
Drive best practices for data reliability, disaster recovery, monitoring, alerting, and incident management.
Collaborate with cross-functional teams (data engineering, DevOps, SREs) to integrate, test, and improve platform reliability and performance.
Mentor and guide engineers across the organization, promoting a culture of engineering excellence and continuous improvement.
Leverage open-source tools and technologies to enhance platform capabilities, reduce costs, and increase flexibility.
Implement automation strategies for system monitoring, data quality checks, failure recovery, and incident resolution.
Optimize performance and cost efficiency across data infrastructures hosted on major cloud providers (AWS, GCP, Azure) or large-scale private data centers.
Establish and enforce security and compliance standards for data systems.
Qualifications:
Expertise in designing and managing large-scale distributed data systems.
Strong knowledge of modern data platforms (e.g. Snowflake, Spark, Kyuubi, Datalake, Kafka, Airbyte, Trino, Flink, Azure Data Factory, Nifi) and related open-source tools.
Hands-on experience with major cloud platforms (Azure, AWS, GCP) or large-scale private data center environments.
Proficiency in programming and scripting (Python, Java, Scala, Go, etc.) for automation, data processing, and systems engineering.
In-depth knowledge of CI/CD practices, containerization (Docker, Kubernetes), and infrastructure-as-code (Terraform, Ansible, Puppet, Chef).
Strong understanding of database technologies (SQL, NoSQL) and distributed computing frameworks.
Experience with monitoring, alerting, and troubleshooting tools (Prometheus, Grafana, Log Analytics, Datadog, etc.).
Proven ability to mentor engineers and lead technical initiatives across teams.
Excellent communication skills and ability to work effectively in a fast-paced, cross-functional environment.
Preferred Qualifications:
Experience with streaming technologies such as Apache Kafka, Flink, or Spark Streaming.
Expertise in data governance, security, and compliance within large-scale data systems.
Prior experience in disaster recovery planning and execution.
Familiarity with machine learning pipelines and advanced analytics platforms.
Experience:
8+ years of experience in Data Engineering, Site Reliability Engineering (SRE), or Software Engineering roles with a focus on data platforms.
8+ years of professional experience in data software development, programming languages and developing with big data technologies
5+ years of experience with architecture and design
5+ years of experience with AWS, GCP, Azure, or another cloud service
4+ years of experience on open-source development
Education
Bachelor’s Degree in Computer Science, Information Systems, or equivalent education or work experience.
#LI-TK1
Annual Salary
$115,000.00 - $230,000.00The above annual salary range is a general guideline. Multiple factors are taken into consideration to arrive at the final hourly rate/ annual salary to be offered to the selected candidate. Factors include, but are not limited to, the scope and responsibilities of the role, the selected candidate’s work experience, education and training, the work location as well as market and business considerations.
Benefits:
As an Associate, you’ll enjoy our Total Rewards Program* to help secure your financial future and preserve your health and well-being, including:
- Premier Medical, Dental and Vision Insurance with no waiting period**
- Paid Vacation, Sick and Parental Leave
- 401(k) Plan
- Tuition Reimbursement
- Paid Training and Licensures
*Benefits may be different by location. Benefit eligibility requirements vary and may include length of service.
**Coverage begins on the date of hire. Must enroll in New Hire Benefits within 30 days of the date of hire for coverage to take effect.
The equal employment opportunity policy of the GEICO Companies provides for a fair and equal employment opportunity for all associates and job applicants regardless of race, color, religious creed, national origin, ancestry, age, gender, pregnancy, sexual orientation, gender identity, marital status, familial status, disability or genetic information, in compliance with applicable federal, state and local law. GEICO hires and promotes individuals solely on the basis of their qualifications for the job to be filled.
GEICO reasonably accommodates qualified individuals with disabilities to enable them to receive equal employment opportunity and/or perform the essential functions of the job, unless the accommodation would impose an undue hardship to the Company. This applies to all applicants and associates. GEICO also provides a work environment in which each associate is able to be productive and work to the best of their ability. We do not condone or tolerate an atmosphere of intimidation or harassment. We expect and require the cooperation of all associates in maintaining an atmosphere free from discrimination and harassment with mutual respect by and for all associates and applicants.
ApplyJob Profile
Benefits/PerksDental Health and well-being Medical Paid training Paid Training and Licensures Paid Vacation Parental leave Total Rewards Program Tuition reimbursement Vision Vision Insurance
Tasks- Automation
- Collaborate with cross functional teams
- Design and implement data platforms
- Develop data pipelines
- Drive data reliability best practices
- Enforce security standards
- Implementation
- Implement automation strategies
- Mentor engineers
- Optimize performance and cost efficiency
Advanced Analytics Analytics Ansible Apache Apache Kafka Architecture Automation AWS Azure Big Data Big Data Technologies Chef CI/CD Cloud Cloud platforms Communication Compliance Compliance Standards Containerization Continuous Improvement Database technologies Datadog Data engineering Data Governance Data Pipelines Data processing Data processing pipelines Data Quality Data reliability engineering Data Systems Design DevOps Disaster Recovery Distributed computing Docker Engineering Flink GCP Go Governance Grafana Incident Management Java Kafka Kubernetes Log Analytics Machine Learning Monitoring Monitoring Solutions NiFi NoSQL Open Source Technologies Open-source tools Organization Programming languages Prometheus Puppet Python Reliability Reliability Engineering Scala Scripting Security Site Reliability Engineering Snowflake Software Development Software Engineering Spark Spark Streaming SQL Terraform Trino Troubleshooting
Experience8 years
EducationComputer Science Equivalent Equivalent Education Equivalent education or work experience Information Systems Work experience
Certifications