Sr Staff Software Engineer – IaaS (Cluster Management) REMOTE
MD Chevy Chase (Office) - JPS
Position Summary
GEICO is seeking an experienced Staff Engineer with a passion for building high performance, low maintenance, zero-downtime platforms, and applications. You will help drive our insurance business transformation as we transition from a traditional IT model to a tech organization with engineering excellence as its mission, while co-creating the culture of psychological safety and continuous improvement.
Position Description
Our Staff Engineer works with our Distinguished Engineers, Sr. Staff Engineers, and Sr. Engineers to innovate and build new systems, improve, and enhance existing systems as well as identify new opportunities to apply your knowledge to solve critical problems. You will lead the strategy and execution of a technical roadmap that will increase the velocity of delivering products and unlock new engineering capabilities. The Cluster Management team is driving the development of the next-gen Kubernetes-based container cluster platform, prioritizing security, reliability, scalability, and efficiency. We seek a candidate with deep technical expertise in designing, building, and maintaining secure cluster management systems on OpenStack IaaS, at scale across physical and public cloud environments.
Position Responsibilities
As a Staff Engineer, you will:
Provide technical and thought leadership across diverse areas.
Collaborate with teams, customers, and product managers to address challenges.
Develop and execute a strategic software development plan for IaaS, encompassing containers, cluster management, Kubernetes, and OpenStack. Prioritize security, optimization for performance and efficiency across the entire development lifecycle.
Own solution quality, usability, and performance.
Mentor and exemplify technical excellence, influencing the engineering and product community.
Share best practices, refine processes, and drive continuous improvement.
Analyze costs, forecast, and integrate into business plans.
Determine resource needs, assess processes, and ensure adaptability for continuous learning.
Fulfill on-call responsibilities and offer operational support.
Qualifications
Proficient in multi-cluster networking using service mesh technologies like ISTIO, Consul, or Envoy.
Expertise in multi-cluster metrics, observability, and operations utilizing frameworks such as Grafana and Prometheus.
In-depth understanding of containerization technologies, including Docker, Podman, and Rancher.
Proficient with advanced technologies like ArgoCD, KubeVirt, and Cluster API (CAPI).
In-depth knowledge and practical experience in Linux operating systems, internals, and command-line utilities.
Proven expertise in optimizing CI/CD for streamlined Kubernetes deployment and configuration using GitOps and ArgoCD.
Hands-on experience in public and/or private cloud environments, including OpenStack, Kubernetes, Azure, AWS, and GCP.
Extensive experience in API, Microservices, network, and security architectures, incorporating design patterns.
Strong foundations in software engineering, encompassing the entire software delivery lifecycle.
Professional experience in software development using modern programming languages like Go, Python, or Java.
Experience in security protocols and products, including Active Directory, SAML, and OAuth.
Demonstrated ability to design and implement resilient, scalable, and efficient solutions.
Experience in building architecture and design, covering patterns, reliability, and scaling for both new and existing systems.
Fluent in DevOps concepts and cloud architecture, with a strong foundation in Shell scripting.
Experience in Datacenter structure, capabilities, and offerings, with a focus on Azure platform and native services.
Possesses relevant cloud and container certifications, coupled with in-depth knowledge of computer science data structures and algorithms.
Strong problem-solving abilities with a proactive approach to risk mitigation, along with excellent communication skills to convey technical concepts to diverse audiences.
Demonstrated ability to excel in dynamic, startup-like environments, showcasing adaptability and resilience.
Experience
8+ years in IaaS/PaaS software development, specializing in the Kubernetes cluster management using some cloud platform.
4+ years of hands-on experience in open-source frameworks.
5+ years dedicated to architecture and design.
5+ years working with AWS, GCP, Azure, or other cloud services
1+ years of leadership experience leading cross functional technical team
Education
Bachelor’s degree in Computer Science, Information Systems, or equivalent education or work experience.
#LI-RP2
#DICE
Annual Salary
$105,000.00 - $260,000.00The above annual salary range is a general guideline. Multiple factors are taken into consideration to arrive at the final hourly rate/ annual salary to be offered to the selected candidate. Factors include, but are not limited to, the scope and responsibilities of the role, the selected candidate’s work experience, education and training, the work location as well as market and business considerations.
Benefits:
As an Associate, you’ll enjoy our Total Rewards Program* to help secure your financial future and preserve your health and well-being, including:
- Premier Medical, Dental and Vision Insurance with no waiting period**
- Paid Vacation, Sick and Parental Leave
- 401(k) Plan
- Tuition Reimbursement
- Paid Training and Licensures
*Benefits may be different by location. Benefit eligibility requirements vary and may include length of service.
**Coverage begins on the date of hire. Must enroll in New Hire Benefits within 30 days of the date of hire for coverage to take effect.
The equal employment opportunity policy of the GEICO Companies provides for a fair and equal employment opportunity for all associates and job applicants regardless of race, color, religious creed, national origin, ancestry, age, gender, pregnancy, sexual orientation, gender identity, marital status, familial status, disability or genetic information, in compliance with applicable federal, state and local law. GEICO hires and promotes individuals solely on the basis of their qualifications for the job to be filled.
GEICO reasonably accommodates qualified individuals with disabilities to enable them to receive equal employment opportunity and/or perform the essential functions of the job, unless the accommodation would impose an undue hardship to the Company. This applies to all applicants and associates. GEICO also provides a work environment in which each associate is able to be productive and work to the best of their ability. We do not condone or tolerate an atmosphere of intimidation or harassment. We expect and require the cooperation of all associates in maintaining an atmosphere free from discrimination and harassment with mutual respect by and for all associates and applicants.
ApplyJob Profile
Benefits/PerksContinuous learning Dental Health and well-being Medical Paid training Paid Training and Licensures Paid Vacation Parental leave Psychological safety Total Rewards Program Tuition reimbursement Vision Vision Insurance
Tasks- Analyze costs
- Collaborate with teams
- Develop software plans
- Drive continuous improvement
- Enhance existing systems
- Fulfill on-call responsibilities
- Innovate and build new systems
- Lead technical strategy
- Mentor engineers
- Own solution quality
- Provide operational support
- Share best practices
Active Directory Algorithms API Applications Architecture ArgoCD AWS Azure Building CI/CD Cloud Cloud Architecture Cloud Computing Cloud Services Cluster api Cluster Management Communication Consul Containerization Containers Continuous Improvement Data Structures Data structures and algorithms Deployment Design Design Patterns DevOps DevOps Concepts Docker Engineering Envoy GCP GitOps Go Grafana IaaS Istio Java Kubernetes Kubevirt Leadership Linux Microservices Multi-cluster networking Networking OAUTH Observability OpenStack Operations Organization PaaS Performance Optimization Podman Problem-solving Programming languages Prometheus Public Cloud Python Rancher Reliability SAML Scripting Security Security Architecture Security protocols Service Mesh Shell scripting Software Development Software Engineering Technical Roadmap
Experience5 years
EducationComputer Science Equivalent Equivalent Education Equivalent education or work experience Information Systems Work experience
Certifications Timezones