Senior Staff Engineer- Observability and Reliability Platform Engineering (REMOTE)
MD Chevy Chase (Office) - JPS
Position Summary
GEICO is seeking an experienced Staff Engineer with a passion for building high-performance, low maintenance, zero-downtime platforms, and applications. You will help drive our insurance business transformation as we transition from a traditional IT model to a tech organization with engineering excellence as its mission, while co-creating the culture of psychological safety and continuous improvement.
Position Description
Our Staff Engineer works with our Sr Staff Engineer and Sr. Engineers to innovate and build new systems, improve, and enhance existing systems as well as identify new opportunities to apply your knowledge to solve critical problems. You will lead Strategy and Execution of a technical roadmap that will increase the velocity of delivering products and unlock new engineering capabilities. The ideal candidate has deep technical expertise in the Open-Source Observability, Data platform domain.
Position Responsibilities
As a Staff Engineer, you will:
Focus on multiple areas and provide technical and thought leadership to the enterprise
Collaborate with product managers, team members, customers, and other engineering teams to solve our toughest problems
Develop and execute technical software development strategy for the Observability Engineering domain
Accountable for the quality, usability, and performance of the solutions
Be a role model and mentor, helping to coach and strengthen the technical expertise and know-how of our engineering and product community. Influence and educate executives
Consistently share best practices and improve processes within and across teams
Analyze cost and forecast, incorporating them into business plans
Determine and support resource requirements, evaluate operational processes, measure outcomes to ensure desired results, demonstrate adaptability and sponsor continuous learning
Willing to take on-call and operation support
Qualifications
Deep hands-on experience in building complex distributed system to process large scale telemetry and architectures to support the scale and performance, with great knowledge on Docker and Kubernetes
Advance knowledge of at least one OOP language such as Java, Go, Python, etc.
Great understanding of open-source databases like MySQL, PostgreSQL, etc. And strong foundation with No-SQL databases like Clickhouse, Cassandra. Apache Trino etc. Knowledge or Big data formats such as Parquet or Avro etc.
Experience in architecting, designing, building Observability platform solutions, Advanced data analytics using Open-Source technologies are a big plus.
Experience building distributed systems
Excellent communication skills – needs to be able to lead projects from the front and interact with clients and sponsors on a regular basis
Experience partnering with engineering teams and transferring research to production
Experience …
This job isn't fresh anymore!
Search Fresh JobsJob Profile
RestrictionsHybrid
Benefits/PerksContinuous learning Dental Health and well-being Hybrid work Medical Paid training Paid Training and Licensures Paid Vacation Parental leave Psychological safety Remote work Total Rewards Program Tuition reimbursement Vision Insurance
Tasks- Analyze cost and forecast
- Analyze costs and forecasts
- Collaborate with Product Managers
- Collaborate with teams
- Develop and execute technical software development strategy
- Enhance existing systems
- Evaluate operational processes
- Improve processes
- Innovate and build new systems
- Interact with clients
- Lead projects
- Lead technical roadmap
- Measure outcomes
- Mentor engineers
- Share best practices
- Share best practices and improve processes
- Support resource requirements
AI Algorithms Ansible Apache Applications Architecture Automation AWS Azure Big Data Building Cassandra CI/CD CircleCI Clickhouse Continuous delivery CS Data Structures CS data structures and algorithms Data Structures Data structures and algorithms Deployment Developer Tooling Docker GCP Git Go Grafana Infrastructure as Code Java Jenkins Kubernetes Lambda Leadership ML Monitoring MySQL NoSQL Observability OOP Operations Parquet PostgreSQL Prometheus Python Real-time Communication Reliability Software Development Software development life cycle Source Code SQL Streaming Data Task Management Technical Roadmap Terraform Test Automation Trino Windows Windows Server Windows Server Administration
Experience5 years
EducationBachelor's Bachelor's degree Bachelor's degree in Computer Science Computer Science Equivalent Education Equivalent education or work experience Information Systems Work experience
Certifications