SRE Engineer

Remote Opportunity

Full Time Contract
Openlogix Corporation logo
Openlogix Corporation
Openlogix is a Value Added Reseller & Technology Services Company - APIs, Cloud, BigData, Security, Mobile - IBM & MuleSoft Partner.
Apply now Apply later

Title: SRE EngineerDuration: 6 - 12 monthsLocation: Remote
How You’ll WorkWork closely with Software Engineering (SWE) counterparts and take an active role as a co-owner of production services to ensure services are built, maintained, and operated in a reliable and scalable way.Deep involvement with application services throughout the Software Development Lifecycle, serving as the local SRE domain specialist and point of contact.Collaborate with the software engineering teams to drive operational health improvements, root cause analysis, postmortem discussions and their associated remediations that serve to improve reliability and sublinearly scale operations.Partner with others to use tools, processes, and techniques to sublinearly scale operations and reduce business risk, in areas that include: infrastructure & configuration management.Identify common patterns in challenges with operating services in production, collaborate with other SRE teams to design and implement
What You’ll DoActively participate and contribute to code reviews and technical design documents, with an eye toward identifying performance and reliability bottlenecks.Work with SWE counterparts to identify and mitigate production issues; validate, document and exercise failover/disaster recovery plans and graceful degradation mechanisms policies and standard methodologiesCapacity planning and analysis, and infrastructure change management (including tuning, reshaping, resizing, and migrating infrastructure), for services and their immediate downstreams.Join with SWE service owners on in-progress large engineering projects, including migrating to the latest Twitter technologies and adopting related standard methodologies.Productionalize new services and features, as well as improve production landscape for existing services, providing SRE expertise and implementing standard methodologies in the areas of CI/CD.Attend team meetings, standups, and on-call handoffs.Participate in team on-call rotation.
Qualifications3+years of experience managing, diagnosing, and debugging large-scale distributed systems in production.Practical knowledge of at least one higher-level language (Python, Go, Ruby, or similar).Thorough understanding of Linux servers, specifically RHEL/CentOS.Detailed understanding of tools, methodologies, and analysis techniques in a distributed systems environment.Experience developing infrastructure, configuration, and deployment scripting and automation for large scale / high complexity services in a microservice environment.Experience dealing with large data sets that inform your knowledge around building robust data pipelines and architectures, and tuning java applications.Experience using containerization software such as: Mesos, Kubernetes, Docker or LXCExperience with Lucene based search systems and scatter gather query patterns is desirable.B.S. in Computer Science or equivalent experience.
OpenLogix Corporation is an Equal Opportunity Employer. OpenLogix does not discriminate on the basis of race, color, religion, gender, national origin, age, disability or any other characteristic protected by applicable state or federal civil rights laws.
Job region(s): Worldwide/Anywhere
Job stats:  0  0  0
  • Share this job via
  • or

Explore more Remote Work and WFH career opportunities