DevOps Engineer - Site Reliability
Global Headquarters
(*) This is a remote position; however, the candidate must reside within 30 miles of one of the following locations: Boston, MA and Portland, ME
About the Team/Role
The WEX Site Reliability Engineering (SRE) team seeks individuals passionate about developing software and solutions for observability, incident response, reliability, performance, operational excellence, and compliance. As part of the Site Reliability Engineering organization, you will support internal stakeholders and Payment Platform teams, tackling complex challenges and enhancing our engineering teams' and customers' experience. The ideal candidate will have a strong aptitude for learning new technologies, driving meaningful projects to completion, and thriving under pressure while closely collaborating with engineering teams.
As an SRE at WEX, you will be exposed to numerous code bases and languages. You will need to be able to assess for bugs during on-call events, find performance issues, and write custom tools to help automate operational tasks.
How you’ll make an impact
Willingness to dig deep into code, networking, operating systems, and/or storage solutions to solve complex issues
Develop automation and utilize monitoring tools to ensure system reliability
Participate in incident response and troubleshooting
Participate in 24x7 Site Reliability rotations and escalation workflows
Identify and address performance bottlenecks. This will include code optimization, configuration changes, or infrastructure upgrade recommendations.
Collaborate with development teams to ensure software design meets operational requirements.
Continuously improve processes and procedures to increase system reliability and efficiency.
Stay up-to-date with the latest industry trends and technologies
Design, code, and debug applications while assisting with CI/CD pipelines, automating infrastructure tasks, and ensuring system scalability and security.
Experience you’ll bring
2+ years of hands-on experience as a Site Reliability Engineer or equivalent role
Development experience OR consistent knowledge of at least one major programming language C#, Java, GoLang, Python
Experience with Cloud Computing platforms (AWS, Azure, GCP)
Ability to thrive in a fast-paced development and operations world
Strong communication and collaboration skills
Experience with observability and logging technologies
Experience with at least one major RDBMS and NoSQL data store
Experience with containerization technologies such as Docker or Kubernetes
BA/BS degree in Computer Science or related technical field or equivalent job experience
Nice to have
Experience with infrastructure as code, preferably Terraform
Working knowledge in building and designing RESTful APIs.
Experience with Datadog, Grafana, and Splunk
Familiarity with Agile methodologies and practices
Experience with GitOps
Experience with Apache Kafka and eventing technologies
Job Profile
Boston, MA Must reside within 30 miles of Boston, MA or Portland, ME
Benefits/PerksDental Insurance Disability Insurance Flexible Spending Flexible Spending Accounts Health insurance Health savings account Life Insurance Paid Time Off Quarterly or annual bonus Retirement savings Retirement savings plan Total compensation package Tuition reimbursement Vision Insurance
Tasks- Automation
- Collaborate with development teams
- Develop automation
- Improve processes for reliability
- Participate in incident response
- Support internal stakeholders
- Utilize monitoring tools
Agile Agile methodologies Apache Kafka APIs Automation AWS Azure C CI/CD Cloud Computing Collaboration Communication Compliance Configuration Containerization Datadog Design DevOps Docker Flexible spending accounts GCP GitOps Golang Grafana Incident Response Industry trends Infrastructure as Code Java Kafka Kubernetes Logging Monitoring Monitoring tools Networking NoSQL Observability Operational Excellence Performance Programming Python RDBMS Reliability RESTful API's Sales Security Site Reliability Engineering Software design Splunk Terraform Troubleshooting
Experience2 years
EducationB.A. B.S. Computer Science Engineering Equivalent Technical field
TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-4 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9