Site Reliability Engineer - Remote
Santa Clara, CA, United States
Company Description
PayNearMe develops technology to facilitate the end-to-end customer payment experience, making it easy for businesses to accept, disburse and manage payments. Our modern and reliable platform lowers the total cost of payments by increasing acceptance rates, driving self-service and simplifying exceptions. We future-proof our clients’ payments roadmap by including all payment types and channels through a single contract and integration. With PayNearMe, businesses can transform the outdated systems holding them back from achieving progress.
PayNearMe has over 200 employees, closed a $45M Series D round in June 2023, and is processing over $1B in payments monthly. We’re headquartered in Silicon Valley with our employees distributed all across the U.S. Help us solve our clients’ biggest payment problems.
As our Site Reliability Engineer, you will design, build, and maintain the systems and infrastructure that power our applications, ensuring their reliability, scalability, and performance. You will bring a software engineering approach to operations, automating processes, and continuously improving the infrastructure and tools to support our business needs.
Job Description
- System Administration: Support our EC2 infrastructure to ensure it’s properly configured, reliable, and monitored, while also helping us modernize it towards more automation and containerization.
- Automation: Build and maintain our Ansible (and legacy Puppet) configuration management, while helping us increase our automation and reduce toil.
- Kubernetes and Containers: Deploy, manage, and optimize Kubernetes clusters and containerized applications using Docker. Implement best practices for container orchestration and management.
- Systems and Application Monitoring/Observability: Develop and maintain comprehensive monitoring and observability solutions using Datadog. Ensure detailed visibility into system performance and application health.
- CI/CD Pipeline Management: Create, enhance, and maintain continuous integration and continuous deployment pipelines using GitLab CI. Ensure seamless and reliable deployment processes.
- Security and Compliance: Implement security best practices and ensure compliance with industry standards. Regularly review and update security policies and procedures.
- Collaboration and Support: Work closely with development teams to ensure reliability and scalability of new features and services. Provide technical support and guidance on infrastructure-related issues.
- On-Call Rotation: Participate in an on-call rotation to address production issues and collaborate in incident response efforts.
Qualifications
- Linux System Administration: We’re migrating our Monolith to Microservices, but this role will have to actively maintain our EC2 servers while helping us migrate to Kubernetes.
- Rails Production Environments: Experience supporting production environments running Ruby on Rails applications.
- Cloud Platform Experience: Proficient with cloud platforms such as AWS, GCP, or Azure. Experience with EC2, RDS, VPCs, and security groups is essential.
- Configuration Management: Ansible or equivalent experience for managing large fleets of EC2 or similar servers.
- Infrastructure as Code: Expert in using Terraform for infrastructure as code.
- Kubernetes and Containers: Strong experience with Kubernetes and Docker, including deployment, scaling, and management of containerized applications.
- Monitoring and Observability: Extensive experience with monitoring and observability tools like Datadog, Prometheus, Grafana, ELK stack, or Splunk. Skilled in setting up detailed monitoring and logging systems.
- Engineering Collaboration: Ability to work with other Engineering team members on troubleshooting, support, and projects both for Production and lower level environments.
- DevOps Best Practices: Deep understanding of DevOps principles, practices, and tools to drive continuous improvement in the software development lifecycle.
Additional Information
Benefits
- Base salary per year (paid semi-monthly)
- Fast- paced and professional work culture
- Stock options with standard startup vesting - 1 year cliff; 4 years total
- $50 monthly communication expense stipend to go towards your phone/internet bill
- $250 stipend to enhance your WFH setup
- Reimbursement for peripheral equipment: monitor (up to $400), keyboard and mouse (up to $200)
- Premium medical benefits including vision and dental (100% coverage for employees)
- Company-sponsored life and disability insurance
- Paid parental bonding leave
- Paid sick leave, jury duty, bereavement
- 401k plan
- Flexible Time Off (our team members typically take off ~3-4 weeks per year)
- Volunteer Time Off
- 13 scheduled holidays
- 4-6x / year in-person team meet-ups
Salary Range: $205,000 - $215,000
PayNearMe strives to create a workplace where all employees thrive. Our core values represent who we are today and we take pride in the way we work with each other as well as with our stakeholders.
We’re in this together to do the right thing. We deliver real results we are proud of while remaining respectful, transparent, and flexible.
PayNearMe is an equal opportunity employer. We are diligently and thoughtfully working towards cultivating a diverse workforce which in turn, enhances our products and services for the communities we serve. Applicants who represent all backgrounds are strongly encouraged to apply.
—
Candidate information will be treated in accordance with our job applicant privacy notice found at: https://home.paynearme.com/ccpa-privacy-notice-jobs-employees/
Assistance for Disabled Applicants
Alternative formats of this Notice are available to individuals with a disability. Please let us know if you need assistance.
All your information will be kept confidential according to EEO guidelines.
ApplyJob Profile
100% coverage for employees 13 scheduled holidays Base salary Base salary per year Communication expense stipend Company-sponsored life and disability insurance Flexible time off Holidays In-person team meet-ups Life and Disability insurance Medical benefits Paid parental bonding leave Paid Sick Leave Paid sick leave, jury duty, bereavement Premium medical benefits Reimbursement for peripheral equipment Scheduled holidays Stock options Team meet-ups Volunteer time off WFH setup reimbursement
Tasks- Automate processes
- Collaborate with development teams
- Design and maintain systems
- Implement security best practices
- Incident response
- Manage CI/CD pipelines
- Monitor application performance
- On-Call Rotation
- Provide technical support
- Security and Compliance
Ansible Automation AWS Azure CI/CD Communication Continuous Improvement Datadog Docker EC2 ELK stack GCP GitLab GitLab CI Go Grafana Incident Response Infrastructure as Code Integration Kubernetes Linux Monitoring Observability Payments Pipeline Management Prometheus Puppet Rails RDS Ruby Ruby on Rails Security Security Best Practices Site Reliability Engineering Software Development Splunk Technical Support Terraform Troubleshooting VPC
EducationBusiness Equivalent experience
TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9