FreshRemote.Work

Senior DevOps Infrastructure Engineer

Ontario, Canada (remote)

NMI is seeking a Senior DevOps Engineer with deep Linux, virtualization, and hardware experience who is passionate about running applications in an exceedingly high availability environment within our SRE organization.  This opportunity to work with similarly skilled professionals in a rapidly growing environment offers opportunities to level-up observability and automation skills while maintaining a mission critical, 4-nines availability platform, and participating in environment modernization.

The SRE team is responsible for the operation of all hardware and software within the production and SDLC environments.  This consists of a global network connecting numerous sites which must be highly available 24x7 with a minimal desired target of 99.99% availability.  The successful applicant as a Senior DevOps Infrastructure Engineer will be a core member of the SRE team with the opportunity to work with experts in the infrastructure, networking, and DevOps space.

The Ideal Candidate:

  • Will have a track record of implementing low-toil solutions to traditionally high-touch operational or administrative tasks.
  • Has a deep technical background and can engage with engineers with the nuances of complex systems, while also being able to zoom out and see the bigger picture.
  • Has a high level of competency implementing hardware projects in data center environments (server & storage installation, troubleshooting, decommissioning).
  • Enjoys being challenged to find creative solutions using both legacy and cutting edge technology.  This is a codespeak for us having a legacy system that has to be maintained and improved while also looking at new technology and tools to improve resiliency, performance, ease of administration, and observability.  It’s not all “the fun stuff”.
  • Wants to work with a globally distributed team of similarly skilled professionals, and is comfortable building relationships with teammates up to thousands of miles away.
  • Is as comfortable in a shell or VIM as an accountant is in QuickBooks.
  • Refuses to believe a service or appliance is production ready until they have the metrics and alerts to prove it.

Key duties:

  • Administration - Participate in maintenance and operations of our production environment, including patching, deployment, server administration, and troubleshooting, either using configuration as code tooling or manually.
  • Reliability & Performance - Ensure reliability, availability and performance of services.  Respond to incidents and resolve before they become customer impacting.
  • Projects - Deliver complex solutions that traverse all layers of the technology stack: Operating System, Virtualisation, Network, Storage & Cloud.
  • Data Centre - Participate and coordinate on-site deployments of critical hardware, including servers and storage.
  • Collaboration - Work closely with teammates, software, and security teams to rapidly meet customer, business, and compliance needs.
  • Automation - Drive the automation of operational tasks, and ensure our infrastructure is more like cattle than pets.
  • Observability - Develop and maintain internal and commercial or OSS tools to improve system health, performance, and deployment.
  • Continuous Improvement - Drive never-ending improvement in SRE processes, tools, and methodologies.  Take a leading role in blameless post-mortems to avoid repeat issues or mistakes and clearly document all lessons learned for others.  If you love writing actionable documentation, we’d love to set up an interview.
  • On-Call - Participate in a rotating 24x7 on-call schedule with your team to ensure availability of services across the production environment.

Essential Skills & Experience:

  • 5+ years of experience in Site Reliability Engineering, DevOps, System Administration, or similar roles.
  • Deep experience working in colocation facilities – we have a hybrid footprint, and if you have only worked in the public cloud space, this role is not a great fit for you.
  • Experience using Puppet, Ansible, or other common configuration as code tooling to deploy and configure systems.
  • Strong familiarity with Linux systems (any distro is fine, but we have a preference for RHEL downstreams).
  • Experience using Proxmox, VMWare, or KVM as virtualization platforms for large-scale production environments.
  • Experience administering enterprise grade SANs and load balancers is necessary to be successful in this role.
  • Demonstrated proficiency in one or more scripting or programming languages (e.g., Python, Go, Bash/ZSH, etc.)
  • Multiple years experience proactively implementing and responding to infrastructure, application, and network alerts using industry standard or homebrew toolchains.
  • Strong problem-solving skills and experience working in extreme high availability production environments (99.95% or greater), with high performance requirements, is required.

Preferred Skills and Experience:

  • Experience with F5 BigIP LTMs or NetApp SANs is highly desirable.
  • Experience using Grafana, Prometheus, and the ELK stack for observability is highly desirable.
  • Experience with MySql (any engine variant) will be extremely helpful in this role.
  • Kubernetes experience is a significant plus.  Alternatively, a burning desire to learn it.
  • Experience working with SaaS based WAF/DDoS protection services such as Silverline, CloudFlare, or Akamai is preferred.
  • Prior experience on a team following common agile processes such as Kanban or Scrum would be valuable.
  • Experience in the start-up to scale-up space will be very valuable.  We are not a calcified, enormous enterprise, and move quickly.
  • GitLab experience is a plus.

As well as being a part of something exciting everyday, you will also receive the following benefits:

  • Competitive compensation package
  • 40 hours week with flexi-time
  • Health and Dental Insurance
  • Life, ADD, Short-term and Long-term Disability insurance
  • Extended Health Care/Emergency Travel Assistance
  • Employee Assistance Program
  • Work Remotely
  • Vacation, Personal Wellness and Sick time
  • 10 Paid Holidays
  • Bonusly colleague reward scheme
  • Employee referral scheme with generous financial reward

Do you feel like you have a slightly out of the ordinary career path or history? We are open to all walks of life and very willing to hear your story. Please don’t feel like this should be a barrier to securing a great career at NMI! We appreciate success can come in all shapes and sizes. Fill in the ‘Additional Info’ box on our application to tell us more about your path.

What we do!

NMI enables our partners with choice, and challenges the one-size-fits-all approach to payments. You've probably used NMI in the last 24 hours without even realizing it. We’re the platform that powers success for innovative tech created by SMBs, entrepreneurs and fintech startups. We’re creative problem solvers who help visionaries smash through boundaries and think beyond what’s possible so they can think about what’s next. But we’re not just built for the tech savvy. We democratize the latest payments technology so that everyone can realize the benefits of easy payments across the full spectrum of commerce. We’re all about enabling more payments in more ways and more places.

We believe that having a diverse group of employees strengthens both our work and our workplace. We’re focused on making NMI more diverse and welcoming with initiatives like having a dedicated Diversity, Equity & Inclusion action group, diversity goals for hiring, anonymized resume screening, affinity groups such as our Women's network and LGBTQ+ Network, open forums for discussions on diversity and social justice, and measuring inclusion and belonging as part of our regular employee engagement surveys.

Equal Opportunity

NMI is committed to providing equal employment opportunity for all persons regardless of race, color, religion, sex, age, marital status, national origin, sexual orientation or sexual identity, genetic information, citizen status (except those that do not have the legal right to be employed in the United States), disability, military service, service member, veteran status, or any other basis protected by applicable law.

Please be aware that all offers of employment are made subject to receipt of satisfactory background and financial checks.

Attention job applicants: Please note that in compliance with the data protection regulations within your jurisdiction, any personal information submitted with your job application may be collected and used by NMI for the purpose of recruitment and employment-related activities. By submitting your application, you acknowledge and provide explicit consent to the processing of your personal information as described in our privacy policy found on our website. For more information on how we process your information, please read our privacy policy here: https://www.nmi.com/legal/privacy-policy/

#LI-Remote

 

Salary range, depending on experience:$150,000—$200,000 CAD Apply