Service Reliability Engineer
Draper, UT
It's fun to work in a company where people truly BELIEVE in what they're doing!
We're committed to bringing passion and customer focus to the business.
Corporate Overview
Proofpoint is a leading cybersecurity company protecting organizations’ greatest assets and biggest risks: vulnerabilities in people. With an integrated suite of cloud-based solutions, Proofpoint helps companies around the world stop targeted threats, safeguard their data, and make their users more resilient against cyber-attacks. Leading organizations of all sizes, including more than half of the Fortune 1000, rely on Proofpoint for people-centric security and compliance solutions mitigating their most critical risks across email, the cloud, social media, and the web.
We are singularly devoted to helping our customers protect their greatest assets and biggest security risk: their people. That’s why we’re a leader in next-generation cybersecurity.
Protection Starts with People. Proofpoint.
The Role
As a Service Reliability Engineer at Proofpoint you will develop a deep understanding of the various services and applications that come together to deliver Proofpoint’s next generation security products. You will contribute to the architecture to improve scalability, operability, service reliability, capacity, and performance. You will be responsible for provisioning, maintaining, and scaling our production services within server farms across multiple, world-wide data centers as well as AWS.
We are looking for passion, curiosity, attention to details, taking pride in one's work, taking ownership, and having ideas and opinions. If you’re an enthusiastic team player who cares about the infrastructure, remains calm in crisis, collaborates cross functionally, loves automation, then we want to talk to you.
Your day-to-day
• Build long lasting, effective partnerships across the organization to foster collaboration between Product, Engineering and Operations teams.
• Manage an international 24x7, multi-site production infrastructure powering the Proofpoint services, including deployment, maintenance, troubleshooting, performance tuning, and security.
• Root-cause complex problems and involve multiple stakeholders, network, hardware and software that relate to scaling and performance.
• Ensure proper monitoring, alerting, capacity planning and reporting in the production environment.
• Contribute to the evolving design and architecture of reliable and scalable infrastructure.
• First line of defense during working hours should any alerts or incidents arise.
• Collaborate with product engineering teams to ensure Operations standards are observed, determine resource impacts for upcoming product deployments, and ensure successful product rollouts.
• Participate in an on-call rotation and be willing to jump on escalated issues as needed.
What you bring to the team
Demonstrable skills and 3-5 years’ experience managing, troubleshooting, and tuning Linux systems.
A friendly and collaborative demeanor, working well with cross functional teams.
Experience with industry-standard foundation technologies such as TCP/IP, HTTP, DNS, SMTP, and LDAP.
Experience in management of a large distributed computing environment.
Experience with virtualization – KVM, VMware vSphere, and/or OpenStack
Excellent verbal and written communication skills.
Experience with monitoring and alerting systems.
Experience with industry-standard operational practices such as change management, incident management, and working in colocation datacenters.
Experience with configuration management such as Puppet or Chef.
Experience automating management of systems and applications using Perl, Python, or Ruby desired.
Experience with load-balancing technologies – F5, Netscaler or similar desired.
Experience with Kafka, Elastic Search, Cassandra, and MySQL desired.
Experience with public cloud providers such as Amazon EC2 or Microsoft Azure
BS or in Computer Science, Engineering or related technical discipline, or equivalent experience required
US citizenship required
Why Proofpoint
Protecting people is at the heart of our award-winning lineup of cybersecurity solutions, and the people who work here are the key to our success. We’re a customer-focused and a driven-to-win organization with leading-edge products. We are an inclusive, diverse, multinational company that believes in culture fit, but more importantly ‘culture-add’, and we strongly encourage people from all walks of life to apply.
We believe in hiring the best and the brightest to help cultivate our culture of collaboration and appreciation. Apply today and explore your future at Proofpoint! #LifeAtProofpoint
#LI-Remote
If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!
Consistent with Proofpoint values and applicable law, we provide the following information to promote pay transparency and equity. Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets as set out below. Pay within these ranges varies and depends on job-related knowledge, skills, and experience. The actual offer will be based on the individual candidate. The range provided may represent a candidate range and may not reflect the full range for an individual tenured employee. This role may be eligible for variable pay and/or equity. We offer a competitive benefits package that includes flexible time off, a robust well-being program that provides for 4 global wellbeing days per year, and a 3-week work from anywhere option.
Base Pay Ranges:
SF Bay Area, New York City Metro Area:
Base Pay Range: 116,690.00 - 183,370.00 USDCalifornia (excludes SF Bay Area), Colorado, Connecticut, Illinois, Washington DC Metro, Maryland, Massachusetts, New Jersey, Texas, Washington, Virginia, and Alaska:
Base Pay Range: 96,880.00 - 152,240.00 USDAll other cities and states excluding those listed above:
Base Pay Range: 87,220.00 - 137,060.00 USD ApplyJob Profile
- Collaborate with engineering teams
- Manage production infrastructure
- Monitor and report on capacity
- Participate in on-call rotation
- Troubleshoot performance issues
Alerting Alerting systems AWS Azure Change Management Chef Cloud-based solutions Communication Configuration Management Cybersecurity DNS F5 HTTP Incident Management KVM LDAP Linux Load balancing Monitoring Netscaler OpenStack Perl Puppet Python Ruby SMTP TCP/IP Virtualization VMWare VMware vSphere
Experience3 - 5 years
TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9