Infrastructure Software Engineer
Remote
The Opportunity
AI is rapidly transforming the world. Whether it’s developing the next generation of human-level intelligence, enhancing voice assistants, or enabling researchers to analyze genetic markers at scale, AI is increasingly integrated into various aspects of our daily lives.
Arize is the leading AI observability and Evaluation platform to help AI teams discover issues, diagnose problems, and improve the results of their AI Applications. We are here to build world class software that helps make AI applications work better.
The Role
As our Infrastructure Software Engineer reporting directly to the CTO, you'll be at the forefront of designing and implementing the systems that power our platform's rapid growth. This is a unique opportunity to shape the foundation of a product that's becoming essential for AI teams across the world.
We're seeking an exceptional Infrastructure Software Engineer who can thrive in a fast-paced environment and has a proven track record of building scalable systems. You'll be a key player in our engineering organization, working directly with the CTO to architect and implement solutions that support our platform's exponential growth.
We are a fully remote company with offices in Bay Area, NYC
What You’ll Work On
- Partner closely with engineering and security teams to architect and scale our infrastructure, supporting Arize's hyper-growth trajectory
- Design and develop the necessary infrastructure for AI fine tuning workloads
- Design and develop best-in-class tooling for internal systems observability and security
- Lead system optimization initiatives including capacity planning, configuration management, and performance tuning.
- Create and maintain productivity tools and frameworks that increases engineering velocity and decreases operational toil
- Optimize infrastructure costs while maintaining high performance
- Drive technical decisions that impact our entire infrastructure stack
What WIll Set You Apart
- 5+ years of experience building infrastructure and developer tools
- A focus on user needs over technology preferences
- Proven track record of improving developer productivity through pragmatic solutions rather than chasing trendy technologies
- Strong empathy for engineering teams' pain points and ability to prioritize impactful solutions over theoretical perfection
- Working knowledge of Kubernetes, Terraform, Bazel
The estimated annual salary for this role is between $125,000 - $225,000, plus a competitive equity package. Actual compensation is determined based on a variety of job-related factors that may include transferable work experience, skill sets, and qualifications. Total compensation also includes a comprehensive benefits package, including medical, dental, vision, a 401(k) plan, unlimited paid time off, a generous parental leave plan, and additional support for mental health and wellness.
While we are a remote-first company, we have opened offices in New York City and the San Francisco Bay Area, as an option for those in those cities who wish to work in-person. For all other employees, there is a WFH monthly stipend to pay for co-working spaces.
More About Arize
Arize’s mission is to make the world’s AI work and work for the people. Our founders came together through a common frustration: investments in AI are growing rapidly across businesses and organizations of all types, yet it is incredibly difficult to understand why a machine learning model behaves the way it does after it is deployed into the real world.
Learn more about Arize in an interview with our founders: https://www.forbes.com/sites/frederickdaso/2020/09/01/arize-ai-helps-us-understand-how-ai-works/#322488d7753c
Diversity & Inclusion @ Arize
Our company's mission is to make AI work and make AI work for the people, we hope to make an impact in bias industry-wide and that's a big motivator for people who work here. We actively hope that individuals contribute to a good culture
- Regularly have chats with industry experts, researchers, and ethicists across the ecosystem to advance the use of responsible AI
- Culturally conscious events such as LGBTQ trivia during pride month
- We have an active Lady Arizers subgroup
Job Profile
RestrictionsFully remote Remote Remote-first with optional in-person work in specific cities
Benefits/Perks401(k) Competitive equity package Comprehensive benefits package Dental Equity Package Fully remote Generous parental leave Generous parental leave plan Medical Mental health support Parental leave Remote-first company Unlimited Paid Time Off Vision WFH stipend
Tasks- Architect and scale infrastructure
- Create and maintain productivity tools
- Design and develop infrastructure for AI workloads
- Drive technical decisions
- Lead system optimization initiatives
- Optimize infrastructure costs
AI Bazel Capacity planning Configuration Management Developer tools Infrastructure Kubernetes Machine Learning Observability Organization Performance Tuning Security System Optimization Terraform
Experience5 years