FreshRemote.Work

Data Center Strategy - Facility Engineering

Remote

Lambda's GPU cloud is used by deep learning engineers at Stanford, Berkeley, and Carnegie Mellon. Lambda's on-prem systems power research and engineering at Intel, Microsoft, Kaiser Permanente, major universities, and the Department of Defense.

If you'd like to build the world's best deep learning cloud, join us.

About the Job 

Become a key member of our Data Center Infrastructure Services team as a Principal Data Center Strategist. In this role, you will be instrumental in shaping the future of our data centers. Your responsibilities will include direct engagement with data center providers to evaluate the electrical, mechanical, and operational components of our facilities. You will report to the Vice President of Infrastructure and leverage your extensive knowledge in data center construction and operations. Your expertise will drive thought leadership and ensure optimal performance of our facility portfolio. Additionally, you will spearhead efficiency and build initiatives in both existing facilities and new construction. The ideal candidate will possess profound expertise in data center facilities management and a proven track record of successful implementation of cost saving strategies, and the ability to provide comprehensive technical guidance.

What You'll Do

  • Act as a technical advisor on data center infrastructureAssess new data centers for suitability and compliance with our operational standards.Evaluate and interface directly with data center providers to ensure operational efficiency, appropriate power utilization, and optimal resource allocation.
  • Provide expert troubleshooting support for data center operational issues.
  • Lead after-action reporting and problem remediation processes to continually enhance data center operations.
  • Ensure adherence to best practices for infrastructure concurrent maintainability, server cooling and power configurations, and maintenance to ensure adherence to operational SLAs Serve as a customer-facing data center expert.Provide strategic input on new technologies, building designs, and retrofitting projects to ensure future-ready infrastructure.Collaborate closely with the VP of Infrastructure and other senior leaders to align data center strategies with Lambda's overarching infrastructure goals.
  • Lead the design, deployment, and optimization of data center infrastructure, focusing on power distribution, cooling systems, and environmental controls
  • Drive data center lifecycle controls to ensure technology deployment is aligned and right sized
  • Develop and maintain comprehensive documentation of data center layout and infrastructure topologies to aid in optimizing the costing controls
  • Establish and enforce installation standards and documentation to ensure consistency and efficiency across all data center facilities

You

  • You will know how to build, manage, run and operate a data center at scale.
  • Bring 15+ years of experience in operating, designing, deploying, and optimizing critical data center infrastructure, with a focus on power systems, cooling solutions, and environmental controls
  • Demonstrate advanced proficiency in infrastructure deployment for high power compute environments
  • Have a proven track record of deploying data center operational controls across multiple data center locations
  • Possess a strong character for negotiating terms for design, build, operate and decommission of data center space
  • Detail-oriented with a strong commitment to following established procedures and standards
  • Action-oriented with a passion for continuous learning and professional development
  • Willingness to travel for the setup and optimization of new data center locations

Nice to have

  • Construction Management experience
  • Experience troubleshooting and theoretical knowledge of HPC computer designs
  • Experience working in large-scale campus and portfolio type business models for distributed data center environments
  • Experience collaborating with auditors to ensure compliance with industry standards
  • Previous experience in a leadership or managerial capacity within a data center engineering and operations team

Salary Range Information 

Based on market data and other factors, the salary range for this position is $200,000- $ 247,000 However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

About Lambda

  • We offer generous cash & equity compensation
  • Investors include Gradient Ventures, Google’s AI-focused venture fund
  • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability
  • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
  • We have a wildly talented team of 300, and growing fast
  • Health, dental, and vision coverage for you and your dependents
  • Commuter/Work from home stipends for select roles
  • 401k Plan with 2% company match
  • Flexible Paid Time Off Plan that we all actually use

A Final Note:

You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.

Equal Opportunity Employer

Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Apply

Job Profile

Benefits/Perks

401k Plan with 2% company match Cash & equity compensation Commuter/Work from home stipends Flexible paid time off Flexible Paid Time Off Plan Health, dental, and vision coverage

Tasks
  • Assess data centers
  • Engage with providers
  • Establish installation standards
  • Lead after-action reporting
  • Optimize infrastructure
  • Provide strategic input
  • Troubleshoot issues
Skills

Cost-saving strategies Data center management Deep Learning Infrastructure Deployment Technical Troubleshooting

Experience

15+ years