Implementation Manager, Data Center Operations
Remote / Near one of our Data Centers
CoreWeave is a specialized cloud provider, delivering a massive scale of GPU compute resources on top of the industry’s fastest and most flexible infrastructure. CoreWeave builds cloud solutions for compute intensive use cases — VFX and rendering, machine learning and AI, batch processing, and Pixel Streaming — that are up to 35 times faster and 80% less expensive than the large, generalized public clouds. Learn more at www.coreweave.com.
We are seeking a highly skilled Implementation Manager to join our team, and work alongside our Lead Project Manager to lead the establishment of processes and procedures for all aspects of data center operations, including managing the training and evaluation of data center technicians. This role can be Remote, near any of our Data Centers located across the U.S.
Responsibilities:
-
Establish and Optimize Processes: Develop and implement standardized processes and procedures for efficient and effective data center operations, ensuring maximum uptime, scalability, and security.
-
Data Center Infrastructure Management: Oversee the planning, deployment, and maintenance of data center infrastructure, including servers, storage systems, networking equipment, and power and cooling systems.
-
Resource Allocation and Capacity Planning: Collaborate with cross-functional teams to forecast and allocate resources to meet customer demands and optimize utilization of data center infrastructure.
-
Incident Management: Lead the incident response process and manage critical situations, working closely with technical teams to quickly resolve issues and minimize downtime.
-
Vendor Management: Collaborate with vendors and suppliers to ensure the availability of necessary equipment, negotiate contracts, and maintain strong working relationships.
-
Training and Evaluation: Develop training programs for data center technicians to ensure their competence in handling data center operations. Evaluate and provide feedback on the performance of technicians regularly.
-
Compliance and Security: Ensure compliance with industry standards, regulations, and security protocols. Implement and monitor robust security measures to protect data center infrastructure and customer data.
-
Performance Monitoring and Reporting: Establish monitoring mechanisms to track data center performance, identify bottlenecks, and generate regular reports to management on key operational metrics.
-
Continual Improvement: Identify opportunities for process optimization and operational efficiency. Implement best practices and innovative solutions to enhance data center operations.
Qualifications:
-
Minimum of 5 years of experience in implementation or technical project management in data center operations …
This job isn't fresh anymore!
Search Fresh JobsJob Profile
RestrictionsHybrid workplace
Benefits/PerksGrowth Opportunities Hybrid work Hybrid workplace
Tasks- Develop training programs
- Ensure compliance
- Incident management
- Monitoring
- Monitor performance
- Performance monitoring
- Reporting
- Resource Allocation
- Support
- Vendor management
AI Batch processing Cloud Computing Cloud solutions Communication Compliance Data Center Operations GPU technologies Implementation Implementation management Incident Management Incident Response Leadership Machine Learning Management Monitoring Networking Organization Performance monitoring Process Optimization Project Management Project Management Methodologies Python Rendering Resource Allocation Scripting Scripting Languages Security Storage Team Management Training Development Vendor Management VFX Virtualization
Experience5 years
EducationBachelor's Bachelor's degree Computer Science Information Technology
Certifications