Deputy Cloud Operations Manager
100% Remote
Since 2016, T-Rex Solutions (T-Rex) has been supporting large scale hybrid-cloud migrations and systems modernization, to include the successful deployment of the first-ever online U.S. Census in 2020. T-Rex is actively expanding its IT Modernization services within the Federal Civilian market, specifically supporting the Department of Education’s strategic priorities.
T-Rex is seeking a Deputy O&M Manager to help lead our Cloud O&M team as the Department of Education looks to transition from an on-premise infrastructure to the cloud. The ideal candidate will possess a deep understanding of cloud technologies, with leadership experience overseeing patching, metrics and KPIs, configuration and inventory management, vulnerability management, system monitoring, backup and disaster recovery, change management and other operational activities.
Responsibilities:
- Metrics and KPIs: Define, monitor, and report on key performance indicators (KPIs) and metrics related to cloud infrastructure performance and operational efficiency.
- Vulnerability Management: Implement and oversee vulnerability management processes in cloud environments, ensuring timely identification and remediation of security vulnerabilities.
- System Monitoring: Monitor cloud system uptime, downtime, and page response times using monitoring tools to proactively identify and resolve performance issues.
- Backup and Disaster Recovery: Develop and implement backup and disaster recovery plans for cloud-based systems to ensure data integrity and business continuity.
- Change Management: Lead change management processes for cloud infrastructure changes, including planning, testing, and implementation while minimizing impact on operations.
- Operational Communications: Facilitate effective communication within the IT team and with stakeholders regarding cloud system status, incidents, and maintenance activities.
- Patching and Updates: Manage and coordinate patching and updates for cloud-based systems to ensure security and compliance with industry standards.
- Configuration and VM Inventory Management: Maintain accurate records and configurations of virtual machines (VMs) and cloud resources to optimize resource utilization and scalability.
- Provisioning New Infrastructure: Manage the provisioning and deployment of new cloud infrastructure resources based on business requirements and growth projections.
- Network Administration: Oversee network administration activities in cloud environments, including configuration, optimization, and troubleshooting of cloud-based networks.
- STIG Implementations: Ensure compliance with Security Technical Implementation Guides (STIGs) and other regulatory requirements for cloud-based systems and services.
- Ticket Triage: Participate in the triage and resolution of escalated tickets and incidents related to cloud infrastructure, ensuring timely resolution and minimal disruption to business operations.
Requirements:
- Must be a US Citizen with the ability to obtain a Public Trust clearance
- Proven experience in IT operations management, with 5+ years in a managerial or leadership role.
- Strong knowledge of cloud platforms (e.g., AWS, Azure, Google Cloud) and cloud services (IaaS, PaaS, SaaS).
- Experience defining, implementing and managing system metrics and KPIs. Experience presenting metrics to customers.
- Strong understanding of system monitoring, patch management, vulnerability assessment, and mitigation strategies in cloud environments.
- Experience with backup and disaster recovery strategies and implementation for cloud-based systems.
- Proficiency in change and configuration management processes and best practices for cloud deployments.
- Excellent communication skills, with the ability to convey technical concepts to non-technical stakeholders.
Desired Education & Experience:
- Bachelor’s degree desired.
- Hands-on experience with cloud-based infrastructure management, including configuration, automation, and monitoring tools.
- Relevant certifications (e.g., AWS Certified Solutions Architect, Azure Administrator) are a plus.
T-Rex Overview
Established in 1999, T-Rex Solutions, LLC is a proven mid-tier business providing data-centric mission services to the Federal government as it increasingly tries to secure and leverage the power of data. We design, integrate, secure, and deploy advanced technical solutions for our customers so they can efficiently fulfill their critical objectives. T-Rex offers both IT and professional services to numerous Federal agencies and is a leader in providing high quality and innovative solutions in the areas of Cloud and Infrastructure Services, Cyber Security, and Big Data Engineering.
T-Rex is constantly seeking qualified people to join our growing team. We have built a broad client base through our devotion to delivering quality products and customer service, and to do that we need quality individuals. But more than that, we at T-Rex are committed to creating a culture that supports the development of every employee's personal and professional lives. T-Rex has made a commitment to maintain the status of an industry leader in compensation packages and benefits which includes competitive salaries, performance bonuses, training and educational reimbursement, Transamerica 401(k) and Cigna healthcare benefits.
T-Rex is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, religion, color, sex (including pregnancy, gender identity, and sexual orientation), parental status, national origin, age, disability, family medical history or genetic information, political affiliation, military service, or other non-merit based factors.
In compliance with pay transparency guidelines, the annual base salary range for this position is $100,000 - $150,000. Please note that the salary information is a general guideline only. T-Rex considers factors such as (but not limited to) scope and responsibilities of the position, candidate’s work experience, education/training, key skills, internal peer equity, as well as market and business considerations when extending an offer.
T-Rex offers a diverse and collaborative work environment, exciting opportunities for professional growth, and generous benefits, including: PTO available to use immediately upon joining (prorated based on start date), paid parental leave, individual and family health, vision, and dental benefits, annual budget for training, professional development and tuition reimbursement, and a 401(k) plan with company match fully vested after 60 days of employment among other benefits.
ApplyJob Profile
RestrictionsAbility to obtain public trust clearance Must be a U.S. Citizen
Benefits/Perks100% Remote Collaborative work environment Competitive salaries Educational reimbursement Fully remote Healthcare Benefits Innovative solutions Performance bonuses Professional development
Tasks- Define and monitor KPIs
- Develop backup and disaster recovery plans
- Ensure compliance with STIGs
- Facilitate operational communications
- Implement backup and disaster recovery
- Implement vulnerability management
- Lead change management
- Maintain VM inventory
- Manage patching and updates
- Monitor system uptime
- Oversee network administration
- Participate in ticket triage
- Provision new infrastructure
Automation AWS Azure Backup and disaster recovery Change Management Cloud operations Cloud Services Cloud Technologies Communication Configuration Management Data engineering Google Cloud IaaS KPIs Leadership Metrics Monitoring Network administration PaaS SaaS Scalability System Monitoring Testing Ticket triage Troubleshooting Vulnerability Management
Experience5 years
Education Certifications